US Buddhist Center data scraping

Scrape data from http://www.buddhanet.info/wbd/country.php?country_id=2.

Scraping code adapted from ChatGPT4, with this prompt:

Write a Python program that web scrapes http://www.buddhanet.info/wbd/country.php?country_id=2 and returns the result as JSON, with one entry per Buddhist center. 

Each entry has HTML like this:

<p class="entryName">96th Street Sangha</p>
<p class="entryDetail">
<strong>Address:</strong> 275 W. 96th Street, #4C New York, NY 10025                   &nbsp;  NY <br>
...
</p>
<hr>

The text inside the `<p class="entryName">` is the name of the Buddhist center.

Inside the `<p class="entryDetail">`, every `<strong>` is a new key name. The text or the <a> following that `<strong>` is the value for that key; if it's an <a>, I want to extract the text. The `<br>` separates each of those details.

Finally, the `<hr>` separates each Buddhist center entry.

To run

First, install Pants: https://www.pantsbuild.org/docs/installation

Scrape: pants run scrape.py
Tests: pants test :
Formatters: pants fix :

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
BUILD		BUILD
README.md		README.md
buddhist_centers.json		buddhist_centers.json
chicago.py		chicago.py
chicago_centers.json		chicago_centers.json
pants.toml		pants.toml
requirements.txt		requirements.txt
scrape.py		scrape.py
scrape_test.py		scrape_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

BUILD

BUILD

README.md

README.md

buddhist_centers.json

buddhist_centers.json

chicago.py

chicago.py

chicago_centers.json

chicago_centers.json

pants.toml

pants.toml

requirements.txt

requirements.txt

scrape.py

scrape.py

scrape_test.py

scrape_test.py

Repository files navigation

US Buddhist Center data scraping

To run

About

Releases

Packages

Languages

Eric-Arellano/buddhist-center-data-scraping

Folders and files

Latest commit

History

Repository files navigation

US Buddhist Center data scraping

To run

About

Resources

Stars

Watchers

Forks

Languages