An agile data science project let’s a web app do it’s talking - take a look here.
What is the climate-news-db?
climate-news-db is a database of climate change newspaper articles.
Why are we building it?
The primary motivation is to provide a dataset for researchers to analyse how climate change is being covered in the media.
Where can I find it?
How is it built?
The climate-news-db is built mainly in Python:
- requests & Beautiful Soup to download and parse the article HTML into JSON
- S3 to store the raw HTML & parsed article JSON
- Jinja & Bootstrap for templating
- Flask App hosted on PythonAnywhere