Skip to content

Latest commit

 

History

History

scripts

Development

Data documentation

Here you will find all the different scripts and tools that we use to generate the complete dataset. Most of the pipelines have been integrated into our cowidev library. For more details, please visit the official documentation

Directory overview

Folder Description
docs/ Sphinx documentation source files. See it live.
grapher/ Internal OWID files to power our grapher visualizations.
input/ External files used to compute derived metrics, such as X-per capita, and aggregate groups, such as 'Asia', etc.
output/ Temporary files. Only for development purposes. Use it at your own risk.
src/cowidev/ cowidev library. It contains the code for almost all project's pipelines.
scripts Legacy folder. Contains some parts of the code, such as the COVID-19 testing collection scripts. The code is a mixture of R and Python scripts.
config.yaml Data pipeline configuration file. The default values should be working.

Our data pipeline exports its outputs to public/data.

Data pipeline

Our data pipeline is built from several pipelines (e.g. vaccinations, testing, etc.), which are executed independently. for an overview, refer to our documentation

Contribute

We welcome contributions for all of our processes. There are two types of contributions:

  • Maintenance/Enhancements: Improve processes currently available in the library (e.g. add a new country scrapper for the vaccinations data).
  • New features: Create a new process in the library.

For more details, read our contribution guideline.