1 repository of data cleaning scripts designed to process the JHU CSSE COVID-19 time-series datasets. The scripts handle global and US-specific CSV files containing daily reports of confirmed cases, deaths, and recoveries across various administrative regions.
Use Cases
- Clean raw JHU CSSE CSV files to create a unified time-series for longitudinal trend analysis.
- Map global infection rates using the Lat and Long coordinate columns provided in the source data.
- Aggregate daily case counts by Country/Region to compare national pandemic trajectories.
Strengths
- Processes time-series CSV files sourced from the JHU CSSE GitHub repository.
- Standardizes geographic columns including Province/State, Country/Region, Lat, and Long.
- Normalizes daily counts for Confirmed, Deaths, and Recovered metrics across inconsistent date headers.