Over the past couple weeks I’ve been collecting various Covid-19 related datasets. It would be great if there’s a single repository that collects all of these in one place, but I thought I’d share my findings here, with a brief comment/description about each. These are from my personal notes, so please excuse the rough form of these. It could be a starting point to which others can add.
-
Comprehensive Covid-19 resources from ESRI/GIS
-
Basic Daily updated Johns Hopkins U data
-
Covid-19 - fast.ai forum
-
Auto-updated Dashboards built using fastpages and updated using github actions, from above JHU data:
- This is a fantastic set of notebooks that pull the JHU data, so we can re-use some of the pre-processing they do
- https://covid19dashboards.com/
- This one is especially interesting: Uses additional country-level covariates to estimate mortality rates:
-
Help With Covid – call for project proposals and volunteers
-
Web-app built from Streamlit
-
Impressive visualizer of country numbers
-
Kaggle data (snapshot of JHU data)
- It has multiple time series data, including age, gender
- https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset
-
Another Kaggle dataset:
- This is the simplest dataset, updated daily: [EDIT; FIXED THIS LINK]
date, region, lat, long, confirmed, dead, recovered
- https://www.kaggle.com/imdevskp/corona-virus-report
- This is the simplest dataset, updated daily: [EDIT; FIXED THIS LINK]
-
“Humanitarian data”, also possibly related to JHU data
-
Weather + Covid
- Reddit – [Project] I’ve compiled weather/climate date for the confirmed COVID19 infection sites, if anyone wants it
https://reddit.com/r/MachineLearning/comments/fh2rr6/project_ive_compiled_weatherclimate_date_for_the/
- Reddit – [Project] I’ve compiled weather/climate date for the confirmed COVID19 infection sites, if anyone wants it
-
Covid19 Tracker (US ONLY!) (https://covidtracking.com/)https://covidtracking.com/
date, state, positive, negative, pending, death, total
- 2020-Mar-04 to now
-
Our World In Data Coronavirus Source Data (from WHO)
date, country, new_cases, new_dead, tot_cases, tot_dead
- 2020-Feb-25 to now
- Nicely formatted version:
- How to pull data from data.world:
-
Microsoft covid19 tracker (but no raw data)
-
Call to Action to the Tech Community on New Machine Readable COVID-19 Dataset
- This is not numerical data, but text data from research articles, and the goal is to answer various questions with NLP
-
China data age sex fatality
-
Nice simulations
-
Singapore covid surprisingly detailed data and dashboard (MIT Tech Review)
-
Project notebook showing [[covid19]] detection from chest x-ray images
-
Scale-AI will provide free data-labeling/annotation service for researchers working with [[covid19]] datasets: