About the Covid-19 category

jeremy · March 16, 2020, 9:20pm

Please use this category to discuss covid-19. Focus particularly on content likely to be of interest to this community - i.e. data driven, technical, practical, accessible. If you’re working on a project, feel free to create a topic and ask for help! Note: you need to read for 10 mins and look at 3 posts before the system lets you create a new topic.

This is a wiki post, so feel free to edit this to add links to data sources, modeling tools, important threads, and so forth:

Data sources

(Please include details like update frequency, format, scope, fields, etc)

covid19dashboards.com

This is a fastai related project, made with fastpages, an open source blogging platform with special features for Jupyter Notebooks. This dashboard has helpful visualizations and links to datasets, and is 100% open source. Furthermore, each dashboard is created with a Jupyter notebook so you can see where the data comes from with all of the code. Pull requests are welcome – showcase your modeling and visualization skills!

Data is updated every hour by GitHub Actions
Primary data source is the John Hopkins COVID-19 Data Repository, which is updated several times daily. However there are other datasets
There has been over 200,000 page views at the time of this writing, so great way to get visibility for your projects.
A great way to get familiar with fastpages

Canadian COVID19 cases dataset.

Source: University of Toronto School of Public Health
Updates: Daily
Format: Google docs spreadsheet (3 tabs)
Scope: Canada and 12 subregions (Provinces, Territories)
Fields: See the 2nd tab in the spreadsheet for details.

R Shiny Dashboard

Includes download handlers to download data powering all graphs as well as information about where the source is for various graphs.
https://isaac-flath.shinyapps.io/coronavirus2/
– Filter by state to look at cases, deaths, recovered, hospitalizations data by state. Some available by date, and some graphs have smoothing applied
https://isaac-flath.shinyapps.io/Covid-19/
– Coronavirus by country data including tests, cases, recovered, NPI, mobility, etc. Can filter to change to look at different countries, mobility changes over tie, NPI policy changes (link to codebook to explain what categories are).
–Minimal ability to do some graph transformation in UI such as apply smoothing, log scale, cumulative vs new daily.

Modeling

(Including tutorials, simulation models, machine learning models, etc)

Bayesian modeling of growth rate predictions, by Thomas Wiecki one of the authors of PyMC3 - Resources:
- Jupyter Notebook
- The dashboard on covid19dashboards.com

Other threads

Fill me in!

madhavajay · March 17, 2020, 12:58am

@jeremy I was speaking with @MadeUpMasters and he is working on some interesting COVID-19 data analysis stuff. One thing that occurred to me is the lack of crowd sourced data. So I quickly made this yesterday and today (even though i’m not feeling great myself).
Is this kind of data of value to anyone? I am happy to make adjustments to the form to collect more valuable data points if needed. Everything is available in a Firestore NoSQL database.

I’m not great with front end so its a little rough around the edges but works.
https://corona-census.firebaseapp.com/

hamelsmu · March 18, 2020, 2:42am

@jeremy I also want to mention https://covid19dashboards.com/

Made with fastpages! It has examples of how to get relevant data, and useful visualizations, including some Bayesian models (by one of the core contributors to PyMC3). Would love more contributions from the fastai community- there has already been a few! It’s had 150k page views in only 4 days, so lots of visibility for people. Also a good way to get familiar with fastpages!

MicPie · March 18, 2020, 8:12am

COVID-19 image data collection on GitHub: https://github.com/ieee8023/covid-chestxray-dataset

However, this still seems to be starting, so no additional data like update frequency, etc.

jeremy · March 19, 2020, 2:10am

@hamelsmu please add to the wiki post at the top

hamelsmu · March 19, 2020, 2:46am

Done!

Brad_S · March 19, 2020, 3:33pm

similar vein I suspect: https://coronavirus-resources.esri.com/

this is an amazing set of resources and maps

salman · March 19, 2020, 6:51pm

@jeremy I tried to create a topic to share an interactive tool we built with the community, but could not do it (maybe, permission issues)

The tool helps you understand how your actions can prevent the spread of coronavirus in your country.

We would love to get some feedback from the fastai community.

App: https://corona-calculator.herokuapp.com/

Blog post: https://towardsdatascience.com/should-i-go-to-brunch-an-interactive-tool-for-covid-19-curve-flattening-6ab6a914af0

If you play with the tool, you’ll see there’s a huge difference between 5 social contacts a day and 20 a day. At < 5 contacts a day, the disease doesn’t spread. In many countries, at < 10 a day, demand remains within hospitals capacity to cope.

jeremy · March 19, 2020, 7:47pm

Thanks @salman. The site checks whether you’ve read 3 posts and spent 10 mins on the site first. These are Discourse’s defaults, to try to make sure that you’ve searched before posting. Can you try creating the topic again after doing that? Sorry for the trouble!

salman · March 20, 2020, 4:27am

I tried again after searching but could not create the topic. Here is the screenshot:

jeremy · March 20, 2020, 5:20pm

OK - I’ve manually set your account’s trust level, so you can now post. Sorry I should have thought of that in the first place!

salman · March 20, 2020, 6:16pm

Done, that worked! Thank you!

madhavajay · March 21, 2020, 2:40am

Okay, so tool has been rebooted and will launch in a few days. We have US MyChart integration and some US Hospitals signed up to use it.

Please share / retweet.

amritv · March 23, 2020, 2:39am

First crowd sourcing site I’ve seen in conjunction with Harvard Medical School and Boston Childrens - sister site to ‘flu near you’ that has been used by Boston Childrens. Provides visualizations of crowd sourced data.

nareshr8 · March 24, 2020, 4:37pm

Just got to know about this dataset.

Might be helpful for exploration

Dooby · March 25, 2020, 2:20am

gautam_e · April 7, 2020, 6:57pm

Germany’s Robert Koch Institute (RKI, federal agency responsible for public health) has a similar Dashboard to that of John Hopkins University (JHU):
https://experience.arcgis.com/experience/478220a4c454480e823b17327b2bf1d4

One of the interesting things is that the number of cases (Germany-wide) are typically lower than JHU (at times I have noticed differences of 10%, as I write this post its about 5%). Articles in Germany say that, that is related to the differences in the process of reporting and collecting this data. While RKI updates its data once a day, JHU does that several times a day. Furthermore, RKI relies on data from local public health authorities (this may result in delays of up to 2 days) while JHU appears to scan data from local newspapers to arrive at their numbers. I haven’t been able to figure out how exactly JHU does it. How do they avoid counting cases twice for example. Would be great if anyone could shed light on that. Is JHU less precise but faster? Perhaps time will tell.

Anyway, there’s an interesting pattern that emerges in the visualisation of the number of Covid-19 cases per day in Germany. The numbers drop on the weekends only to rise later and surpass those of ht previous week (some people might say its obvious?). That shows that it’s easy to make the mistaken conclusion that the curve is flattening based on data of a couple of days (politicians, have made that mistake, btw!).

Might all seem obvious to many, but I thought I’d share this info. Perhaps there are similar trends in other countries?

alfred0809 · April 8, 2020, 1:10am

Hats off to this man. I’m sure he doesn’t have a lot of free time to begin with. It’s reassuring to see world leaders act as such.

barty777 · April 8, 2020, 9:18am

@jeremy

Here’s a pre-trained Pytorch model along with the training code for COVID detection from lung X-rays: https://github.com/velebit-ai/COVID-Next-Pytorch

Sleuth · April 13, 2020, 7:30pm

A radiologist I know said there were some mislabeled cases. I don’t know how bad it is though, but he recommended me not to use it.