About the Covid-19 category

Please use this category to discuss covid-19. Focus particularly on content likely to be of interest to this community - i.e. data driven, technical, practical, accessible. If you’re working on a project, feel free to create a topic and ask for help! Note: you need to read for 10 mins and look at 3 posts before the system lets you create a new topic.

This is a wiki post, so feel free to edit this to add links to data sources, modeling tools, important threads, and so forth:


Data sources

(Please include details like update frequency, format, scope, fields, etc)

covid19dashboards.com

This is a fastai related project, made with fastpages, an open source blogging platform with special features for Jupyter Notebooks. This dashboard has helpful visualizations and links to datasets, and is 100% open source. Furthermore, each dashboard is created with a Jupyter notebook so you can see where the data comes from with all of the code. Pull requests are welcome – showcase your modeling and visualization skills!

  • Data is updated every hour by GitHub Actions
  • Primary data source is the John Hopkins COVID-19 Data Repository, which is updated several times daily. However there are other datasets
  • There has been over 200,000 page views at the time of this writing, so great way to get visibility for your projects.
  • A great way to get familiar with fastpages

Canadian COVID19 cases dataset.

  • Source: University of Toronto School of Public Health
  • Updates: Daily
  • Format: Google docs spreadsheet (3 tabs)
  • Scope: Canada and 12 subregions (Provinces, Territories)
  • Fields: See the 2nd tab in the spreadsheet for details.

R Shiny Dashboard

  • Includes download handlers to download data powering all graphs as well as information about where the source is for various graphs.
  • https://isaac-flath.shinyapps.io/coronavirus2/
    – Filter by state to look at cases, deaths, recovered, hospitalizations data by state. Some available by date, and some graphs have smoothing applied
  • https://isaac-flath.shinyapps.io/Covid-19/
    – Coronavirus by country data including tests, cases, recovered, NPI, mobility, etc. Can filter to change to look at different countries, mobility changes over tie, NPI policy changes (link to codebook to explain what categories are).
    –Minimal ability to do some graph transformation in UI such as apply smoothing, log scale, cumulative vs new daily.

Modeling

(Including tutorials, simulation models, machine learning models, etc)

Other threads

  • Fill me in!
10 Likes

@jeremy I was speaking with @MadeUpMasters and he is working on some interesting COVID-19 data analysis stuff. One thing that occurred to me is the lack of crowd sourced data. So I quickly made this yesterday and today (even though i’m not feeling great myself).
Is this kind of data of value to anyone? I am happy to make adjustments to the form to collect more valuable data points if needed. Everything is available in a Firestore NoSQL database.

I’m not great with front end so its a little rough around the edges but works.
https://corona-census.firebaseapp.com/

2 Likes

@jeremy I also want to mention https://covid19dashboards.com/

Made with fastpages! It has examples of how to get relevant data, and useful visualizations, including some Bayesian models (by one of the core contributors to PyMC3). Would love more contributions from the fastai community- there has already been a few! It’s had 150k page views in only 4 days, so lots of visibility for people. Also a good way to get familiar with fastpages!

5 Likes

COVID-19 image data collection on GitHub: https://github.com/ieee8023/covid-chestxray-dataset

However, this still seems to be starting, so no additional data like update frequency, etc.

2 Likes

@hamelsmu please add to the wiki post at the top

1 Like

Done!

2 Likes

similar vein I suspect: https://coronavirus-resources.esri.com/

this is an amazing set of resources and maps

@jeremy I tried to create a topic to share an interactive tool we built with the community, but could not do it (maybe, permission issues)

The tool helps you understand how your actions can prevent the spread of coronavirus in your country.

We would love to get some feedback from the fastai community.

App: https://corona-calculator.herokuapp.com/

Blog post: https://towardsdatascience.com/should-i-go-to-brunch-an-interactive-tool-for-covid-19-curve-flattening-6ab6a914af0

If you play with the tool, you’ll see there’s a huge difference between 5 social contacts a day and 20 a day. At < 5 contacts a day, the disease doesn’t spread. In many countries, at < 10 a day, demand remains within hospitals capacity to cope.

2 Likes

Thanks @salman. The site checks whether you’ve read 3 posts and spent 10 mins on the site first. These are Discourse’s defaults, to try to make sure that you’ve searched before posting. Can you try creating the topic again after doing that? Sorry for the trouble!

1 Like

I tried again after searching but could not create the topic. Here is the screenshot:

OK - I’ve manually set your account’s trust level, so you can now post. Sorry I should have thought of that in the first place!

1 Like

Done, that worked! Thank you! :slight_smile:

Okay, so tool has been rebooted and will launch in a few days. We have US MyChart integration and some US Hospitals signed up to use it.

Please share / retweet.

First crowd sourcing site I’ve seen in conjunction with Harvard Medical School and Boston Childrens - sister site to ‘flu near you’ that has been used by Boston Childrens. Provides visualizations of crowd sourced data.

Just got to know about this dataset.

Might be helpful for exploration

1 Like

Germany’s Robert Koch Institute (RKI, federal agency responsible for public health) has a similar Dashboard to that of John Hopkins University (JHU):
https://experience.arcgis.com/experience/478220a4c454480e823b17327b2bf1d4

One of the interesting things is that the number of cases (Germany-wide) are typically lower than JHU (at times I have noticed differences of 10%, as I write this post its about 5%). Articles in Germany say that, that is related to the differences in the process of reporting and collecting this data. While RKI updates its data once a day, JHU does that several times a day. Furthermore, RKI relies on data from local public health authorities (this may result in delays of up to 2 days) while JHU appears to scan data from local newspapers to arrive at their numbers. I haven’t been able to figure out how exactly JHU does it. How do they avoid counting cases twice for example. Would be great if anyone could shed light on that. Is JHU less precise but faster? Perhaps time will tell.

Anyway, there’s an interesting pattern that emerges in the visualisation of the number of Covid-19 cases per day in Germany. The numbers drop on the weekends only to rise later and surpass those of ht previous week (some people might say its obvious?). That shows that it’s easy to make the mistaken conclusion that the curve is flattening based on data of a couple of days (politicians, have made that mistake, btw!).

Might all seem obvious to many, but I thought I’d share this info. Perhaps there are similar trends in other countries?

Hats off to this man. I’m sure he doesn’t have a lot of free time to begin with. It’s reassuring to see world leaders act as such.

@jeremy

Here’s a pre-trained Pytorch model along with the training code for COVID detection from lung X-rays: https://github.com/velebit-ai/COVID-Next-Pytorch

A radiologist I know said there were some mislabeled cases. I don’t know how bad it is though, but he recommended me not to use it.

1 Like