Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Cultivation #4

Open
bayoumi17m opened this issue Apr 5, 2020 · 4 comments
Open

Data Cultivation #4

bayoumi17m opened this issue Apr 5, 2020 · 4 comments
Assignees

Comments

@bayoumi17m
Copy link
Collaborator

Find more data sources that showcase the following:

  1. Number of COVID+ patients that are inpatient with sever status (hospitalized)
  2. Number of COVID+ patients that are in ICUs
  3. Number of daily hospital admissions of COVID+ patients
  4. Number of COVID+ patients on ventilators

Ideally, we can stratify by age or other basic demographics (location within a region).

@bayoumi17m
Copy link
Collaborator Author

There is a lot of data online already for the US + for Other countries if we are willing to search through foreign language sites and use some google translate. Municipal websites (usually have github links) have been a good source. Another check, surprisingly is news articles!

From Iran: https://en.radiofarda.com/a/exclusive-67-000-suspected-coronavirus-cases-hospitalized-in-iran-/30515682.html

news reports can help make sense of more 'official' data if, e.g., columns are not well labeled.. did they mean 'hospitalizations' as census or as cumulative?

For the Italian data they have, they meant cumulative - they found out by talking with the owner of their data. They believe that the Spanish data is the same. In the Wuhan/Hubei data, they mean census. It is often hard to tell. But comparing to the numbers on given days in news stories can help make sense, and also can serve as datapoints from regions that don't post official numbers).

@bayoumi17m
Copy link
Collaborator Author

Data they currently have:

@bayoumi17m
Copy link
Collaborator Author

Note, even having 1 or 2 datapoints from a country ('e.g., there were 6k patients in the hospital on March 1, and 12K on April 1'), and knowing when they shut down or did other measures, is probably enough to make the data useful. That's why putting some work into finding news stories can be useful.

@bayoumi17m
Copy link
Collaborator Author

They have been very interested in comparing timing of different interventions across countries to the slowing or stopping of exponential growth of case counts according to the John's Hopkins COVID-19 data, which are here:

https://github.com/CSSEGISandData/COVID-19

I believe that the insights into what slows / stops the curve and timing (how long does it take?) gainable from that analysis could help inform the other task as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants