- Provide COVID19 dataset containing detailed information on Russia.
- Maintain CSSE compatibility
- Provide some higher level APIs for accessing the data.
- Close the project after a more systematic approch is developed
Disclamer: the author doesn't have relationships with any government or commercial organisations. The data provided here are collected from unreliable sources and may be not accurate. Use it at your own risk.
Отказ от ответственности: автор не имеет отношения к государственным или коммерческим организациям. Данные, приведенные здесь, собраны из ненадежных источников и могут быть неточными. Используйте их на свой страх и риск.
- csse_covid_19_data/csse_covid_19_daily_reports contains CSV files which were released by CSSE and later amended by us. Files released after March 25 were updates with additional information on Russian regions.
- csse_covid_19_data/csse_covid_19_time_series folder contains additional auto-generated timeseries on Russian regions.
- python3 folder contains Python development tools:
- covid19ru.check module for checking certain invariants
- covid19ru.fetch Yandex data fetcher
- covid19ru.access Data accessor API
- covid19ru.plot Matplotlib plotting
- https://github.com/CSSEGISandData/COVID-19
- Upstream world data by CSSE.
- Rospotrebnadzor
- The supposedly original official data source of COVID19 data in Russia. Data is published in Russian as a plain text. The source provides daily difference per region and current total for the whole state. Example: https://www.rospotrebnadzor.ru/about/info/news/news_details.php?ELEMENT_ID=14125
- Yandex COVID19 map
- The Yandex company provides current per-region numbers.
- NovelCoronaVirusChannel at Telegram
- Random COVID19 news in Russian.
- https://стопкоронавирус.рф//#
Originally, author filled the data on Moscow and Saint Petersburg manually,
based on Rospotrebnadzor
and NovelCoronaVirusChannel
data. Starting from
March, 25 we follow the below procedure:
- Fetch hourly data from
Yandex COVID map
- Fetching is done by running
monitor
function of the fetcher script - The data is saved into
pending
folder, stamped with UTC time.
- Fetching is done by running
- Fetch daily upstream updates by using regular
git fetch
manually. - If update is available,
- Rebase repository to
upstream/master
branch usinggit rebase
- For every
csse_covid_19_data/csse_covid_19_daily_reports
file which doesn't have russian details, do the following:- Determine the update time of 'Russia' record found in the world data.
The time is supposed to be UTC. The update time is often near
23:30
(supposedly UTC time). - Find the russian details dump in
pending
folder which has the closest UTC timestamp. - Update world information file by inserting russian details manually.
- Review the format compatibility (CSV fields order, date format, etc.).
- Update RU timeline by calling
ru_timeline_dump()
of access.py. - Run the checker script.
- Update plots by running plot script.
- Commit the changes to this repository, forcebly push (due to rebase) here.
- Determine the update time of 'Russia' record found in the world data.
The time is supposed to be UTC. The update time is often near
- Rebase repository to
- https://github.com/AlexxIT/YandexCOVID
- https://github.com/klevin92/covid19_moscow_cases
- https://github.com/wolfxyx/moscow-covid-19
Visualizations:
Python code to check the correctness of CSV filesPython stub checking the validity of basic CSV structure(see ./python3/src/covid19ru/check.py )Check less-trivial invariants
Python API to access the CSV data. It should handle the CSV format change which happened around 23.03.2020Pandas API(see ./python3/src/covid19ru/access.py)Provide compatibility level for data before 23.03.2020
Semi-automated data loader from Yandex. Ideally, we want to perform the following actions:Collect(starting fromConfirmed/Death/Recovered
info for each Russian city03-25-2020.csv
)Save this information in a temporary file to handle update gapSet correct value of Longitude/Latitude for Russian regionsFigure out what does 'Active' field mean and how to get it.- Seems that it is just
Confirmed-Deaths-Recovered
. One have to update the data which miss this value.
- Seems that it is just
Make periodical dumps of rospotrebnadzor cite. Try to track possible source of data inconsistency.Auto-generate timeseries- Daily update CSSE with Russian state information
- Find data on Russian regions for pre- 25.03.2020 period.
- Got an update on Komi republic.
- Decrease of confirmed cases in Amursk oblast is detected. The number falled from 17 to 6. Local news reported about 17 confirmed cases originally: https://vostok.today/33443-koronavirus-vs-prichiny-smertnosti-v-amurskoj-oblasti-bolshe-shansov-umeret-v-dtp-ili-ot-infarkta.html
- Still no updates on Komi republic
- Added code to dump RU timeline. The up-to-date RU timeline file is available csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_RU.csv
- Also added deaths timeline csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_deaths_RU.csv
- No updates on Komi republic (3rd place among Russian regions) since 10.04.2020. Checked both Yandex and Rospotrebnadzor site.
- More breaking changes from the upstream. The following daily data files have
unmatching data foramt and extra symbols in the line ends:
03-21-2020.csv
03-29-2020.csv
03-30-2020.csv
04-06-2020.csv
- Updated issue CSSEGISandData#1523
- More errors come from checker script, this time on Crimea:
That means that Yandex counters decrease their values. We can't name the reason, probably there were some corrections. One possible reason - splitting the Crimea into
Error(file='COVID-19_plus_Russia/csse_covid_19_data/csse_covid_19_daily_reports/03-31-2020.csv', text='Confirmed decreased for Republic of Crimea from 20 to 16')
Crimea
andSevastopol
.
- Number of 'recovered' decreased in Sverdlovsk oblast
- Exact text of an error:
Error(file='COVID-19_plus_Russia/csse_covid_19_data/csse_covid_19_daily_reports/03-29-2020.csv', text='Recovered decreased in Sverdlov oblast from 3 to 1 (oh no!)'),
- Conflict resolved.
23-22-2020.csv
file seemed to be damaged by the upstream admins. - CSSEGISandData#1523
- Implemented Yandex data fetcher
Upstream format change: now
,,Moscow,Russia,2020-03-24 00:00:00,55.75222,37.61556,262,1,9,,"Moscow, Russia"
,,"Saint Petersburg",Russia,2020-03-22 00:00:00,59.93863,30.31413,16,0,2,,"Saint Petersburg, Russia"
We augmented CSV files from csse_covid_19_daily_reports
folder by adding lines
like:
Moscow,Russia,2020-03-21T00:00:00,5,0,0,55.75222,37.61556
"Saint Petersburg",Russia,2020-03-21T00:00:00,4,0,2,59.93863,30.31413
Original README.md starts here
This is the data repository for the 2019 Novel Coronavirus Visual Dashboard operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). Also, Supported by ESRI Living Atlas Team and the Johns Hopkins University Applied Physics Lab (JHU APL).
Visual Dashboard (desktop):
https://www.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6
Visual Dashboard (mobile):
http://www.arcgis.com/apps/opsdashboard/index.html#/85320e2ea5424dfaaa75ae62e5c06e61
Lancet Article:
An interactive web-based dashboard to track COVID-19 in real time
Provided by Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE):
https://systems.jhu.edu/
Data Sources:
- World Health Organization (WHO): https://www.who.int/
- DXY.cn. Pneumonia. 2020. http://3g.dxy.cn/newh5/view/pneumonia.
- BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/
- National Health Commission of the People’s Republic of China (NHC):
http://www.nhc.gov.cn/xcs/yqtb/list_gzbd.shtml - China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm
- Hong Kong Department of Health: https://www.chp.gov.hk/en/features/102465.html
- Macau Government: https://www.ssm.gov.mo/portal/
- Taiwan CDC: https://sites.google.com/cdc.gov.tw/2019ncov/taiwan?authuser=0
- US CDC: https://www.cdc.gov/coronavirus/2019-ncov/index.html
- Government of Canada: https://www.canada.ca/en/public-health/services/diseases/coronavirus.html
- Australia Government Department of Health: https://www.health.gov.au/news/coronavirus-update-at-a-glance
- European Centre for Disease Prevention and Control (ECDC): https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases
- Ministry of Health Singapore (MOH): https://www.moh.gov.sg/covid-19
- Italy Ministry of Health: http://www.salute.gov.it/nuovocoronavirus
- 1Point3Arces: https://coronavirus.1point3acres.com/en
- WorldoMeters: https://www.worldometers.info/coronavirus/
- COVID Tracking Project: https://covidtracking.com/data. (US Testing and Hospitalization Data. We use the maximum reported value from "Currently" and "Cumulative" Hospitalized for our hospitalization number report ed for each state.)
Additional Information about the Visual Dashboard:
https://systems.jhu.edu/research/public-health/ncov/
Contact Us:
- Email: [email protected]
Terms of Use:
This GitHub repo and its contents herein, including all data, mapping, and analysis, copyright 2020 Johns Hopkins University, all rights reserved, is provided to the public strictly for educational and academic research purposes. The Website relies upon publicly available data from multiple sources, that do not always agree. The Johns Hopkins University hereby disclaims any and all representations and warranties with respect to the Website, including accuracy, fitness for use, and merchantability. Reliance on the Website for medical guidance or use of the Website in commerce is strictly prohibited.