This repository contains resources for the Pittsburgh GOSSIS Datathon. GOSSIS is the Global Open Source Severity of Illness Score.
- Getting started
- Documentation
- Databases on BigQuery
- Analysing data with Google Colab
- An example in Python
- An example in R
The datasets are hosted on Google Cloud, which requires a Gmail account to manage permissions.
- Create a Gmail account, if you don't already have one. It will be used to manage your access to the resources.
- Give your gmail address to the session hosts via the form provided in the data access instructions.
We will be working with two care datasets during the event:
- WIDS datathon data (GOSSIS subset): https://physionet.org/content/widsdatathon2020/
- GOSSIS-1 Dataset (eICU only): https://physionet.org/content/gossis-1-eicu
BigQuery is a database system that makes it easy to explore data with Structured Query Language ("SQL"). There are several datasets on BigQuery available for you to explore, including widsdatathon2020
(the WIDS Datathon data) and gossis_1_eicu
(the GOSSIS-1 eICU Dataset).
-
At the top of the console, select
gossis-datathon
as the project. This indicates the account used for billing. -
"Pin" a project to the resources menu to view available datasets. In the Resources menu on the left, click "Add data", "Pin a project", then add the following project names:
physionet-data
. -
You should be able preview the data available on these projects using the graphical interface.
-
Now try running a query. For example, try counting the number of rows in the demo eICU patient table:
SELECT count(*) FROM `physionet-data.widsdatathon2020.training_v2`
Python is an popular programming language for analysing data. We will explore the data using Python notebooks, which allow code and text to be combined into executable documents. First, try opening a blank document using the link below:
Several tutorials are provided below. Requirements for these notebooks are: (1) you have a Gmail account and (2) your Gmail address has been added to the appropriate Google Group by the workshop hosts.
- Coming shortly...
If you prefer working in R, then you can connect to Google Cloud from your code in a similar way:
- Coming shortly...