Skip to content
@uc-cdis

Center for Translational Data Science

Center for Translational Data Science @ University of Chicago

Gen3 logo

Gen3

The University of Chicago, Center for Translational Data Science is the maintainer organization for the open-source Gen3 Data Platform. Some useful references are included below to help you learn more about Gen3.

Gen3 Community

Contributing code to Gen3

Gen3 software is completely open-source. Please refer to our Contributor Guidelines for how to contribute code to our repositories.

Chat with Gen3 staff and the community

If you have questions about operating or using a Gen3 system please contact us. We can be reached on slack, discussion board, or via email ([email protected]).

Events

We hold regular forums on a variety of topics to engage with and hear from the community. You can find previous and upcoming events here.

Gen3 releases

Gen3 releases typically happen every month. The lastest release notes can be found here.

User Guides

Setting up and operating a Gen3 data commons or mesh

Gen3 on Helm is the new preferred way to deploy a Gen3 commons or mesh. Using Helm, a package manager for Kubernetes, Gen3 can be installed against any cluster, including a local one such as Minikube or Rancher Desktop. Helm is a tool that streamlines the installation and management of applications on Kubernetes platforms. Helm plays a crucial role in simplifying the deployment and management of Gen3 components within your environment.

This option is suitable for all deployments. If you are ready to get started, you can follow our guide here.

Using a Gen3 system

To learn more about interacting with a Gen3 data commons or mesh including how to create a data dictionary, upload data and metadata, indexing files, and other topics please refer to our user guide found here.

Additional References

Learn More about Gen3

Check out gen3.org.

Learn More about the Center for Translational Data Science

The Center for Translational Data Science at the University of Chicago is developing the discipline of data science and its applications to problems in biology, medicine, healthcare and the environment. We develop and operate large scale data platforms to support research in topics of societal interest including cancer, cardiovascular disease, inflammatory bowel disease (IBD), birth defects, veterans’ health, pain management, opioid use disorder, and environmental science. We also develop new machine learning and AI algorithms over the data in our platforms.

Our center has been at the forefront of data sharing by continuously leveraging new approaches and new technology to enable world class science in a variety of fields. We have developed a number of important firsts: including one of the first large scale data clouds (NSF supported Open Science Data Cloud (2010-2016)), the first data cloud designed to host biomedical data and approved as a NIH Trusted Partner (Bionimbus Protected Data Cloud (2013-present)), the first large scale data commons (National Cancer Institute’s Genomic Data Commons (2016-present)), and the first set of services to create data ecosystems or meshes for biomedical data (Data Commons Frameworks Services for the Cancer Research Data Commons (2020-present)).

Today with our partners, we operate a data ecosystem comprising over 20 data commons that make over 17 PB of data available to the research community from nearly 1.7M patients. We provide access to this data via secure and compliant workspaces, while protecting patient privacy. These are all based on the open-source Gen3 data platform, that includes the Gen3 data commons, Gen3 Framework Services, and Gen3 Workspaces.

We are based in Chicago, but our work engages collaborators from across the world.

You can find out more here.

Pinned Loading

  1. fence fence Public

    AuthN/AuthZ OIDC Service

    Python 38 48

  2. indexd indexd Public

    Index service server

    Python 21 20

  3. data-portal data-portal Public

    Windmill: data portal for metadata submission and query

    JavaScript 29 44

  4. sheepdog sheepdog Public

    API for herding data submissions

    Python 11 17

  5. peregrine peregrine Public

    GraphQL search API service

    Python 12 9

  6. cloud-automation cloud-automation Public

    Automation for standing up Gen3 commons

    Shell 41 67

Repositories

Showing 10 of 213 repositories
  • uc-cdis/dcf-dataservice’s past year of commit activity
    Python 0 Apache-2.0 0 0 24 Updated Dec 4, 2024
  • gen3-frontend-framework Public

    A Frontend Framework for Gen3 data portals

    uc-cdis/gen3-frontend-framework’s past year of commit activity
    TypeScript 2 Apache-2.0 4 0 15 Updated Dec 4, 2024
  • pelican Public

    Sower jobs for PFB export & import

    uc-cdis/pelican’s past year of commit activity
    Python 3 Apache-2.0 4 0 8 Updated Dec 4, 2024
  • data-portal Public

    Windmill: data portal for metadata submission and query

    uc-cdis/data-portal’s past year of commit activity
    JavaScript 29 Apache-2.0 44 43 20 Updated Dec 4, 2024
  • cdis-manifest Public

    Manifests tracking service versions to release to each environment

    uc-cdis/cdis-manifest’s past year of commit activity
    HTML 8 Apache-2.0 7 0 185 Updated Dec 4, 2024
  • cloud-automation Public

    Automation for standing up Gen3 commons

    uc-cdis/cloud-automation’s past year of commit activity
    Shell 41 Apache-2.0 67 99 80 Updated Dec 4, 2024
  • indexd Public

    Index service server

    uc-cdis/indexd’s past year of commit activity
    Python 21 Apache-2.0 20 24 19 Updated Dec 4, 2024
  • gen3-helm Public

    Helm charts for Gen3 Deployments

    uc-cdis/gen3-helm’s past year of commit activity
    Smarty 9 Apache-2.0 23 5 13 Updated Dec 4, 2024
  • uc-cdis/gen3-terraform’s past year of commit activity
    HCL 1 Apache-2.0 0 0 5 Updated Dec 4, 2024
  • dictionaryutils Public

    python wrapper and metaschema for datadictionary

    uc-cdis/dictionaryutils’s past year of commit activity
    Python 4 Apache-2.0 7 4 9 Updated Dec 4, 2024