The Data Science and Sharing Team (DSST) within the National Institute of Mental Health's (NIMH) Intramural Research Program (IRP) in Bethesda, MD, is seeking an aspiring data scientist for an on-site position for an on-site position via the NIH Postbac training program.
With a budget of $2.5B per year, the NIMH is among the largest divisions of the NIH and is the largest funder of research on mental disorders in the world. The NIMH mission is to transform the understanding and treatment of mental illnesses through basic and clinical research, paving the way for prevention, recovery, and cure. The IRP is the largest biomedical research institution on earth, where the unique funding environment allows scientists to conduct both long-term and high-impact science that would difficult to impossible in a grant-dependent institution.
The DSST is responsible for leading and supporting data-intensive scientific projects within the NIMH IRP. We work closely with the Machine Learning Team, as well as other IRP investigators (e.g. Armin Raznahan, Peter Bandettini, Robert Innis). Special emphasis is placed on reproducible analyses of high dimensional genetic, genomic, neuroimaging, and behavioral datasets.
We also work to implement the policy guidance from the White House Office of Science Policy to ensure free, immediate, and equitable public access to federally funded research, foster greater collaboration and innovation, and strengthen public trust.
Everything we create is open source and freely distributed. A full listing of research objects from out team as of December 2023 is available in our Board of Scientific Counsellors Report.
- Building tools and pipelines for reproducible data analysis using Python
- Implementing reproducible pipelines using both High Performance Computing (HPC) and Elastic Cloud Computing (ECC)
- Writing clear documentation and scholarly publications
- Advocating for open science best practices for reproducibility (e.g. Software and Data Carpentries, BrainHack Global, FAES)
- Creating, expanding, and supporting FAIR data standards in biomedical science (e.g. BIDS, NWB OME, GO)
- A recent undergraduate degree in a STEM field or equivalent knowledge and experience
- Strong coding skills in Python
- Experience with version control and continuous integration with git and github
- Experience working with biomedical data repositories and/or biobanks (e.g. The Adolescent Brain and Cognitive Development Project (ABCD), OpenNeuro, UK Biobank, GTEX, IDR: Image Data Repository, Dandi Archive)
- Experience with distributed, high-performance computing tools such as Docker/Apptainer and batch processing systems such as SLURM
- Recent examples of open access research objects (e.g. publications, code, datasets) on which you were a major contributor
- Submit your application to the [NIH Postbac Training Program]((https://www.training.nih.gov/research-training/pb/pb/)
- Send us an email at [email protected] to let us know you're interested in working with our team
We'll then review your application in the database and get back to you soon. We are currently reviewing applicants and expect to select a candidate by Friday, January 31, 2025
We especially encourage applications from members of underrepresented groups in the data science and biomedical research community. Other inquiries are also welcome.
The National Institutes of Health is an equal opportunity employer. This position will be based at NIMH in Bethesda, MD via a third party contracting firm who is able to provide visa sponsorship.