Comp_Genomics

Repository for the final Computational Genomics project, analyzing gliomas.

The data for this project can be found on the Chinese Glioma Genome Atlas (http://www.cgga.org.cn/download.jsp - Part C). It consists of a matrix of RNA-seq gene expression counts for 24000+ genes and a separate matrix of clinical information of each of the 325 patients/samples. The clinical data as well as the processed train and test data are included in this repository.

To replicate this project, download the raw expression and clinical data sets from the Chinese Glioma Genome Atlas. Load this into the same directory as the downloaded repository. First, run the data_processing script to create test and train data and labels. Save those data in the directory as well. The remaining code can be run in any order.

The code can be run on any RNA-seq expression count data as long as each sample is given a pre-determined label, and the format follows that of the original data.

Requirements: Python with the following packages:

numpy
sklearn
seaborn
matplotlib
scipy
pandas
pygmnormalize (git+https://github.com/ficusss/PyGMNormalize.git)
skfuzzy (-U scikit-fuzzy)
umap (only for the umap code)
collections (only for supervised)
torch (only for supervised)
mord (only for supervised)

All of the above can be installed using the pip command.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
critical_gene_analysis		critical_gene_analysis
sample_data		sample_data
supervised_methods		supervised_methods
unsupervised_methods		unsupervised_methods
README.md		README.md
data_processing.ipynb		data_processing.ipynb
umap.ipynb		umap.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comp_Genomics

About

Releases

Packages

Contributors 2

Languages

dsikka/Comp_Genomics

Folders and files

Latest commit

History

Repository files navigation

Comp_Genomics

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages