Enrichment Analysis Use Cases. Tools to analyse plant biology knowledge graphs and find enriched traits or bioprocesses in differential gene expression data. The project uses data from Knetminer, ENSEMBL, EBI Gene Expression Atlas and others, exported as Knowledge graphs using Agrischemas/Bioschemas annotations.
This notebook is intended mainly for KnetMiner developers. The files generated are already provided in the interactive_jupyter_notebook folder on Github.
-
Run the first cell to choose the species and concepts.
-
Run the second cell to get the download link for the database csv files.
This is the main notebook to perform the enrichment analysis.
-
Run the first cell to choose the species, concept and study/list.
-
Run the second cell to perform the analysis:
- If you chose "study", you will get a list of studies to choose from for the species.
- Then you will be asked whteher you want to use all genes in the study or filter the genes according to p-value.
- If you chose to filter, you will get an interactive line graph represtation for the p-values and the corresponding number of genes.
You can also filter using the slider.
- Whether you chose list or study, the final result is two tables, which can be downloaded as csv files:
- The first table shows the chosen genes and related ontology, with the evidence and the link to the network knowledge graphs on KnetMiner.
- The second table shows the p-values for the ontology terms.
- The first table shows the chosen genes and related ontology, with the evidence and the link to the network knowledge graphs on KnetMiner.
-
Run the cells in the "View whole tables section" if you want to print the complete tables in the notebooks.
-
To filter the first table (gene-concept), run the cells in the sections:
- choosing the ontology term to display the related genes
- or choosing a gene to display the related ontology terms
For running the jupyter notebook for Enrichment Analysis using KnetMiner SPARQL endpoint,
click on launch binder:
For running the jupyter notebook for gene-concept relations from the database,
click on launch binder:
Please Note: The binders take sometime to be built and might fail a couple of times before launching successfully.
Download the zipped folder for the interactive notebooks and the required files and unzip it.
-
Open the miniconda prompt and write the following commands:
(note: change "path/to/interactive_jupyter_notebook" to actual path)conda create -n my-conda-env python=3.9.12 # create new virtual env conda activate my-conda-env # activate environment in terminal cd "path/to/interactive_jupyter_notebook" # change directory to interactive_jupyter_notebook folder pip install -r requirements.txt # install requirements jupyter nbextension enable --py widgetsnbextension # activate the widgets jupyter notebook # start server + kernel inside my-conda-env
-
Choose the notebook to run.
- python v3.9.12
- pandas v1.3.3
- numpy v1.21.2
- matplotlib v3.4.3
- scipy v1.7.3
- sparqlwrapper v1.8.5
- ipympl v0.9.2
- ipywidgets v7.6.5
- jupyter v1.0.0
- ipykernel v6.9.1
- jupyterlab