Enrichment Analysis Notebooks

Enrichment Analysis Use Cases. Tools to analyse plant biology knowledge graphs and find enriched traits or bioprocesses in differential gene expression data. The project uses data from Knetminer, ENSEMBL, EBI Gene Expression Atlas and others, exported as Knowledge graphs using Agrischemas/Bioschemas annotations.

Database Files Notebook

This notebook is intended mainly for KnetMiner developers. The files generated are already provided in the interactive_jupyter_notebook folder on Github.

There are two cells in the notebook:

Run the first cell to choose the species and concepts.
Run the second cell to get the download link for the database csv files.

Enrichment Analysis Notebook

This is the main notebook to perform the enrichment analysis.

The first two cells are mainly for the analysis:

Run the first cell to choose the species, concept and study/list.
Run the second cell to perform the analysis:
- If you chose "study", you will get a list of studies to choose from for the species.
- Then you will be asked whteher you want to use all genes in the study or filter the genes according to p-value.
- If you chose to filter, you will get an interactive line graph represtation for the p-values and the corresponding number of genes.
  You can also filter using the slider.
- Whether you chose list or study, the final result is two tables, which can be downloaded as csv files:
  - The first table shows the chosen genes and related ontology, with the evidence and the link to the network knowledge graphs on KnetMiner.
  - The second table shows the p-values for the ontology terms.

Other cells:

Run the cells in the "View whole tables section" if you want to print the complete tables in the notebooks.
To filter the first table (gene-concept), run the cells in the sections:
- choosing the ontology term to display the related genes
- or choosing a gene to display the related ontology terms

Running the notebooks using Binder

For running the jupyter notebook for Enrichment Analysis using KnetMiner SPARQL endpoint,
click on launch binder:

For running the jupyter notebook for gene-concept relations from the database,
click on launch binder:

Please Note: The binders take sometime to be built and might fail a couple of times before launching successfully.

Running the notebooks on local computer

Download the notebooks

Download the zipped folder for the interactive notebooks and the required files and unzip it.

Install conda and dependncies

Install Miniconda

Open the miniconda prompt and write the following commands:
(note: change "path/to/interactive_jupyter_notebook" to actual path)

conda create -n my-conda-env python=3.9.12          # create new virtual env
conda activate my-conda-env                         # activate environment in terminal
cd "path/to/interactive_jupyter_notebook"           # change directory to interactive_jupyter_notebook folder
pip install -r requirements.txt                     # install requirements
jupyter nbextension enable --py widgetsnbextension  # activate the widgets
jupyter notebook                                    # start server + kernel inside my-conda-env

Choose the notebook to run.

Requirements

python v3.9.12
pandas v1.3.3
numpy v1.21.2
matplotlib v3.4.3
scipy v1.7.3
sparqlwrapper v1.8.5
ipympl v0.9.2
ipywidgets v7.6.5
jupyter v1.0.0
ipykernel v6.9.1
jupyterlab

Name		Name	Last commit message	Last commit date
Latest commit History 171 Commits
NeoDash_dashboard		NeoDash_dashboard
images_for_HTML		images_for_HTML
interactive_jupyter_notebook		interactive_jupyter_notebook
working_files		working_files
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
interactive_jupyter_notebook.zip		interactive_jupyter_notebook.zip
postBuild		postBuild
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enrichment Analysis Notebooks

Database Files Notebook

There are two cells in the notebook:

Enrichment Analysis Notebook

The first two cells are mainly for the analysis:

Other cells:

Running the notebooks using Binder

Running the notebooks on local computer

Download the notebooks

Install conda and dependncies

Requirements

About

Releases

Packages

Contributors 3

Languages

License

Rothamsted/knetgraphs-gene-traits

Folders and files

Latest commit

History

Repository files navigation

Enrichment Analysis Notebooks

Database Files Notebook

There are two cells in the notebook:

Enrichment Analysis Notebook

The first two cells are mainly for the analysis:

Other cells:

Running the notebooks using Binder

Running the notebooks on local computer

Download the notebooks

Install conda and dependncies

Requirements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages