This code is implementation of our paper [CLARINET].
CLARINET is an attention based deep learning model that predicts whether a TF binds to a given DNA sequence.
CLARINET takes DNA sequence as one-hot vector and predicts 690 TF binding tasks.
The above picture is an overview of our model.
- pytorch
- keras
- Download data (http://deepsea.princeton.edu/media/code/deepsea_train_bundle.v0.9.tar.gz)
- Data includes
train.mat
,valid.mat
, andtest.mat
- Place that data at the
./data
folder
- Data includes
- Train model and save trained model to ./model folder
python clarinet.py train ./data ./model/[model_name]
- Evaluate trained model
python clarinet.py eval ./data ./model/[model_name]
python attn/analysis.py [] []
python snp/analysis.py [] []
ROC AUC scatter plot
Heatmap plot
ROC AUC plot