ASDmodel

Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence and sex information

Requirements

numpy==1.24.3
pandas==2.0.3
scikit_learn==1.3.0
scipy==1.12.0

Basic usage

usage: model.py [-h] [-v [VARIANTS]] [-g [GENES]] [-o [OUTPUT]]
                [-n [NEIGHBORS]] [-k [KMER]]

options:
  -h, --help            show this help message and exit
  -v [VARIANTS], --variants [VARIANTS]
                        Variants file
  -g [GENES], --genes [GENES]
                        Gene matrix file
  -o [OUTPUT], --output [OUTPUT]
                        Output file
  -n [NEIGHBORS], --neighbors [NEIGHBORS]
                        Size of each neighborhood, default 1000
  -k [KMER], --kmer [KMER]
                        Length of k-mers, default 6

The input variants file must have columns "Proband" (binary proband-sibling phenotype labels), "sequence" (DNA sequence) and "GC_100bp" (GC content of corresponding DNA sequence). Sample files can be found in the data folder. The gene matrix file must be tab-delimited, with a header and an index column. A sample gene matrix file can be found in the data folder.

Output format

The columns in the output file are:

Gene name
The local GC content Mann-Whitney U effect size, computed from dividing the U-statistic by the total number of comparisons
The local GC content Mann-Whitney U p-value
The K-mer Naive Bayes model Mann-Whitney U effect size on the testing fold
The K-mer Naive Bayes model Mann-Whitney U p-value on the testing fold
The local GC content Mann-Whitney U effect size on the testing fold
The local GC content Mann-Whitney U p-value on the testing fold All p-values reported are uncorrected.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASDmodel

Requirements

Basic usage

Output format

About

Releases

Packages

Languages

luke0321li/ASDmodel

Folders and files

Latest commit

History

Repository files navigation

ASDmodel

Requirements

Basic usage

Output format

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages