Intelligent Recombination Experiments

This repository contains scripts and results related to structure- and homology-directed protein recombination experiments conducted with the Schreiter Lab.

SCHEMA-RASPP is a protocol for structure-directed recombination using a parent structure and multiple homologous sequences. We have forked the original CalTech repository and initialized the forked version as a submodule in this repository (origin is hbhargava/SCHEMA-RASPP). It is important to keep the submodule up to date by periodically running git submodule foreach git pull origin master to fetch the latest version.

Usage

The Python script compute_chimeras takes as input a single directory containing a subdirectory for each homolog to be recombined (each subdirectory contains a FASTA sequence file and, optionally, a PDB structure file). Several additional parameters are specified by the user in the course of the computation. The overall output is a list of chimeras with corresponding SCHEMA energies and mutation scores.

To begin a computation, run the following:

python compute_chimeras.py -i /full/path/to/input/dir -o /full/path/to/output/dir

This command will initiate the chimera generation process. The steps are outlined below.

Overview of compute_chimeras

Search input directory for potential parent sequences and structures and ask user which sequence is to be used for structure guidance and which are to be used as normal parent structures.
Build FASTA files for all selected sequences (including structure parent).
Compute sequence alignment of all parent sequences using ClustalOmega.
Build FASTA file with structure parent sequence and sequence from PDB file.
Compute sequence alignment for both parent sequences using ClustalOmega.
[In progress]: Compute the sequence identity for all possible pairs of homologs.
Copy parent PDB structure file to output folder for use by SCHEMA algorithm.
Use SCHEMA_RASPP.schemacontacts via SR_interlink to perform radial search and determine contacts present in structure.
Obtain desired number of crossovers from user and use SCHEMA_RASPP.rasppcurve via SR_interlink to compute the RASPP curve.
Obtain desired crossover locations from user and use SCHEMA_RASPP.schemaenergy via SR_interlink to compute SCHEMA energies and mutation scores for all chimeras.

Dependencies

numpy for miscellaneous computation
biopython for FASTA parsing and more
clustalomega for sequence alignment
picker for user multi-selection

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
SCHEMA_RASPP @ 4bdd1b0		SCHEMA_RASPP @ 4bdd1b0
microbial_opsin_outputs		microbial_opsin_outputs
microbial_opsins		microbial_opsins
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
SR_interlink.py		SR_interlink.py
compute_chimeras.py		compute_chimeras.py
compute_identity.py		compute_identity.py
picker.py		picker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intelligent Recombination Experiments

Usage

Overview of compute_chimeras

Dependencies

About

Releases

Packages

Languages

hbhargava7/intelligent-recombination-experiments

Folders and files

Latest commit

History

Repository files navigation

Intelligent Recombination Experiments

Usage

Overview of compute_chimeras

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages