-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathREADME
185 lines (144 loc) · 6.02 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
###############################################################################
#
# README created 10/02/2009
#
# INSTALL and test instructions for pGen/DomTHREADER
# VERSION 8.9
# Author: [email protected] [email protected]
#
# Change in 8.9
# - addition of mtx2con
# - Added predict_contacts.pl to bin/
# - added svm probability_model to data/
#
# Changes in 8.7
# - fixes the psuedo-multialignment building scripts in BLAST+/
#
# Changes in 8.6
# - consolidation of BLAST+scripts
# - new genthreader.sh script for both domthreader and genthreader for BLAST+
#
# Changes in 8.5
# - Some minor bug fixes to the BLAST+ methods
# - Added the ability to switch off psiblast in the MSA methods when seeded
# with the complete set of homologues
#
# Changes in 8.4
# - Several bug fixes for the MSA methods
#
# Changes in 8.3
# - Addition of MSA and BLAST+ methods
#
# Changes in 8.3.1
# - Ligand contact prediction removed
#
# Changes in 8.3.2
# - Ansible installer added
#
###############################################################################
INSTALLATION VIA GITHUB AND ANSIBLE:
First ensure that ansible and PSIPRED are installed on your system, then clone
the github repo. PSIPRED is available at https://github.com/psipred/psipred.
Use the ansible installer provided for PSIPRED.
For pGenTHREADER
% pip install ansible
% git clone https://github.com/psipred/pGenTHREADER.git
% cd pGenTHREADER/ansible_installer
Next edit the the config_vars.yml to reflect where you would like pGenTHREADER and
its underlying data to be installed. The app_locations variable MUST be the same
as you used for the PSIPRED install. You can choose a version of uniref but
we recommend uniref90 for most accurate results.
You can now run ansible as per
% ansible-playbook -i hosts install.yml
You can edit the hosts file to install pGenTHREADER on one or more machines.
Manual installation can be completed with the steps below
Follow the steps:
#-----------------------------------------------------------------------------#
#
# Untar and install the fold libraries
#
#-----------------------------------------------------------------------------#
1. cd into tdb directory
prompt> cd tdb
2. unpack the fold libraries
prompt/tdb> tar -xvzf cath_tdb.tgz fold_lib.tar.gz
NOTE: At present there is no software available to create fold libraries, but
up to date libraries can be downloaded from our site. Support for user-created
fold libraries will be added in a future release.
#------------------------------------------------------------------------------
#
# Build the binaries for your system
#
#------------------------------------------------------------------------------
3. cd into the src directory
4. run the makefile
prompt/src> make
5. install the compiled binary
prompt/src> make install
#------------------------------------------------------------------------------
#
# Configure the shell script for your system
#
#
#------------------------------------------------------------------------------
6. cd back to the top level directory
7. open the shell script pGenThreader.sh in an editor
8. change the paths at the top to suit your install
9. install PSIPRED and NCBI BLAST if you haven't already (these are
NOT included in this package).
#------------------------------------------------------------------------------
#
# Run the test jobs
#
#
#------------------------------------------------------------------------------
9. prompt> sh pGenThreader.sh pgen_test test/test.fsa
10.prompt> sh pDomThreader.sh pdom_test test/test.fsa
11.prompt> sh GenThreader.sh -j test -i test/test.fsa
#------------------------------------------------------------------------------
#
# Example outputs
#
#
#------------------------------------------------------------------------------
11. Compare your ouputs with the examples found within the test directory :
test/pGT.output and test/pDT.output
#------------------------------------------------------------------------------
#
# Trouble shooting
#
#
#------------------------------------------------------------------------------
1. Check with your system administrator that you have installed the binaries
correctly and that you have configured the shell scripts appropriately for
your system
2. Send a polite Email to one of the GenThreader team: or alternatively try
[email protected] if you are desperate!
3. Confused about the BLAST database to use? We recommend any large, relatively
sequence complete non-redundant protein database. Uniref90 is a good choice.
You may also wish to download pfilt to mask low complexity and repetative
sequences in the database.
#------------------------------------------------------------------------------
#
# Experimental BLAST+, mutliple alignment and contact prediction support
#
#------------------------------------------------------------------------------
1. /BLAST+ now contains a script for running pGenTHREADER and pDomTHREADER
with BLAST+; GenThreader.sh. This requires the new chkparse code which you
should find in src/ and bin/
2. Copy then run this script from the root genTHREADER directory not from the
BLAST+/ directory.
3. GenThreader.sh will take multiple sequence alignments input. This
functionality only works with BLAST+. Takes optional command line variables
which indicate if the msa is Complete (includes all relevant close and
distant relatives); Partial (includes all relevant close relavties but no
distant relatives) or Seed (does not include all relevant close or distant
relatives)
4. Output of multiple alignments: This requires a copy of MAFFT is installed.
You must change the MAFFT_DIR global variable at the top of the
mafft_spliced_alignment.pl script to the path to your mafft bin/ directory.
5. The GenThreader.sh script can be set to output a map of the known ligand
contacts in the genthreader hits, using the -c command line parameter.
This output has the name contactcons. This will also output jalview input
files for each pairwise alignment in the genthreader results. There can be
found in jalview_inputs