forked from rlabduke/reduce
-
Notifications
You must be signed in to change notification settings - Fork 0
/
README.usingReduce.txt
236 lines (199 loc) · 11.7 KB
/
README.usingReduce.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
(reduce - JMW - 6/3/03)
INTRODUCTION
This describes the pre-compiled executable for reduce, developed by
J. Michael Word. Reduce is a (UNIX/Linux/MacOSX/Windows) program
for adding hydrogens to a Protein DataBank (PDB) molecular structure
file. It was developed at Duke University in the lab of David and Jane
Richardson. Reduce is described in:
Word, et. al. (1999) Asparagine and Glutamine: Using Hydrogen Atom
Contacts in the Choice of Side-chain Amide Orientation, J. Mol. Biol.
285, 1733-1747.
Both proteins and nucleic acids can have hydrogens added. HET groups
can also be processed as long as the atom connectivity is provided.
A slightly modified version of the connectivity table provided by
the PDB is included (as a gzip compressed text file). Reduce will
attempt to look for this information, in the standard location,
"/local/reduce_het_dict.txt". If you place this file somewhere other
than /local or want to refer to a different connection file for a
particular project, use the -DB command line flag to point to the new
connection file, or set an environmental variable prior to running reduce-
setenv REDUCE_HET_DICT connectionfilename
A C++ PROGRAM
The latest version of reduce is available at
http://kinemage.biochem.duke.edu/
Version 2 of reduce is a C++ program and written on a Silicon Graphics
workatation. For low level processing of PDB file records it uses
Pdb++, our modified version of a class developed as part of Midas
by researchers at the University of California, San Francisco.
NEW FEATURES
5/31/01 SegmentID to Chain Mapping
It is now possible to process a PDBfile where multiple chains are
identified by their segment ID and the chain ID is missing (certain
xtallographic model building and refinement packages do this). A new
flag -SEGIDmap "seg,c..." allows you to define a set of segid->chainid
mappings. Reduce will then add the chainids and process PDBfile. In the
string of seg,c,seg,c mapping info, use _ to represent a blank.
INSTALLING REDUCE
1. Unpack the compressed executable and other files using the appropriate tar command for your platform. You should find in the package:
- reduce.X.XX.YYMMDD.platform - the executable
- reduce_het_dict.txt - HET library
- README.usingReduce.txt - this file
2. Place the executable wherever you keep your unix programs -
we use /usr/local/bin
3. Place the HET library in /usr/local
(but see below as you can place it where you want and tell Reduce where to find it via setting the global variable, REDUCE_HET_DICT
4. Check your file ownerships and permissions and set as appropriate for your computer's use policy. We set root as the owner and group; permissions are set at r-x for all cases.
RUNNING REDUCE
In most circumstances, the recommended command when using reduce to
add hydrogens to a PDB file and standardize the bond lengths of existing
hydrogens is
reduce -build coordfile.pdb > coordfileH.pdb
which includes the optimization of adjustable groups (OH, SH, NH3+,
Met-CH3, and Asn, Gln and His sidechain orientation). When speed is
important, the -build option can be dropped; hydrogens will still be
added, but not His side-chain NH hydrogens, and side-chains will not
be flipped. For even greater speed, but even less accuracy, adding
-nooh and -noadj will skip the OH and SH hydrogens and eliminate
optimization altogether. Input is from the specified PDB format
coordinate file and the new, updated PDB coordinates are written to
"standard output", here redirected to a file with the '>' symbol.
Disulfides, covalent modifications, and connection of the ribose-phosphate
nucleic acid backbone, are recognized and any hydrogens eliminated
by bonding are skipped. When an amino acid main-chain nitrogen is not
connected to the preceeding residue or some other group, reduce treats
it as the N-terminus and constructs an NH3+ only if the residue number
is less than or equal to an ajustable limit (1, by default). Otherwise,
it considers the residue to be the observable beginning of an
actually-connected fragment and does not protonate the nitrogen.
Reduce does not protonate carboxylates (including the C-terminus)
bacause it does not specifically consider pH, instead modeling a
neutral environment.
Hydrogens are positioned with respect to the covalently bonded neighbors
and these are identified by name. Non-standard atom names are the
primary cause of missing or misplaced hydrogens. If reduce tries to
process a file which contains hydrogens with non-standard names,
the existing hydrogens may not be recognized and may interfere
with the generation of new hydrogens. The solution may be to remove
existing hydrogens before further processing.
Hydrogens can be removed from a pdb format file with reduce.
reduce -trim 1abcH > 1abc
This can be used, for example, to update the orientation of Asn/Gln/His
side chains where the H atoms are not wanted; first build the hydrogens
and then trim them back out. Trimming can occasionally be fooled if a
hydrogen has been given a non-standard name. The most common example
of this comes from left-justified atom names: gamma hydrogens masquerade
as mercury atoms! In this case, manual editing may be required.
The following brief description of the command line flags is displayed
with the -h flag:
$ reduce -h
reduce: version 2.20 6/03/03, Copyright 1997-2003, J. Michael Word
arguments: [-flags] filename or -
Adds hydrogens to a PDB format file and writes to standard output.
(note: By default, HIS sidechain NH protons are not added. See -BUILD)
Flags:
-Trim remove (rather than add) hydrogens
-NOOH remove hydrogens on OH and SH groups
-OH add hydrogens on OH and SH groups (default)
-HIS create NH hydrogens on HIS rings
-FLIPs allow complete ASN, GLN and HIS sidechains to flip
(usually used with -HIS)
-NOHETh do not attempt to add NH proton on Het groups
-ROTNH3 allow lysine NH3 to rotate (default)
-NOROTNH3 do not allow lysine NH3 to rotate
-ROTEXist allow existing rotatable groups (OH, SH, Met-CH3) to rotate
-ROTEXOH allow existing OH & SH groups to rotate
-ALLMEthyls allow all methyl groups to rotate
-ONLYA only adjust 'A' conformations (default)
-ALLALT process adjustments for all conformations
-NOROTMET do not rotate methionine methyl groups
-NOADJust do not process any rot or flip adjustments
-BUILD add H, including His sc NH, then rotate and flip groups
(except for pre-existing methionine methyl hydrogens)
(same as: -OH -ROTEXOH -HIS -FLIP)
-Keep keep bond lengths as found
-NBonds# remove dots if cause within n bonds (default=3)
-Model# which model to process (default=1)
-Nterm# max number of nterm residue (default=1)
-DENSity#.# dot density (in dots/A^2) for VDW calculations (Real, defau
-RADius#.# probe radius (in A) for VDW calculations (Real, default=0)
-OCCcutoff#.# occupancy cutoff for adjustments (default=0.01)
-H2OOCCcutoff#.# occupancy cutoff for water atoms (default=0.66)
-H2OBcutoff# B-factor cutoff for water atoms (Integer, default=40)
-PENalty#.# fraction of std. bias towards original orientation (default
-HBREGcutoff#.# over this gap regular HBonds bump (default=0.6)
-HBCHargedcut#.# over this gap charged HBonds bump (default=0.8)
-BADBumpcut#.# at this gap a bump is 'bad' (default=0.4)
-METALBump#.# H 'bumps' metals at radius plus this (default=0.865)
-NONMETALBump#.# 'bumps' nonmetal at radius plus this (default=0.125)
-SEGIDmap "seg,c..." assign chainID based on segment identifier field
-Xplor use Xplor conventions for naming polar hydrogens
-NOCon drop conect records
-LIMIT# max num iter. for exhaustive search (default=100000)
-NOTICKs do not display the set orientation ticker during processing
-SHOWSCore display scores for each orientation considered during proce
-FIX "filename" if given, file specifies orientations for adjustable groups
-DB "filename" file to search for het info
(default="/usr/local/reduce_het_dict.txt")
note: can also redirect with unix environment variable: REDUCE_HET_DICT
-Quiet do not write extra info to the console
-REFerence display citation reference
-Help more extensive description of command line arguments
FIXING AN ORIENTATION
At times it is useful to control the flip state or rotation angle of an
adjustable group when adding hydrogens, either because the correct
orientation has already been established, allowing the optimization time
to be reduced, or because a non-optimal orientation is sought.
One of the command line flags (-fix myfile.txt) takes a file containing
information about which conformation to set for one or more adjustable
groups. The colon delimited format is similar to the orientation data
that reduce prints in the header file
action:residueID:comment
(one line for each group to be fixed) and because spacing matters in the
residue identifier string, the easiest way to produce this file is to copy
and edit USER MOD records from reduce output. The action can be one of
three kinds, depending on residue type: O to leave in the original orientation,
F to flip the orientation, and R# to rotate a dihedral to an angle of #deg.
Using either O or F with His sidechains allows the protonation state to vary;
to specify a particular orientation and protonation state use F# where #
is the number of the state (1, 2 or 3 for the original orientation with H
(1) only on NE2, (2) only on ND1, or (3) doubly protonated; 4-6 for the
corresponding three flipped states).
CLIQUES
The current version of reduce uses brute-force enumeration to optimize
the conformations of ajustable groups. If a 'clique' of adjustble groups
is too large (> ~7) this sort of search technique is inadequate--the
enumeration will be abandoned and these groups will be left in their
original conformations. The cuttoff point is based on the total number
of permutations, which the user can control with the -limit# option.
Although we are considering more pwerful search techniques for these
situations, some work-around strategies have been developed.
First check to see if distinct chainIDs are provided for each chain.
Reduce does not support files which specify chain information only
in the segID field and can get confused.
Examination of the clique may reveal that the orientations of one or
more groups are obvious; for instance, they may interact with obligate
H-bond donors or acceptors. By fixing the orientation of these groups
(as described above), the total number of permutations is reduced. This
is especially effective if it breaks the clique into smaller sum-cliques
or singletons.
An alternative way to break up cliques is to rotate all the methionine
CH3s and lysine/N-terminus NH3+s in an initial pass, then keep them
fixed in a second pass.
reduce -nooh inputfile | reduce -build -norotmet -norotnh3 - > outputfileH
The single dash towards the end of the command line tells reduce to
read data piped ('|') from the first pass rather than from a file. A
fiew NH3+ H-bonds may have inferior geometry with this two pass approach
but the result is otherwise comparable to using -build alone and can
be combined with the previous approach, if neccessary. With this
technique, unusual cliques requiring many hours to process have been
converted into several smaller problems which wer all solved in a matter
of minutes.
CONTACT
If you use reduce, I would appreciate any comments you send my way.
J. Michael Word
e-mail: [email protected]
voice: (919)483-3522
Richardson Lab
Biochemistry Department
Duke University
Durham, NC USA 27710