Skip to content

Commit

Permalink
Merge pull request #11 from choderalab/sonya_changes
Browse files Browse the repository at this point in the history
Change to using scale-up calculation
  • Loading branch information
jchodera committed Feb 2, 2016
2 parents 24b40d8 + 22aa701 commit ed4de9e
Show file tree
Hide file tree
Showing 3 changed files with 79 additions and 79 deletions.
Binary file not shown.
Binary file modified manuscript/ms.pdf
Binary file not shown.
158 changes: 79 additions & 79 deletions manuscript/ms.tex
Original file line number Diff line number Diff line change
Expand Up @@ -99,7 +99,7 @@
Human kinase dysregulation has been linked to a number of diseases, such as cancer, diabetes, and inflammation, and as a result, much of the effort in developing treatments (and perhaps 30\% of \emph{all} current drug development effort) has focused on shutting down aberrant kinases with targeted inhibitors.
While insect and mammalian expression systems are frequently utilized for the expression of human kinases, they cannot compete with the simplicity and cost-effectiveness of bacterial expression systems, which historically had found human kinases difficult to express.
Following the demonstration that phosphatase coexpression could give high yields of Src and Abl kinase domains in inexpensive bacterial expression systems~\cite{seeliger:2005:protein-sci:kinase-expression}, we have performed a large-scale expression screen to generate a library of His-tagged human kinase domain constructs that express well in a simple automated bacterial expression system where phosphatase coexpression (YopH for Tyr kinases, lambda for Ser/Thr kinases) is used.
Starting from 96 kinases with crystal structures and any reported bacterial expression, we engineered a library of human kinase domain constructs and screened their coexpression with phosphatase, finding 68 kinases with yields greater than 2 mg/mL culture.
Starting from 96 kinases with crystal structures and any reported bacterial expression, we engineered a library of human kinase domain constructs and screened their coexpression with phosphatase, finding 52 kinases with yields greater than 2 mg/L culture.
All sequences and expression data are provided online at \url{https://github.com/choderalab/kinase-ecoli-expression-panel}, and the plasmids are in the process of being made available through AddGene.
\end{abstract}

Expand Down Expand Up @@ -128,7 +128,7 @@ \section{Introduction}
The protein databank (PDB) now contains over 100 human kinases that---according to the PDB data records---were expressed in bacteria.
Since bacterial expression is often complicated by the need to tailor expression and purification protocols individually for each protein expressed, we wondered whether a simple, uniform, automatable expression and purification protocol could be used to express a large number of human kinases to produce a convenient bacterial expression library to facilitate kinase research and selective inhibitor development.
As a first step toward this goal, we developed a structural informatics pipeline to use available kinase structural data and associated metadata to select constructs from available human kinase libraries to clone into a standard set of vectors intended for phosphatase coexpression.
Automated expression screening in Rosetta2 cells found that 68 human kinase domains express with yields greater than 2 $\mu$g/mL, which should be usable for biochemical, biophysical, screening, and structural biology studies.
Automated expression screening in Rosetta2 cells found that 52 human kinase domains express with yields greater than 2 mg/L culture, which should be usable for biochemical, biophysical, screening, and structural biology studies.

All code and source files used in this project can be found at \url{https://github.com/choderalab/kinase-ecoli-expression-panel}, and a convenient sortable table of results can be viewed at \url{http://choderalab.github.io/kinome-data/kinase\_constructs-addgene\_hip\_sgc.html}.

Expand Down Expand Up @@ -268,98 +268,98 @@ \subsection{PDB mining results}
\subsection{Small-scale kinase expression test in \emph{E. coli}}

A panel containing the 96 kinase domain constructs selected through our semi-automated method, was tested for expression in \emph{E. coli}.
From this initial test, 68 kinase domains showed detectable expression (yield of more than 2 ng/$\mu$l eluate) (Table~\ref{expression_table}).
While the initial panel of 96 kinases was well-distributed across kinase families, the final most highly expressing (yield of more than 100 ng/$\mu$l eluate) were not as evenly distributed (Figure~\ref{fig:kinome_expression_tree}).
From this initial test, 52 kinase domains showed reasonable expression (yield of more than 2 ng/$\mu$L eluate, which corresponds to 2 mg/L culture) (Table~\ref{expression_table}).
While the initial panel of 96 kinases was well-distributed across kinase families, the final most highly expressing (yield of more than 12 mg/L kinase) were not as evenly distributed (Figure~\ref{fig:kinome_expression_tree}).
The 17 most highly expressing kinases showed relatively high purity after elution, though we note that eluting via TEV site cleavage results in a quantity of TEV protease in the eluate (Figure~\ref{fig:caliper_image}).

\begin{table*}[]
\centering
\caption{{\bf Expression results by kinase.} Yield (determined by Caliper GX II quantitation of the expected size band) reported in ng/$\mu$l eluate, where total eluate volume was 120 $\mu$l from 900 $\mu$L bacterial culture.}
\caption{{\bf Expression results by kinase.} Yield (determined by Caliper GX II quantitation of the expected size band) reported in mg/L culture, where total eluate volume was 120 $\mu$l from 900 $\mu$L bacterial culture.}
\label{expression_table}
\footnotesize
\begin{tabular}{p{3.5cm}p{4cm}c}
\toprule
\bf{kinase expressed} & \bf{phosphatase co-expressed} & \bf{concentration (ng/$\mu$l eluate)} \\
\bf{kinase expressed} & \bf{phosphatase co-expressed} & \bf{expected scale-up culture (mg/L)} \\
\midrule
MK14\_HUMAN\_D0 & Lambda & 530 \\
VRK3\_HUMAN\_D0 & Lambda & 506 \\
GAK\_HUMAN\_D0 & Lambda & 485 \\
CSK\_HUMAN\_D0 & Truncated YopH164 & 469 \\
VRK1\_HUMAN\_D0 & Lambda & 467 \\
KC1G3\_HUMAN\_D0 & Lambda & 422 \\
FES\_HUMAN\_D0 & Truncated YopH164 & 330 \\
PMYT1\_HUMAN\_D0 & Lambda & 285 \\
MK03\_HUMAN\_D0 & Lambda & 273 \\
STK3\_HUMAN\_D0 & Lambda & 257 \\
DYR1A\_HUMAN\_D0 & Lambda & 256 \\
KC1G1\_HUMAN\_D0 & Lambda & 256 \\
MK11\_HUMAN\_D0 & Lambda & 238 \\
MK13\_HUMAN\_D0 & Lambda & 238 \\
EPHB1\_HUMAN\_D0 & Truncated YopH164 & 217 \\
MK08\_HUMAN\_D0 & Lambda & 214 \\
CDK16\_HUMAN\_D0 & Lambda & 202 \\
EPHB2\_HUMAN\_D0 & Truncated YopH164 & 188 \\
PAK4\_HUMAN\_D0 & Lambda & 179 \\
CDKL1\_HUMAN\_D0 & Lambda & 174 \\
SRC\_HUMAN\_D0 & Truncated YopH164 & 165 \\
STK16\_HUMAN\_D0 & Lambda & 155 \\
MAPK3\_HUMAN\_D0 & Lambda & 141 \\
PAK6\_HUMAN\_D0 & Lambda & 135 \\
CSK22\_HUMAN\_D0 & Lambda & 134 \\
MERTK\_HUMAN\_D0 & Truncated YopH164 & 126 \\
PAK7\_HUMAN\_D0 & Lambda & 110 \\
CSK21\_HUMAN\_D0 & Lambda & 109 \\
EPHA3\_HUMAN\_D0 & Truncated YopH164 & 106 \\
BMPR2\_HUMAN\_D0 & Lambda & 106 \\
M3K5\_HUMAN\_D0 & Lambda & 105 \\
KCC2G\_HUMAN\_D0 & Lambda & 100 \\
E2AK2\_HUMAN\_D0 & Lambda & 87 \\
MK01\_HUMAN\_D0 & Lambda & 84 \\
CSKP\_HUMAN\_D0 & Lambda & 76 \\
CHK2\_HUMAN\_D0 & Lambda & 61 \\
KC1G2\_HUMAN\_D0 & Lambda & 57 \\
DMPK\_HUMAN\_D0 & Lambda & 57 \\
KCC2B\_HUMAN\_D0 & Lambda & 53 \\
FGFR1\_HUMAN\_D0 & Truncated YopH164 & 46 \\
KS6A1\_HUMAN\_D1 & Lambda & 43 \\
DAPK3\_HUMAN\_D0 & Lambda & 30 \\
STK10\_HUMAN\_D0 & Lambda & 28 \\
KC1D\_HUMAN\_D0 & Lambda & 28 \\
KC1E\_HUMAN\_D0 & Lambda & 26 \\
NEK1\_HUMAN\_D0 & Lambda & 25 \\
CDK2\_HUMAN\_D0 & Lambda & 23 \\
ABL1\_HUMAN\_D0 & Truncated YopH164 & 19 \\
DAPK1\_HUMAN\_D0 & Lambda & 18 \\
DYRK2\_HUMAN\_D0 & Lambda & 18 \\
HASP\_HUMAN\_D0 & Lambda & 17 \\
FGFR3\_HUMAN\_D0 & Truncated YopH164 & 17 \\
EPHB3\_HUMAN\_D0 & Truncated YopH164 & 13 \\
SLK\_HUMAN\_D0 & Lambda & 12 \\
KCC2D\_HUMAN\_D0 & Lambda & 12 \\
NEK7\_HUMAN\_D0 & Lambda & 10 \\
PHKG2\_HUMAN\_D0 & Lambda & 10 \\
VRK2\_HUMAN\_D0 & Lambda & 9 \\
AAPK2\_HUMAN\_D0 & Lambda & 8 \\
AURKA\_HUMAN\_D0 & Lambda & 8 \\
MARK3\_HUMAN\_D0 & Lambda & 8 \\
KAPCA\_HUMAN\_D0 & Lambda & 7 \\
STK24\_HUMAN\_D0 & Lambda & 6 \\
VGFR1\_HUMAN\_D0 & Truncated YopH164 & 4 \\
KCC4\_HUMAN\_D0 & Lambda & 3 \\
KCC1G\_HUMAN\_D0 & Lambda & 2 \\
KCC2A\_HUMAN\_D0 & Lambda & 2 \\
FAK2\_HUMAN\_D0 & Truncated YopH164 & 2 \\
MK14\_HUMAN\_D0 & Lambda & 70.7 \\
VRK3\_HUMAN\_D0 & Lambda & 67.5 \\
GAK\_HUMAN\_D0 & Lambda & 64.7 \\
CSK\_HUMAN\_D0 & Truncated YopH164 & 62.5 \\
VRK1\_HUMAN\_D0 & Lambda & 62.3 \\
KC1G3\_HUMAN\_D0 & Lambda & 56.3 \\
FES\_HUMAN\_D0 & Truncated YopH164 & 44.0 \\
PMYT1\_HUMAN\_D0 & Lambda & 38.0 \\
MK03\_HUMAN\_D0 & Lambda & 36.4 \\
STK3\_HUMAN\_D0 & Lambda & 34.3 \\
DYR1A\_HUMAN\_D0 & Lambda & 34.1 \\
KC1G1\_HUMAN\_D0 & Lambda & 34.1 \\
MK11\_HUMAN\_D0 & Lambda & 31.7 \\
MK13\_HUMAN\_D0 & Lambda & 31.7 \\
EPHB1\_HUMAN\_D0 & Truncated YopH164 & 28.9 \\
MK08\_HUMAN\_D0 & Lambda & 28.5 \\
CDK16\_HUMAN\_D0 & Lambda & 26.9 \\
EPHB2\_HUMAN\_D0 & Truncated YopH164 & 25.1 \\
PAK4\_HUMAN\_D0 & Lambda & 23.9 \\
CDKL1\_HUMAN\_D0 & Lambda & 23.2 \\
SRC\_HUMAN\_D0 & Truncated YopH164 & 22.0 \\
STK16\_HUMAN\_D0 & Lambda & 20.7 \\
MAPK3\_HUMAN\_D0 & Lambda & 18.8 \\
PAK6\_HUMAN\_D0 & Lambda & 18.0 \\
CSK22\_HUMAN\_D0 & Lambda & 17.9 \\
MERTK\_HUMAN\_D0 & Truncated YopH164 & 16.8 \\
PAK7\_HUMAN\_D0 & Lambda & 14.7 \\
CSK21\_HUMAN\_D0 & Lambda & 14.5 \\
EPHA3\_HUMAN\_D0 & Truncated YopH164 & 14.1 \\
BMPR2\_HUMAN\_D0 & Lambda & 14.1 \\
M3K5\_HUMAN\_D0 & Lambda & 14.0 \\
KCC2G\_HUMAN\_D0 & Lambda & 13.3 \\
E2AK2\_HUMAN\_D0 & Lambda & 11.6 \\
MK01\_HUMAN\_D0 & Lambda & 11.2 \\
CSKP\_HUMAN\_D0 & Lambda & 10.1 \\
CHK2\_HUMAN\_D0 & Lambda & 8.1 \\
KC1G2\_HUMAN\_D0 & Lambda & 7.6 \\
DMPK\_HUMAN\_D0 & Lambda & 7.6 \\
KCC2B\_HUMAN\_D0 & Lambda & 7.1 \\
FGFR1\_HUMAN\_D0 & Truncated YopH164 & 6.1 \\
KS6A1\_HUMAN\_D1 & Lambda & 5.7 \\
DAPK3\_HUMAN\_D0 & Lambda & 4.0 \\
STK10\_HUMAN\_D0 & Lambda & 3.7 \\
KC1D\_HUMAN\_D0 & Lambda & 3.7 \\
KC1E\_HUMAN\_D0 & Lambda & 3.5 \\
NEK1\_HUMAN\_D0 & Lambda & 3.3 \\
CDK2\_HUMAN\_D0 & Lambda & 3.1 \\
ABL1\_HUMAN\_D0 & Truncated YopH164 & 2.5 \\
DAPK1\_HUMAN\_D0 & Lambda & 2.4 \\
DYRK2\_HUMAN\_D0 & Lambda & 2.4 \\
HASP\_HUMAN\_D0 & Lambda & 2.3 \\
FGFR3\_HUMAN\_D0 & Truncated YopH164 & 2.3 \\
EPHB3\_HUMAN\_D0 & Truncated YopH164 & 1.7 \\
SLK\_HUMAN\_D0 & Lambda & 1.6 \\
KCC2D\_HUMAN\_D0 & Lambda & 1.6 \\
NEK7\_HUMAN\_D0 & Lambda & 1.3 \\
PHKG2\_HUMAN\_D0 & Lambda & 1.3 \\
VRK2\_HUMAN\_D0 & Lambda & 1.2 \\
AAPK2\_HUMAN\_D0 & Lambda & 1.1 \\
AURKA\_HUMAN\_D0 & Lambda & 1.1 \\
MARK3\_HUMAN\_D0 & Lambda & 1.1 \\
KAPCA\_HUMAN\_D0 & Lambda & 0.9 \\
STK24\_HUMAN\_D0 & Lambda & 0.8 \\
VGFR1\_HUMAN\_D0 & Truncated YopH164 & 0.5 \\
KCC4\_HUMAN\_D0 & Lambda & 0.4 \\
KCC1G\_HUMAN\_D0 & Lambda & 0.3 \\
KCC2A\_HUMAN\_D0 & Lambda & 0.3 \\
FAK2\_HUMAN\_D0 & Truncated YopH164 & 0.3 \\
\bottomrule
\end{tabular}
\end{table*}

\begin{figure}[tb]
\includegraphics[width=\columnwidth]{figures/kinome_expression.png}
\caption{{\bf Representation of kinase domain expression results on phylogenetic tree.}
Dark green circles represent kinases with expression above 250 $ng/ \mu l$ eluate.
Light green circles represent kinases with expression between 100 and 250 $ng/ \mu l$ eluate.
Yellow circles represent kinases with expression between 50 and 100 $ng/ \mu l$ eluate.
Yellow circles represent kinases with any expression up to 50 $ng/ \mu l$ eluate.
Dark green circles represent kinases with expression above 50 mg/L yield.
Light green circles represent kinases with expression between 50 and 12 mg/L yield.
Yellow circles represent kinases with expression between 12 and 7 mg/L yield.
Yellow circles represent kinases with any expression (even below 2 mg/L) up to 7 mg/L yield.
Image made with KinMap: \href{http://www.kinhub.org/kinmap}{http://www.kinhub.org/kinmap}.
}
\label{fig:kinome_expression_tree}
Expand All @@ -368,7 +368,7 @@ \subsection{Small-scale kinase expression test in \emph{E. coli}}
\begin{figure*}[tb]
\includegraphics[width=\columnwidth]{figures/caliper_image.png}
\caption{{\bf Synthetic gel image rendering of highest expressing kinases.}
Caliper GX II synthetic gel image rendering of kinases expressing > 200 $ng/ \mu l$ eluate from microfluidic capillary electrophoresis quantitation.
Caliper GX II synthetic gel image rendering of kinases expressing > 25 mg/L culture from microfluidic capillary electrophoresis quantitation.
}
\label{fig:caliper_image}
\end{figure*}
Expand Down

0 comments on commit ed4de9e

Please sign in to comment.