The goal of this project is to create a unified technological infrastructure for protein expression and purification that will be suitable for large-scale structural biology initiatives. Central to our approach is the use of multiple genetically engineered affinity tags. We are currently trying to determine what combination of affinity tags is most effective and how to use them with maximal efficiency. At the same time, because most affinity tags have the potential to interfere with structural studies, we are also striving to develop more reliable methods for removing them. One of the greatest technical obstacles that we face is """"""""the inclusion body problem""""""""-i.e., the tendency of proteins to accumulate in an insoluble, inactive form. Because refolding of proteins can be an arduous and time consuming undertaking, some way to circumvent the formation of inclusion bodies would be advantageous. Sometimes this can be accomplished by fusing an aggregation-prone polypeptide to a highly soluble partner. We have demonstrated that Escherichia coli maltose-binding protein (MBP) is a remarkably effective solubility enhancer, and that in many cases MBP can promote the proper folding of its fusion partners as well. This chaperone-like quality distinguishes MBP from other affinity tags and greatly enhances its value as a fusion partner. Accordingly, MBP fusion proteins have become the cornerstone of our strategy for protein expression. Additional tags are utilized within the framework of an MBP fusion protein to facilitate purification of the target protein. Affinity tags would probably be used more often if it were not so difficult to remove them. This is usually accomplished by endoproteolysis of a fusion protein at a designed site. The main difficulty with this approach stems from the intrinsic promiscuity of the proteases that are commonly used to cleave fusion proteins. This problem is compounded by the fact that it is prohibitively expensive to purchase enough of any of these reagents to cleave fusion proteins on a scale amenable for structural studies. To overcome these problems, we produce our own supply of TEV protease, the catalytic domain of the nuclear inclusion protease from tobacco etch virus. TEV protease cleaves the amino acid sequence ENLYFQG/S between Q and G or Q and S with high specificity. In contrast to factor Xa, enteropeptidase and thrombin, there have never been any reports of cleavage at noncanonical sites in fusion proteins by TEV protease. The production of TEV protease in Escherichia coli has been hampered in the past by low yield and poor solubility, but we have been able to solve both problems by making synonymous codon replacements and producing the protease in the form of an MBP fusion protein. A more troublesome shortcoming of TEV protease is that it readily cleaves itself at a specific site, generating a truncated protease with greatly diminished activity. We have been able to rectify this problem as well by introducing amino acid substitutions that prevent autoinactivation without impeding the ability of the protease to cleave canonical target sequences. A systematic analysis of the enzyme's P1' specificity revealed that, in addition to G and S, many different amino acids can be accommodated in this position with relatively little impact on the efficiency of processing. The crystal structure of catalytically inactive TEV protease in complex with a peptide substrate illuminated the structural basis of its stringent substrate specificity. A homologous protease from tobacco vein mottling virus (TVMV), a close relative of TEV protease with a distinct sequence specificity, is currently being developed as an alternative reagent.

Agency
National Institute of Health (NIH)
Institute
Division of Basic Sciences - NCI (NCI)
Type
Intramural Research (Z01)
Project #
1Z01BC010341-05
Application #
7052639
Study Section
Mammalian Cell Lines Committee (MCL)
Project Start
Project End
Budget Start
Budget End
Support Year
5
Fiscal Year
2004
Total Cost
Indirect Cost
Name
Basic Sciences
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Zhang, Di; Tozser, Jozsef; Waugh, David S (2009) Molecular cloning, overproduction, purification and biochemical characterization of the p39 nsp2 protease domains encoded by three alphaviruses. Protein Expr Purif 64:89-97
Tropea, Joseph E; Cherry, Scott; Waugh, David S (2009) Expression and purification of soluble His(6)-tagged TEV protease. Methods Mol Biol 498:297-307
Austin, Brian P; Nallamsetty, Sreedevi; Waugh, David S (2009) Hexahistidine-tagged maltose-binding protein as a fusion partner for the production of soluble recombinant proteins in Escherichia coli. Methods Mol Biol 498:157-72
Gan, Jianhua; Shaw, Gary; Tropea, Joseph E et al. (2008) A stepwise model for double-stranded RNA processing by ribonuclease III. Mol Microbiol 67:143-54
Nallamsetty, Sreedevi; Waugh, David S (2007) Mutations that alter the equilibrium between open and closed conformations of Escherichia coli maltose-binding protein impede its ability to enhance the solubility of passenger proteins. Biochem Biophys Res Commun 364:639-44
Tropea, Joseph E; Cherry, Scott; Nallamsetty, Sreedevi et al. (2007) A generic method for the production of recombinant proteins in Escherichia coli using a dual hexahistidine-maltose-binding protein affinity tag. Methods Mol Biol 363:1-19
Nallamsetty, Sreedevi; Waugh, David S (2007) A generic protocol for the expression and purification of recombinant proteins in Escherichia coli using a combinatorial His6-maltose binding protein fusion tag. Nat Protoc 2:383-91
Nallamsetty, Sreedevi; Waugh, David S (2006) Solubility-enhancing proteins MBP and NusA play a passive role in the folding of their fusion partners. Protein Expr Purif 45:175-82
Tozser, Jozsef; Tropea, Joseph E; Cherry, Scott et al. (2005) Comparison of the substrate specificity of two potyvirus proteases. FEBS J 272:514-23
Nallamsetty, Sreedevi; Austin, Brian P; Penrose, Kerri J et al. (2005) Gateway vectors for the production of combinatorially-tagged His6-MBP fusion proteins in the cytoplasm and periplasm of Escherichia coli. Protein Sci 14:2964-71

Showing the most recent 10 out of 17 publications