This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. The subproject and investigator (PI) may have received primary funding from another NIH source, and thus could be represented in other CRISP entries. The institution listed is for the Center, which is not necessarily the institution for the investigator. In the synthetic biology community, bacteria are re-engineered to carry out tasks that benefit human society, including the production of biofuels, therapeutics, plastics, and other important chemicals. With the latest advances in genetic engineering techniques, including DNA synthesis, the challenge is to identify the DNA sequence that reliably yields a desired behavior without resorting to trial-and-error procedures. Specifically, it is important to precisely control the production rate of a protein in order to design a biological system that exhibits a desired metabolic or regulatory behavior. To solve this problem, we have developed a design algorithm that generates a DNA sequence that causes a protein to be produced at a user-specified rate. A user inputs an arbitrary protein coding sequence and a target production rate and the algorithm outputs the RNA (DNA) sequence that yields the target rate. More specifically, the algorithm predicts the translation initiation rate of a protein coding sequence from a messenger RNA transcript by modeling the interactions between the ribosome binding site on the mRNA with the ribosome. Using the predictive model, a Monte Carlo optimization technique identifies the RNA (DNA) sequence that meets the user's target translation rate with the given protein coding sequence. These predictions and the design process have been experimentally validated in the bacterium Escherichia coli (a common industrial organism). The design algorithm predicts translation rates across four orders of magnitude and its accuracy is ~1 kT in terms of the Gibbs free energy. It is written in C and Python. We have created a web-accessible version of the design algorithm at http://voigtlab.ucsf.edu/software/. The computational cost of the design algorithm is moderate;a single design job can require between 5 and 30 minutes on an Intel 1.6Ghz dual core processor (using only 1 core) with 2Gb RAM. The synthetic biology community (at least 1000 PIs/postdocs/graduate students) would make frequent use of this tool. Accordingly, we request 30,000 SUs of Teragrid Roaming resources to perform the computations. Any combination of suitable resources would be appreciated, such as the Purdue Condor Pool, the IU Quarry IBM HS21 bladeserver, or the TACC Ranger. A single user may request 10-100 or more design jobs annually, yielding ~50 SU/user/year and up to ~600 users/year. From his PhD graduate studies, Howard Salis has extensive experience with developing algorithms and running jobs on the Teragrid, including parallel programming with MPI.

Agency
National Institute of Health (NIH)
Institute
National Center for Research Resources (NCRR)
Type
Biotechnology Resource Grants (P41)
Project #
5P41RR006009-20
Application #
8171880
Study Section
Special Emphasis Panel (ZRG1-BCMB-Q (40))
Project Start
2010-08-01
Project End
2013-07-31
Budget Start
2010-08-01
Budget End
2013-07-31
Support Year
20
Fiscal Year
2010
Total Cost
$1,091
Indirect Cost
Name
Carnegie-Mellon University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
052184116
City
Pittsburgh
State
PA
Country
United States
Zip Code
15213
Simakov, Nikolay A; Kurnikova, Maria G (2018) Membrane Position Dependency of the pKa and Conductivity of the Protein Ion Channel. J Membr Biol 251:393-404
Yonkunas, Michael; Buddhadev, Maiti; Flores Canales, Jose C et al. (2017) Configurational Preference of the Glutamate Receptor Ligand Binding Domain Dimers. Biophys J 112:2291-2300
Hwang, Wonmuk; Lang, Matthew J; Karplus, Martin (2017) Kinesin motility is driven by subdomain dynamics. Elife 6:
Earley, Lauriel F; Powers, John M; Adachi, Kei et al. (2017) Adeno-associated Virus (AAV) Assembly-Activating Protein Is Not an Essential Requirement for Capsid Assembly of AAV Serotypes 4, 5, and 11. J Virol 91:
Subramanian, Sandeep; Chaparala, Srilakshmi; Avali, Viji et al. (2016) A pilot study on the prevalence of DNA palindromes in breast cancer genomes. BMC Med Genomics 9:73
Ramakrishnan, N; Tourdot, Richard W; Radhakrishnan, Ravi (2016) Thermodynamic free energy methods to investigate shape transitions in bilayer membranes. Int J Adv Eng Sci Appl Math 8:88-100
Zhang, Yimeng; Li, Xiong; Samonds, Jason M et al. (2016) Relating functional connectivity in V1 neural circuits and 3D natural scenes using Boltzmann machines. Vision Res 120:121-31
Lee, Wei-Chung Allen; Bonin, Vincent; Reed, Michael et al. (2016) Anatomy and function of an excitatory network in the visual cortex. Nature 532:370-4
Murty, Vishnu P; Calabro, Finnegan; Luna, Beatriz (2016) The role of experience in adolescent cognitive development: Integration of executive, memory, and mesolimbic systems. Neurosci Biobehav Rev 70:46-58
Jurkowitz, Marianne S; Patel, Aalapi; Wu, Lai-Chu et al. (2015) The YhhN protein of Legionella pneumophila is a Lysoplasmalogenase. Biochim Biophys Acta 1848:742-51

Showing the most recent 10 out of 292 publications