Transport systems provide essential functions for all living cells. Their importance to medicine cannot be overemphasized as (1) they provide a basis for multidrug resistance in pathogenic bacteria, fungi and protozoans as well as in tumor cells, (2) they are defective in numerous human genetic diseases, and (3) they are essential elements of toxin secretion systems by virtually all pathogenic organisms. We have developed a classification system for transport proteins called the Transporter Classification (TC) system. The TC system is a functional/phylogenetic system designed for the classification of all transmembrane transport proteins found in living organisms on Earth. It parallels but differs from the strictly functional EC system, developed decades ago by the Enzyme Commission of the International Union of Biochemistry and Molecular Biology (IUBMB) for the classification of enzymes. The TC system has recently been adopted by the IUBMB as the internationally acclaimed system to the classification of transporters. TCDB is a curated repository for factual information compiled from more than 10,000 references, encompassing approximately 3,000 representative transport proteins and putative transporters, classified into about 400 families. The primary goal of this proposal is to facilitate the updating, expansion and improvement of TCDB for use by the international scientific community. Our primary goals are first, to automate text classification and information extraction software so as to facilitate the continual updating of TCDB as newly published information about transporters becomes available (Specific Aims 1-3), and second, to devise software for the semiautomatic prediction of transport functions for putative transporters for which complete sequence data but little or no experimental data are available (Specific Aims 4-6).
Our specific aims are to: (1) Incorporate into TCDB a text classification system for automatically identifying published literature describing newly characterized transport systems or previously characterized systems where new information is provided or old data are corrected. (2) Design information extraction software for incorporation of pertinent information from publications identified in specific aim #1 into TCDB. (3) Automate links to the primary literature from which the information in specific aim #2 was extracted as well as links to protein and operon analysis tools. (4) Incorporate software that will allow identification of motifs, signature sequences, protein domains, phylogenetic relationships and structural features for the proteins in TCDB and their homologues for the purpose of functional prediction. (5) Incorporate software that will facilitate the extraction of operon analysis data that can similarly be used for functional prediction. (6) Develop approaches that will allow integration of the different lines of functional information revealed in specific aims #4 and 5.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
3R01GM077402-04S1
Application #
7903633
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Remington, Karin A
Project Start
2009-08-20
Project End
2010-03-31
Budget Start
2009-08-20
Budget End
2010-03-31
Support Year
4
Fiscal Year
2009
Total Cost
$169,332
Indirect Cost
Name
University of California San Diego
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
804355790
City
La Jolla
State
CA
Country
United States
Zip Code
92093
Babu, Mohan; Bundalovic-Torma, Cedoljub; Calmettes, Charles et al. (2018) Global landscape of cell envelope protein complexes in Escherichia coli. Nat Biotechnol 36:103-112
Rodionova, Irina A; Goodacre, Norman; Babu, Mohan et al. (2018) The Nitrogen Regulatory PII Protein (GlnB) and N-Acetylglucosamine 6-Phosphate Epimerase (NanE) Allosterically Activate Glucosamine 6-Phosphate Deaminase (NagB) in Escherichia coli. J Bacteriol 200:
Moreno-Hagelsieb, Gabriel; Vitug, Bennett; Medrano-Soto, Arturo et al. (2017) The Membrane Attack Complex/Perforin Superfamily. J Mol Microbiol Biotechnol 27:252-267
Lee, Justin; Ghosh, Shounak; Saier Jr, Milton H (2017) Comparative genomic analyses of transport proteins encoded within the red algae Chondrus crispus, Galdieria sulphuraria, and Cyanidioschyzon merolae11. J Phycol 53:503-521
Zhang, Zhongge; Kukita, Chika; Humayun, M Zafri et al. (2017) Environment-directed activation of the Escherichia coliflhDC operon by transposons. Microbiology 163:554-569
Saier Jr, Milton H; Kukita, Chika; Zhang, Zhongge (2017) Transposon-mediated directed mutation in bacteria and eukaryotes. Front Biosci (Landmark Ed) 22:1458-1468
Heidari Tajabadi, Fereshteh; Medrano-Soto, Arturo; Ahmadzadeh, Masoud et al. (2017) Comparative Analyses of Transport Proteins Encoded within the Genomes of Bdellovibrio bacteriovorus HD100 and Bdellovibrio exovorus JSS. J Mol Microbiol Biotechnol 27:332-349
Do, Jimmy; Zafar, Hassan; Saier Jr, Milton H (2017) Comparative genomics of transport proteins in probiotic and pathogenic Escherichia coli and Salmonella enterica strains. Microb Pathog 107:106-115
Humayun, M Zafri; Zhang, Zhongge; Butcher, Anna M et al. (2017) Hopping into a hot seat: Role of DNA structural features on IS5-mediated gene activation and inactivation under stress. PLoS One 12:e0180156
Saier Jr, Milton H; Trevors, J T (2017) Science, Innovation and the Future of Humanity. J Mol Microbiol Biotechnol 27:128-132

Showing the most recent 10 out of 92 publications