Transporters catalyze entry and exit of molecules into and out of cells and organelles. They achieve cellular homeostasis, are responsible for multidrug resistance in pathogens and tumors, and when defective, cause dozens of important human genetic diseases. Our laboratory maintains, updates and improves the Transporter Classification Database, TCDB, which houses the Transporter Classification (TC) system, adopted officially by the International Union of Biochemistry and Molecular Biology (IUBMB). TCDB is the internationally acclaimed, carefully annotated, universal standard for classifying and providing information about transporters and transport-related proteins in all major domains of life. It presents sequence, biochemical, physiological, pathological, structural and evolutionary data about these proteins and the transport systems they comprise. It uses a successful system of classification based on transporter class, subclass, family, subfamily, and individual transporter. In this competitive renewal of GM0077402, we propose to broaden and deepen our efforts to expand, update, automate and interlink TCDB. We will generate new data concerning transport proteins, design new machine learning approaches for data, and introduce procedures for making functional predictions of uncharacterized transporters. This last effort will derive reliable new biological knowledge from a variety of sources, including phylogeny, motif, domain, operon and regulon analyses.
Our Specific Aims are as follows: 1. To develop software for automatic text mining and information extraction. 2. To conduct bioinformatic analyses and molecular biological experiments for TC knowledge expansion. 3. To interconnect TCDB bidirectionally with other relevant databases, thereby creating a """"""""network"""""""" of knowledge from current """"""""island"""""""" databases. 4. To use multiple approaches to derive reliable functional predictions as guides for future research. 5. To utilize a newly formed TCDB advisory board and establish a plan for modernization and sustainability. These goals are top priorities for rendering TCDB increasingly useful to the scientific community.

Public Health Relevance

TCDB is a database providing the worldwide scientific community with systematized information about proteins that catalyze transmembrane transport of salts, nutrients, toxins, drugs and macromolecules. It is the only IUBMB approved system for classifying transport proteins. Funding of this proposal will allow the maintenance and further development of TCDB, interlinking with related databases, expansion of machine learning approaches for information acquisition, and introduction of approaches for predicting the functions of uncharacterized proteins.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Brazhnik, Paul
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Diego
Schools of Arts and Sciences
La Jolla
United States
Zip Code
Babu, Mohan; Bundalovic-Torma, Cedoljub; Calmettes, Charles et al. (2018) Global landscape of cell envelope protein complexes in Escherichia coli. Nat Biotechnol 36:103-112
Rodionova, Irina A; Goodacre, Norman; Babu, Mohan et al. (2018) The Nitrogen Regulatory PII Protein (GlnB) and N-Acetylglucosamine 6-Phosphate Epimerase (NanE) Allosterically Activate Glucosamine 6-Phosphate Deaminase (NagB) in Escherichia coli. J Bacteriol 200:
Vastermark, Ake; Driker, Adelle; Weng, Jingwei et al. (2017) Difference distance map data of alternative crystal forms of UlaA. Data Brief 10:198-201
Moreno-Hagelsieb, Gabriel; Vitug, Bennett; Medrano-Soto, Arturo et al. (2017) The Membrane Attack Complex/Perforin Superfamily. J Mol Microbiol Biotechnol 27:252-267
Lee, Justin; Ghosh, Shounak; Saier Jr, Milton H (2017) Comparative genomic analyses of transport proteins encoded within the red algae Chondrus crispus, Galdieria sulphuraria, and Cyanidioschyzon merolae11. J Phycol 53:503-521
Zhang, Zhongge; Kukita, Chika; Humayun, M Zafri et al. (2017) Environment-directed activation of the Escherichia coliflhDC operon by transposons. Microbiology 163:554-569
Saier Jr, Milton H; Kukita, Chika; Zhang, Zhongge (2017) Transposon-mediated directed mutation in bacteria and eukaryotes. Front Biosci (Landmark Ed) 22:1458-1468
Heidari Tajabadi, Fereshteh; Medrano-Soto, Arturo; Ahmadzadeh, Masoud et al. (2017) Comparative Analyses of Transport Proteins Encoded within the Genomes of Bdellovibrio bacteriovorus HD100 and Bdellovibrio exovorus JSS. J Mol Microbiol Biotechnol 27:332-349
Do, Jimmy; Zafar, Hassan; Saier Jr, Milton H (2017) Comparative genomics of transport proteins in probiotic and pathogenic Escherichia coli and Salmonella enterica strains. Microb Pathog 107:106-115
Humayun, M Zafri; Zhang, Zhongge; Butcher, Anna M et al. (2017) Hopping into a hot seat: Role of DNA structural features on IS5-mediated gene activation and inactivation under stress. PLoS One 12:e0180156

Showing the most recent 10 out of 92 publications