Learning Sparse Models for A Dynamic Bayesian Network Classifier of Protein Seco

Noble, William

Abstract

This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. The subproject and investigator (PI) may have received primary funding from another NIH source, and thus could be represented in other CRISP entries. The institution listed is for the Center, which is not necessarily the institution for the investigator. Protein secondary structure prediction provides insight into protein function and is a valuable preliminary step for predicting the 3D structure of a protein. Dynamic Bayesian networks (DBNs) have been shown to provide state-of-the-art performance in secondary structure prediction. As the size of the protein database grows, it becomes feasible to use a richer model in an effort to capture subtle correlations among the amino acids and the predicted labels. In this context, it is beneficial to derive sparse models that discourage over-fitting and provide biological insight. Results: We introduce an algorithm for sparsifying the parameters of a DBN. Using this algorithm, we can automatically remove up to 80% of the parameters of a DBN while maintaining the same level of predictive accuracy. We also prove an upper bound for the test error difference between the sparse and fully dense models. Finally, we demonstrate, using simulated data, that the algorithm is able to recover true sparse structures with high accuracy, and using real data, that the sparse model identifies known correlation structure related to different classes of secondary structure elements.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Center for Research Resources (NCRR)
Type: Biotechnology Resource Grants (P41)
Project #: 5P41RR011823-15
Application #: 8171411
Study Section: Special Emphasis Panel (ZRG1-CB-H (40))

Project Start: 2010-09-01
Project End: 2011-08-31
Budget Start: 2010-09-01
Budget End: 2011-08-31
Support Year: 15
Fiscal Year: 2010
Total Cost: $19,514
Indirect Cost

Institution

Name: University of Washington
Department: Biochemistry
Type: Schools of Medicine
DUNS #: 605799469

City: Seattle
State: WA
Country: United States
Zip Code: 98195

Related projects

Publications

Hollmann, Taylor; Kim, Tae Kwon; Tirloni, Lucas et al. (2018) Identification and characterization of proteins in the Amblyomma americanum tick cement cone. Int J Parasitol 48:211-224

Stieg, David C; Willis, Stephen D; Ganesan, Vidyaramanan et al. (2018) A complex molecular switch directs stress-induced cyclin C nuclear release through SCFGrr1-mediated degradation of Med13. Mol Biol Cell 29:363-375

Seixas, Adriana; Alzugaray, María Fernanda; Tirloni, Lucas et al. (2018) Expression profile of Rhipicephalus microplus vitellogenin receptor during oogenesis. Ticks Tick Borne Dis 9:72-81

Wang, Zheng; Wu, Catherine; Aslanian, Aaron et al. (2018) Defective RNA polymerase III is negatively regulated by the SUMO-Ubiquitin-Cdc48 pathway. Elife 7:

Xavier, Marina Amaral; Tirloni, Lucas; Pinto, Antônio F M et al. (2018) A proteomic insight into vitellogenesis during tick ovary maturation. Sci Rep 8:4698

Luhtala, Natalie; Aslanian, Aaron; Yates 3rd, John R et al. (2017) Secreted Glioblastoma Nanovesicles Contain Intracellular Signaling Proteins and Active Ras Incorporated in a Farnesylation-dependent Manner. J Biol Chem 292:611-628

Thakar, Sonal; Wang, Liqing; Yu, Ting et al. (2017) Evidence for opposing roles of Celsr3 and Vangl2 in glutamatergic synapse formation. Proc Natl Acad Sci U S A 114:E610-E618

Jin, Meiyan; Fuller, Gregory G; Han, Ting et al. (2017) Glycolytic Enzymes Coalesce in G Bodies under Hypoxic Stress. Cell Rep 20:895-908

Ogami, Koichi; Richard, Patricia; Chen, Yaqiong et al. (2017) An Mtr4/ZFC3H1 complex facilitates turnover of unstable nuclear RNAs to prevent their cytoplasmic transport and global translational repression. Genes Dev 31:1257-1271

Ju Lee, Hyun; Bartsch, Deniz; Xiao, Cally et al. (2017) A post-transcriptional program coordinated by CSDE1 prevents intrinsic neural differentiation of human embryonic stem cells. Nat Commun 8:1456

Showing the most recent 10 out of 583 publications

Comments

Be the first to comment on William Noble's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: