Genetic data has transformed the field of evolutionary biology. The age of genomics offers the possibility of measuring molecular information in the present and using it to infer evolutionary events from the distant past. The increasing availability of genetic data and computational power offer the challenge and the opportunity to develop improved methods of analysis that are able to use the data fully to further our understanding of molecular evolution. Bayesian approaches to the estimation of phylogeny began in the mid 1990s and are rapidly increasing in popularity. Important advantages of the Bayesian approach include easily interpreted measures of uncertainty and computational feasibility for solutions to highly complex problems. This grant proposal describes an ambitious plan to improve many aspects of the Bayesian approach to phylogeny estimation and to build practical tools capable of analyzing very large genetic data sets.
Specific aims of the proposal are to: (1) improve models for evolution of molecular sequences; (2) improve models for genome-scale rearrangement; (3) develop models to combine information from molecular sequences and from genome arrangements; (4) develop methods to handle partial data; (5) improve methods for ancestral sequence estimation; (6) develop methods to elicit informative prior distributions; (7) develop visualization-based interfaces for exploring distributions of phylogeny; (8) develop computational algorithms to apply Bayesian methods to very large phylogenies; and (9) test the robustness of our methods. The proposed research will enhance scientific understanding in an area where there is potential benefit to society, for example, in analysis of rapidly evolving viruses, such as HIV. We will develop powerful new research tools for phylogenetic inference and distribute these tools for free via the web. This proposal provides the opportunity for students in biology, computer science, human-computer interaction, statistics, and computational mathematics to work with each other and the principle investigators on a large cross-disciplinary project that will enrich the education of these students.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM068950-02
Application #
6756436
Study Section
Special Emphasis Panel (ZGM1-MBP-1 (01))
Program Officer
Whitmarsh, John
Project Start
2003-06-15
Project End
2007-05-31
Budget Start
2004-06-01
Budget End
2005-05-31
Support Year
2
Fiscal Year
2004
Total Cost
$328,465
Indirect Cost
Name
University of Wisconsin Madison
Department
Other Basic Sciences
Type
Schools of Arts and Sciences
DUNS #
161202122
City
Madison
State
WI
Country
United States
Zip Code
53715
Derthick, Mark (2008) Interactive visualization software for exploring phylogenetic trees and clades. Bioinformatics 24:868-9
Luo, Ruiyan; Hipp, Andrew L; Larget, Bret (2007) A Bayesian model of AFLP marker evolution and phylogenetic inference. Stat Appl Genet Mol Biol 6:Article11
Ane, Cecile; Larget, Bret; Baum, David A et al. (2007) Bayesian estimation of concordance among gene trees. Mol Biol Evol 24:412-26
Larget, Bret; Kadane, Joseph B; Simon, Donald L (2005) A Bayesian approach to the estimation of ancestral genome arrangements. Mol Phylogenet Evol 36:214-23
Holder, Mark T; Lewis, Paul O; Swofford, David L et al. (2005) Hastings ratio of the LOCAL proposal used in Bayesian phylogenetics. Syst Biol 54:961-5
Larget, Bret; Simon, Donald L; Kadane, Joseph B et al. (2005) A bayesian analysis of metazoan mitochondrial genome arrangements. Mol Biol Evol 22:486-95
Huelsenbeck, John P; Larget, Bret; Alfaro, Michael E (2004) Bayesian phylogenetic model selection using reversible jump Markov chain Monte Carlo. Mol Biol Evol 21:1123-33