A phylogenetic approach to metagenomic analysis

Filipski, Alan

Abstract

Metagenomics is a powerful molecular approach in which an environmental sample containing genetic material from an entire community of organisms is analyzed as a whole, without requiring individual organisms to be isolated or cultured in the laboratory. This has great relevance to medical applications, as many microorganisms, both harmful and beneficial to humans, operate as closely-knit communities. Metagenomic analysis of DNA sequences obtained is complicated by a number of factors. First, the nucleotide sequences are available only in the form of """"""""reads,"""""""" which may, because of their short length, be difficult to assign to species. Second, environmental samples contain many organisms that are neither known nor previously characterized. Current metagenomic analysis programs classify sequence reads on the basis of crude measures of similarity between the unknown sequence and those currently available in the databanks. A shortcoming of the current approach is that the assignments are not based on rigorous statistical considerations, so that the assignment of unknown sequences to existing tax is largely heuristic and it is not readily possible to associate a probability to the assignment using advanced evolutionary genomics tools. The goal of the proposed research is to design and implement a new approach to metagenomic analysis based on statistical phylogenetics principles in order to generate more accurate and informative assignments. The new approach will utilize existing (and carefully assembled) multiple sequence alignments now publically available, as compared to the current system of using raw data in sequence banks. The new method will be tested using many empirical and simulated data sets for accuracy. These accuracies will be compared to those achieved by current state-of-the-art methods. Successful completion of this project will yield insights into factors responsible for successes and failures of the proposed and the existing methods, and it has a high likelihood of producing a useful method for evolutionary bioinformatics of metagenomic data.

Public Health Relevance

Metagenomic analysis has emerged as a powerful tool to analyze genetic, and thus organism, compositions of microbial communities that inhabit our planet and our bodies. The proposed statistical and computational research will result in the development of an evolutionary phylogenetic framework for an advanced analysis of the metagenomic data, which will improve the application of metagenomics to understanding microorganisms, both harmful and beneficial to humans.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Exploratory/Developmental Grants (R21)
Project #: 1R21HG006039-01
Application #: 8032384
Study Section: Special Emphasis Panel (ZRG1-BST-F (02))
Program Officer: Bonazzi, Vivien

Project Start: 2011-02-01
Project End: 2013-01-31
Budget Start: 2011-02-01
Budget End: 2012-01-31
Support Year: 1
Fiscal Year: 2011
Total Cost: $228,750
Indirect Cost

Institution

Name: Arizona State University-Tempe Campus
Department: Genetics
Type: Organized Research Units
DUNS #: 943360412

City: Tempe
State: AZ
Country: United States
Zip Code: 85287

Related projects


NIH 2012 R21 HG	A phylogenetic approach to metagenomic analysis Filipski, Alan / Arizona State University-Tempe Campus	$190,625
NIH 2011 R21 HG	A phylogenetic approach to metagenomic analysis Filipski, Alan / Arizona State University-Tempe Campus	$228,750

Publications

Filipski, Alan; Tamura, Koichiro; Billing-Ross, Paul et al. (2015) Phylogenetic placement of metagenomic reads using the minimum evolution principle. BMC Genomics 16 Suppl 1:S13

Tamura, Koichiro; Stecher, Glen; Peterson, Daniel et al. (2013) MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol 30:2725-9

Kumar, Sudhir; Filipski, Alan J; Battistuzzi, Fabia U et al. (2012) Statistics and truth in phylogenomics. Mol Biol Evol 29:457-72

Tamura, Koichiro; Battistuzzi, Fabia Ursula; Billing-Ross, Paul et al. (2012) Estimating divergence times in large molecular phylogenies. Proc Natl Acad Sci U S A 109:19333-8

Comments

Be the first to comment on Alan Filipski's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: