The goal of the current proposal is to develop a new flexible framework for metagenomic assembly. This framework will provide novel algorithms for de novo and comparative assembly of metagenomic data, as well as tools for extracting and analyzing genomic variation. In addition, a novel approach will be developed for reconstructing segments of a genome containing a specific 16S rRNA sequence. At the core of this framework are two packages developed in the PIs lab: the open source package AMOS - a flexible and modular assembly package, and Bambus 2 - a genomic scaffolder specifically targeted at metagenomic data. A novel model-based testing framework will be closely integrated with the developments proposed here in order to ensure the correctness of the software being developed. Several levels of test procedures will be run every time the code is changed, on a nightly basis, as well as at regular intervals, allowing errors to be promptly discovered and corrected.

Public Health Relevance

Metagenomics studies are starting to elucidate the roles microbes play in human health and disease. This proposal will enhance future metagenomic studies by providing full featured, efficient, and robust algorithms and tools for reconstructing te genomes and metagenomes from high-throughput sequencing data.

Agency
National Institute of Health (NIH)
Institute
National Institute of Allergy and Infectious Diseases (NIAID)
Type
Research Project (R01)
Project #
1R01AI100947-01A1
Application #
8506220
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Dugan, Vivien Grace
Project Start
2013-03-21
Project End
2017-02-28
Budget Start
2013-03-21
Budget End
2014-02-28
Support Year
1
Fiscal Year
2013
Total Cost
$437,391
Indirect Cost
$132,507
Name
University of Maryland College Park
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
790934285
City
College Park
State
MD
Country
United States
Zip Code
20742
Nguyen, Nam-Phuong; Mirarab, Siavash; Liu, Bo et al. (2014) TIPP: taxonomic identification and phylogenetic profiling. Bioinformatics 30:3548-55
Koren, Sergey; Treangen, Todd J; Hill, Christopher M et al. (2014) Automated ensemble assembly and validation of microbial genomes. BMC Bioinformatics 15:126
Almeida, Mathieu; Hébert, Agnès; Abraham, Anne-Laure et al. (2014) Construction of a dairy microbial genome catalog opens new perspectives for the metagenomic analysis of dairy fermented products. BMC Genomics 15:1101