High-throughput sequencing has provided a tool capable of observing the human microbiome, but characterizing the biological roles and metabolic potential of these microbial communities remains a significant challenge. Increasing evidence points to the functional activity of gene products, rather than community taxonomic composition, as the most robust descriptor of the microflora's relationship with its host and as a potential point of intervention in modulating human health. Existing computational tools for exploring a newly sequenced metagenome rely heavily on sequence homology and do not yet leverage information from the thousands of publicly available functional experimental results. Likewise, no previous methods have provided genome-scale computational tools for biological hypothesis generation regarding specific molecular interactions among the microflora and with a human host. This proposal aims to develop computational methodology to interpret the functional activity of microfloral communities: 1. Integrate functional information from taxonomic, metagenomic, and metatranscriptomic datasets. We will develop methodology to unify these three representations of microbiome composition by incorporating information from large scale functional genomic data collections. 2. Identify genomic predictors of inter-species functional activity, including host/microflora interactions and points of community-wide regulatory feedback. We will computationally screen microbiome assays for molecular interactions and regulatory motifs spanning multiple organisms in the community. 3. Implement these technologies as publicly available, accessible, and interpretable tools. We will provide freely available, open source, downloadable and web-based implementations of this methodology for use by the bioinformatic and biological communities. As high-throughput sequencing becomes more widely used to study microbial communities in the human microbiome and in the environment, computational tools will be necessary to summarize their global functional activity and systems-level regulatory interactions. In the long term, by providing methodology to understand the human microbiome at the molecular level, we hope to enable its future use as a diagnostic indicator and as a point of intervention to improve human health.

Public Health Relevance

DNA sequencing technology has recently allowed us to examine the microorganisms naturally residing in and on the human body, many of which are beneficial and some of which can be harmful. Although we can now gather data on the cellular behavior of these microbes and on their interactions with human beings, computational tools are needed to interpret this information. By developing new software to study these communities of microorganisms, we hope to eventually be able to detect when they may be causing disease and modify their composition to improve human health.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-GGG-N (50))
Program Officer
Proctor, Lita
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Harvard University
Biostatistics & Other Math Sci
Schools of Public Health
United States
Zip Code
McIver, Lauren J; Abu-Ali, Galeb; Franzosa, Eric A et al. (2018) bioBakery: a meta'omic analysis environment. Bioinformatics 34:1235-1237
Sinha, Rashmi; Abu-Ali, Galeb; Vogtmann, Emily et al. (2017) Assessment of variation in microbial community amplicon sequencing by the Microbiome Quality Control (MBQC) project consortium. Nat Biotechnol 35:1077-1086
Truong, Duy Tin; Tett, Adrian; Pasolli, Edoardo et al. (2017) Microbial strain-level population structure and genetic diversity from metagenomes. Genome Res 27:626-638
Donati, Claudio; Zolfo, Moreno; Albanese, Davide et al. (2016) Uncovering oral Neisseria tropism and persistence using metagenomic sequencing. Nat Microbiol 1:16070
Morgan, Xochitl C; Kabakchiev, Boyko; Waldron, Levi et al. (2015) Associations between host gene expression, the mucosal microbiome, and clinical outcome in the pelvic pouch of patients with inflammatory bowel disease. Genome Biol 16:67
Franzosa, Eric A; Huang, Katherine; Meadow, James F et al. (2015) Identifying personal microbiomes using metagenomic codes. Proc Natl Acad Sci U S A 112:E2930-8
Kaminski, James; Gibson, Molly K; Franzosa, Eric A et al. (2015) High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED. PLoS Comput Biol 11:e1004557
Taft, Diana H; Ambalavanan, Namasivayam; Schibler, Kurt R et al. (2015) Center Variation in Intestinal Microbiota Prior to Late-Onset Sepsis in Preterm Infants. PLoS One 10:e0130604
Yasuda, Koji; Oh, Keunyoung; Ren, Boyu et al. (2015) Biogeography of the intestinal mucosal and lumenal microbiome in the rhesus macaque. Cell Host Microbe 17:385-391
Kronenberg, Zev N; Osborne, Edward J; Cone, Kelsey R et al. (2015) Wham: Identifying Structural Variants of Biological Consequence. PLoS Comput Biol 11:e1004572

Showing the most recent 10 out of 48 publications