Whole-genome and multi-species-genome segment sequencing projects are, along with re-sequencing efforts, producing increasingly larger datasets for understanding evolutionary dynamics of mutations, genes, and genomes. The need for effective analysis of these datasets that contain a large number of genes and species has expanded the scope of evolutionary bioinformatics studies from specialist bioinformaticians to basic and applied biomedical researchers at the forefront of laboratory sciences. Therefore, we propose an integrated research and programming project with an aim to provide extensible software with facilities for (a) high-throughput application of the same data analysis for different genes, domains, genomic segments, and groups of sequences using the new IterationExpert, (b) employing sophisticated computational tools in the familiar MEGA platform by linking applications using the new AppLinker, (c) visualizing differences in natural selection among positions in a protein-structural context using the StructTracer, and (d) conducting extensive analysis in order to [i] infer evolutionary history of sequences from populations, species, and gene families; [ii] estimate the confidence intervals for times of species divergence and gene duplication events; [iii] deduce tracks of adaptive evolution in proteins, genes, and codons; [iv] test alternative evolutionary hypothesis; and [v] find the most appropriate model of molecular evolution in genes and lineages. Building on the successes of our previous software, we plan to add these new facilities for exploration and analysis of DMA and protein sequences in MEGA. In addition, we plan to tackle methodological challenges posed by the need to infer phylogenetic trees for large numbers of sequences and many genes by using theoretical and empirical data analysis with a focus on investigating the accuracy of different ways of combining data from multiple genes, assessing the performance of computationally-feasible methods under different optimality criteria for increasing number of sequences, and developing novel methods and algorithms. Outcomes from these investigations will guide the incorporation of the next set of methods and algorithms for phylogenetic inference in MEGA. These software and research developments will contribute to advances in molecular evolution, bioinformatics, functional genomics, computational biology, and basic biomedicine. As always, MEGA will be made available free of charge for all uses, including research, education, and training. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG002096-08
Application #
7413744
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Brooks, Lisa
Project Start
2000-01-01
Project End
2010-04-30
Budget Start
2008-05-01
Budget End
2009-04-30
Support Year
8
Fiscal Year
2008
Total Cost
$259,330
Indirect Cost
Name
Arizona State University-Tempe Campus
Department
Other Basic Sciences
Type
Schools of Arts and Sciences
DUNS #
943360412
City
Tempe
State
AZ
Country
United States
Zip Code
85287
Katsura, Yukako; Stanley Jr, Craig E; Kumar, Sudhir et al. (2017) The Reliability and Stability of an Inferred Phylogenetic Tree from Empirical Data. Mol Biol Evol 34:718-723
Karim, Sajjad; NourEldin, Hend Fakhri; Abusamra, Heba et al. (2016) e-GRASP: an integrated evolutionary and GRASP resource for exploring disease associations. BMC Genomics 17:770
Liu, Li; Tamura, Koichiro; Sanderford, Maxwell et al. (2016) A Molecular Evolutionary Reference for the Human Variome. Mol Biol Evol 33:245-54
Miura, Sayaka; Tate, Stephanie; Kumar, Sudhir (2015) Using Disease-Associated Coding Sequence Variation to Investigate Functional Compensation by Human Paralogous Proteins. Evol Bioinform Online 11:245-51
Kumar, Avishek; Butler, Brandon M; Kumar, Sudhir et al. (2015) Integration of structural dynamics and molecular evolution via protein interaction networks: a new era in genomic medicine. Curr Opin Struct Biol 35:135-42
Butler, Brandon M; Gerek, Z Nevin; Kumar, Sudhir et al. (2015) Conformational dynamics of nonsynonymous variants at protein interfaces reveals disease association. Proteins 83:428-35
Filipski, Alan; Tamura, Koichiro; Billing-Ross, Paul et al. (2015) Phylogenetic placement of metagenomic reads using the minimum evolution principle. BMC Genomics 16 Suppl 1:S13
Battistuzzi, Fabia U; Billing-Ross, Paul; Murillo, Oscar et al. (2015) A Protocol for Diagnosing the Effect of Calibration Priors on Posterior Time Estimates: A Case Study for the Cambrian Explosion of Animal Phyla. Mol Biol Evol 32:1907-12
Gerek, Nevin Z; Liu, Li; Gerold, Kristyn et al. (2015) Evolutionary Diagnosis of non-synonymous variants involved in differential drug response. BMC Med Genomics 8 Suppl 1:S6
Hedges, S Blair; Marin, Julie; Suleski, Michael et al. (2015) Tree of life reveals clock-like speciation and diversification. Mol Biol Evol 32:835-45

Showing the most recent 10 out of 48 publications