Comparative Molecular Sequence Analysis

Kumar, Sudhir

Abstract

Whole-genome and multi-species-genome segment sequencing projects are, along with re-sequencing efforts, producing increasingly larger datasets for understanding evolutionary dynamics of mutations, genes, and genomes. The need for effective analysis of these datasets that contain a large number of genes and species has expanded the scope of evolutionary bioinformatics studies from specialist bioinformaticians to basic and applied biomedical researchers at the forefront of laboratory sciences. Therefore, we propose an integrated research and programming project with an aim to provide extensible software with facilities for (a) high-throughput application of the same data analysis for different genes, domains, genomic segments, and groups of sequences using the new IterationExpert, (b) employing sophisticated computational tools in the familiar MEGA platform by linking applications using the new AppLinker, (c) visualizing differences in natural selection among positions in a protein-structural context using the StructTracer, and (d) conducting extensive analysis in order to [i] infer evolutionary history of sequences from populations, species, and gene families; [ii] estimate the confidence intervals for times of species divergence and gene duplication events; [iii] deduce tracks of adaptive evolution in proteins, genes, and codons; [iv] test alternative evolutionary hypothesis; and [v] find the most appropriate model of molecular evolution in genes and lineages. Building on the successes of our previous software, we plan to add these new facilities for exploration and analysis of DMA and protein sequences in MEGA. In addition, we plan to tackle methodological challenges posed by the need to infer phylogenetic trees for large numbers of sequences and many genes by using theoretical and empirical data analysis with a focus on investigating the accuracy of different ways of combining data from multiple genes, assessing the performance of computationally-feasible methods under different optimality criteria for increasing number of sequences, and developing novel methods and algorithms. Outcomes from these investigations will guide the incorporation of the next set of methods and algorithms for phylogenetic inference in MEGA. These software and research developments will contribute to advances in molecular evolution, bioinformatics, functional genomics, computational biology, and basic biomedicine. As always, MEGA will be made available free of charge for all uses, including research, education, and training. ? ? ?

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Research Project (R01)
Project #: 5R01HG002096-08
Application #: 7413744
Study Section: Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer: Brooks, Lisa

Project Start: 2000-01-01
Project End: 2010-04-30
Budget Start: 2008-05-01
Budget End: 2009-04-30
Support Year: 8
Fiscal Year: 2008
Total Cost: $259,330
Indirect Cost

Institution

Name: Arizona State University-Tempe Campus
Department: Other Basic Sciences
Type: Schools of Arts and Sciences
DUNS #: 943360412

City: Tempe
State: AZ
Country: United States
Zip Code: 85287

Related projects

Publications

Katsura, Yukako; Stanley Jr, Craig E; Kumar, Sudhir et al. (2017) The Reliability and Stability of an Inferred Phylogenetic Tree from Empirical Data. Mol Biol Evol 34:718-723

Karim, Sajjad; NourEldin, Hend Fakhri; Abusamra, Heba et al. (2016) e-GRASP: an integrated evolutionary and GRASP resource for exploring disease associations. BMC Genomics 17:770

Liu, Li; Tamura, Koichiro; Sanderford, Maxwell et al. (2016) A Molecular Evolutionary Reference for the Human Variome. Mol Biol Evol 33:245-54

Miura, Sayaka; Tate, Stephanie; Kumar, Sudhir (2015) Using Disease-Associated Coding Sequence Variation to Investigate Functional Compensation by Human Paralogous Proteins. Evol Bioinform Online 11:245-51

Kumar, Avishek; Butler, Brandon M; Kumar, Sudhir et al. (2015) Integration of structural dynamics and molecular evolution via protein interaction networks: a new era in genomic medicine. Curr Opin Struct Biol 35:135-42

Butler, Brandon M; Gerek, Z Nevin; Kumar, Sudhir et al. (2015) Conformational dynamics of nonsynonymous variants at protein interfaces reveals disease association. Proteins 83:428-35

Filipski, Alan; Tamura, Koichiro; Billing-Ross, Paul et al. (2015) Phylogenetic placement of metagenomic reads using the minimum evolution principle. BMC Genomics 16 Suppl 1:S13

Battistuzzi, Fabia U; Billing-Ross, Paul; Murillo, Oscar et al. (2015) A Protocol for Diagnosing the Effect of Calibration Priors on Posterior Time Estimates: A Case Study for the Cambrian Explosion of Animal Phyla. Mol Biol Evol 32:1907-12

Gerek, Nevin Z; Liu, Li; Gerold, Kristyn et al. (2015) Evolutionary Diagnosis of non-synonymous variants involved in differential drug response. BMC Med Genomics 8 Suppl 1:S6

Hedges, S Blair; Marin, Julie; Suleski, Michael et al. (2015) Tree of life reveals clock-like speciation and diversification. Mol Biol Evol 32:835-45

Showing the most recent 10 out of 48 publications

Comments

Be the first to comment on Sudhir Kumar's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: