This is a revised proposal to determine what evolutionary forces affect protein evolution. Four experiments are proposed; all are based on the collection of DNA sequence data from the closely related species Drosophila melanogaster and D. simulans, as well as the species pairs D. yakuba - D. teissieri and D. erecta - D. orena. Strictly neutral evolution is independent of population size, but the probability of fixation of slightly deleterious alleles is inversely proportional to population size and the probability of fixation of advantageous alleles is directly proportional to population size. Therefore, if the effective population sizes are known for different lineages, the patterns of substitutions in these lineages can be analyzed to determine whether substitutions are on average strictly neutral, slightly deleterious or advantageous. Dr. Kreitman argues that the species effective population size of D. melanogaster is smaller than that of D. simulans, because there is less codon bias in the former than the latter. Furthermore, genetic variation within species is reduced in genomic regions of low recombination compared to regions of high recombination. Two alternative explanations for these observations - loss of neutral variation linked to selected deleterious mutations (background selection), and loss of neutral genetic variation linked to positively selected advantageous mutations (selective sweeps) - both imply smaller effective population sizes in regions of reduced recombination. Given these a priori inferred population size contrasts, the following data will be collected to test whether evolutionary substitutions are primarily neutral (the null hypothesis), slightly deleterious or slightly adaptive. 1. Numbers of amino acid replacement substitutions well be determined for approximately 25 homologous genes in D. melanogaster, D. simulans and the outgroup, D. yakuba. Loci will be chosen that have been sequenced in D. melanogaster, that encode large proteins, that are in regions of high recombination and that are not in In(3R)a, a fixed inversion between D. simulans and D. melanogaster. The loci will be obtained from D. simulans and D. yakuba by PCR using primers designed from the melanogaster sequence and knowledge of conserved regions from other homologs; and sequenced. The number of replacement substitutions in D. melanogaster and D. simulans will be determined by comparing each to the outgroup species. To test for systematic lineage effects (e.g., high mutation rates in one of the species), substitutions in intron sequences (presumed to be neutral) will be determined. Rates of amino acid replacement in the coding regions of D. melanogaster and D. simulans, adjusted for lineage effects, will then be compared to determine if substitutions are primarily neutral (rates equal), slightly deleterious (higher rates in D. melanogaster), or slightly advantageous (higher rates in D. simulans). 2. Experiment 1 will be repeated for a sample of 25 genes in regions of low recombination in both species. Here the assumption is that the effective population size of both species in regions of low recombination are nearly equal, and the prediction is that there will be no lineage differences in substitution rates of these genes. 3. In(3R)a is a fixed inversion (84F-93F) between D. melanogaster and D. simulans. D. simulans has the ancestral gene order. Recombination on chromosome 3 is suppressed near the centromere (from 81-84), so genes near the breakpoints of this inversion will have changed recombinational environments. 12 proximal genes and 12 distal genes within the inversion will be sequenced, and rates of amino acid substitution of the same genes that have experienced different effective population sizes in the two species will be compared. Under the slightly deleterious model, rates of substitution of genes from the proximal breakpoint in D. melanogaster will be greater than the same genes in D. simulans, and rates of substitution of genes from the distal breakpoints in D. melanogaster will be less than the same genes in D. simulans. The opposite predictions would be true for advantageous substitutions. 4. The generality of the inferences from D. melanogaster and D. simulans comparisons of rates of synonymous substitutions, relative codon bias and rates of amino acid replacement will be tested using the species pairs D. yakuba - D. teissieri and D. erecta - D. orena. The same 25 high recombination region genes sequenced in the first experiment will be sequenced in the three additional species. Rates of synonymous and amino acid substitutions will be determined for these species pairs, and examined for consistency with the patterns observed for D. melanogaster and D. simulans.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM039355-11
Application #
2734589
Study Section
Special Emphasis Panel (ZRG2-GEN (05))
Project Start
1988-04-01
Project End
2000-06-30
Budget Start
1998-07-01
Budget End
1999-06-30
Support Year
11
Fiscal Year
1998
Total Cost
Indirect Cost
Name
University of Chicago
Department
Biology
Type
Schools of Medicine
DUNS #
225410919
City
Chicago
State
IL
Country
United States
Zip Code
60637
Toomajian, Christopher; Ajioka, Richard S; Jorde, Lynn B et al. (2003) A method for detecting recent selection in the human genome from allele age estimates. Genetics 165:287-97
Comeron, Josep M; Kreitman, Martin (2002) Population, evolutionary and genomic consequences of interference selection. Genetics 161:389-410
Toomajian, Christopher; Kreitman, Martin (2002) Sequence variation and haplotype structure at the human HFE locus. Genetics 161:1609-23
Andolfatto, P; Kreitman, M (2000) Molecular variation at the In(2L)t proximal breakpoint site in natural populations of Drosophila melanogaster and D. simulans. Genetics 154:1681-91
Comeron, J M; Kreitman, M (2000) The correlation between intron length and recombination in drosophila. Dynamic equilibrium between mutational and selective forces. Genetics 156:1175-90
Antezana, M A; Hudson, R R (1999) Type I error and the power of the s-test: old lessons from a new, analytically justified statistical test for phylogenies. Syst Biol 48:300-16
Comeron, J M; Kreitman, M; Aguade, M (1999) Natural selection on synonymous sites is correlated with gene length and recombination in Drosophila. Genetics 151:239-49
Comeron, J M; Kreitman, M (1998) The correlation between synonymous and nonsynonymous substitutions in Drosophila: mutation, selection or relaxed constraints? Genetics 150:767-75
Hasson, E; Wang, I N; Zeng, L W et al. (1998) Nucleotide variation in the triosephosphate isomerase (Tpi) locus of Drosophila melanogaster and Drosophila simulans. Mol Biol Evol 15:756-69
Ballard, J W; Hatzidakis, J; Karr, T L et al. (1996) Reduced variation in Drosophila simulans mitochondrial DNA. Genetics 144:1519-28

Showing the most recent 10 out of 19 publications