Comparative sequence analyses are an essential component of contemporary genomics research. We are continuing to develop and apply new methods for detecting more complex types of evolutionary sequence constraint, such as lineage-specific constrained sequences or weakly constrained sequences, as well as other types of constraint that might not be reflected in the primary nucleotide sequence. Furthermore, we are utilizing inter-species sequence analyses of functional annotations to identify sequence signatures that might confer function; such approaches are particularly useful for detecting short sequences not evolutionarily constrained in orthologous positions across multiple species, but whose relative position to genes and other features is important.? ? The Section is also developing high-throughput methods to experimentally detect and classify functional genomic sequences. This approach utilizes a flow-cytometry-based selection of GFP-reporter constructs harboring candidate enhancer sequences. Preliminary results are promising and have validated that known sequences with enhancer activity can be detected. Scaling this approach to larger regions of the genome will require the use of new-generation sequencing technologies.? ? Along these lines, the Section is utilizing a next-generation sequencing technology from Illumina/Solexa. We are working both on the wet-lab side to get the machine working optimally, as well as the informatics side, developing approaches to maximize the amount of high-quality sequence data that can be extracted from the raw images produced by the new instrument. We are pursuing a number of applications, including ChIP-Seq, RNA-Seq, and other """"""""tag"""""""" based counting experiments, as well as medically-relevant de novo genome sequencing projects that can now be pursued at a dramatically reduced cost. This work will not only enable the above-mentioned research projects in my Section, but also many others in the Institute looking to take advantage of this new technology for their own projects.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Intramural Research (Z01)
Project #
1Z01HG200341-02
Application #
7734909
Study Section
Project Start
Project End
Budget Start
Budget End
Support Year
2
Fiscal Year
2008
Total Cost
$658,441
Indirect Cost
Name
National Human Genome Research Institute
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Margulies, Elliott H; Birney, Ewan (2008) Approaches to comparative sequence analysis: towards a functional view of vertebrate genomes. Nat Rev Genet 9:303-13
Margulies, Elliott H (2008) Confidence in comparative genomics. Genome Res 18:199-200
Boyle, Alan P; Davis, Sean; Shulha, Hennady P et al. (2008) High-resolution mapping and characterization of open chromatin across the genome. Cell 132:311-22
Clark, Taane G; Andrew, Toby; Cooper, Gregory M et al. (2007) Functional constraint and small insertions and deletions in the ENCODE regions of the human genome. Genome Biol 8:R180
Nikolaev, Sergey; Montoya-Burgos, Juan I; Margulies, Elliott H et al. (2007) Early history of mammals is elucidated with the ENCODE multiple species sequencing data. PLoS Genet 3:e2
Margulies, Elliott H; Cooper, Gregory M; Asimenos, George et al. (2007) Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome. Genome Res 17:760-74
McCauley, Jacob L; Kenealy, Shannon J; Margulies, Elliott H et al. (2007) SNPs in Multi-species Conserved Sequences (MCS) as useful markers in association studies: a practical approach. BMC Genomics 8:266
(2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447:799-816
Nikolaev, Sergey I; Montoya-Burgos, Juan I; Popadin, Konstantin et al. (2007) Life-history traits drive the evolutionary rates of mammalian coding and noncoding genomic elements. Proc Natl Acad Sci U S A 104:20443-8