Although the elucidation of the entire sequence of the human genome is a long-term goal of the Human Genome Project, the efficiency of sequencing genomic DNA is not currently such that it would allow this goal to be realized keeping track of this reason, two of the current objectives of the HGP are to sequence several megabase-sized regions of DNA with high biological interest and to develop more efficient technologies for sequencing. In Project 5, we propose to determine the sequence of about eight Mb in 4 different regions of the genome during the 5-year funding period. We plan to achieve this goal using the transposon-mediated (TM) sequencing strategy as we believe the low levels of sequence redundancy and the ease of sequence assembly that it offers will make this an efficient way to perform megabase-sequencing. We propose to establish a megabase- sequencing group at the Center and will test the transferability of the current TM-sequencing procedure, using a well-characterized cosmid clone as a model system. We propose to develop several improvements in this strategy to eliminate some rate-limiting steps. Specifically, we will develop a method to perform the transposition and sequencing steps directly in cosmids and to develop an efficient PCR approach to map the positions of transposons in these cosmids. We will then use this method to determine the DNA sequence of 4 regions of the genome, selected primarily on the basis of their biological relevance, but also because many of the required reagents are readily available to us. These regions are: a 2 Mb region of chromosome 21q22 implicated in Down syndrome; a gene-rich 2 Mb segment in the pseudoautosomal region of the sex chromosomes; a 2 Mb region around the EGF gene on chromosome 4q25-4q26, implicated in hepatocellular carcinoma and Rieger's syndrome, and a 2 Mb region on chromosome 4q11-4q12 that includes a cluster of receptor tyrosine kinase genes. At the end of the proposed 5 year funding period, we not only will have provided biologically relevant sequence information for several interesting regions of the genome, but also will have tested the efficacy of and improved upon a sequencing strategy that is an alternative to the mainstream shotgun sequencing approach. We believe several such approaches should be attempted to improve sequencing technologies so that the ultimate goal of determining the sequence of the whole genome can be realized.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Specialized Center (P50)
Project #
5P50HG000206-08
Application #
6109086
Study Section
Project Start
1998-07-15
Project End
1999-06-30
Budget Start
Budget End
Support Year
8
Fiscal Year
1998
Total Cost
Indirect Cost
Name
Stanford University
Department
Type
DUNS #
800771545
City
Stanford
State
CA
Country
United States
Zip Code
94305
Stewart, E A; McKusick, K B; Aggarwal, A et al. (1997) An STS-based radiation hybrid map of the human genome. Genome Res 7:422-33
Pennacchio, L A; Lehesjoki, A E; Stone, N E et al. (1996) Mutations in the gene encoding cystatin B in progressive myoclonus epilepsy (EPM1) Science 271:1731-4
Stone, N E; Fan, J B; Willour, V et al. (1996) Construction of a 750-kb bacterial clone contig and restriction map in the region of human chromosome 21 containing the progressive myoclonus epilepsy gene. Genome Res 6:218-25
Fan, J B; DeYoung, J; Lagace, R et al. (1994) Isolation of yeast artificial chromosome clones from 54 polymorphic loci mapped with high odds on human chromosome 4. Hum Mol Genet 3:243-6
Goold, R D; diSibio, G L; Xu, H et al. (1993) The development of sequence-tagged sites for human chromosome 4. Hum Mol Genet 2:1271-88
Mathews, K D; Mills, K A; Bosch, E P et al. (1992) Linkage localization of facioscapulohumeral muscular dystrophy (FSHD) in 4q35. Am J Hum Genet 51:428-31