Genome Data Analysis - Theory and Software

Green, Philip

Abstract

I will develop improved methods for analyzing several types of genome mapping and sequencing data, which will then be incorporated into my existing software packages CRIMAP, SEGMAP and GENEFINDER and distributed to the genome analysis community. 1. I have begun work on a unified approach to analyzing, and integrating into a single physical map, the data from a variety of different mapping techniques, including STS-content mapping of YAC contigs, linkage mapping, radiation hybrid mapping, in situ hybridization, and pulsed field gel restriction mapping. This approach, called genomic segment analysis, is based on the observation that these different mapping methods can all be viewed as providing information about relationships between genomic segments of various types. A statistical approach to analysis of genomic segment data, inspired by linkage analysis methods, has been implemented in the program CRIMAP, and a combinatorial (deterministic) approach, inspired by methods for constructing STS content maps of YAC contigs, has been implemented in the program SEGMAP. These methods will be further developed, with the primary emphases being on allowing efficient joint analysis of different types (and large amounts) of data, on characterizing and representing ambiguities of map order and distance, and on detection of data errors. Simulation studies will be carried out to investigate the accuracy of maps constructed using these approaches, examining in particular the effects of data errors. 2. I will improve CRIMAP's ability to perform multilocus linkage analysis with disease loci, by extending its current efficient likelihood computation and maximization methods to handle incomplete pedigree information and more general disease locus models. 3. The program GENEFINDER uses a systematic statistical approach to identify and display probable exons in C. elegans genomic sequence. I will develop its ability to analyze other genomes, including the human. Other improvements will include the automated construction of candidate genes from their component exons, automatic identification of likely regions of sequencing errors, and extension of the display capabilities to include other types of genomic features, such as repeats, promoter sequences, and protein motifs. In addition, I will systematically compare the power of this approach with recent """"""""neural net"""""""" approaches to gene identification.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Research Project (R01)
Project #: 1R01HG000774-01
Application #: 3333903
Study Section: Genome Study Section (GNM)

Project Start: 1992-09-29
Project End: 1995-08-31
Budget Start: 1992-09-29
Budget End: 1993-08-31
Support Year: 1
Fiscal Year: 1992
Total Cost
Indirect Cost

Institution

Name: Washington University
Department
Type: Schools of Medicine
DUNS #: 062761671

City: Saint Louis
State: MO
Country: United States
Zip Code: 63130

Related projects


NIH 1999 R01 HG	Automated Data Processing for Genome Sequencing Green, Philip P. / University of Washington
NIH 1998 R01 HG	Automated Data Processing for Genome Sequencing Green, Philip P. / University of Washington
NIH 1997 R01 HG	Automated Data Processing for Genome Sequencing Green, Philip P. / University of Washington
NIH 1995 R01 HG	Genome Data Analysis - Theory and Software Green, Philip P. / University of Washington
NIH 1993 R01 HG	Genome Data Analysis - Theory and Software Green, Philip P. / University of Washington
NIH 1993 R01 HG	Genome Data Analysis - Theory and Software Green, Philip P. / Washington University
NIH 1992 R01 HG	Genome Data Analysis - Theory and Software Green, Philip P. / Washington University

Publications

Gordon, D; Desmarais, C; Green, P (2001) Automated finishing with autofinish. Genome Res 11:614-25

Garg, K; Green, P; Nickerson, D A (1999) Identification of candidate coding region single nucleotide polymorphisms in 165 human genes using assembled expressed sequence tags. Genome Res 9:1087-92

Green, E D; Idol, J R; Mohr-Tidwell, R M et al. (1994) Integration of physical, genetic and cytogenetic maps of human chromosome 7: isolation and analysis of yeast artificial chromosome clones for 117 mapped genetic markers. Hum Mol Genet 3:489-501

Comments

Be the first to comment on Philip Green's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: