For many genetic loci, the rapid accumulation of sequence data from several species, along with detailed functional analyses, has produced a ponderous data set that is challenging, if not impossible, for individual investigators to fully assimilate. We propose to develop a prototype database of sequence alignments and experimental results for the beta-like globin gene cluster of mammals. This data repository is intended to help the international community of scientists studying globin genes to plan experiments, to design models and, ultimately, to understand regulation of those genes. The resource will be developed for mammalian beta-like globin gene clusters because they have been thoroughly sequenced for several species and extensive functional data are available. Moreover, the approaches and software developed for this project will be applicable to other sequence-analysis problems, and the prototype developed here will be applicable to any locus for which extensive sequence and functional data are available from several species. We recently developed a program that can simultaneously align a few very long sequences. An e-mail server was then established to deliver any requested portion of the alignment, annotated to indicate highly-conserved regions and known sequence features. This proposal seeks support to develop an interactive Globin Gene Server on the Internet to provide access to and analysis of both the alignments and experimental data. We are currently refining a data model and a collection of tools to capture and display the wealth of pertinent experimental data. Implementation of these tools will allow the user to view the results of the simultaneous alignment of sequences (conserved sequence blocks) in register with the experimental data, thus providing an integrated view of the gene cluster in a flexible, interactive format. This integrated view of both alignments and experimental data in register, which we call electronic genetic analysis, is a unique feature of this database. Future developments will include tools to search the Transcription Factor Database for matches to the consensus for conserved blocks, provide alternative alignments, produce summary views of the database in an interactive manner, support batch queries in classic database query languages, and automatically detect database inconsistencies. To best serve the needs of our potential user community, this information is being made available via World-Wide Web, which is an interactive, graphically oriented environment available over the Internet.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM005773-03
Application #
2460260
Study Section
Special Emphasis Panel (ZRG2-GNM (02))
Program Officer
Bean, Carol A
Project Start
1995-08-01
Project End
2000-07-31
Budget Start
1997-08-01
Budget End
2000-07-31
Support Year
3
Fiscal Year
1997
Total Cost
Indirect Cost
Name
Pennsylvania State University
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
City
University Park
State
PA
Country
United States
Zip Code
16802
Elnitski, L; Li, J; Noguchi, C T et al. (2001) A negative cis-element regulates the level of enhancement by hypersensitive site 2 of the beta-globin locus control region. J Biol Chem 276:6289-98
Hardison, R C; Chui, D H; Riemer, C et al. (2001) Databases of human hemoglobin variants and other resources at the globin gene server. Hemoglobin 25:183-93
Hardison, R C (2000) Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet 16:369-72
Schwartz, S; Zhang, Z; Frazer, K A et al. (2000) PipMaker--a web server for aligning two genomic DNA sequences. Genome Res 10:577-86
Stojanovic, N; Florea, L; Riemer, C et al. (1999) Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions. Nucleic Acids Res 27:3899-910
Riemer, C; ElSherbini, A; Stojanovic, N et al. (1998) A database of experimental results on globin gene expression. Genomics 53:325-37
Li, J; Noguchi, C T; Miller, W et al. (1998) Multiple regulatory elements in the 5'-flanking sequence of the human epsilon-globin gene. J Biol Chem 273:10202-9
Hardison, R C; Chui, D H; Riemer, C R et al. (1998) Access to a syllabus of human hemoglobin variants (1996) via the World Wide Web. Hemoglobin 22:113-27
Hardison, R (1998) Hemoglobins from bacteria to man: evolution of different patterns of gene expression. J Exp Biol 201:1099-117
Hardison, R; Riemer, C; Chui, D H et al. (1998) Electronic access to sequence alignments, experimental results, and human mutations as an aid to studying globin gene regulation. Genomics 47:429-37

Showing the most recent 10 out of 15 publications