We will continue to develop software environments for solving two computational problems in molecular biology. The first problem is sequence alignment; the second is integrating sequence data, physical maps, and genetic maps. In each case, development of an appropriate software environment involves (1) designing and implementing algorithms, (2) constructing a graphical user interface, (3) building tools to print publication-quality graphical output, and (4) designing and constructing software to handle heterogeneous and interrelated sets of data. The software that we develop is implemented in a reasonably portable manner and is distributed free of charge. Our current software environment for sequence alignment will be enhanced to handle alignment of more than two sequences and to compute and display information about the trustworthiness or robustness of the alignments. Tools to extract information, such as the presence of coding regions, from alignments will be built. Better mechanisms for interactively recording and displaying sequence features will be developed, and a hypertext system will be designed and built for managing the resulting corrections of sequences, sequence features, and alignments. Additional alignment algorithms that strike a balance between sensitivity and efficiency will be developed and tested. Finally, if given funds for the task, we will integrate the software environment with the Software Development Kit from the National Center for Biotechnology Information to make the environment available on a wide variety of computers. Our current software environment for integrating sequence data, physical maps, and genetic maps will be given a comprehensive graphical user interface and the complex data-management questions will be addressed. We will also develop algorithms to search a genomic restriction map with fragment-length data and will continue work on algorithms to search and to compare restriction maps.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM005110-07
Application #
2237695
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Project Start
1989-08-01
Project End
1996-07-31
Budget Start
1995-08-01
Budget End
1996-07-31
Support Year
7
Fiscal Year
1995
Total Cost
Indirect Cost
Name
Pennsylvania State University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
City
University Park
State
PA
Country
United States
Zip Code
16802
Berman, Piotr; Bertone, Paul; Dasgupta, Bhaskar et al. (2004) Fast optimal genome tiling with applications to microarray design and homology search. J Comput Biol 11:766-85
Molete, J M; Petrykowska, H; Bouhassira, E E et al. (2001) Sequences flanking hypersensitive sites of the beta-globin locus control region are required for synergistic enhancement. Mol Cell Biol 21:2969-80
Elnitski, L; Li, J; Noguchi, C T et al. (2001) A negative cis-element regulates the level of enhancement by hypersensitive site 2 of the beta-globin locus control region. J Biol Chem 276:6289-98
Hardison, R C; Chui, D H; Riemer, C et al. (2001) Databases of human hemoglobin variants and other resources at the globin gene server. Hemoglobin 25:183-93
Wilson, M D; Riemer, C; Martindale, D W et al. (2001) Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5. Nucleic Acids Res 29:1352-65
Hardison, R C (2000) Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet 16:369-72
Yung Yu, C; Yang, Z; Blanchong, C A et al. (2000) The human and mouse MHC class III region: a parade of 21 genes at the centromeric segment. Immunol Today 21:320-8
McClelland, M; Florea, L; Sanderson, K et al. (2000) Comparison of the Escherichia coli K-12 genome with sampled genomes of a Klebsiella pneumoniae and three salmonella enterica serovars, Typhimurium, Typhi and Paratyphi. Nucleic Acids Res 28:4974-86
Schwartz, S; Zhang, Z; Frazer, K A et al. (2000) PipMaker--a web server for aligning two genomic DNA sequences. Genome Res 10:577-86
Doyle, J L; DeSilva, U; Miller, W et al. (2000) Divergent human and mouse orthologs of a novel gene (WBSCR15/Wbscr15) reside within the genomic interval commonly deleted in Williams syndrome. Cytogenet Cell Genet 90:285-90

Showing the most recent 10 out of 66 publications