Software for Analyzing Biosequence Data

Miller, Webb

Abstract

We will continue to develop software environments for solving two computational problems in molecular biology. The first problem is sequence alignment; the second is integrating sequence data, physical maps, and genetic maps. In each case, development of an appropriate software environment involves (1) designing and implementing algorithms, (2) constructing a graphical user interface, (3) building tools to print publication-quality graphical output, and (4) designing and constructing software to handle heterogeneous and interrelated sets of data. The software that we develop is implemented in a reasonably portable manner and is distributed free of charge. Our current software environment for sequence alignment will be enhanced to handle alignment of more than two sequences and to compute and display information about the trustworthiness or robustness of the alignments. Tools to extract information, such as the presence of coding regions, from alignments will be built. Better mechanisms for interactively recording and displaying sequence features will be developed, and a hypertext system will be designed and built for managing the resulting corrections of sequences, sequence features, and alignments. Additional alignment algorithms that strike a balance between sensitivity and efficiency will be developed and tested. Finally, if given funds for the task, we will integrate the software environment with the Software Development Kit from the National Center for Biotechnology Information to make the environment available on a wide variety of computers. Our current software environment for integrating sequence data, physical maps, and genetic maps will be given a comprehensive graphical user interface and the complex data-management questions will be addressed. We will also develop algorithms to search a genomic restriction map with fragment-length data and will continue work on algorithms to search and to compare restriction maps.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Library of Medicine (NLM)
Type: Research Project (R01)
Project #: 5R01LM005110-07
Application #: 2237695
Study Section: Biomedical Library and Informatics Review Committee (BLR)

Project Start: 1989-08-01
Project End: 1996-07-31
Budget Start: 1995-08-01
Budget End: 1996-07-31
Support Year: 7
Fiscal Year: 1995
Total Cost
Indirect Cost

Institution

Name: Pennsylvania State University
Department: Biostatistics & Other Math Sci
Type: Schools of Arts and Sciences
DUNS #

City: University Park
State: PA
Country: United States
Zip Code: 16802

Related projects

Publications

Berman, Piotr; Bertone, Paul; Dasgupta, Bhaskar et al. (2004) Fast optimal genome tiling with applications to microarray design and homology search. J Comput Biol 11:766-85

Molete, J M; Petrykowska, H; Bouhassira, E E et al. (2001) Sequences flanking hypersensitive sites of the beta-globin locus control region are required for synergistic enhancement. Mol Cell Biol 21:2969-80

Elnitski, L; Li, J; Noguchi, C T et al. (2001) A negative cis-element regulates the level of enhancement by hypersensitive site 2 of the beta-globin locus control region. J Biol Chem 276:6289-98

Hardison, R C; Chui, D H; Riemer, C et al. (2001) Databases of human hemoglobin variants and other resources at the globin gene server. Hemoglobin 25:183-93

Wilson, M D; Riemer, C; Martindale, D W et al. (2001) Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5. Nucleic Acids Res 29:1352-65

Schwartz, S; Zhang, Z; Frazer, K A et al. (2000) PipMaker--a web server for aligning two genomic DNA sequences. Genome Res 10:577-86

Doyle, J L; DeSilva, U; Miller, W et al. (2000) Divergent human and mouse orthologs of a novel gene (WBSCR15/Wbscr15) reside within the genomic interval commonly deleted in Williams syndrome. Cytogenet Cell Genet 90:285-90

Berman, P; Zhang, Z; Wolf, Y I et al. (2000) Winnowing sequences from a database search. J Comput Biol 7:293-302

Florea, L; Riemer, C; Schwartz, S et al. (2000) Web-based visualization tools for bacterial genome alignments. Nucleic Acids Res 28:3486-96

Ellsworth, R E; Jamison, D C; Touchman, J W et al. (2000) Comparative genomic sequence analysis of the human and mouse cystic fibrosis transmembrane conductance regulator genes. Proc Natl Acad Sci U S A 97:1172-7

Showing the most recent 10 out of 66 publications

Comments

Be the first to comment on Webb Miller's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: