Unification of Biotechnology Information

Ostell, J

Abstract

To provide a single formal specification for information relevant to biotechnology computing, including scientific literature, nucleic acid sequences data, protein sequenced data genetic and physical maps, chromosomes, genes, the relationships of other scientific knowledge about these entities and their relationship to normal and disease conditions. To convert a number of important biological databases of diverse content and form into such unifying specification. Develop tools demonstrating use of such unified new of biological data. A large number of databases were examined, such as GenBank. EMBL, PIR, SWISSPROT, Kabat, MIM, ACEDB, Flybase, EcoSeq, MEDLINE, and others. A single modular data model was constructed which could represent almost all of the data from these sources ina consistent way, and formally specified in Abstract Syntax Notation 1, (ISO 8824, 8825). Parsers were written to read the different data formats of the sources, and software developed to map the different available data elements into the proper places in the unified ASN.1 data model. A software product (Entrez) was developed to production quality which took advantage of the unified view of some of the sources (GenBank, PIR, SWUSSPROT, and MEDLINE) to allow the scientist to explore all these data as a single integrated whole. Entrez and its associated integrated data is distributed to scientists on CDROM every two months by NCBI. A client/server version provides high speed access over Internet. New databases now allow NCBI to maintain an """"""""up to the minute"""""""" view of the diverse data sources and their relationships with each other despite differing content, data formats, and data release cycles. Additional data sources, such as 3-D protein structures, are being mapping to the common specification and new tools are being developed to use the growing web of connected data. Scientific knowledge continues to evolve, which means the data must evolve as well. This is a project which must continue as long as biomedical research does.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Library of Medicine (NLM)
Type: Intramural Research (Z01)
Project #: 1Z01LM000033-06
Application #: 6162795
Study Section: Special Emphasis Panel (IEB)

Project Start
Project End
Budget Start
Budget End
Support Year: 6
Fiscal Year: 1997
Total Cost
Indirect Cost

Institution

Name: National Library of Medicine
Department
Type
DUNS #

City
State
Country: United States
Zip Code

Related projects


NIH 1997 Z01 LM	Unification of Biotechnology Information Ostell, J M. / National Library of Medicine
NIH 1996 Z01 LM	Unification of Biotechnology Information Ostell, J M. / National Library of Medicine
NIH 1995 Z01 LM	Unification of Biotechnology Information Ostell, J M. / National Library of Medicine
NIH 1994 Z01 LM	Unification of Biotechnology Information Ostell, J M. / National Library of Medicine
NIH 1993 Z01 LM	Unification of Biotechnolohy Information Ostell, J M. / National Library of Medicine
NIH 1992 Z01 LM	Unification of Biotechnolohy Information Ostell, J M. / National Library of Medicine

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants:

Abstract

Funding Agency

Institution

Related projects

Comments