The long-term objective of this research is to provide computer system's that are useful for the analysis and solution of a number of problems in computational biology. The simulation tools will allow biologists to design models of protein families, compute multiple alignments, search through large data bases, and analyze DNA sequences and gene structure (exons/introns/promoters/...). Protein families under study, with clear medical interest, include immunoglobins, kinases, G-protein-coupled receptors, growth factors and retroviral proteins. The software methodology consists in taking advantage of recent progress in machine leaning algorithms, such as Hidden Markov Models (HMMs), to automatically extract pertinent information from massive amounts of biological data produced by genome and other sequencing efforts. At the same time, the research effort is aimed at optimizing the implementation of the software simulator developed during Phase I on various hardware architectures. Such Systems have widespread commercial applications in the biotechnology industry. For instance, sequence analysis often represents a key step towards the systematic detection of genetic defects responsible for hereditary forms of cancer, and other complex diseases. We intend to provide a line of fast and cost efficient computational systems tailored to the various needs of both academic and industrial organizations.

Proposed Commercial Applications

In the short term, the simulator will be licensed to biological laboratories as a tool to perform multiple alignments, motif detections, data base searches, etc. Licensing will be made available under various system hardware configurations. In a longer term, a library of high quality HMM models of protein families and genomic functional elements will be constructed and made available to clients through electronic networks.

Agency
National Institute of Health (NIH)
Institute
National Institute on Alcohol Abuse and Alcoholism (NIAAA)
Type
Small Business Innovation Research Grants (SBIR) - Phase II (R44)
Project #
5R44AA011499-03
Application #
2516867
Study Section
Special Emphasis Panel (ZRG7-SSS-2 (11))
Project Start
1994-09-20
Project End
1998-05-31
Budget Start
1997-09-01
Budget End
1998-05-31
Support Year
3
Fiscal Year
1997
Total Cost
Indirect Cost
Name
Net-ID, Inc.
Department
Type
DUNS #
City
San Francisco
State
CA
Country
United States
Zip Code
94107