Reductions in sequencing costs and increases in sequencing efficiency are quickly making high-throughput sequencing accessible to individual laboratories looking to use sequencing as a powerful tool in their research endeavors. In fact, as costs continue to decline, we can expect high-throughput sequencing to become a commonly used tool, not only in human phenotype based sequencing projects, but also as an effective tool in forward genetics applications in model organisms, and potentially for the diagnosis idiopathic disease. However, very few laboratories have the computational expertise and infrastructure to make sense of the genetic variants identified through these studies. The goal of this proposal is to make high-throughput sequencing data interpretation as accessible as data generation through expansion of the Scripps Genome Annotation and Distributed Variant Interpretation SERver (SG-ADVISER) and companion data processing and visualization tools. SG-ADVISER is a web-server based tool for holistic, in-depth, annotations and functional predictions of variants generated from high-throughput sequencing. Annotations are formed on at least four major levels: 1) annotation of the genomic element within which a variant resides;2) prediction of the functional impact of a variant on a genomic element;3) annotation of molecular and biological processes which link variants across genes and/or genomic elements with one another, and 4) annotation of known clinical characteristics of the gene or variant. The annotations currently provided by SG-ADVISER cover many of these levels of annotation, but are incomplete. Therefore, we propose to expand the capabilities of SG-ADVISER to cover as many generally interesting annotation types as possible, while also extending SG-ADVISER's capabilities to model organism studies. Moreover, we recognize a need for flexibility, and have included a plan to provide customized annotations through the SG-ADVISER web-server. Finally, we feel that truly powerful data interpretation can only be achieved through visualization of massive datasets. Therefore, we propose a plan to produce simple companion tools to process, filter, and visualize SG-ADVISER annotations through currently available genome browsers.

Public Health Relevance

Identification and interpretation of variants associated with inherited but not strongly familial disease is a crucial step in translating the investment in huma genome sequencing efforts into a truly significant impact on public health. Annotation, prioritization and grouping of variants logically will be required to bring enough statistical powe to sequencing studies so that disease causing variants can be identified.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project--Cooperative Agreements (U01)
Project #
5U01HG006476-03
Application #
8603252
Study Section
Special Emphasis Panel (ZHG1-HGR-M (O3))
Program Officer
Sofia, Heidi J
Project Start
2012-02-01
Project End
2015-12-31
Budget Start
2014-01-01
Budget End
2014-12-31
Support Year
3
Fiscal Year
2014
Total Cost
$186,934
Indirect Cost
$55,111
Name
Scripps Health
Department
Type
DUNS #
131185241
City
San Diego
State
CA
Country
United States
Zip Code
92121
Torkamani, Ali; Bersell, Kevin; Jorge, Benjamin S et al. (2014) De novo KCNB1 mutations in epileptic encephalopathy. Ann Neurol 76:529-40
Larman, H Benjamin; Scott, Erick R; Wogan, Megan et al. (2014) Sensitive, multiplex and direct quantification of RNA sequences using a modified RASL assay. Nucleic Acids Res 42:9146-57
Belani, R; Oliveira, G; Erikson, G A et al. (2014) ASXL1 and DNMT3A mutation in a cytogenetically normal B3 thymoma. Oncogenesis 3:e111
Chen, Ying-Zhang; Friedman, Jennifer R; Chen, Dong-Hui et al. (2014) Gain-of-function ADCY5 mutations in familial dyskinesia with facial myokymia. Ann Neurol 75:542-9