Bioinformatics Developments The Comparative Genomics Analysis Unit continues to develop, maintain, and distribute software tools for the analysis of DNA and RNA sequence data. This year, a suite of tools for the precise detection and specification of structural variants, distributed as a package named SVanalyzer, allows users to characterize the ambiguity of SVs with respect to nearby sequence similarity (SVwiden), detect equivalent SV predictions by comparing altered sequences (SVcomp), genotype known SVs in new datasets (SVbackgenotype), and refine SV predictions using long-read assemblies (SVrefine). SVanalyzer is currently being used by the National Institutes of Standards and Technologys Genome in a Bottle project to integrate SV calls from multiple different calling algorithms and sequencing platforms. Collaborative Work An ongoing collaboration with Dr. Susan Harbison studying the genetics of selection-driven sleep-duration in fruit flies (Drosophila melanogaster) has resulted in two publications (Harbison, Serrano Negron et al. 2017, Serrano Negron, Hansen et al. 2018), and was selected for the 2018 NHLBI Orloff Science Award. Our specific contribution was the analysis of 84 whole genome sequence datasets for long- and short-sleeping Drosophila melanogaster lines resulting from an evolve and resequence experiments, using two different aligners (BWA and novoalign), two different genome builds (Dm3 and Dm6), one variant caller (LoFreq), and custom software developed to calculate allele frequencies. In continued collaboration with Dr. Daphne Bell, we performed somatic mutation detection analysis on 14 tumor (uterine carcinosarcomas)/normal pairs, and identified fifteen genes that were somatically mutated in at least two tumors. Sanger sequencing of these fifteen genes in another 39 primary uterine carcinosarcomas identified FOXA2 being newly implicated in tumorigenesis (Le Gallo, Rudd et al. 2018). In a collaboration with Dr. Philip Shaw, we searched exome datasets for rare, de novo, variants that might be the source of ADHD discordance between monozygotic twins. However, of the eight twins consented for exome sequencing, copy number variants and rare, deleterious SNVs did not emerge as a major driver of discordance in this small sample. (Chen, Sudre et al. 2018) The assembly of the dust mite (Dermatophagoides pteronyssinus) genome and its subsequent analysis has shed new light on the class of proteins involved in allergic responses in humans. We assembled the genome from PacBio circular consensus sequence (CCS) reads with a total assembled size of 52 Mb in 834 contigs and an N50 of 376 kb. Since the starting DNA originated from hundreds of individual mites collected from an inbred colony, the CCS reads also captured genomic variation, enabling the detection of variation present within this colony. RNA extracted from the mites was sequenced as well, allowing the combined analysis of sequence variation and its effect on isoforms, focusing specifically on the genes encoding allergenic proteins. (Randall, Mullikin et al. 2018) In 2010, Morris Animal Foundation received a pledge of $1,000,000 from Hills Pet Nutrition to develop a Cat SNP Chip. SNP discovery efforts in the domestic cat had generated a set of over 10 million SNPs to draw from for this array. The subset of SNPs selected for this array had properties of being polymorphic in more than one breed and were reasonably distributed across the genome. The final array consisted of 62,897 variants and was used to genotype over 2,000 cats. Both the performance of this array and cat population structure analyses are presented here (Gandolfi, Alhaddad et al. 2018).

Project Start
Project End
Budget Start
Budget End
Support Year
14
Fiscal Year
2018
Total Cost
Indirect Cost
Name
Human Genome Research
Department
Type
DUNS #
City
State
Country
Zip Code
Chen, Y-C; Sudre, G; Sharp, W et al. (2018) Neuroanatomic, epigenetic and genetic differences in monozygotic twins discordant for attention deficit hyperactivity disorder. Mol Psychiatry 23:683-690
Randall, Thomas A; Mullikin, James C; Mueller, Geoffrey A (2018) The Draft Genome Assembly of Dermatophagoides pteronyssinus Supports Identification of Novel Allergen Isoforms in Dermatophagoides Species. Int Arch Allergy Immunol 175:136-146
Gandolfi, Barbara; Alhaddad, Hasan; Abdi, Mona et al. (2018) Applications and efficiencies of the first cat 63K DNA array. Sci Rep 8:7024
Serrano Negron, Yazmin L; Hansen, Nancy F; Harbison, Susan T (2018) The Sleep Inbred Panel, a Collection of Inbred Drosophila melanogaster with Extreme Long and Short Sleep Duration. G3 (Bethesda) 8:2865-2873
Le Gallo, Matthieu; Rudd, Meghan L; Urick, Mary Ellen et al. (2018) The FOXA2 transcription factor is frequently somatically mutated in uterine carcinosarcomas and carcinomas. Cancer 124:65-73
Le Gallo, Matthieu; Rudd, Meghan L; Urick, Mary Ellen et al. (2017) Somatic mutation profiles of clear cell endometrial tumors revealed by whole exome and targeted gene sequencing. Cancer 123:3261-3268
Kwon, Erika M; Connelly, John P; Hansen, Nancy F et al. (2017) iPSCs and fibroblast subclones from the same fibroblast population contain comparable levels of sequence variations. Proc Natl Acad Sci U S A 114:1964-1969
Dewan, Ramita; Pemov, Alexander; Dutra, Amalia S et al. (2017) First insight into the somatic mutation burden of neurofibromatosis type 2-associated grade I and grade II meningiomas: a case report comprehensive genomic study of two cranial meningiomas with vastly different clinical presentation. BMC Cancer 17:127
Ng, David; Hong, Celine S; Singh, Larry N et al. (2017) Assessing the capability of massively parallel sequencing for opportunistic pharmacogenetic screening. Genet Med 19:357-361
Pemov, A; Li, H; Patidar, R et al. (2017) The primacy of NF1 loss as the driver of tumorigenesis in neurofibromatosis type 1-associated plexiform neurofibromas. Oncogene 36:3168-3177

Showing the most recent 10 out of 141 publications