Dr. Gonzalez-Delgado set up a semi-automated pipeline using the alignment program HMMer and the Pfam protein domain database, to identify the functionality of viral proteins. The pipeline takes the genome of a human virus, extracts the annotated genes, and runs HMMer to find the domains composing a viral protein. Once statistically significant Pfam domains have been found, Pfam is again queried, to discover which of the proteins contributing to a given domain are human. The human proteins become candidates for a previous horizontal gene transfer between the virus and its human host. To inform the project with questions relevant to experimental virologists, Dr. Gonzalez-Delgado consulted with Drs. Devico and Lewis, human immunodeficiency virus (HIV) experts at the Institute of Human Virology. The project has resulted in a submitted manuscript examining human herpes virus 8 (HHV-8) as a model virus, where HGT with the human genome are relatively well characterized. HHV-8 shares at least 36% of its genes with the human host. To compare the frequency of HGT across viruses, the manuscript surveyed 10 other viruses impacting human health, concluding that HGT probably occurs between humans and both DNA and RNA viruses, in viral genomes of differing sizes, regardless of DNA transcription strategies. Of special note are the Human T-lymphotropic viruses, where genes involved in HGT possibly have a frequency as high as 73%. Dr. Gonzalez-Delgado is now analyzing sequences of the simian immunodeficiency virus, which infects macaques. The SIV-macaque system provides an animal model for HIV infection, to answer a question posed by Drs. Lewis and Devico: are the sequences of the infecting and non-infecting HIV particles in a viral inoculum statistically distinguishable? The founder viruses of the SIV infection appear to differ significantly from the typical virions in the stock inoculum, and the statistical differences have been assigned to specific residues in the viral attachment protein.

Project Start
Project End
Budget Start
Budget End
Support Year
1
Fiscal Year
2013
Total Cost
$226,009
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
Zip Code
Mengistu, Meron; Tang, Ai-Hui; Foulke Jr, James S et al. (2017) Patterns of conserved gp120 epitope presentation on attached HIV-1 virions. Proc Natl Acad Sci U S A 114:E9893-E9902
Gonzalez, Mileidy; DeVico, Anthony L; Spouge, John L (2017) Conserved signatures indicate HIV-1 transmission is under strong selection and thus is not a ""stochastic"" process. Retrovirology 14:13
Manzourolajdad, Amirhossein; Gonzalez, Mileidy; Spouge, John L (2016) Changes in the Plasticity of HIV-1 Nef RNA during the Evolution of the North American Epidemic. PLoS One 11:e0163688
Acevedo-Luna, Natalia; Mariño-Ramírez, Leonardo; Halbert, Armand et al. (2016) Most of the tight positional conservation of transcription factor binding sites near the transcription start site reflects their co-localization within regulatory modules. BMC Bioinformatics 17:479
Gonzalez, Mileidy W; DeVico, Anthony L; Lewis, George K et al. (2015) Conserved molecular signatures in gp120 are associated with the genetic bottleneck during simian immunodeficiency virus (SIV), SIV-human immunodeficiency virus (SHIV), and HIV type 1 (HIV-1) transmission. J Virol 89:3619-29
Gonzalez, Mileidy W; Spouge, John L (2013) Domain analysis of symbionts and hosts (DASH) in a genome-wide survey of pathogenic human viruses. BMC Res Notes 6:209