The primary goal of the proposed Center of Excellence is to build a powerful and scalable Knowledge Engine for Genomics, KnowEnG. KnowEnG will transform the way biomedical researchers analyze their genome-wide data by integrating multiple analytical methods derived from the most advanced data mining and machine learning research to use the full breadth of existing knowledge about the relationships between genes as background, and providing an intuitive and professionally designed user interface. In order to achieve these goals, the project includes the following components: (1) gathering and integrating existing knowledgebases documenting connections between genes and their functions into a single Knowledge Network;(2) developing computational methods for analyzing genome-wide user datasets in the context of this pre-existing knowledge;(3) implementing these methods into scalable software components that can be deployed in a public or private cloud;(4) designing and implementing a Web-based user interface, based on the HUBZero toolkit, that enables the interactive analysis of user-supplied datasets in a graphics-driven and intuitive fashion;(5) thoroughly testing the functionality and usefulness of the KnowEnG environment in three large scale projects in the clinical sciences (pharmacogenomics of breast cancer), behavioral sciences (identification of gene regulatory modules underlying behavioral patterns) and drug discovery (genome-based prediction of the capacity of microorganisms to synthesize novel biologically active compounds). The KnowEng environment will be deployed in a cloud infrastructure and fully available to the community, as will be the software developed by the Center. The proposed Center is a collaboration between the University of Illinois (UIUC), a recognized world leader in computational science and engineering, and the Mayo Clinic, one of the leading clinical care and research organizations in the worid, and will be based at the UIUC Institute for Genomic Biology, which has state-of-the-art facilities and a nationally recognized program of multidisciplinary team-based genomic research.
Physicians and biologists are now routinely producing very large, genome-wide datasets. These data need to be analyzed in the context of an even larger corpus of publically available data, in a manner that is approachable to non-specialist doctors and scientists. The proposed Center will leverage the latest computational techniques used to mine corporate or Internet data to enable the intuitive analysis and exploration of biomedical Big Data.
|Cairns, Junmei; Ung, Choong Yong; da Rocha, Edroaldo Lummertz et al. (2016) A network-based phenotype mapping approach to identify genes that modulate drug response phenotypes. Sci Rep 6:37003|
|Hanson, C; Cairns, J; Wang, L et al. (2016) Computational discovery of transcription factors associated with drug response. Pharmacogenomics J 16:573-582|
|Wang, Sheng; Qu, Meng; Peng, Jian (2016) PROSNET: INTEGRATING HOMOLOGY WITH MOLECULAR NETWORKS FOR PROTEIN FUNCTION PREDICTION. Pac Symp Biocomput 22:27-38|
|Cohen, Nadia R; Ross, Christian A; Jain, Saloni et al. (2016) A role for the bacterial GATC methylome in antibiotic stress survival. Nat Genet 48:581-6|
|Bendre, Mangesh; Sun, Bofan; Zhang, Ding et al. (2015) DataSpread: Unifying Databases and Spreadsheets. Proceedings VLDB Endowment 8:2000-2003|
|Tao, Fangbo; Zhao, Bo; Fuxman, Ariel et al. (2015) Leveraging Pattern Semantics for Extracting Entities in Enterprises. Proc Int World Wide Web Conf 2015:1078-1088|
|Kordjamshidi, Parisa; Roth, Dan; Moens, Marie-Francine (2015) Structured learning for spatial information extraction from biomedical text: bacteria biotopes. BMC Bioinformatics 16:129|
|Ren, Xiang; El-Kishky, Ahmed; Wang, Chi et al. (2015) Automatic Entity Recognition and Typing from Massive Text Corpora: A Phrase and Network Mining Approach. KDD 2015:2319-2320|
|Stephens, Zachary D; Lee, Skylar Y; Faghri, Faraz et al. (2015) Big Data: Astronomical or Genomical? PLoS Biol 13:e1002195|
|Kuck, Jonathan; Zhuang, Honglei; Yan, Xifeng et al. (2015) Query-Based Outlier Detection in Heterogeneous Information Networks. Adv Database Technol 2015:325-336|
Showing the most recent 10 out of 33 publications