MicroRNAs (miRNAs) perform critical roles in biological processes by regulating respective target genes. Thus, miRNAs are closely associated with human cancers. The manual integration of information on miRNAs and their target genes is challenging: labor-intensive, error-prone, and subject to biologists'prior knowledge - because it involves an extremely large amount of heterogeneous data sources to be explored. Our objective is to develop OmniSearch, a semantic search tool to assist cancer biologists in unraveling critical roles of miRNAs in human cancers in an automated and highly efficient manner.
AIM 1 : We will develop ontology for microRNA target (OMIT), the first miRNA domain ontologies. OMIT will formally define miRNA knowledge and will provide a global metadata model (i.e., data exchange standards and common data elements) as the foundation for automated knowledge acquisition. We will work within established standards and contribute new terminology to a wide range of bio-ontology groups.
AIM 2 : According to the OMIT metadata model, we will develop OmniSearch, including an automated semantic data annotation &integration tool and a user-friendly semantic search interface. The resultant knowledgebase will contain completely machine-readable data integrated from heterogeneous sources: miRNA target prediction databases, Gene Ontology, PubMed, and KEGG PATHWAY. OmniSearch will present unified knowledge, at the semantic level, that is most relevant to what cancer biologists are seeking.
AIM 3 : Finally, we will design use cases followed by a set of evaluating queries. OmniSearch will be thoroughly and iteratively evaluated in the knowledge accuracy, the efficiency of construction methods (the reduction of human labor), and the system friendliness and usability. On a regular basis and in a structured manner, we will solicit feedback from the community and incorporate domain experts'opinions to further improve the system. Feedback mechanisms will take place throughout the entire project lifetime. This project will handle critical needs recognized by the NCI ITCR Initiative: establishing data exchange standards and common data elements;sustained effort to promote data sharing;enhanced support of community-based, research-driven informatics technology development;improved mechanisms to support software development. Expected deliverables include OMIT ontologies as miRNA data exchange standards and common data elements;OmniSearch for miRNA data sharing and automated knowledge acquisition;a unified miRNA knowledgebase;and use cases and evaluating queries. OmniSearch can be used to obtain unified miRNA knowledge and bring insights into the regulation and control of cancer disease processes. By providing a deeper understanding of miRNAs'functions, OmniSearch will also assist miRNA bio-curation and new biological experiment design. Thus, the project can significantly accelerate cancer biology research. And, OmniSearch is by its nature extensible and can be readily generalized to other biomedical areas.

Public Health Relevance

miRNAs have been identified to be closely associated with development, diagnosis, and prognosis for human cancers but miRNA knowledge acquisition remains challenging. We will develop OmniSearch, a semantic search tool, to assist cancer biologists in unraveling critical roles of miRNAs in human cancers in an automated and highly efficient manner. We will handle the significant challenge of data sharing, data integration, and effective search in miRNA/microgenomics research in oncology.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Project--Cooperative Agreements (U01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-SRLB-4 (M1))
Program Officer
Li, Jerry
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of South Alabama
Other Domestic Higher Education
United States
Zip Code
Wu, Bin; Huang, Jingshan; Fukuo, Keisuke et al. (2018) Different Associations of Trunk and Lower-Body Fat Mass Distribution with Cardiometabolic Risk Factors between Healthy Middle-Aged Men and Women. Int J Endocrinol 2018:1289485
Chen, Huiqin; Zhang, Dihua; Zhang, Guoping et al. (2018) A semantics-oriented computational approach to investigate microRNA regulation on glucocorticoid resistance in pediatric acute lymphoblastic leukemia. BMC Med Inform Decis Mak 18:57
Wu, Bin; Huang, Jingshan; Zhang, Lihua et al. (2018) An integrative approach to investigate the association among high-sensitive C-reactive protein, body fat mass distribution, and other cardiometabolic risk factors in young healthy women. Methods 145:60-66
Zhang, Lihua; Li, Rong; He, Junyi et al. (2017) Co-expression analysis among microRNAs, long non-coding RNAs, and messenger RNAs to understand the pathogenesis and progression of diabetic kidney disease at the genetic level. Methods 124:46-56
Huang, Jingshan; Eilbeck, Karen; Smith, Barry et al. (2016) The development of non-coding RNA ontology. Int J Data Min Bioinform 15:214-232
Liu, Zixing; Smith, Kelly R; Khong, Hung T et al. (2016) miR-125b regulates differentiation and metabolic reprogramming of T cell acute lymphoblastic leukemia by directly targeting A20. Oncotarget 7:78667-78679
Huang, Jingshan; Eilbeck, Karen; Smith, Barry et al. (2016) The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology. J Biomed Semantics 7:24
Huang, Jingshan; Gutierrez, Fernando; Strachan, Harrison J et al. (2016) OmniSearch: a semantic search system based on the Ontology for MIcroRNA Target (OMIT) for microRNA-target gene interaction data. J Biomed Semantics 7:25
Huang, Jingshan; Dang, Jiangbo; Borchert, Glen M et al. (2014) OMIT: dynamic, semi-automated ontology development for the microRNA domain. PLoS One 9:e100855