The rapid and continuing proliferation of on-line biomedical information sources has necessitated the creation of software tools for automatically navigating these networks, identifying relevant documents, and extracting the desired information from them. Designing, implementing, and evaluating such a software system is the primary objective of this project. More specifically, a central aim is to create a system that can be personalized to the particular and changing interests of individual biomedical scientists. Machine leaming methods play a central role in our software assistant allowing the system to adapt to individual users. A flexible language with which scientists can communicate their interests to the machine leaming algorithms will be developed. Machine leaming methods typically have the weakness of requiring user-provided training examples, but creating such examples is usually a burdensome task. Hence, another central aim is to greatly reduce the need for human-labeled training data by creating techniques that obtain these examples by indirect methods. The system being developed will be field-tested in the laboratories of several biomedical researchers.
Chasman, Deborah; Gancarz, Brandi; Hao, Linhui et al. (2014) Inferring host gene subnetworks involved in viral replication. PLoS Comput Biol 10:e1003626 |
Vlachos, Andreas; Craven, Mark (2012) Biomedical event extraction from abstracts and full papers using search-based structured prediction. BMC Bioinformatics 13 Suppl 11:S5 |
Kawaler, Emily; Cobian, Alexander; Peissig, Peggy et al. (2012) Learning to predict post-hospitalization VTE risk from EHR data. AMIA Annu Symp Proc 2012:436-45 |
Andrzejewski, David; Zhu, Xiaojin; Craven, Mark (2009) Incorporating Domain Knowledge into Topic Modeling via Dirichlet Forest Priors. Proc Int Conf Mach Learn 382:25-32 |
Smith, Adam A; Vollrath, Aaron; Bradfield, Christopher A et al. (2009) Clustered alignments of gene-expression time series data. Bioinformatics 25:i119-27 |
Smith, Adam A; Craven, Mark (2008) Fast multisegment alignments for temporal expression profiles. Comput Syst Bioinformatics Conf 7:315-26 |
Smith, Adam A; Vollrath, Aaron; Bradfield, Christopher A et al. (2008) Similarity queries for temporal toxicogenomic expression profiles. PLoS Comput Biol 4:e1000116 |
Noto, Keith; Craven, Mark (2006) A specialized learner for inferring structured cis-regulatory modules. BMC Bioinformatics 7:528 |
Settles, Burr (2005) ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text. Bioinformatics 21:3191-2 |
Ray, Soumya; Craven, Mark (2005) Learning statistical models for annotating proteins with function information using biomedical text. BMC Bioinformatics 6 Suppl 1:S18 |
Showing the most recent 10 out of 14 publications