Medical reports contain a great deal of information that characterizes a patient's medical condition. A large percent of this information, however, is unstructured in the form of free text. In this form, the information is difficult to search, sort, analyze, summarize and present. We present a new method of natural language processing which attempts to automatically extract the important concepts from medical free text reports. The sentence parsing algorithm is based on a """"""""field theory"""""""" that is motivated by how a multi-particle system forms attachments as explained in physics. This view of natural language processing attempts to view words as active entities rather than passive data elements. A model for how and why word attachments occur is presented. The model presents explanations in terms of both energy fields that characterize the stability of a word at a given state of the sentence parse and in terms of a signal processing model involving a resonance phenomenon. Our hypotheses are that: 1) the system can extract information from medical reports; 2) the results can be represented in a canonical form with respect to meaning; 3) the system is extendable to many domains of medicine. The testing and evaluation of our system will be performed on reports from the specific domains of radiology. We will explore the adaptability of the NLP system to the domains of pediatric pathology and urology. In addition to technical measures, NLP will be evaluated from the end-user perspective including the selection of appropriate imaging protocols, SNOMED coding, and information retrieval from radiology teaching cases.

Agency
National Institute of Health (NIH)
Institute
National Institute of Biomedical Imaging and Bioengineering (NIBIB)
Type
Research Program Projects (P01)
Project #
8P01EB000216-11
Application #
7305961
Study Section
Subcommittee G - Education (NCI)
Project Start
Project End
Budget Start
Budget End
Support Year
11
Fiscal Year
2002
Total Cost
$231,603
Indirect Cost
Name
University of California Los Angeles
Department
Type
DUNS #
119132785
City
Los Angeles
State
CA
Country
United States
Zip Code
90095
Bashyam, Vijayaraghavan; Hsu, William; Watt, Emily et al. (2009) Problem-centric organization and visualization of patient imaging and clinical data. Radiographics 29:331-43
Taira, Ricky K; Taira, Ricky; Bui, Alex et al. (2008) A tool for improving the longitudinal imaging characterization for neuro-oncology cases. AMIA Annu Symp Proc :712-6
Son, Roderick Y; Taira, Ricky K; Kangarloo, Hooshang et al. (2008) Context-sensitive correlation of implicitly related data: an episode creation methodology. IEEE Trans Inf Technol Biomed 12:549-60
Bui, Alex A T; Morioka, Craig; Dionisio, John David N et al. (2007) OpenSourcePACS: an extensible infrastructure for medical image management. IEEE Trans Inf Technol Biomed 11:94-109
Taira, Ricky K; Bashyam, Vijayaraghavan; Kangarloo, Hooshang (2007) A field theoretical approach to medical natural language processing. IEEE Trans Inf Technol Biomed 11:364-75
Bui, Alex A T; Aberle, Denise R; Kangarloo, Hooshang (2007) TimeLine: visualizing integrated patient records. IEEE Trans Inf Technol Biomed 11:462-73
Ardekani, Siamak; Kumar, Anand; Bartzokis, George et al. (2007) Exploratory voxel-based analysis of diffusion indices and hemispheric asymmetry in normal aging. Magn Reson Imaging 25:154-67
Ardekani, Siamak; Selva, Luis; Sayre, James et al. (2006) Quantitative metrics for evaluating parallel acquisition techniques in diffusion tensor imaging at 3 Tesla. Invest Radiol 41:806-14
Ardekani, Siamak; Sinha, Usha (2006) Statistical representation of mean diffusivity and fractional anisotropy brain maps of normal subjects. J Magn Reson Imaging 24:1243-51
Ardekani, Siamak; Sinha, Usha (2005) Geometric distortion correction of high-resolution 3 T diffusion tensor brain images. Magn Reson Med 54:1163-71

Showing the most recent 10 out of 21 publications