This research investigates schemes for describing documents which can achieve much higher recall and precision than existing techniques. The schemes use term frequency in an optimal way, significantly more effective than the binary independence model. The significance of this work is that the explosively growing number of documents available to scientists, engineers, and researchers in all fields can now only be handled effectively by computer systems. Current indexing systems operate rapidly, but must be improved to accurately retrieve the most relevant documents while retrieving very few irrelevant documents for any given query.

Agency
National Science Foundation (NSF)
Institute
Division of Information and Intelligent Systems (IIS)
Type
Standard Grant (Standard)
Application #
8702177
Program Officer
MICHELE R. JOHNSON
Project Start
Project End
Budget Start
1987-08-01
Budget End
1990-01-31
Support Year
Fiscal Year
1987
Total Cost
$94,361
Indirect Cost
Name
University of Illinois at Chicago
Department
Type
DUNS #
City
Chicago
State
IL
Country
United States
Zip Code
60612