We have begun work on how to use a large corpus of literature on a topic to most efficiently extend the set of documents in such a database by related material. This is fundamentally different than the problem of how to find a document closely related to a single document. One is given much more data. Thus far we have used a logodds approach with some success. Another approach which we hope to try in the future is the pooling of neighborhood data based on single documents in the database. Such pooling may give a good idea of what is likely to be useful material. This whole problem of extension can also be studied from the point of view of the initial construction of a literature. We hope to look at this problem also and propose to measure the success of a method by the number documents that must be examined in order to find a given number of relevant documents.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Intramural Research (Z01)
Project #
1Z01LM000075-01
Application #
2452895
Study Section
Special Emphasis Panel (CBB)
Project Start
Project End
Budget Start
Budget End
Support Year
1
Fiscal Year
1996
Total Cost
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
United States
Zip Code