Lung cancer is the leading cause of cancer death worldwide, both in men and women, with an estimate of over 164,000 new cases and over 156,000 deaths in 2000 in the United States alone. A principal reason for this high mortality is that lung cancer typically is first detected at an advanced stage where the prospects for cure are quite low. However, in those cases where it is found in an early stage, the prospects for cure are quite high. Recognition of these facts is a primary driver behind the development of improved screening and diagnostic tools. We propose to form a collaborative group of institutions to develop a large, high-quality internet-accessible spiral computed tomography (CT) image database of pulmonary nodules. This will serve as an important resource for researchers interested in developing improved methods for early detection and screening for lung cancer. Specifically this proposal plans to I) develop the criteria for inclusion of nodules within the database, 2) develop ground truth or pathologic diagnosis of each nodule, 3) populate the database with the appropriate nodule candidates as described above, 4) develop common data elements (CDEs) to classify each case, 5) develop Criteria for measuring performance standards of various CADs, and 6) develop an overall management plan for the consortium. The database developed in this consortium will be an important resource for research and teaching purposes. It will represent a standard that can be used for testing new CAD systems. With the rapid advances in computer science and engineering, a high-quality database that is continually evolving will be an invaluable resource. The design of this research proposal is somewhat novel in that we aim not only to collaborate with others on the design and content of the image database, but intend also to attach demographic and pathologic data to each case so that a broad community of research can be served. Our overall management plan seeks to aggressively identify collaborative partners from a variety of sources including similar or related industry , for example the Visible Human Project. Working groups will include radiology, CAD development, and informatics and our outreach efforts will include patient advocacy and early users of the database.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Project--Cooperative Agreements (U01)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-SRRB-Y (J3))
Program Officer
Croft, Barbara
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Weill Medical College of Cornell University
Schools of Medicine
New York
United States
Zip Code
Mulshine, James L; Avila, Rick; Yankelevitz, David et al. (2015) Lung Cancer Workshop XI: Tobacco-Induced Disease: Advances in Policy, Early Detection and Management. J Thorac Oncol 10:762-7
Armato 3rd, Samuel G; McLennan, Geoffrey; Bidaut, Luc et al. (2011) The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. Med Phys 38:915-31
Armato 3rd, Samuel G; Roberts, Rachael Y; Kocherginsky, Masha et al. (2009) Assessment of radiologist performance in the detection of lung nodules: dependence on the definition of ""truth"". Acad Radiol 16:28-38
Reeves, Anthony P; Biancardi, Alberto M; Apanasovich, Tatiyana V et al. (2007) The Lung Image Database Consortium (LIDC): a comparison of different size metrics for pulmonary nodule measurements. Acad Radiol 14:1475-85
McNitt-Gray, Michael F; Armato 3rd, Samuel G; Meyer, Charles R et al. (2007) The Lung Image Database Consortium (LIDC) data collection process for nodule detection and annotation. Acad Radiol 14:1464-74
Armato 3rd, Samuel G; Roberts, Rachael Y; McNitt-Gray, Michael F et al. (2007) The Lung Image Database Consortium (LIDC): ensuring the integrity of expert-defined ""truth"". Acad Radiol 14:1455-63
Armato 3rd, Samuel G; McNitt-Gray, Michael F; Reeves, Anthony P et al. (2007) The Lung Image Database Consortium (LIDC): an evaluation of radiologist variability in the identification of lung nodules on CT scans. Acad Radiol 14:1409-21
Meyer, Charles R; Johnson, Timothy D; McLennan, Geoffrey et al. (2006) Evaluation of lung MDCT nodule annotation across radiologists and methods. Acad Radiol 13:1254-65
Dodd, Lori E; Wagner, Robert F; Armato 3rd, Samuel G et al. (2004) Assessment methodologies and statistical issues for computer-aided diagnosis of lung nodules in computed tomography: contemporary research topics relevant to the lung image database consortium. Acad Radiol 11:462-75
Armato 3rd, Samuel G; McLennan, Geoffrey; McNitt-Gray, Michael F et al. (2004) Lung image database consortium: developing a resource for the medical imaging research community. Radiology 232:739-48