Missing data often plague data analysts' attempts to interpret clinical and social data. We propose to produce a software toolkit, S+MissingData, that enables medical researchers to apply principled methods of handling missing data without wasting either data or human labor resources. S+MissingData will implement missing data procedures that are applicable more or less routinely to a wide variety of missing data problems. The proposed research rests on several foundations: (1) a particular model based approach (2) a variety of recently developed computational tools and (3) implementation in a modern statistical computing environment. In Phase II, we will implement tools for creating and managing objects used in statistical analysis with missing data. Graphics and a graphical user interface will make the software easy to learn and use. Research will extend current methods to handle data arising from two extremely important sources: longitudinal and complex survey designs. S+MissingData will enable medical researchers to earn a greater return on their investment of collecting data: maximally extracting information and achieving reliable inferences, despite missing data.

Proposed Commercial Applications

This research will produce an add-on software module for S-PLUS called S+MissingData, which will offer a scientifically and cost-effective way to handle missing data. We expect a wide market. This module will appeal to existing S-Plus users, and attract new users - in disciplines as diverse as biology, medicine, sociology, marketing, and economics. This research will also lead to short courses, books, videos and other educational material.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Small Business Innovation Research Grants (SBIR) - Phase II (R44)
Project #
5R44CA065147-03
Application #
2429834
Study Section
Special Emphasis Panel (ZRG7-SSS-9 (01))
Program Officer
Choudhry, Jawahar
Project Start
1994-06-08
Project End
1999-05-31
Budget Start
1997-07-25
Budget End
1999-05-31
Support Year
3
Fiscal Year
1997
Total Cost
Indirect Cost
Name
Insightful Corporation
Department
Type
DUNS #
150683779
City
Seattle
State
WA
Country
United States
Zip Code
98109
Schafer, J L (1999) Multiple imputation: a primer. Stat Methods Med Res 8:3-15