Several algorithms have been developed to calculate multivariate risk of CVD based on characteristics associated with the disease. Framingham Heart Study data were used to develop the original algorithms, along with later models, using different mathematical forms, outcomes, and characteristics. Researchers then began to investigate the issue of generalizability, whether these risk estimates could be applied to new populations. For these algorithms to have general application, they must be able to rank risk correctly. And, when Framingham models were compared to new models developed for other studies, resulting orderings of risk were, in fact, similar. The ability to order risk correctly, however, does not imply that estimated probabilities are right in terms of predicting disease for individuals. We need methods to assess individual risk to make treatment decisions, do cost-benefit analyses, and quantify benefits. These methods must be based on the patient's absolute risk, and existing equations may be incapable of establishing absolute risk across populations. Earlier comparisons of multivariate risk among studies have made comparison populations as homogenous as possible before analysis. However, if multivariate risk estimates are to be truly useful, they must be applicable to the general population, and to be applicable, estimates must be based on comparisons of cohorts that include women and ethnic minorities. Also, in statistical terms, estimates must be robust enough to allow for minor shifts in methodologies for data collection and endpoint definition. We propose to examine the heterogeneity of multivariate risk in different populations based on data from studies representing national samples, cohort studies, and clinical trials. We will conduct an analysis of these studies that include both sexes, various risk profiles, and representatives from several nationalities and ethnic groups. The pooled sample will involve 20 studies, 233,833 participants, and over 47,000 deaths. Based on a common statistical approach, we will develop proportional hazards models for each study to relate a set of essential characteristics to the prediction of CVD mortality. The characteristics include body mass index, age, blood pressure, serum cholesterol, smoking, and diabetes status. We will then compare the models in terms of their ability to predict absolute risk of mortality across studies We will conduct secondary analyses to discover factors associated with inaccurate prediction and study characteristics associated with particular findings, such as interaction terms. We will conduct an empirical examination of methods for adding newly discovered risk factors to existing prediction equations.

National Institute of Health (NIH)
National Heart, Lung, and Blood Institute (NHLBI)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-SNEM-4 (05))
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Medical University of South Carolina
Public Health & Prev Medicine
Other Domestic Higher Education
United States
Zip Code
Kitsantas, Panagiota; Hollander, Myles; Li, Lei (2006) Using classification trees to assess low birth weight outcomes. Artif Intell Med 38:275-89
Natarajan, Sundar; Liao, Youlian; Sinha, Debajyoti et al. (2005) Sex differences in the effect of diabetes duration on coronary heart disease mortality. Arch Intern Med 165:430-5
Diverse Populations Collaboration (2004) Smoking, body weight, and CHD mortality in diverse populations. Prev Med 38:834-40
Natarajan, Sundar; Liao, Youlian; Cao, Guichan et al. (2003) Sex differences in risk for coronary heart disease mortality associated with diabetes and established coronary heart disease. Arch Intern Med 163:1735-40