This project will enhance and develop the only source of data that provides core information about work, education, income, and migration for the entire U.S. population. This unique resource allows us to observe today's late-life population when they were young. This allows a prospective view of the impact of early-life environments and socioeconomic status on health and well-being in later life. The massive database describes the characteristics of all 133 million persons who resided in the United States in 1940. The data have been available for only a brief period, but are already having a profound impact on scientific research. They are the primary data source for at least 19 sponsored projects, including nine funded by NIH and seven funded by NSF. Through May 2017, 104 investigators had produced 95 papers based on the data, including 22 articles, four books, three PhD dissertations, and 66 working papers. This project will improve the quality and usability of the database by correcting transcription errors; improving the coding, editing, and allocation of key variables; and introducing new data dissemination tools designed to simplify access to the data. The project will undertake three major activities to meet these objectives. 1. Incorporate new verified census information on name, age, sex, family relationship, race, marital status, birthplace, and residence five years ago. 2. Improve coding, editing, and allocation for geographic variables, migration, occupation, and numerically- coded variables such as income. 3. Apply new technologies to democratize access to the data through new data access tools and a virtual data enclave for a restricted version of the data that contains names. This project is a highly cost-effective use of scarce resources to develop shared infrastructure for research, education, and policy-making on health and aging. The proposed improvement of data quality is urgent, and will provide a resource of unprecedented power for understanding the effects of public policies, social institutions, and environmental conditions on the health, well-being, and functioning of people, both over the life course and in their later years.

Public Health Relevance

This project will develop data for prospective analysis of the impact of early-life circumstances and environment on health outcomes in late life, including Alzheimer's disease. The proposed improvement of data quality is urgent, and will provide a resource of unprecedented power for understanding the effects of public policies, social institutions, and environmental conditions on the health, well-being, and functioning of people, both over the life course and in their later years. The proposed work is directly relevant to the core mission of the Population and Social Processes branch of NIA, since the data will allow us to observe today's late-life population when they were young.

Agency
National Institute of Health (NIH)
Institute
National Institute on Aging (NIA)
Type
Research Project (R01)
Project #
5R01AG041831-07
Application #
9750600
Study Section
Social Sciences and Population Studies A Study Section (SSPA)
Program Officer
Karraker, Amelia Wilkes
Project Start
2012-04-01
Project End
2023-03-31
Budget Start
2019-04-01
Budget End
2020-03-31
Support Year
7
Fiscal Year
2019
Total Cost
Indirect Cost
Name
University of Minnesota Twin Cities
Department
Miscellaneous
Type
Schools of Arts and Sciences
DUNS #
555917996
City
Minneapolis
State
MN
Country
United States
Zip Code
55455
Kugler, Tracy A; Fitch, Catherine A (2018) Interoperable and accessible census and survey data from IPUMS. Sci Data 5:180007
Ruggles, Steven; Fitch, Catherine; Roberts, Evan (2018) Historical Census Record Linkage. Annu Rev Sociol 44:19-37
Roberts, Evan; Warren, John Robert (2017) Family structure and childhood anthropometry in Saint Paul, Minnesota in 1918. Hist Fam 22:258-290
Ruggles, Steven (2014) Big microdata for population research. Demography 51:287-97
Sobek, Matthew; Cleveland, Lara; Flood, Sarah et al. (2011) Big Data: Large-Scale Historical Infrastructure from the Minnesota Population Center. Hist Methods 44:61-68