A vast archive of raw census microdata covering Eurasia in the period since 1960 survives in machine- readable form. Over the past five years, this project has made a substantial portion of these data available to researchers for the first time. This proposal seeks continued funding to preserve, integrate, and freely disseminate large-scale Eurasian population microdata samples. The first phase of the project, focusing on European data, is on schedule to accomplish all the goals described in our original application. The primary goal was processing data and documentation to create publicly-accessible large-scale census microdata samples from multiple decades for a wide range of European countries and disseminating those data to researchers. This work involved data cleaning, drawing samples, implementing confidentiality protections, creating integrated variables, developing comprehensive metadata, and disseminating the data and metadata through a sophisticated web-based access system. This competing continuation project will extend the geographic scope eastward to cover Asia, prepare 40 additional census microdata samples for release to the research community, improve the geographic variables in the database, and recover data at risk of destruction. In addition to adding new partner countries, the expansion of the database will add new samples from the 2010 round of censuses for countries already incorporated into the integrated data series, and increase the size of several existing samples. The project leverages previous federal investments in social science infrastructure. Grants from the National Institutes of Health and the National Science Foundation laid the groundwork for the project by funding many of the fixed costs. Those projects underwrote the development of data cleaning and sampling procedures, metadata systems, data conversion and dissemination software, and design protocols for data and documentation. Raw microdata files, internal documentation, and redistribution agreements for most censuses to be processed have already been obtained. As a result, this project is highly cost-effective. The integrated database provides fundamental infrastructure for scientific research, education, and policy- making. The new data will allow social scientists to make comparisons across Europe and Asia during five decades of transformative change, and will result in a substantial body of new research on population health, economic development, fertility and mortality decline, population aging, migration, and reshaping of families. By opening new avenues for investigating the causes and consequences of population dynamics, this infrastructure directly addresses the priorities of the Demographic and Behavioral Sciences Branch of NICHD.

Public Health Relevance

The proposed database is directly relevant to the central mission of the National Institutes of Health;by adding dozens of new Eurasian census samples to a global, integrated database of census microdata, this infrastructure will advance fundamental knowledge about the nature of human population dynamics and will spark new health-related research. The data series will result in a substantial body of new scientific and policy- relevant health-related research on key priority areas of the Demographic and Behavioral Sciences Branch of NICHD, including family change, population health, and migration. By opening access to a vast collection of microdata, including material from the 2010 census round, the project will allow social science and health researchers to address fundamental questions about the impact of the extraordinary social and economic transformations that have reshaped the Eurasian continent during the past half century.

National Institute of Health (NIH)
Eunice Kennedy Shriver National Institute of Child Health & Human Development (NICHD)
Research Project (R01)
Project #
Application #
Study Section
Social Sciences and Population Studies Study Section (SSPS)
Program Officer
Bures, Regina M
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Minnesota Twin Cities
Schools of Arts and Sciences
United States
Zip Code
Kugler, Tracy A; Fitch, Catherine A (2018) Interoperable and accessible census and survey data from IPUMS. Sci Data 5:180007
Jeffers, Kristen; King, Miriam; Cleveland, Lara et al. (2017) Data Resource Profile: IPUMS-International. Int J Epidemiol 46:390-391
MacDonald, Alphonse L (2016) IPUMS International: A review and future prospects of a unique global statistical cooperation programme. Stat J IAOS 32:715-727
Ruggles, Steven (2015) Patriarchy, Power, and Pay: The Transformation of American Families, 1800-2015. Demography 52:1797-823
Ruggles, Steven; McCaa, Robert; Sobek, Matthew et al. (2015) THE IPUMS COLLABORATION: INTEGRATING AND DISSEMINATING THE WORLD'S POPULATION MICRODATA. J Demogr Economics 81:203-216
McCaa, Robert; Cleveland, Lara; Kelly-Hall, Patricia et al. (2015) Statistical coherence of primary schooling in IPUMS-International integrated population samples for China, India, Vietnam, and ten other Asia-Pacific countries. Chin J Sociol 1:333-355
Kennedy, Sheela; Ruggles, Steven (2014) Breaking up is hard to count: the rise of divorce in the United States, 1980-2010. Demography 51:587-98
Ruggles, Steven (2014) Big microdata for population research. Demography 51:287-97
McCAA, Robert (2013) Thanks to 70 years of Inter American Statistical cooperation, the world's largest integrated census microdata dissemination site www.ipums.org/international. Estadastica 65:31-45
McCaa, Robert (2013) The Big Census Data Revolution: IPUMS-International. Trans-Border Access to Decades of Census Samples for Three-Fourths of the World and more. Rev Demogr Hist 30:69-88

Showing the most recent 10 out of 20 publications