Combining Data from Multiple Sources: Potential and Challenges of Data Linkages

The second webinar in the NCAER Seminar series on Data Collection Methodology organised by the NCAER National Data Innovation Centre was held virtually on June 24, 2021. The webinar is part of a series of thought-provoking discussions on research methodologies in which distinguished speakers in the field will share their views and one or more discussants will reflect on them from an Indian perspective. This talk was delivered by Frauke Kreuter, Professor of Statistics and Data Science for the Social Sciences and Humanities at the Ludwig-Maximilians-University of Munich (Germany) and Professor at the Joint Program in Survey Methodology at the University of Maryland. K.S. James from the International Institute for Population Sciences and Soumya Bhaduri from the Reserve Bank of India (RBI) were the discussants.

Combining data from different sources has become essential for social scientists and policy makers to take full advantage of the data deluge in an increasingly digitalised society. While we see many attempts at using a single approach (big data sources) with mixed results, the most exciting projects rely on a combination of different data, some of which are still collected through traditional modes. In this talk, Professor Kreuter highlighted a few approaches and provide a framework enabling researchers to think about creating new data products.

Professor Kreuter used several examples from economic research, with a specific focus on the IAB-SMART research project to discuss privacy issues and approaches deployed to create high-quality combined data sources (read more here). The IAB-SMART study uses innovative data sources, such as administrative records, surveys, and digital traces from smart phones, to measure the effects of long-term unemployment on social integration and social activity. Using the case study from different countries, the talk demonstrated how to handle potential coverage bias and biases due to non-response and measurement errors, while being cognisant of privacy norms.

Professor Kreuter co-founded and co-directs the Data Science Centers at the Universities of Maryland (USA) and Mannheim (Germany). She is an elected fellow of the American Statistical Association and the 2020 recipient of the Warren Mitofsky Innovators Award of the American Association for Public Opinion Research. Dr. Kreuter is the Founder of the International Program for Survey and Data Science, developed in response to the increasing demand from researchers and practitioners for the appropriate methods and right tools to face a changing data environment.

K.S. James is the Director and Senior Professor, International Institute for Population Sciences (IIPS), Mumbai. Prior to joining IIPS, he was Professor of Demography, Jawaharlal Nehru University, New Delhi. He works extensively on demographic changes with a focus on population and development, and ageing issues. He has published widely on the demographic transition and demographic dividend in India.

Soumya Bhadury is a macroeconomist currently working with the Strategic Research Unit at the RBI. His research interests include understanding macro-financial linkages in emerging markets. Before joining RBI, he worked as an economist at NCAER.

    24 June 2021
