Annual Bilateral Migration Data
Description
The study of the patterns and evolution of international migration often requires high-frequency data on migration flows on a global scale. However, the presently existing databases force a researcher to choose between the frequency of the data and its geographical scale. Yearly data exist but only for a small subset of countries, while most others are only covered every 5 to 10 years. To fill in the gaps in the coverage, the vast majority of databases use some imputation method. Gaps in the stock of migrants are often filled by combining information on migrants based on their country of birth with data based on nationality or using ‘model’ countries and propensity methods. Gaps in the data on the flow of migrants, on the other hand, are often filled by taking the difference in the stock, which the ’demographic accounting’ methods then adjust for demographic evolutions. This database aims to fill this gap by providing a global, yearly, bilateral database on the stock of migrants according to their country of birth. This database contains close to 2.9 million observations on over 56,000 country pairs from 1960 to 2020, a tenfold increase relative to the second-largest database. In addition, it also produces an estimate of the net flow of migrants. For a subset of countries –over 8,000 country pairs and half a million observations– we also have lower-bound estimates of the gross in- and outflow. This database was constructed using a novel approach to estimating the most likely values of missing migration stocks and flows. Specifically, we use a Bayesian state-space model to combine the information from multiple datasets on both stocks and flows into a single estimate. Like the demographic accounting technique, the state-space model is built on the demographic relationship between migrant stocks, flows, births and deaths. The most crucial difference is that the state-space model combines the information from multiple databases, including those covering migrant stocks, net flows, and gross flows. More details on the construction can currently be found in the UNU-CRIS working paper: Standaert, Samuel and Rayp, Glenn (2022) "Where Did They Come From, Where Did They Go? Bridging the Gaps in Migration Data" UNU-CRIS working paper 22.04. Bruges. https://cris.unu.edu/where-did-they-come-where-did-they-go-bridging-gaps-migration-data
Files
Steps to reproduce
To use this file, first, unzip the document and open it using any database software (excel, R, Stata, ...). If necessary, you can separate the text into different columns using the comma (",") as a separator (e.g., using the text-to-columns function in excel). The imputed migration data file contains the following information: iso_or: ISO code origin country origin: Origin Country iso_des: ISO code destination country destination: Destination Country year: Year stock: Stock of migrants born in the origin country, living in the destination country flow: Net flows of migrants born in the origin country, moving to the destination country inflow: Gross inflow of migrants born in the origin country, moving to the destination country outflow: Gross outflow of migrants born in the origin country, moving out of the destination country A small note on those ISO codes, they originate from the correlates of war and differ quite a bit from those used by the UN or World Bank (e.g. France is FRN, rather than FRA).