G2TT
来源类型Working Paper
规范类型报告
DOI10.3386/w24324
来源IDWorking Paper 24324
Linking Individuals Across Historical Sources: a Fully Automated Approach
Ran Abramitzky; Roy Mill; Santiago Pérez
发表日期2018-02-19
出版年2018
语种英语
摘要Linking individuals across historical datasets relies on information such as name and age that is both non-unique and prone to enumeration and transcription errors. These errors make it impossible to find the correct match with certainty. In the first part of the paper, we suggest a fully automated probabilistic method for linking historical datasets that enables researchers to create samples at the frontier of minimizing type I (false positives) and type II (false negatives) errors. The first step guides researchers in the choice of which variables to use for linking. The second step uses the Expectation-Maximization (EM) algorithm, a standard tool in statistics, to compute the probability that each two records correspond to the same individual. The third step suggests how to use these estimated probabilities to choose which records to use in the analysis. In the second part of the paper, we apply the method to link historical population censuses in the US and Norway, and use these samples to estimate measures of intergenerational occupational mobility. The estimates using our method are remarkably similar to the ones using IPUMS’, which relies on hand linking to create a training sample. We created an R code and a Stata command that implement this method.
主题Econometrics ; Estimation Methods ; Labor Economics ; Demography and Aging ; History
URLhttps://www.nber.org/papers/w24324
来源智库National Bureau of Economic Research (United States)
引用统计
资源类型智库出版物
条目标识符http://119.78.100.153/handle/2XGU8XDN/581997
推荐引用方式
GB/T 7714
Ran Abramitzky,Roy Mill,Santiago Pérez. Linking Individuals Across Historical Sources: a Fully Automated Approach. 2018.
条目包含的文件
文件名称/大小 资源类型 版本类型 开放类型 使用许可
w24324.pdf(671KB)智库出版物 限制开放CC BY-NC-SA浏览
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Ran Abramitzky]的文章
[Roy Mill]的文章
[Santiago Pérez]的文章
百度学术
百度学术中相似的文章
[Ran Abramitzky]的文章
[Roy Mill]的文章
[Santiago Pérez]的文章
必应学术
必应学术中相似的文章
[Ran Abramitzky]的文章
[Roy Mill]的文章
[Santiago Pérez]的文章
相关权益政策
暂无数据
收藏/分享
文件名: w24324.pdf
格式: Adobe PDF
此文件暂不支持浏览

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。