fastLink: Fast Probabilistic Record Linkage with Missing Data

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2019) ”Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records” <doi:10.1017/S0003055418000783> and is available at <>.

Version: 0.6.1
Depends: R (≥ 2.14.0)
Imports: Matrix, parallel, foreach, doParallel, gtools, data.table, stringdist, stringr, stringi, Rcpp (≥ 0.12.7), adagio, dplyr, plotrix, grDevices, graphics, methods
LinkingTo: RcppArmadillo, Rcpp, RcppEigen
Suggests: testthat
Published: 2023-11-17
DOI: 10.32614/CRAN.package.fastLink
Author: Ted Enamorado [aut, cre], Ben Fifield [aut], Kosuke Imai [aut]
Maintainer: Ted Enamorado <ted.enamorado at>
License: GPL (≥ 3)
NeedsCompilation: yes
In views: MissingData, OfficialStatistics
