Treffer: Efficient and Scalable Bipartite Matching with Fast Beta Linkage (fabl).

Title:
Efficient and Scalable Bipartite Matching with Fast Beta Linkage (fabl).
Source:
Bayesian Analysis; Sep2025, Vol. 20 Issue 3, p949-972, 24p
Database:
Complementary Index

Weitere Informationen

Within the field of record linkage, Bayesian methods have the crucial advantage of quantifying uncertainty from imperfect linkages. However, current implementations of Bayesian Fellegi-Sunter models are computationally intensive, making them challenging to use on larger-scale record linkage tasks. To address these computational difficulties, we propose fast beta linkage (fabl), an extension to the Beta Record Linkage (BRL) method of Sadinle (2017). Specifically, we use independent prior distributions over the matching space, allowing us to use hashing techniques that reduce computational overhead. This also allows us to complete pairwise record comparisons over large data files through parallel computing and to reduce memory costs through a new technique called storage efficient indexing. Through simulations and two case studies, we show that fabl can have markedly increased speed with minimal loss of accuracy when compared to BRL. [ABSTRACT FROM AUTHOR]

Copyright of Bayesian Analysis is the property of International Society for Bayesian Analysis and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)