Fairness and Cost Constrained Privacy-Aware Record Linkage

by   Nan Wu, et al.

Record linkage algorithms match and link records from different databases that refer to the same real-world entity based on direct and/or quasi-identifiers, such as name, address, age, and gender, available in the records. Since these identifiers generally contain personal identifiable information (PII) about the entities, record linkage algorithms need to be developed with privacy constraints. Known as privacy-preserving record linkage (PPRL), many research studies have been conducted to perform the linkage on encoded and/or encrypted identifiers. Differential privacy (DP) combined with computationally efficient encoding methods, e.g. Bloom filter encoding, has been used to develop PPRL with provable privacy guarantees. The standard DP notion does not however address other constraints, among which the most important ones are fairness-bias and cost of linkage in terms of number of record pairs to be compared. In this work, we propose new notions of fairness-constrained DP and fairness and cost-constrained DP for PPRL and develop a framework for PPRL with these new notions of DP combined with Bloom filter encoding. We provide theoretical proofs for the new DP notions for fairness and cost-constrained PPRL and experimentally evaluate them on two datasets containing person-specific data. Our experimental results show that with these new notions of DP, PPRL with better performance (compared to the standard DP notion for PPRL) can be achieved with regard to privacy, cost and fairness constraints.


page 1

page 14


Continual Learning with Differential Privacy

In this paper, we focus on preserving differential privacy (DP) in conti...

Provable Membership Inference Privacy

In applications involving sensitive data, such as finance and healthcare...

Relations among different privacy notions

We present a comprehensive view of the relations among several privacy n...

DP-Sync: Hiding Update Patterns in Secure Outsourced Databases with Differential Privacy

In this paper, we have introduced a new type of leakage associated with ...

Privacy-Preserving Record Linkage

Given several databases containing person-specific data held by differen...

On the (Im)Possibility of Estimating Various Notions of Differential Privacy

We analyze to what extent final users can infer information about the le...

Fairness-aware Differentially Private Collaborative Filtering

Recently, there has been an increasing adoption of differential privacy ...

Please sign up or login with your details

Forgot password? Click here to reset