Local Generalization and Bucketization Technique for Personalized Privacy Preservation

by   Boyu Li, et al.

Anonymization technique has been extensively studied and widely applied for privacy-preserving data publishing. In most previous approaches, a microdata table consists of three categories of attribute: explicit-identifier, quasi-identifier (QI), and sensitive attribute. Actually, different individuals may have different view on the sensitivity of different attributes. Therefore, there is another type of attribute that contains both QI values and sensitive values, namely, semi-sensitive attribute. Based on such observation, we propose a new anonymization technique, called local generalization and bucketization, to prevent identity disclosure and protect the sensitive values on each semi-sensitive attribute and sensitive attribute. The rationale is to use local generalization and local bucketization to divide the tuples into local equivalence groups and partition the sensitive values into local buckets, respectively. The protections of local generalization and local bucketization are independent, so that they can be implemented by appropriate algorithms without weakening other protection, respectively. Besides, the protection of local bucketization for each semi-sensitive attribute and sensitive attribute is also independent. Consequently, local bucketization can comply with various principles in different attributes according to the actual requirements of anonymization. The conducted extensive experiments illustrate the effectiveness of the proposed approach.


page 9

page 10

page 11


Privacy-Preserving Data Publishing via Mutual Cover

We study anonymization techniques for preserving privacy in the publicat...

Dikaios: Privacy Auditing of Algorithmic Fairness via Attribute Inference Attacks

Machine learning (ML) models have been deployed for high-stakes applicat...

Topology of Privacy: Lattice Structures and Information Bubbles for Inference and Obfuscation

Information has intrinsic geometric and topological structure, arising f...

A Determination Scheme for Quasi-Identifiers Using Uniqueness and Influence for De-Identification of Clinical Data

Objectives; The accumulation and usefulness of clinical data have increa...

Agnostic data debiasing through a local sanitizer learnt from an adversarial network approach

The widespread use of automated decision processes in many areas of our ...

A Novel Microdata Privacy Disclosure Risk Measure

A tremendous amount of individual-level data is generated each day, of u...

Distribution-Preserving k-Anonymity

Preserving the privacy of individuals by protecting their sensitive attr...

Please sign up or login with your details

Forgot password? Click here to reset