Authors: Salheddine Kabou, Imad eddine Kimi, Arbia Boudaouad
Keywords: Data anonymization, Big data, Apache Spark, k-concealment model, Bottom-Up.
Abstract
In the context of big data, balancing individual privacy with the need for data utility remains a critical challenge. This research presents the Bottom-Up k-Concealment (BU-KC) framework, a pioneering solution for Privacy-Preserving Big Data Publishing (PPBDP). At its core, the k-concealment model addresses the limitations of traditional k-anonymity by minimizing excessive generalizations, ensuring stronger privacy protection while preserving data utility. Combined with the Bottom-Up Generalization (BUG) strategy, BU-KC demonstrates superior performance compared to Top-Down Specialization (TDS) in terms of computational efficiency, scalability, and privacy preservation, while concurrently sustaining data utility. Furthermore, the integration of Apache Spark’s distributed computing paradigm enables us to effectively mitigate scalability constraints and processing bottlenecks commonly observed in the anonymization of large-scale datasets.
Salheddine Kabou
ORCID: https://orcid.org/0000-0002-1423-7215
Higher Normal School of Bechar
Bechar city, Algeria
E-mail:
Imad eddine Kimi
ORCID: https://orcid.org/0009-0005-6997-9194
Higher Normal School of Bechar
Bechar city, Algeria
E-mail:
Arbia Boudaouad
ORCID: https://orcid.org/0009-0007-3735-4477
Ahmed Draia University
Adrar city, Algeria
E-mail:
DOI
https://doi.org/10.56415/csjm.v34.07
Fulltext

–
0.76 Mb