T10-AT-015HIGH

Anonymization Reversal

T10 · Integrity & Confidentiality Breach →
Risk score225
RatingHigh
Procedures10
Severity
Mechanism

Anonymization reversal de-anonymizes data supposedly made safe through k-anonymity, l-diversity, t-closeness, pseudonymization, or generalization. The fundamental vulnerability is that anonymization assumes a static threat model — a fixed amount of auxiliary information available to the attacker. In practice, attackers accumulate auxiliary data over time from breaches, public records, social media, and other model outputs.

Detection
  • Re-identification risk scoring: proactively test anonymized datasets against known auxiliary sources
  • Query pattern analysis: detect systematic probing correlating anonymized data with quasi-identifiers
  • Multi-attribute query monitoring for de-anonymization-typical combinations
  • Behavioral anomaly: sequential queries systematically narrowing an anonymity set
Mitigation
Differential privacy (formal guarantee)HIGH
k-Anonymity + l-Diversity + t-Closeness (layered)MEDIUM
Synthetic data generationHIGH
Data minimizationHIGH
Chaining

De-anonymization transforms "safe" datasets into PII-rich sources feeding all T10 extraction and inference techniques. In the LLM context, de-anonymized training data enables T10-AT-001 (Training Data Extraction) to target specific individuals.

Framework mapping
OWASP LLMLLM02
MITRE ATLASAML.T0024.000
Open in the technique browser →