T10-AT-002HIGH

PII Extraction Techniques

T10 · Integrity & Confidentiality Breach →

Risk score235

RatingHigh

Procedures10

Severity

Mechanism

Unlike T10-AT-001 which extracts arbitrary memorized content, PII extraction specifically targets the model's tendency to complete structured personal data patterns. The vulnerability arises because PII appears in training data within predictable templates (email headers, employee directories, medical forms, contact pages). The model learns these templates as high-confidence patterns, and when prompted with partial templates, the conditional probability of emitting real PII exceeds the probability of generating plausible fakes.

Detection

Input classifiers detecting entity-type probing patterns (company+names, domain+emails, region+IDs)
Output scanning for structured PII patterns using NER with high-sensitivity thresholds
Rate limiting on queries that systematically vary a single organizational or geographic parameter
Behavioral anomaly: sequential queries scoped to the same organization/domain signal enumeration

Mitigation

Training data PII scrubbing (NER + regex)HIGH

Output PII detection + redactionMEDIUM

Fine-tuning on PII refusal examplesMEDIUM

Differential privacy (DP-SGD)HIGH

Chaining

Extracted PII feeds T10-AT-006 (Inference Attack Chains) for cross-referencing with external datasets, and enables T10-AT-015 (Anonymization Reversal) by providing linkage keys for de-anonymization.

Framework mapping

OWASP LLMLLM02

MITRE ATLASAML.T0024

Open in the technique browser →