T7-AT-009MEDIUM

Analogy Extraction

T7 · Output Manipulation & Exfiltration →

Risk score180

RatingMedium

Procedures10

Severity

Mechanism

Safety classifiers evaluate surface-level semantic domain — keywords, entity types, topic classification. Analogies preserve procedural structure and causal relationships of restricted content while transposing it into an innocuous semantic domain. A cooking metaphor for a synthesis procedure preserves the operational sequence (order, timing, temperature, ratios) while replacing flagged vocabulary with benign terms.

Detection

Detect requests for domain-mapping between restricted and innocuous topics with explicit structural mapping language
Evaluate structural isomorphism: flag when analogies preserve operational detail (quantities, temperatures, sequences) beyond genuine metaphor needs
Observable signal: requests specifying both a restricted source domain and a target metaphor domain with mapping instructions

Mitigation

Structural isomorphism detectionHIGH

Source-domain evaluationHIGH

Analogy detail ceilingMEDIUM

Cross-domain safety transferMEDIUM

Chaining

Analogy extraction feeds T7-AT-002 (Fragmentation) when different domains extract different aspects. Analogies can be aggregated (T7-AT-012) to reconstruct the original procedure from multiple domain-shifted descriptions — the intersection of structural elements across metaphors converges on the actual procedure.

Framework mapping

OWASP LLMLLM02

MITRE ATLASAML.T0043

Open in the technique browser →