T7-AT-009MEDIUM

Analogy Extraction

T7 · Output Manipulation & Exfiltration →
Risk score180
RatingMedium
Procedures10
Severity
Mechanism

Safety classifiers evaluate surface-level semantic domain — keywords, entity types, topic classification. Analogies preserve procedural structure and causal relationships of restricted content while transposing it into an innocuous semantic domain. A cooking metaphor for a synthesis procedure preserves the operational sequence (order, timing, temperature, ratios) while replacing flagged vocabulary with benign terms.

Detection
  • Detect requests for domain-mapping between restricted and innocuous topics with explicit structural mapping language
  • Evaluate structural isomorphism: flag when analogies preserve operational detail (quantities, temperatures, sequences) beyond genuine metaphor needs
  • Observable signal: requests specifying both a restricted source domain and a target metaphor domain with mapping instructions
Mitigation
Structural isomorphism detectionHIGH
Source-domain evaluationHIGH
Analogy detail ceilingMEDIUM
Cross-domain safety transferMEDIUM
Chaining

Analogy extraction feeds T7-AT-002 (Fragmentation) when different domains extract different aspects. Analogies can be aggregated (T7-AT-012) to reconstruct the original procedure from multiple domain-shifted descriptions — the intersection of structural elements across metaphors converges on the actual procedure.

Framework mapping
OWASP LLMLLM02
MITRE ATLASAML.T0043
Open in the technique browser →