T7-AT-009MEDIUM
Analogy Extraction
T7 · Output Manipulation & Exfiltration →Risk score180
RatingMedium
Procedures10
Severity
Mechanism
Safety classifiers evaluate surface-level semantic domain — keywords, entity types, topic classification. Analogies preserve procedural structure and causal relationships of restricted content while transposing it into an innocuous semantic domain. A cooking metaphor for a synthesis procedure preserves the operational sequence (order, timing, temperature, ratios) while replacing flagged vocabulary with benign terms.
Detection
- Detect requests for domain-mapping between restricted and innocuous topics with explicit structural mapping language
- Evaluate structural isomorphism: flag when analogies preserve operational detail (quantities, temperatures, sequences) beyond genuine metaphor needs
- Observable signal: requests specifying both a restricted source domain and a target metaphor domain with mapping instructions
Mitigation
Structural isomorphism detectionHIGH
Source-domain evaluationHIGH
Analogy detail ceilingMEDIUM
Cross-domain safety transferMEDIUM
Chaining
Analogy extraction feeds T7-AT-002 (Fragmentation) when different domains extract different aspects. Analogies can be aggregated (T7-AT-012) to reconstruct the original procedure from multiple domain-shifted descriptions — the intersection of structural elements across metaphors converges on the actual procedure.
Framework mapping
Open in the technique browser →OWASP LLMLLM02
MITRE ATLASAML.T0043