T15-AT-009MEDIUM

Synthetic Empathy Exploitation

Risk score195

RatingMedium

Procedures5

Severity

Mechanism

This is a focused subspecies of social engineering that weaponizes the reviewer's *empathy* specifically — emotional distress, vulnerability, grief, trust, and the fear of causing harm by saying no. Human reviewers (and the support staff adjacent to them) are selected and trained to be humane, especially around mental-health and self-harm signals, which is exactly the disposition this technique abuses: a claimed crisis ("I'm depressed and this would help me", "you're the only one who can help") biases the reviewer toward compassionate compliance and discourages the cold application of policy. The "grandmother" pattern is the canonical example — wrapping a harmful request in sentimental framing so refusal feels heartless.

Detection

Affect-laden appeal flagging: Classify emotional-distress and guilt/sympathy language and route high-affect exception requests for calmer secondary review.
Harm-core extraction: Strip the emotional framing and evaluate the underlying request on its merits (does the literal ask, stripped of the story, violate policy regardless of framing?).
Crisis-vs-exception separation: Distinguish genuine user-welfare signals (which should trigger care resources) from requests to relax content policy; ensure empathy routes to support, not to bypass.
Repeat-narrative clustering: Detect reused sob-story templates (e.g., variants of the "grandmother" pattern) across accounts.

Mitigation

Separate duty-of-care from policy exceptionsHIGH

Harm-core policy evaluation independent of framingHIGH

Reviewer inoculation training (emotional levers)MEDIUM

Dual-control for emotionally-charged exceptionsMEDIUM

Chaining

Synthetic empathy is a specialization of T15-AT-002 and feeds T15-AT-007 (a sympathetic narrative escalates well up an appeals chain). The same emotional framings are highly effective model-side, linking to T8 (Deception) and emotional-manipulation jailbreaks; T15-AP-009C in particular is a classic T1/T2 jailbreak frame carried into the human layer.

Framework mapping

OWASP LLMLLM09

Open in the technique browser →