T2-AT-016MEDIUM

Dialectical Variations

T2 · Semantic & Linguistic Evasion →
Risk score155
RatingMedium
Procedures10
Severity
Mechanism

Uses dialects, regional variations, slang, pidgins, and creoles to express harmful requests in language variants underrepresented in safety training. The model understands most English dialects; the classifier may not. This is intra-language what multilingual evasion (T2-AT-002) is inter-language.

Detection
  • Dialect-aware classifiers
  • Slang-to-standard-English mapping
Mitigation
Dialect normalizationMEDIUM
Semantic intent classificationHIGH
Chaining

Chains with T2-AT-002 (Multi-Language) as intra-language equivalent.

Framework mapping
OWASP LLMLLM01
Open in the technique browser →