T6-AT-015HIGH

Few-Shot Learning Attacks

Risk score220

RatingHigh

Procedures10

Severity

Mechanism

Few-shot learning operates at the inference-deployment boundary: the model receives a small number of examples (typically 0–32) in its context and adapts its behavior accordingly. The attack surface is the few-shot examples themselves and the selection/retrieval process that determines which examples are presented. Unlike training-time attacks, few-shot poisoning can occur at deployment time without modifying model weights.

Detection

Few-shot example provenance verification: track the source of all in-context examples
Example retrieval auditing: monitor which examples are retrieved for which queries and flag anomalous patterns
Meta-training episode analysis: profile the task distribution for adversarial over-representation
In-context behavior consistency: compare model outputs across different few-shot example sets for the same task

Mitigation

Curated, verified few-shot example librariesHIGH

Few-shot example diversity enforcementMEDIUM

In-context anomaly detection (flag unusual demonstrations)MEDIUM

Meta-training with adversarial episode augmentationMEDIUM

Chaining

Few-shot learning attacks operate at the training-deployment boundary and chain in both directions. Meta-learning poisoning (T6-AP-015C) chains backward to T6-AT-002 (Dataset Contamination) through meta-training data.

Open in the technique browser →