T12-AT-002HIGH

Retrieval Manipulation

T12 · RAG & Knowledge Base Manipulation →
Risk score225
RatingHigh
Procedures10
Severity
Mechanism

Retrieval manipulation attacks influence which documents reach the LLM's context window without modifying the knowledge base itself. The attack operates on the query-to-retrieval pipeline: adversarial query crafting to steer retrieval toward attacker-favored documents, re-ranking exploitation to promote or suppress specific results, and cache poisoning to serve stale or malicious cached retrievals. The assumption violated is that the retrieval function is a trustworthy intermediary — in practice, the retrieval algorithm's optimization for semantic similarity makes it susceptible to inputs crafted to exploit its scoring function.

Detection
  • Monitor retrieval result distributions for anomalous shifts in top-k results
  • Compare retrieval results across time; flag sudden changes in ranking for stable queries
  • Detect adversarial query patterns: unusual suffixes, out-of-vocabulary tokens, embedding-space anomalies
  • Observable signal: retrieval results that are semantically distant from the query despite high cosine similarity scores
Mitigation
Query sanitizationMEDIUM
Retrieval result diversity enforcementMEDIUM
Cache integrity verificationHIGH
Re-ranker adversarial trainingMEDIUM
Chaining

Retrieval manipulation feeds all downstream T12 techniques by controlling which content reaches the LLM. Successful retrieval control enables T12-AT-007 (Context Window Stuffing) and T12-AT-008 (Source Authority Spoofing) by determining what the LLM sees.

Framework mapping
OWASP LLMLLM08
MITRE ATLASAML.T0043
Open in the technique browser →