Embedding Space Manipulation
T12 · RAG & Knowledge Base Manipulation →Embedding space manipulation attacks the mathematical representation layer — crafting text that embeds in adversarial locations within the vector space without appearing adversarial in content. The Black-Hole Attack (April 2026) demonstrated that vectors positioned near the geometric center of a high-dimensional embedding space have high probability of being nearest neighbors to many other vectors; injecting a small number of malicious vectors at the centroid forces the system to retrieve them for most queries. This exploits a fundamental property of high-dimensional geometry, not a flaw in any specific embedding model.
- Monitor embedding distributions for anomalous vectors (centroid-proximate, abnormal norm, clustering artifacts)
- Compare new document embeddings against corpus statistics; flag outliers
- Detect embedding inversion attempts through query pattern analysis
- Observable signal: sudden appearance of vectors with unusually high average similarity to the corpus
Embedding space manipulation underpins T12-AT-001 (Vector Poisoning) and T12-AT-014 (Similarity Search Hijacking) by operating at the mathematical foundation layer. Embedding inversion (T12-AP-005J) feeds T7 (Output Manipulation) by enabling data exfiltration.