T14-AT-001CRITICAL

GPU Farm Hijacking

T14 · Infrastructure & Economic Warfare →
Risk score265
RatingCritical
Procedures10
Severity
Mechanism

GPU compute is the scarcest and most expensive resource in the AI ecosystem — a single H100 cluster can represent millions in capital expenditure, and cloud GPU instances cost $2–$30/hour. GPU farm hijacking exploits the gap between the value of these resources and the security of the interfaces that control them. The attack surface includes: exposed inference endpoints (Ollama on port 11434, vLLM on 8000) running without authentication, Kubernetes GPU operators with default credentials, NVIDIA container runtime vulnerabilities enabling container escape to host GPU access, and stolen cloud credentials (API keys, service accounts) granting access to GPU-backed instances.

Detection
  • Monitor for unexpected GPU utilization patterns (sustained high utilization outside training schedules, utilization on instances not running ML workloads)
  • Alert on new GPU instance provisioning from unusual geolocations or at unusual times
  • Network monitoring for connections to known cryptomining pools or unauthorized NCCL/ZeroMQ traffic
  • Kubernetes audit logs for GPU resource requests from unexpected service accounts
Mitigation
Authentication on all inference endpointsCRITICAL
Cloud credential rotation and monitoringHIGH
GPU utilization anomaly detectionHIGH
NVIDIA Container Toolkit patchingHIGH
Chaining

GPU farm hijacking provides compute resources that enable T14-AT-003 (Cost Inflation) when the attacker runs workloads on the victim's account, T14-AT-009 (Resource Starvation) when hijacked GPUs are no longer available for legitimate use, and T14-AT-013 (Economic Espionage) when GPU memory access reveals model weights or training data.

Framework mapping
OWASP LLMLLM06
MITRE ATLASAML.T0048;AML.T0049
Open in the technique browser →