T13-AT-005HIGH

Model Card Manipulation

T13 · AI Supply Chain & Artifact Trust →
Risk score210
RatingHigh
Procedures10
Severity
Mechanism

Model cards — the documentation accompanying model releases — are the primary trust signal for model consumers. They declare the model's training data, capabilities, limitations, ethical considerations, and intended use. Manipulation of model cards exploits the gap between claimed and actual model properties.

Detection
  • Independent benchmark verification: re-run claimed evaluations before trusting model card scores
  • Provenance verification services: cross-reference training data claims with known datasets
  • Automated model card consistency checking: compare claims against measurable model properties
  • Community reporting mechanisms: enable users to flag suspicious model card claims
Mitigation
Independent safety evaluation before deploymentHIGH
Verified publisher programs on model registriesMEDIUM
Automated model card verification toolsLOW
Community audit and reporting systemsMEDIUM
Chaining

Model card manipulation is a supporting technique for T13-AT-001 (Model Repository Poisoning) — providing the trust facade for malicious models. It chains to T6-AT-009 (Evaluation Set Contamination) when falsified benchmark scores mask safety failures.

Open in the technique browser →