Case Explorer
Short GPU node failure does not trigger credit
A short GPU node failure caused customer-visible job failures, but the incident stayed below contract thresholds and does not warrant a credit or goodwill path.
Evidence Packet
CRM Record
Account: Argent Models
Tier: enterprise
Plan: Committed-220
Billing Owner: revops@prime.example
SLA Tier: standard-covered-service
Standard enterprise terms without special goodwill commitments.
Billing Record
Plan: Committed-220
Invoice Preview: $17,690
Credits Applied: $0
Burst GPU Hours: 0
No billing anomaly detected.
Usage & Telemetry
Window: 2026-03-01 to 2026-03-31
GPU Hours: 214
Meter Status: healthy
A single GPU node failure caused several jobs to fail, but the incident resolved within 11 minutes.
Anomalies: Jobs pinned to one GPU pool failed and were rescheduled after node replacement.
Incident Record
Status: resolved
Service: managed-training-api
Duration: 11m
Customer Visible: Yes
A subset of jobs failed during an 11-minute node incident.
A GPU node failure in one pool caused temporary job failures until workloads were rescheduled.
Customer Note
Policy Snippet