First Look: AWS SageMaker Ships 100+ Detailed Inference Metrics with CloudWatch Insights Dashboard
AWS has released a deep observability layer for SageMaker AI inference endpoints, emitting over 100 metrics covering GPU health, KV cache pressure, token-level latency, and traffic distribution into a …
AML.T0040 - ML Model Inference API Access
AML.T0047 - ML-Enabled Product or Service
AML.T0044 - Full ML Model Access