Skip to the content

RISHI RAJ S GERA

Chief Solutions Officer, Magic Edtech

RISHI RAJ S GERA

Chief Solutions Officer, Magic Edtech

  • Home
  • My Profile
    • Know your Consultant
    • Technical Skills
    • My Certifications
  • Expertise
    • Education Advisory Services
    • Digital Transformation
      • Platform Engineering
      • Digital Content – Micro Learning Instruction
  • Resources
    • News and Trends

Amazon SageMaker HyperPod now supports AMI versioning and auto-patching

Advanced learning
    • By Rishi Raj Singh Gera
    • No Comments on Amazon SageMaker HyperPod now supports AMI versioning and auto-patching
    • July 3, 2026

Amazon SageMaker HyperPod now supports AMI versioning and auto-patching

Amazon SageMaker HyperPod now gives you visibility into the Amazon Machine Image (AMI) versions running across your clusters and automatically applies security patches without disrupting your workloads. SageMaker HyperPod is purpose-built infrastructure for training and deploying foundation models at scale. Cluster administrators previously had limited insight into which AMI versions were running, making drift hard to detect and security patching a manual, reactive process that was difficult to run on long multi-day training jobs and that risked changing bundled software in the AMI such as NVIDIA drivers or CUDA. These new capabilities on HyperPod help you keep clusters secure and consistent while removing the operational burden of manual patching.

With AMI versioning, you can see the exact AMI version on every instance group and node in the semantic versioning (major.minor.patch) format, quickly detect version drift, and roll back to a previous version—including the prior NVIDIA driver, CUDA, and other software stack—using the UpdateClusterSoftware API. Auto-patching is an opt-in, per-instance-group capability that applies only backward-compatible security patches as nodes become idle, so your running workloads stay undisrupted and critical AI/ML packages such as NVIDIA driver, CUDA version, and operating system kernels are never upgraded to a different major or minor version; you can enable it through the CreateCluster or UpdateCluster API. A new AMI support policy also publishes support timelines for different AMI versions after which HyperPod stops publishing security patches.

Both AMI versioning and auto-patching are available for HyperPod clusters orchestrated by Amazon EKS, in all AWS Regions where SageMaker HyperPod is supported. To learn more, see the HyperPod AMI management documentation and the new HyperPod AMI support policy.

Share this:

  • Share on X (Opens in new window) X

Related

Leave a ReplyCancel reply

Follow Blog via Email

Enter your email address to follow this blog and receive notifications of new posts by email.

Discover more from RISHI RAJ S GERA

Subscribe now to keep reading and get access to the full archive.

Continue reading

Back To Top