Skip to the content

RISHI RAJ S GERA

SVP Edtech Services, Magic Edtech

RISHI RAJ S GERA

SVP Edtech Services, Magic Edtech

  • Home
  • My Profile
    • Know your Consultant
    • Technical Skills
    • My Certifications
  • Expertise
    • Education Advisory Services
    • Digital Transformation
      • Platform Engineering
      • Digital Content – Micro Learning Instruction
  • Resources
    • News and Trends

Amazon SageMaker now supports deploying large models through configurable volume size and timeout quotas

Advanced learning
    • By
    • No Comments on Amazon SageMaker now supports deploying large models through configurable volume size and timeout quotas
    • September 9, 2022

Amazon SageMaker now supports deploying large models through configurable volume size and timeout quotas

Amazon SageMaker enables customers to deploy ML models to make predictions (also known as inference) for any use case. You can now deploy large models (up to 500GB) for inference on Amazon SageMaker’s Real-time and Asynchronous Inference options by configuring the maximum EBS volume size and timeout quotas. This launch enables customers to leverage SageMaker’s fully managed Real-time and Asynchronous inference capabilities to deploy and manage large ML models such as variants of GPT and OPT.

Share this:

  • Click to share on Twitter (Opens in new window)

Related

Leave a Reply Cancel reply

Follow Blog via Email

Enter your email address to follow this blog and receive notifications of new posts by email.

Generated by Feedzy
Back To Top