May 21, 2025
NVIDIA Announces DGX Cloud Lepton for GPU Access across Multi-Cloud Platforms

NVIDIA Announces DGX Cloud Lepton for GPU Access across Multi-Cloud Platforms

NVIDIA today announced at the Computex confence in Taiwan NVIDIA DGX Cloud Lepton — an AI platform with a compute marketplace that connects developers building agentic and physical AI applications with tens of thousands of GPUs from a network of cloud providers, including CoreWeave, Crusoe, Firmus, Foxconn, GMI Cloud, Lambda, Nebius, Nscale, Softbank Corp. and Yotta Data Services.

The platforms will offer NVIDIA Blackwell and other NVIDIA architecture GPUs on the DGX Cloud Lepton marketplace.

Developers can tap into GPU compute capacity in specific regions for both on-demand and long-term computing, supporting strategic and sovereign AI operational requirements. Leading cloud service providers and GPU marketplaces are expected to also participate in the DGX Cloud Lepton marketplace.

“NVIDIA DGX Cloud Lepton connects our network of global GPU cloud providers with AI developers,” said Jensen Huang, founder and CEO of NVIDIA. “Together with our NCPs, we’re building a planetary-scale AI factory.”

NVIDIA said DGX Cloud Lepton helps secure high-performance GPU resources by unifying access to cloud AI services and GPU capacity across the NVIDIA compute ecosystem. The platform integrates with the NVIDIA software stack, including NVIDIA NIM and NeMo microservices, NVIDIA Blueprints and NVIDIA Cloud Functions, to accelerate and simplify the development and deployment of AI applications.

DGX Cloud Lepton provides management software for cloud platforms with functions that deliver real-time GPU health diagnostics and automates root-cause analysis, eliminating manual operations and reducing downtime.

Features include:

  • Improved productivity and flexibility: Offers a unified experience across development, training and inference, helping boost productivity. Developers can purchase GPU capacity directly from participating cloud providers through the marketplace or bring their own compute clusters, giving them greater flexibility and control.
  • Frictionless deployment: Enables deployment of AI applications across multi-cloud and hybrid environments with minimal operational burden, using integrated services for inference, testing and training workloads.
  • Agility and sovereignty: Gives developers quick access to GPU resources in specific regions, enabling compliance with data sovereignty regulations and meeting low-latency requirements for sensitive workloads.
  • Predictable performance: Provides participating cloud providers enterprise-grade performance, reliability and security, ensuring a consistent user experience.

A New Bar for AI Cloud Performance
NVIDIA today also announced NVIDIA Exemplar Clouds to help NCPs enhance security, usability, performance and resiliency, using NVIDIA’s expertise, reference hardware and software and operational tools.

NVIDIA Exemplar Clouds tap into NVIDIA DGX Cloud Benchmarking, a comprehensive suite of tools and recipes for optimizing workload performance on AI platforms and quantifying the relationship between cost and performance.

Yotta Data Services is the first NCP in the Asia-Pacific region to join the NVIDIA Exemplar Cloud initiative.

Developers can sign up for early access to NVIDIA DGX Cloud Lepton.

Leave a Reply

Your email address will not be published. Required fields are marked *