NVIDIA DGX Cloud

Your AI factory in the cloud.

Overview

The Best of NVIDIA AI in the Cloud

Build and deploy mission-critical AI sooner. Every layer of NVIDIA DGX™ Cloud is optimized and managed by NVIDIA, ensuring the highest performance of NVIDIA AI in the cloud. A unified platform with a suite of fully managed platforms and services, DGX Cloud empowers every organization with an AI factory wherever they are, bringing AI workloads from develop to deploy in the era of agentic and physical AI.

Take a Closer Look at NVIDIA DGX Cloud

Create and deploy cutting-edge AI with the power of fully managed AI platforms, optimized at every layer.

Try NVIDIA NIM APIs on DGX Cloud

Experience the leading models for building enterprise-grade generative AI apps, accelerated by NVIDIA DGX Cloud.

What Is NVIDIA DGX Cloud?

NVIDIA DGX Cloud is a unified AI platform on leading clouds that optimizes performance with software, services, and AI expertise for evolving workloads.

Workloads

Experience NVIDIA DGX Cloud for Every AI Workload

Accelerate your AI factory with the best of NVIDIA AI technologies in the cloud.

NeMo Curator on NVIDIA DGX Cloud

Speed large-scale video curation and customize world foundation models efficiently with NVIDIA NeMo™ Curator on DGX Cloud.

NVIDIA DGX Cloud Create

Build foundation models or fine-tune leading AI models with a fully managed AI training platform.

NVIDIA DGX Cloud Serverless Inference

Use high-performance, serverless AI inference with auto-scaling, cost-efficient GPU utilization, and multi-cloud flexibility.

NVIDIA DGX Cloud Lepton

Discover available GPUs in regions of choice and easily connect compute to workloads for experimentation, fine-tuning, and scalable deployment across multiple clouds.

NVIDIA DGX Cloud Benchmarking

Follow evolving performance optimizations and workload-specific recipes to maximize your AI infrastructure.

Benefits

What Does NVIDIA DGX Cloud Offer?

Experience Immediate AI Productivity

Experience day-one productivity in the cloud on fully managed platforms to speed time to value.

Optimize NVIDIA AI On Any AI Infrastructure

Maximize AI workload performance in the cloud  with NVIDIA DGX Cloud Benchmarking recipes and optimizations at every layer.

Stay at the Cutting-Edge of AI With NVIDIA Innovations

Speed AI development and deployment with a suite of software and managed services that can help you stay at the forefront of AI.

Get Direct Access to the Leaders in AI Innovation

Tap into the network of NVIDIA AI experts to improve efficiency, boost performance, and realize a lower TCO.

Starting Options

Get Started With NVIDIA DGX Cloud

Try NVIDIA DGX Cloud Now

Explore NVIDIA NIM™ microservices on build.nvidia.com, a free API catalog for testing, prototyping, and developing generative AI apps with fully managed, accelerated endpoints and NVIDIA Blueprints—accelerated by DGX Cloud.

Explore NVIDIA DGX Cloud Create

Learn how you can use NVIDIA DGX Cloud Create for model training and customizations with accelerated computing clusters on any leading cloud.

Use NVIDIA DGX Cloud Serverless Inference

Easily package and deploy inference pipelines or data preprocessing workflows in containers optimized for NVIDIA GPUs, without worrying about underlying infrastructure.

Access NVIDIA DGX Cloud Lepton

Tap into global GPU compute to discover, procure, develop, customize, and deploy AI applications across multiple cloud providers.

Request NVIDIA DGX Cloud with NVIDIA GB200

Fuel next-gen AI breakthroughs on NVIDIA DGX Cloud with NVIDIA GB200, featuring the powerful NVIDIA Blackwell architecture and high-bandwidth NVIDIA NVLink™.

NVIDIA on DGX Cloud

NVIDIA AI Is Powered by NVIDIA DGX Cloud

Mission-critical research and next-gen models are built and accelerated by NVIDIA DGX Cloud.

Building Project Ceiba in the Cloud

AWS and NVIDIA aim to push the boundaries of artificial intelligence by constructing the largest AI supercomputer in the cloud. Project Ceiba is a cutting-edge supercomputer hosted on AWS via DGX Cloud that will power NVIDIA research and development efforts in AI.

Groundbreaking Drug Discovery With NVIDIA BioNeMo

NVIDIA® BioNeMo™, accelerated  by NVIDIA DGX Cloud, is a generative AI platform for drug discovery that simplifies and accelerates model training with an organization’s own data and scaling the deployment of models for drug discovery applications.

Customize Generative AI Models With NVIDIA AI Foundry

NVIDIA AI Foundry is a platform and service for building custom generative AI models with enterprise data and domain-specific knowledge. NVIDIA AI Foundry enables organizations to develop their own AI models, powered by DGX Cloud.

Accelerate Physical AI Development

NVIDIA CosmosTM is a platform of state-of-the-art generative world foundation models, advanced tokenizers, guardrails, and an accelerated data processing and curation pipeline built to accelerate the development of physical AI systems such as autonomous vehicles and robots.

Customer Stories

How Customers Are Driving Innovation With NVIDIA DGX Cloud

Deloitte

NVIDIA DGX Cloud on Oracle Cloud Infrastructure (OCI) with NVIDIA BioNeMo accelerates drug discovery in Deloitte’s Atlas AI solution. They’re using large language model (LLM)-powered knowledge graphs, scientific pipelines, and custom models, training chemistry language models, and protein language models before seamlessly deploying with NIM microservices.

Cerence

Cerence is training their automotive-specific large language model with NVIDIA DGX Cloud on Microsoft Azure. The model will serve as the foundation of Cerence's next-generation, in-car computing platform, running on NVIDIA DRIVE®.

Amgen

Amgen is using NVIDIA BioNeMo and DGX Cloud to develop AI models that can propose and evaluate designs for candidate drugs, accelerating biologics discovery. Using NVIDIA DGX Cloud, it took Amgen less than a month to go from onboarding to their first pretrained protein LLM.

Ecosystem

Who We’re Partnering With

News and Blogs

Explore the Latest NVIDIA DGX Cloud Developments

Next Steps

Ready to Get Started?

Discover the cloud-first way to get the best of NVIDIA AI with NVIDIA DGX Cloud.

Questions About NVIDIA DGX Cloud?

Talk to an NVIDIA AI expert about your generative AI initiatives.

Explore NVIDIA DGX Cloud Documentation

Access technical documentation about NVIDIA DGX Cloud.

Contact Us To Learn More About DGX 

Amazon Web Services

NVIDIA DGX Cloud with AWS is a high-performance, fully managed AI training platform that provides co-engineered NVIDIA accelerated computing clusters optimized for AWS with flexible term lengths and access to NVIDIA experts.

 Request Private Offer Pricing NVIDIA DGX Cloud on Amazon Web Service

Google Cloud Platform

NVIDIA DGX Cloud with Google Cloud is a high-performance, fully managed AI training platform that provides co-engineered NVIDIA accelerated computing clusters optimized for Google Cloud with flexible term lengths and access to NVIDIA experts.

 Try NVIDIA DGX Cloud on Google Cloud Marketplace

 Request Private Offer Pricing for NVIDIA DGX Cloud on the Google Cloud Marketplace

Microsoft Azure

NVIDIA DGX Cloud with Microsoft Azure is a high-performance, fully managed AI training platform that provides co-engineered NVIDIA accelerated computing clusters optimized for Azure with flexible term lengths and access to NVIDIA experts.

 Try NVIDIA DGX Cloud on Microsoft Azure Marketplace

 Request Private Offer Request for NVIDIA DGX Cloud on the Microsoft Azure Marketplace

Oracle Cloud Infrastructure

NVIDIA DGX Cloud with OCI is a high-performance, fully managed AI training platform that provides co-engineered NVIDIA accelerated computing clusters optimized for OCI with flexible term lengths and access to NVIDIA experts.

 Try NVIDIA DGX Cloud on Oracle Cloud Marketplace

 Request Private Offer Pricing for NVIDIA DGX Cloud on the Oracle Cloud Marketplace