Development & Optimization

Jun 06, 2025

How NVIDIA GB200 NVL72 and NVIDIA Dynamo Boost Inference Performance for MoE Models

The latest wave of open source large language models (LLMs), like DeepSeek R1, Llama 4, and Qwen3, have embraced Mixture of Experts (MoE) architectures. Unlike...

12 MIN READ

Jun 04, 2025

Maximizing OpenMM Molecular Dynamics Throughput with NVIDIA Multi-Process Service

Molecular dynamics (MD) simulations model atomic interactions over time and require significant computational power. However, many simulations have small...

7 MIN READ

Jun 03, 2025

NVIDIA Base Command Manager Offers Free Kickstart for AI Cluster Management

As AI and high-performance computing (HPC) workloads continue to become more common and complex, system administrators and cluster managers are at the heart of...

3 MIN READ

May 27, 2025

Upcoming Webinar: Supercharge Agentic AI with Scalable Data Flywheels

Join our live webinar on June 18 to see how NVIDIA NeMo microservices speed AI agent development.

1 MIN READ

May 27, 2025

Advanced Optimization Strategies for LLM Training on NVIDIA Grace Hopper

In the previous post, Profiling LLM Training Workflows on NVIDIA Grace Hopper, we explored the importance of profiling large language model (LLM) training...

10 MIN READ

May 27, 2025

Profiling LLM Training Workflows on NVIDIA Grace Hopper

The rapid advancements in AI have resulted in an era of exponential growth in model sizes, particularly in the domain of large language models (LLMs). These...

12 MIN READ

May 22, 2025

Spotlight: Infleqtion Optimizes Portfolios Using Q-CHOP and NVIDIA CUDA-Q Dynamics

Computing is an essential tool for the modern financial services industry. Profits are won and lost based on the speed and accuracy of algorithms guiding...

9 MIN READ

May 15, 2025

Predicting Performance on Apache Spark with GPUs

The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform...

9 MIN READ

May 14, 2025

Get Trained and Certified at GTC Paris at VivaTech 2025

Join us at GTC Paris on June 10th and choose from six full-day, instructor-led workshops.

1 MIN READ

May 09, 2025

CUDA C++ Compiler Updates Impacting ELF Visibility and Linkage

In the next CUDA major release, CUDA 13.0, NVIDIA is introducing two significant changes to the NVIDIA CUDA Compiler Driver (NVCC) that will impact ELF...

11 MIN READ

May 05, 2025

Just Released: CUDA 12.9

New features include enhancements to confidential computing and family-specific features and targets supported by NVCC.

1 MIN READ

May 02, 2025

An Even Easier Introduction to CUDA (Updated)

Note: This blog post was originally published on Jan 25, 2017, but has been edited to reflect new updates. This post is a super simple introduction to CUDA, the...

16 MIN READ

May 02, 2025

HackAI Challenge Winners Announced

Explore the groundbreaking projects and real-world impacts of the HackAI Challenge powered by NVIDIA AI Workbench and Dell Precision.

1 MIN READ

May 01, 2025

NVIDIA Blackwell and NVIDIA CUDA 12.9 Introduce Family-Specific Architecture Features

One of the earliest architectural design decisions that went into the CUDA platform for NVIDIA GPUs was support for backward compatibility of GPU code. This...

14 MIN READ

May 01, 2025

Stacking Generalization with HPO: Maximize Accuracy in 15 Minutes with NVIDIA cuML

Stacking generalization is a widely used technique among machine learning (ML) engineers, where multiple models are combined to boost overall predictive...

7 MIN READ

A fireside chat with Kaggle Grandmasters.

Apr 29, 2025

Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers

Kaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year’s Google Cloud Next...

9 MIN READ