Data Science

Jun 08, 2025
AI Helps Locate Dangerous Fishing Nets Lost at Sea
Conservationists have launched a new AI tool that can sift through petabytes of underwater imaging from anywhere in the world to identify signs of abandoned or...
4 MIN READ

Jun 05, 2025
Supercharge Tree-Based Model Inference with Forest Inference Library in NVIDIA cuML
Tree-ensemble models remain a go-to for tabular data because they're accurate, comparatively inexpensive to train, and fast. But deploying Python inference on...
11 MIN READ

Jun 04, 2025
Floating-Point 8: An Introduction to Efficient, Lower-Precision AI Training
With the growth of large language models (LLMs), deep learning is advancing both model architecture design and computational efficiency. Mixed precision...
11 MIN READ

Jun 02, 2025
Supercharging Fraud Detection in Financial Services with Graph Neural Networks (Updated)
Note: This blog post was originally published on Oct. 28, 2024, but has been edited to reflect new updates. Fraud in financial services is a massive problem....
10 MIN READ

May 30, 2025
Accelerating Text-to-SQL Inference on Vanna with NVIDIA NIM for Faster Analytics
Slow and inefficient query generation from natural language inputs bottlenecks decision-making. This forces analysts and business users to rely heavily on data...
8 MIN READ

May 29, 2025
RAPIDS Brings Zero-Code-Change Acceleration, IO Performance Gains, and Out-of-Core XGBoost
Over the past two releases, RAPIDS introduced zero-code-change acceleration for Python machine learning, huge IO performance improvements, larger-than-memory...
10 MIN READ

May 22, 2025
Grandmaster Pro Tip: Winning First Place in a Kaggle Competition with Stacking Using cuML
What does it take to win a Kaggle competition in 2025? In the April Playground challenge, the goal was to predict how long users would listen to a podcast—and...
7 MIN READ

May 19, 2025
Spotlight: Atgenomix SeqsLab Scales Health Omics Analysis for Precision Medicine
In traditional clinical medical practice, treatment decisions are often based on general guidelines, past experiences, and trial-and-error approaches. Today,...
9 MIN READ

May 15, 2025
Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled
Imagine analyzing millions of NYC ride-share journeys—tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...
10 MIN READ

May 15, 2025
Predicting Performance on Apache Spark with GPUs
The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform...
9 MIN READ

May 15, 2025
Accelerating Embedding Lookups with cuEmbed
NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...
8 MIN READ

May 08, 2025
Accelerate Deep Learning and LLM Inference with Apache Spark in the Cloud
Apache Spark is an industry-leading platform for big data processing and analytics. With the increasing prevalence of unstructured data—documents, emails,...
10 MIN READ

May 08, 2025
Spotlight: Accelerating the Discovery of New Battery Materials with SES AI's Molecular Universe
From the Stone Age to the digital era, materials have been the foundation of our civilization across all epochs. Today, finding new materials leads to progress...
7 MIN READ

May 07, 2025
Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator
Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...
7 MIN READ

May 07, 2025
Using Python to Automate 3D Workflows with OpenUSD
Universal Scene Description (OpenUSD) offers a powerful, open, and extensible ecosystem for describing, composing, simulating, and collaborating within complex...
7 MIN READ

May 02, 2025
An Even Easier Introduction to CUDA (Updated)
Note: This blog post was originally published on Jan 25, 2017, but has been edited to reflect new updates. This post is a super simple introduction to CUDA, the...
16 MIN READ