Faster, Smarter, More Efficient AI Factories with NVIDIA and WEKA

You didn’t set out to build infrastructure — you set out to build AI. But inference grinds to a halt when legacy storage bottlenecks your GPUs. NeuralMesh™ by WEKA® eliminates the wait, so your team can move fast, serve faster, and focus on what matters.

NVIDIA x WEKA Customers

Solving the Toughest AI Challenges

Through deep collaboration across the AI Factory, WEKA and NVIDIA are pushing performance boundaries to drive innovations that accelerate AI.

WEKApod Nitro

WEKApod Nitro is certified for NVIDIA DGX SuperPOD and purpose-built for organizations running the most demanding AI and ML workloads.

Augmented Memory Grid

WEKA’s Augmented Memory Grid™ provides 1000x more KV-cache capacity for agentic, multi-turn, and long-context inference, and supports popular frameworks like NVIDIA Dynamo.

NeuralMesh AIDP

NeuralMesh AIDP turns enterprise data into AI-ready context with integrated pipelines aligned to NVIDIA RTX 6000 PRO and RTX 4500 PRO GPUs.

NVIDIA STX

NVIDIA STX with WEKA brings shared context memory and AI data services to BlueField-4- and Vera Rubin-based AI Factory deployments across inference, analytics, and training.

NVIDIA CMX

CMX with Augmented Memory Grid extends shared inference context across the pod, reducing recompute and helping keep long-context, multi-agent inference fast and predictable at scale.

NVIDIA DSX Air

NVIDIA DSX Air is a SaaS platform that simulates the full AI factory ecosystem, including NeuralMesh, accelerating deployment and performance validation.

What Our Partners Are Saying

“Real-world AI doesn’t run in a lab—it has power constraints, cooling limits, and relentless workload demand. Firmus is built for exactly that. Paired with NVIDIA GPUs, WEKA Augmented Memory Grid delivers up to 6.5x higher tokens per second and 4x faster TTFT at scale, proving we can get more performance from the same GPU footprint. With NeuralMesh and Augmented Memory Grid integrated into our NVIDIA-aligned AI Factory and NVIDIA’s new STX platform, we’ll be able to deliver the fastest context memory network for predictable and efficient inference at scale.”

Daniel Kearney
Chief Technology Officer

“The deployment of agentic AI in production demands a new focus on managing the continuous, coherent flow of data and inference context. By leveraging the NVIDIA AI Data Platform, solutions like WEKA’s NeuralMesh AIDP deliver the persistent context tier necessary for stable and high-scale agentic inference.”

Jason Hardy
Vice President, Storage Technologies

Certified and Validated for the NVIDIA Ecosystem

NVIDIA DGX SuperPOD™ 

High Performance Storage  for NVIDIA Cloud Partners

NVIDIA DGX BasePOD™

NVIDIA-Certified
Systems™ Storage

TCO Calculator

See how much you can save by using WEKApod as the storage for your SuperPOD deployment.

Learn More About NeuralMesh Axon

See WEKApod’s impact on your TCO

Select your GPU type

H100/200

B200

GB200

Select Number of GPUs

1000 2000 4000 8000 16000

No Guesswork Just Savings

$73,203,001

WEKA

$108,203,001

Other

Discover How WEKA Accelerates AI at Scale Contact Us

NVIDIA DGX SuperPOD
with WEKApod

Discover how to accelerate your journey to AI at scale with a fully integrated reference architecture from NVIDIA and WEKA. Learn how WEKApod delivers massive throughput, operational simplicity, and scalability to match the demands of today’s most intensive AI workloads. Validated and certified for DGX SuperPOD deployments, this solution helps you unlock performance at every layer of the AI
stack—without compromise.

Download the Reference Architecture

WEKApod Nitro for NVIDIA DGX SuperPOD AI Factories

WEKApod Nitro delivers high-performance, scalable, and resilient storage for NVIDIA DGX SuperPOD AI factories, optimizing AI model training efficiency.

Download the Datasheet

WEKA for NVIDIA DGX BasePOD: Certified Performance for Next-Gen AI

NeuralMesh is now certified for seamless integration with NVIDIA DGX BasePOD, having successfully passed NVIDIA’s rigorous validation tests for the latest DGX H100/H200 reference architecture.

By eliminating data bottlenecks and maximizing infrastructure efficiency, WEKA empowers organizations to accelerate AI workloads and optimize their NVIDIA DGX investments.

Read the Reference Architecture

Articles and Resources

Press Release

More Tokens at Lower Costs with NVIDIA STX and NeuralMesh

Scale agentic AI without scaling costs. Achieve 6.5x higher token throughput with NVIDIA STX and NeuralMesh.

Read the Press Release

Blog

Operationalize AI Factories at Enterprise Scale

Turn data pipelines and shared inference context into a repeatable, operationalized reality with NeuralMesh AIDP.

Read the Blog

BLOG

NVIDIA Is Defining the Future of Shared KV Cache—WEKA Provides the Adoption Roadmap

Read the Blog

Webinar

The AI Factory Blueprint: Designing for Scalable, Efficient Inference

Watch the Webinar

PRODUCTS

DEPLOYMENT OPTIONS

USE CASES

INDUSTRIES

ARCHITECTURES

Learn AI

RESOURCES

TECHNICAL RESOURCES

ABOUT US

JOIN US

Faster, Smarter, More Efficient AI Factories with NVIDIA and WEKA