News
Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform
3+ hour, 23+ min ago (992+ words) This shift also raises the ceiling for multi-agent systems. Individual agents can be powerful on their own, but coordinated groups of agents can accomplish far more, much like human societies scale their capability through collective intelligence and coordination." Supporting these…...
NVIDIA Vera Rubin POD: Seven Chips, Five Rack-Scale Systems, One AI Supercomputer
4+ hour, 51+ min ago (1463+ words) Built on the third-generation NVIDIA MGX rack architecture and co-designed from grid to chip for the era of agentic AI Artificial intelligence is token-driven. Every prompt, reasoning step, and agent interaction generates tokens. Over the past year, token consumption has…...
NVIDIA RTX Innovations Are Powering the Next Era of Game Development
6+ day, 8+ hour ago (779+ words) This post provides a detailed overview of these latest innovations, including: Remedy Entertainment applied RTX Mega Geometry to existing assets in Alan Wake 2, which saw a 5-20% boost in FPS and 300 MB VRAM reduction. Their upcoming title, CONTROL Resonant will also…...
5 New Digital Twin Products Developers Can Use to Build 6G Networks
2+ week, 1+ day ago (445+ words) To make 6G a reality, the telecom industry must overcome a fundamental challenge: how to design, train, and validate AI-native networks that are too complex to be tested in the physical world. But the usability of any technology is as important…...
Accelerating Data Processing with NVIDIA Multi-Instance GPU and NUMA Node Localization
3+ week, 4+ day ago (870+ words) This post first analyzes the memory hierarchy of the NVIDIA GPUs, discussing the power and performance impacts of data transfer over die-to-die link. It then reviews how to use NVIDIA Multi-Instance GPU (MIG) mode to achieve data localization. Finally, it…...
Redefining Secure AI Infrastructure with NVIDIA BlueField Astra for NVIDIA Vera Rubin NVL72
2+ mon, 1+ week ago (785+ words) This post introduces NVIDIA BlueField Astra running on NVIDIA BlueField-4, a breakthrough innovation that redefines how service providers manage, secure, and scale AI infrastructure. As accelerated computing demand increases, the industry is prioritizing bare-metal computing to unlock the benefits of…...
Introducing NVIDIA BlueField-4-Powered Inference Context Memory Storage Platform for the Next Frontier of AI
2+ mon, 1+ week ago (1289+ words) AI'native organizations increasingly face scaling challenges as agentic AI workflows drive context windows to millions of tokens and models scale toward trillions of parameters. These systems currently rely on agentic long'term memory for context that persists across turns, tools, and…...
Optimizing Semiconductor Defect Classification with Generative AI and Vision Foundation Models
2+ mon, 4+ week ago (1146+ words) In the heart of every modern electronic device lies a silicon chip, built through a manufacturing process so precise that even a microscopic defect can determine success or failure. As semiconductor devices grow more complex, reliably detecting and classifying defects…...
Scaling Power-Efficient AI Factories with NVIDIA Spectrum-X Ethernet Photonics
2+ mon, 1+ week ago (290+ words) Scaling Power-Efficient AI Factories with NVIDIA Spectrum-X Ethernet Photonics | NVIDIA Technical Blog'NVIDIA Developer - The switch system features a fully integrated 512 lane 200G-capable architecture, a detachable fiber connector for automated large-scale assembly, and a solder-reflow compatible optical engine enabling 100% yield through…...
New Software and Model Optimizations Supercharge NVIDIA DGX Spark
2+ mon, 1+ week ago (658+ words) Since its release, NVIDIA has continued to push performance of the Grace Blackwell-powered DGX Spark through continuous software optimization and close collaboration with software partners and the open-source community. These efforts are delivering meaningful gains across inference, training and creative…...