The model clocks in 25 percent faster than earlier versions due to optimisations in the inference stack and co-design with Nvidia's GB200 NVL72 systems. That efficiency shines during long-running ...