Yesterday's Top Poster

Nemotron 3 Ultra now available on AI Gateway

  • Thread starter Thread starter Walter Korman, Rohan Taneja, Jerilyn Zheng
  • Start date Start date
Nemotron 3 Ultra from Nvidia is now available on Vercel AI Gateway.

Nemotron 3 Ultra is an open Mixture-of-Experts reasoning model built for orchestrating long-running agent workflows, with a 1M token context window. The model targets multi-turn agent workflows: planning, tool use, sub-agent delegation, and error recovery. Throughput reaches up to 350 tokens per second, with up to 30% lower cost on agentic tasks.

To use Nemotron 3 Ultra, set model to nvidia/nemotron-3-ultra-550b-a55b in the AI SDK.


AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in custom reporting, Zero Data Retention support, dynamic provider sorting by latency and cost, and more. AI Gateway reflects provider pricing with no markup and does not charge a platform fee on inference, including on Bring Your Own Key (BYOK) requests.

Learn more about AI Gateway, view the AI Gateway model leaderboard or try it in our model playground.

Read more

Continue reading...
 
Back
Top