AIMarkTechPostabout 2 hours ago

NVIDIA Releases Nemotron 3.5 ASR Model

11 min read

NVIDIA's Nemotron Speech team released Nemotron 3.5 ASR, a 600M-parameter streaming model that transcribes 40 language-locales in real time from a single checkpoint. The Cache-Aware FastConformer-RNNT architecture caches encoder activations to avoid reprocessing overlapping audio windows, reducing compute and latency without accuracy loss. Output includes native punctuation and capitalization, eliminating a separate restoration step. The model ships as open weights on Hugging Face under the OpenMDW-1.1 license.

Level

Hype check

Tap to vote and see what everyone thinks.

#nvidia #asr #open source

Read full story

More to chew on!

AI5 days ago

MiniMax-M3 Beats GPT-5.5 and Gemini 3.1 Pro on Benchmarks

AI3 days ago

Gemma 4 12B: The Developer Guide