AIMarkTechPostabout 4 hours ago

NVIDIA Canary-1B-v2 ASR and Translation Tutorial

6 min read

A tutorial demonstrates building a speech recognition and translation pipeline using NVIDIA Canary-1B-v2 in Python. The workflow covers audio preparation, English ASR, multilingual translation, timestamp generation, and SRT subtitle export. The pipeline supports long-form transcription, batch processing, and inference benchmarking on GPU-enabled runtimes.

Level

Hype check

Tap to vote and see what everyone thinks.

In this storyNvidia

#nvidia

NVIDIA Canary-1B-v2 ASR and Translation Tutorial

More to chew on!

More to chew on!