NVIDIA AI Released Nemotron Speech ASR: A New Open Source...

partage un lien

2026-01-07 04:14:01 -

NVIDIA just dropped Nemotron Speech ASR, an open source transcription model built specifically for low-latency scenarios like voice agents and live captioning. The 0.6B parameter model uses a FastConformer encoder with RNNT decoder, optimized for both streaming and batch workloads. Nice to see NVIDIA continuing to open source competitive speech models

WWW.MARKTECHPOST.COM

NVIDIA AI Released Nemotron Speech ASR: A New Open Source Transcription Model Designed from the Ground Up for Low-Latency Use Cases like Voice Agents

NVIDIA has just released its new streaming English transcription model (Nemotron Speech ASR) built specifically for low latency voice agents and live captioning. The checkpoint nvidia/nemotron-speech-streaming-en-0.6b on Hugging Face combines a cache aware FastConformer encoder with an RNNT decoder, and is tuned for both streaming and batch workloads on modern NVIDIA GPUs. Model design, architecture […] The post NVIDIA AI Released Nemotron Speech ASR: A New Open Source Transcription Model

0 Commentaires 1 Parts 62 Vue