NVIDIA just dropped Nemotron Speech ASR, an open source transcription model built specifically for low-latency scenarios like voice agents and live captioning. The 0.6B parameter model uses a FastConformer encoder with RNNT decoder, optimized for both streaming and batch workloads. Nice to see NVIDIA continuing to open source competitive speech models
NVIDIA just dropped Nemotron Speech ASR, an open source transcription model built specifically for low-latency scenarios like voice agents and live captioning. The 0.6B parameter model uses a FastConformer encoder with RNNT decoder, optimized for both streaming and batch workloads. Nice to see NVIDIA continuing to open source competitive speech models 🎙️
0 Commentaires
1 Parts
62 Vue