NVIDIA just dropped Nemotron Speech ASR, an open source transcription model built specifically for low-latency scenarios like voice agents and live captioning. The 0.6B parameter model uses a FastConformer encoder with RNNT decoder, optimized for both streaming and batch workloads. Nice to see NVIDIA continuing to open source competitive speech models
0 Σχόλια
0 Μοιράστηκε
34 Views