Since Streaming Speech-to-Text models are built for low latency, they are only able to use limited historical context when making predictions. This is compared to our Async models, which can look at complete historical and future context to make predictions (since they have access to the entire audio file). As a result, Streaming models are usually a few percentage points less accurate than Async models (~2-3% absolute). Overall, the results are still quite good!Documentation Index
Fetch the complete documentation index at: https://assemblyai.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Streaming audio
How accurate is your Streaming transcription compared to Async transcription?
Can I use speaker diarization with Streaming Speech-to-Text?
Previous
How does automatically scaling concurrency for Streaming STT work?
Next