diff --git a/server/services/stt/elevenlabs.mdx b/server/services/stt/elevenlabs.mdx index 7a2d08c4..9fe0215f 100644 --- a/server/services/stt/elevenlabs.mdx +++ b/server/services/stt/elevenlabs.mdx @@ -221,7 +221,7 @@ stt = ElevenLabsRealtimeSTTService( ## Notes - **HTTP vs Realtime**: The HTTP service (`ElevenLabsSTTService`) uploads complete audio segments and is best for VAD-segmented transcription. The Realtime service (`ElevenLabsRealtimeSTTService`) streams audio over WebSocket for lower latency and provides interim transcripts. -- **Commit strategies**: The Realtime service defaults to `manual` commit strategy, where Pipecat's VAD controls when transcription segments are committed. Set `commit_strategy=CommitStrategy.VAD` to let ElevenLabs handle segment boundaries. +- **Commit strategies**: The Realtime service defaults to `manual` commit strategy, where Pipecat's VAD controls when transcription segments are committed. Set `commit_strategy=CommitStrategy.VAD` to let ElevenLabs handle segment boundaries. When using `MANUAL` commit strategy, transcription frames are marked as finalized (`TranscriptionFrame.finalized=True`). - **Keepalive**: The Realtime service sends silent audio chunks as keepalive to prevent idle disconnections (keepalive interval: 5s, timeout: 10s). - **Auto-reconnect**: The Realtime service automatically reconnects if the WebSocket connection is closed when new audio arrives.