Skip to content

Fix batch inference#511

Open
makaveli10 wants to merge 2 commits into
collabora:mainfrom
makaveli10:fix-batch-inference
Open

Fix batch inference#511
makaveli10 wants to merge 2 commits into
collabora:mainfrom
makaveli10:fix-batch-inference

Conversation

@makaveli10

Copy link
Copy Markdown
Collaborator

The batched inference path in _process_multi decoded only at temperature=0.0 with no quality gating, causing 224-token hallucination loops on short/silent inputs.
Mirror faster_whisper.transcribe()'s per-item fallback: re-decode failed items at higher temperatures based on compression_ratio, avg_logprob, and no_speech_prob thresholds. Accept empty result for high-no_speech + low-logprob items as silence.

Signed-off-by: makaveli10 <vineet.suryan@collabora.com>
Signed-off-by: makaveli10 <vineet.suryan@collabora.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

1 participant