Sped_up_audios_wtimestamps -

WhisperX: Automatic Speech Recognition with Word ... - GitHub

: This 2024 paper improves timestamp precision for OpenAI's Whisper model. It addresses "unsharp" timestamps caused by pauses or rapid speech by adjusting the model's tokenizer and using cross-attention scores for alignment. sped_up_audios_wtimestamps

: This paper explores the effectiveness of combining transcripts with pitch-normalized, time-compressed speech. It specifically looks at how speed impacts user comprehension and the accuracy of machine-generated text alignments. WhisperX: Automatic Speech Recognition with Word