Can the accuracy of the timestamp be improved? #255

czkoko · 2022-12-10T17:45:01Z

The timestamp of whisper is not very accurate.
The following is the comparison between Microsoft Cognitive Services Speech and whisper.

1                                    
00:00:00,120 --> 00:00:01,379 (Microsoft)    
[00:00:00.000 --> 00:00:02.000] (whisper)
2
00:00:02,120 --> 00:00:06,320 (Microsoft)  
[00:00:02.000 --> 00:00:07.500] (whisper)

The text was updated successfully, but these errors were encountered:

misutoneko · 2022-12-11T00:50:29Z

Yes, this would be much appreciated, I'm not sure how much can be done without retraining the model(s) though.
I suppose you are using the large model?
I've found the smaller models to be less accurate.

Btw for the original whisper there's the stable-ts fork, maybe that can provide some inspiration. See here:
openai/whisper#435

ggerganov · 2022-12-11T18:41:21Z

The timestamp precision is a limitation of the model. You would need some sort of pre/post-processing to improve the timestamps. But at the moment it is not clear what is the best approach.

pneyrinck · 2022-12-12T17:54:54Z

Apparently, this work has been done to improve time stamps. https://github.com/jianfch/stable-ts

ggerganov added the question Further information is requested label Dec 11, 2022

ggerganov mentioned this issue Dec 18, 2022

Improve decoding #291

Merged

FlakM mentioned this issue Dec 30, 2022

Transcriptions JupiterBroadcasting/jupiterbroadcasting.com#301

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can the accuracy of the timestamp be improved? #255

Can the accuracy of the timestamp be improved? #255

czkoko commented Dec 10, 2022

misutoneko commented Dec 11, 2022 •

edited

Loading

ggerganov commented Dec 11, 2022

pneyrinck commented Dec 12, 2022

Can the accuracy of the timestamp be improved? #255

Can the accuracy of the timestamp be improved? #255

Comments

czkoko commented Dec 10, 2022

misutoneko commented Dec 11, 2022 • edited Loading

ggerganov commented Dec 11, 2022

pneyrinck commented Dec 12, 2022

misutoneko commented Dec 11, 2022 •

edited

Loading