Add searchable text captions with speech to text. #14468

bagobones · 2024-10-21T03:44:30Z

bagobones
Oct 21, 2024

Using faster whisper or something similar, process complete video segments and ether add a caption track (probably easier to play back later) or caption file that can then be added to frigate search.

It might be possible to leverage existing projects to add this capability.

https://github.com/McCloudS/subgen

bagobones · 2024-11-05T06:35:36Z

bagobones
Nov 5, 2024
Author

Could be combined with the existing audio detection label of speech to efficiently determine which clips should have speech to text applied.

0 replies

sotiris-bos · 2024-12-17T17:57:26Z

sotiris-bos
Dec 17, 2024

This could also be helpful: https://github.com/Carleslc/AudioToText

I would really like this feature!

0 replies

bagobones · 2024-12-19T01:59:03Z

bagobones
Dec 19, 2024
Author

Looks like this one is small, fast and available as ONNX models. I wonder if real time word triggers could be a thing as part of audio detection?

https://www.reddit.com/r/LocalLLaMA/comments/1hh5y87/moonshine_web_realtime_inbrowser_speech/

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add searchable text captions with speech to text. #14468

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Add searchable text captions with speech to text. #14468

bagobones Oct 21, 2024

Replies: 3 comments

bagobones Nov 5, 2024 Author

sotiris-bos Dec 17, 2024

bagobones Dec 19, 2024 Author

bagobones
Oct 21, 2024

bagobones
Nov 5, 2024
Author

sotiris-bos
Dec 17, 2024

bagobones
Dec 19, 2024
Author