Offline test improvements #150

palana · 2024-08-07T11:35:33Z

These changes make the offline test deterministic in my testing, i.e. the same input file produces the same segment_labels in segments.json

this should mostly not make a difference, but feels semantically more correct

There are two issues here: 1. `line_size` may contain padding (didn't happen in my tests) 2. from: https://git.ffmpeg.org/gitweb/ffmpeg.git/blob/2b5f000d3f6f9e737e918a5438e6c881f65e70e2:/libavutil/frame.h#l405 > For audio, only linesize[0] may be set. For planar audio, each > channel plane must be the same size.

This kind of behaves like libobs, where each chunk of audio is inspected individually by VAD/whisper, until processing of either takes longer than the window length, in which case audio continues to stream in

palana · 2024-08-07T12:01:03Z

src/tests/localvocal-offline-test.cpp

+
+					// sleep up to window size in case whisper is processing, so the buffer builds up similar to OBS
+					auto now = std::chrono::system_clock::now();
+					if (false && now > max_wait)


kind of undecided if this false should be a parameter, or whether the max wait should be removed

for running tests deterministically between machines you currently want to always only feed a single chunk of audio into gf->input_buffers

honoring max_wait gives you something that is closer to the "plugin within obs experience", i.e. while whisper inference (etc) is running (which takes longer than the wait time on my machine), audio buffers continue to fill up, so on the next run of the whisper loop a bigger chunk of audio is fed into VAD

royshil

looks great!

royshil · 2024-08-08T15:55:44Z

src/tests/localvocal-offline-test.cpp

 	std::time_t now_time_t = std::chrono::system_clock::to_time_t(now);
 	std::tm now_tm = *std::localtime(&now_time_t);


think we don't need this anymore right?

I kind of like having both timestamps available, local time to orient myself on when a particular run happened and "running time" to compare relative timing within a run

palana added 6 commits August 7, 2024 13:40

look at the front of the whisper buffer instead of the back

a9120a6

this should mostly not make a difference, but feels semantically more correct

Initialize resampled_buffer for offline tests

12db51e

log running time in addition to local time

befdd65

Run whisper test "as fast as possible"

407ac47

This kind of behaves like libobs, where each chunk of audio is inspected individually by VAD/whisper, until processing of either takes longer than the window length, in which case audio continues to stream in

Only ever send a single chunk of audio

2c42001

palana force-pushed the offline-test-improvements branch from 59800a1 to 6e5a8af Compare August 7, 2024 11:40

Add additional files to tests copy command

e9581f3

palana force-pushed the offline-test-improvements branch from 6e5a8af to e9581f3 Compare August 7, 2024 11:48

palana commented Aug 7, 2024

View reviewed changes

palana added 2 commits August 8, 2024 14:02

Use condition variable to signal input thread if available

b5f994f

Only wait in whisper thread if input buffers are empty

bb73bcf

palana force-pushed the offline-test-improvements branch from 6414214 to bb73bcf Compare August 8, 2024 12:02

royshil approved these changes Aug 8, 2024

View reviewed changes

royshil merged commit 6cc88b1 into locaal-ai:master Aug 9, 2024
9 checks passed

palana deleted the offline-test-improvements branch August 13, 2024 12:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offline test improvements #150

Offline test improvements #150

palana commented Aug 7, 2024

palana Aug 7, 2024

royshil left a comment

royshil Aug 8, 2024

palana Aug 9, 2024

		std::time_t now_time_t = std::chrono::system_clock::to_time_t(now);
		std::tm now_tm = *std::localtime(&now_time_t);

Offline test improvements #150

Offline test improvements #150

Conversation

palana commented Aug 7, 2024

palana Aug 7, 2024

Choose a reason for hiding this comment

royshil left a comment

Choose a reason for hiding this comment

royshil Aug 8, 2024

Choose a reason for hiding this comment

palana Aug 9, 2024

Choose a reason for hiding this comment