-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run talk example failed #782
Comments
Apple M1 Pro |
I'll fix these in a few days |
The same is happening to me when using q2 whisper version. mr. @ggerganov I've also tried switching to 4-bit branch and it still didn't work. Besides usual "./main -m models/ggml-model-whisper-large-q4_0.bin -f file.mp4", is there anything else to be done to run quantized model? |
The same error to me when running the quantized file version: [1] 37589 segmentation fault (core dumped) ./main -m models/ggml-model-whisper-large-q4_0.bin -f |
I've tried a wav file instead of mp4 file and it didn't work. Running ggml-large.bin works but it's not the quantized one, unfortunately. |
Should be fixed now |
I am still getting the same error: (py310-whisper) whisper.cpp % ./talk -p Santa main: processing, 4 threads, lang = en, task = transcribe, timestamps = 0 ... init: found 4 capture devices: |
Hi all. The same problem on Win10 when started python script from example. I test with all win builds from 1.5.4 to 1.6.2 system_info: n_threads = 4 / 12 | AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | METAL = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | CUDA = 0 | COREML = 0 | OPENVINO = 0 main: processing '.\jfk.wav' (176000 samples, 11.0 sec), 4 threads, 1 processors, 5 beams + best of 5, lang = en, task = transcribe, timestamps = 1 ... whisper_print_timings: load time = 324.05 ms But if run the command directly it works normal `PS C:\Users\123\CODE\test2> ..\src\utils\whisper.cpp_win\main.exe -m ggml-base.bin -f .\jfk.wav system_info: n_threads = 4 / 12 | AVX = 1 | AVX2 = 1 | AVX512 = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | METAL = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | CUDA = 0 | COREML = 0 | OPENVINO = 0 main: processing '.\jfk.wav' (176000 samples, 11.0 sec), 4 threads, 1 processors, 5 beams + best of 5, lang = en, task = transcribe, timestamps = 1 ... [00:00:00.000 --> 00:00:08.000] And so, my fellow Americans, ask not what your country can do for you, whisper_print_timings: load time = 366.80 ms Any suggestions? Thx |
main: processing, 4 threads, lang = en, task = transcribe, timestamps = 0 ...
init: found 1 capture devices:
init: - Capture device #0: 'MacBook Pro麦克风'
init: attempt to open default capture device ...
init: obtained spec for input device (SDL Id = 2):
init: - sample rate: 16000
init: - format: 33056 (required: 33056)
init: - channels: 1 (required: 1)
init: - samples per frame: 1024
[1] 68264 segmentation fault ./talk -p santa
➜ whisper.cpp git:(master) ✗ shasum -a 256 ./models/ggml-gpt-2-117M.bin
b457d5fcc7f2f71e727bee74298d42d80610619e02af16beca53d44a71d5f607 ./models/ggml-gpt-2-117M.bin
The text was updated successfully, but these errors were encountered: