-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[S2T] Whisper ASR Model excution got ValueError #2818
Comments
目前模型不能支持太长的语音数据,不建议手动修改self.max_len参数。如果语音过长建议使用vad工具对语音进行切分。 |
引起这个错误的是切割后的短语音, 长度在 8 秒左右 |
方便提供一下可以复现的音频吗? |
好的 这里是复现音频的链接 https://huggingface.co/datasets/cxumol/temp-data/resolve/main/000-0.950-8.850.wav import paddlespeech.cli.whisper
import paddle
audio_file = '000-0.950-8.850.wav'
whisper_executor = paddlespeech.cli.whisper.WhisperExecutor()
result = whisper_executor(
model='whisper',
task='transcribe',
size='large',
sample_rate=16000,
config=None, # Set `config` and `ckpt_path` to None to use pretrained model.
ckpt_path=None,
audio_file=audio_file,
language='ja',
device=paddle.get_device()) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
A clear and concise description of what the bug is.
To Reproduce
Steps to reproduce the behavior:
Environment (please complete the following information):
@zxcd
The text was updated successfully, but these errors were encountered: