-
Notifications
You must be signed in to change notification settings - Fork 27.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whisper v3 dependency issue #28156
Comments
Hi @lionsheep0724, thanks for raising this issue! The most recent version of transformers is compatible with tokenizers==0.15. Could you try reinstalling transformers?
For the error message, could you share the full traceback? |
Hi @amyeroberts, sorry for late response, I was in year-end vacation.
But same result as follows(full traceback):
And refer to my test code:
|
@lionsheep0724 Could you confirm the versions of transformers and tokenizers in your environment?
And in the python environment:
|
@amyeroberts |
Let me share my troubleshooting result. |
Another finding : ubuntu 18.04 version(pytorch/pytorch:1.13.1-cuda11.6-cudnn8-runtime image) also has same issue. |
Thanks for updating @lionsheep0724! Across different platforms - when working and not working - do you see the same versions of |
Yes, I installed libraries using same method and the versions were same. |
@lionsheep0724 Hmmmm - I honestly have no idea what's happening here. I am to run without issue on my ubuntu machine and mac. My best guess is that the version of transformers being run in the python environment isn't the same as the one being installed by pip. The version restrictions seen in the warning message were changed with #23909 and have been part of the library since v4.34. You can check which version is being run using the python command I posted above. If you're running in an ipython environment, you'll need to make sure you're using the same libraries installed by pip. Running: import x
print(x.__version__) in the python environment should confirm if this is what's happening. |
@amyeroberts |
@lionsheep0724 Thanks for the update! |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
Hi, I faced the same error "Wrong index found for <|0.02|>: should be None but found 50366" Tried this: |
I solve the same problem by upgrade transformers to latest version : transformers==4.41.2 |
System Info
pip install --upgrade git+https://github.com/huggingface/transformers.git accelerate datasets[audio]
, which instructed in hereWho can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
ImportError: tokenizers>=0.11.1,!=0.11.3,<0.14 is required for a normal functioning of this module, but found tokenizers==0.15.0.
ValueError: Non-consecutive added token ‘<|0.02|>’ found. Should have index 50365 but has index 50366 in saved vocabulary.
error occurrs.The text was updated successfully, but these errors were encountered: