Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] 404 Client Error: Not Found for url: https://bpemb.h-its.org/multi/multi.wiki.bpe.vs100000.model #217

Closed
ajndkr opened this issue Mar 15, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@ajndkr
Copy link
Contributor

ajndkr commented Mar 15, 2024

Describe the bug

AddressParser fails due to incorrect model path: https://bpemb.h-its.org/multi/multi.wiki.bpe.vs100000.model

To Reproduce

from deepparse.parser import AddressParser
address_parser = AddressParser(model_type="bpemb", device=0)

Expected behavior

I believe the class needs to download the model from the following url: https://bpemb.h-its.org/multi/multi/multi.wiki.bpe.vs1000000.model

@ajndkr ajndkr added the bug Something isn't working label Mar 15, 2024
@davebulaval
Copy link
Collaborator

Yeah, it appears they have changed the URL from https://bpemb.h-its.org/multi/multi.wiki.bpe.vs100000.model to https://bpemb.h-its.org/multi/multi/multi.wiki.bpe.vs1000000.model. The problem is not ours but BPEmb library.

It seems like an involuntary error on their side. I have opened an issue on the project repo here.

@akseljoonas
Copy link

akseljoonas commented Mar 19, 2024

Is there a fix or is the AddressParser with bpemb unusable for now?

@ajndkr
Copy link
Contributor Author

ajndkr commented Mar 19, 2024

@akseljoonas for the time being, until it is resolved on bpemb side, you can use this workaround.

mkdir -p ~/.cache/deepparse/multi/ && \
wget -O ~/.cache/deepparse/multi/multi.wiki.bpe.vs100000.model https://bpemb.h-its.org/multi/multi/multi.wiki.bpe.vs100000.model && \
wget -O multi.wiki.bpe.vs100000.d300.w2v.bin.tar.gz https://bpemb.h-its.org/multi/multi/multi.wiki.bpe.vs100000.d300.w2v.bin.tar.gz && \
tar -xzf multi.wiki.bpe.vs100000.d300.w2v.bin.tar.gz -C ~/.cache/deepparse/multi/ && rm multi.wiki.bpe.vs100000.d300.w2v.bin.tar.gz

I download these files into the cache folder before initialising AddressParser.

@davebulaval
Copy link
Collaborator

Is there a fix or is the AddressParser with bpemb unusable for now?

No, except for the @ajndkr solution.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants