[Bugfix] Gracefully handle huggingface hub http error #12571

ywang96 · 2025-01-30T06:43:44Z

Currently if huggingface hub is down, users cannot start the engine because of the following code that always checks against the hub if the specified model has a encoder config, even though the users have all the other files available locally.

vllm/vllm/config.py

Line 299 in f17f1d4

self.encoder_config = self._get_encoder_config()

vllm/vllm/config.py

Lines 417 to 419 in f17f1d4

    
           def _get_encoder_config(self): 
        
               return get_sentence_transformer_tokenizer_config( 
        
                   self.model, self.revision)

This PR gracefully handles the http error from huggingface hub and gives warnings if there's indeed connection issue.

IMO a proper fix for this issue should be optionally calling _get_encoder_config to check against the hub (though I'm not sure there's a good way to do so) cc @flaviabeo @maxdebayser

Signed-off-by: Roger Wang <[email protected]>

github-actions · 2025-01-30T06:43:56Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

ywang96 · 2025-01-30T06:44:05Z

Thanks @charlesfrye for finding out the root cause!

simon-mo · 2025-01-30T07:18:05Z

non blocking, can we figure out a way to test this? maybe edit /etc/hosts to block huggingface.co to see whether offline mode is truly working

maxdebayser

Thanks for fixing this!

Signed-off-by: Roger Wang <[email protected]>

vllm/transformers_utils/config.py

Signed-off-by: Roger Wang <[email protected]>

tlrmchlsmth

🙏 🙏 🙏

…2571) Signed-off-by: Isotr0py <[email protected]>

…2571)

…2571) Signed-off-by: Srikanth Srinivas <[email protected]>

…2571)

ywang96 added 2 commits January 29, 2025 22:35

check for HfHubHTTPError

71c973b

Signed-off-by: Roger Wang <[email protected]>

use warning

0a394d3

Signed-off-by: Roger Wang <[email protected]>

simon-mo approved these changes Jan 30, 2025

View reviewed changes

maxdebayser approved these changes Jan 30, 2025

View reviewed changes

mgoin approved these changes Jan 30, 2025

View reviewed changes

return None

75fef94

Signed-off-by: Roger Wang <[email protected]>

comaniac approved these changes Jan 30, 2025

View reviewed changes

vllm/transformers_utils/config.py Outdated Show resolved Hide resolved

comaniac enabled auto-merge (squash) January 30, 2025 16:25

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 30, 2025

typo

6da3036

Signed-off-by: Roger Wang <[email protected]>

tlrmchlsmth approved these changes Jan 30, 2025

View reviewed changes

comaniac merged commit 7a8987d into main Jan 31, 2025
46 checks passed

comaniac deleted the fix-file-download branch January 31, 2025 08:19

Isotr0py pushed a commit to Isotr0py/vllm that referenced this pull request Feb 2, 2025

[Bugfix] Gracefully handle huggingface hub http error (vllm-project#1…

4f6d675

…2571) Signed-off-by: Isotr0py <[email protected]>

youngkent pushed a commit to youngkent/vllm that referenced this pull request Feb 3, 2025

[Bugfix] Gracefully handle huggingface hub http error (vllm-project#1…

b698922

…2571)

srikanthsrnvs pushed a commit to srikanthsrnvs/vllm that referenced this pull request Feb 3, 2025

[Bugfix] Gracefully handle huggingface hub http error (vllm-project#1…

c4795ce

…2571) Signed-off-by: Srikanth Srinivas <[email protected]>

NickLucche pushed a commit to NickLucche/vllm that referenced this pull request Feb 7, 2025

[Bugfix] Gracefully handle huggingface hub http error (vllm-project#1…

ff90cdd

…2571)

ShangmingCai pushed a commit to ShangmingCai/vllm that referenced this pull request Feb 10, 2025

[Bugfix] Gracefully handle huggingface hub http error (vllm-project#1…

b7e6a68

…2571)

GWS0428 pushed a commit to GWS0428/VARserve that referenced this pull request Feb 12, 2025

[Bugfix] Gracefully handle huggingface hub http error (vllm-project#1…

bfc3c48

…2571)

panf2333 pushed a commit to yottalabsai/vllm that referenced this pull request Feb 18, 2025

[Bugfix] Gracefully handle huggingface hub http error (vllm-project#1…

ec04d58

…2571)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Gracefully handle huggingface hub http error #12571

[Bugfix] Gracefully handle huggingface hub http error #12571

ywang96 commented Jan 30, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Jan 30, 2025

ywang96 commented Jan 30, 2025

simon-mo commented Jan 30, 2025

maxdebayser left a comment

tlrmchlsmth left a comment

	def _get_encoder_config(self):
	return get_sentence_transformer_tokenizer_config(
	self.model, self.revision)

[Bugfix] Gracefully handle huggingface hub http error #12571

[Bugfix] Gracefully handle huggingface hub http error #12571

Conversation

ywang96 commented Jan 30, 2025 • edited by github-actions bot Loading

github-actions bot commented Jan 30, 2025

ywang96 commented Jan 30, 2025

simon-mo commented Jan 30, 2025

maxdebayser left a comment

Choose a reason for hiding this comment

tlrmchlsmth left a comment

Choose a reason for hiding this comment

ywang96 commented Jan 30, 2025 •

edited by github-actions bot

Loading