fix: replace torch.device("cuda") with torch.device("cuda:0") in devices initialization #3184

vblagoje · 2022-09-08T08:35:15Z

Related Issues

fixes Incorrect device specified when using HF transformer pipeline object #3160

Proposed Changes:

As version 4.21.2 in their pipelines, HF transformers only accept indexed Cuda devices. Therefore, we needed to replace instances of torch.device("cuda") with torch.device("cuda:0") in devices initialization util function.

How did you test it?

A fix is trivial; I tried a small unit test in an interpreter but didn't add any additional unit tests

Notes for the reviewer

Think of a potential scenario where the proposed fix breaks.

vblagoje · 2022-09-08T11:19:33Z

wdyt @sjrl ? This is what I had in mind ☝️

sjrl · 2022-09-08T12:04:34Z

haystack/modeling/utils.py

@@ -96,6 +96,12 @@ def initialize_device_settings(
        n_gpu = 1
        # Initializes the distributed backend which will take care of sychronizing nodes/GPUs
        torch.distributed.init_process_group(backend="nccl")
+
+    # HF transformers v4.21.2 pipeline object doesn't accept torch.device("cuda"), it has to be indexed cuda device


small typo add "an" to "to be an indexed ..."

sjrl · 2022-09-08T12:04:56Z

haystack/modeling/utils.py

@@ -96,6 +96,12 @@ def initialize_device_settings(
        n_gpu = 1
        # Initializes the distributed backend which will take care of sychronizing nodes/GPUs
        torch.distributed.init_process_group(backend="nccl")
+
+    # HF transformers v4.21.2 pipeline object doesn't accept torch.device("cuda"), it has to be indexed cuda device
+    # TODO eventually remove once the limitation is fixed in HF transformers


Can you make this TODO an issue, so we can keep track in GitHub?

sjrl

Looks good! Just two small comments.

…alization

vblagoje · 2022-09-08T12:43:48Z

Corrected comment and opened #3185

…alization (#3184)

vblagoje requested a review from a team as a code owner September 8, 2022 08:35

vblagoje requested review from masci and sjrl and removed request for a team September 8, 2022 08:35

sjrl reviewed Sep 8, 2022

View reviewed changes

sjrl approved these changes Sep 8, 2022

View reviewed changes

Replace torch.device(cuda) with torch.device(cuda:0) in devices initi…

9fba4c0

…alization

vblagoje mentioned this pull request Sep 8, 2022

Track HF transformers' pipeline ability to use non-indexed cuda devices #3185

Closed

vblagoje merged commit e0d73f3 into deepset-ai:main Sep 8, 2022

masci changed the title ~~Replace torch.device("cuda") with torch.device("cuda:0") in devices initialization~~ fix: replace torch.device("cuda") with torch.device("cuda:0") in devices initialization Sep 21, 2022

brandenchan pushed a commit that referenced this pull request Sep 21, 2022

Replace torch.device(cuda) with torch.device(cuda:0) in devices initi…

87781ee

…alization (#3184)

vblagoje deleted the fix_initialize_device branch October 24, 2022 08:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: replace torch.device("cuda") with torch.device("cuda:0") in devices initialization #3184

fix: replace torch.device("cuda") with torch.device("cuda:0") in devices initialization #3184

vblagoje commented Sep 8, 2022

vblagoje commented Sep 8, 2022

sjrl Sep 8, 2022

sjrl Sep 8, 2022

sjrl left a comment

vblagoje commented Sep 8, 2022

fix: replace torch.device("cuda") with torch.device("cuda:0") in devices initialization #3184

fix: replace torch.device("cuda") with torch.device("cuda:0") in devices initialization #3184

Conversation

vblagoje commented Sep 8, 2022

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

vblagoje commented Sep 8, 2022

sjrl Sep 8, 2022

Choose a reason for hiding this comment

sjrl Sep 8, 2022

Choose a reason for hiding this comment

sjrl left a comment

Choose a reason for hiding this comment

vblagoje commented Sep 8, 2022