FEAT: MultiGPU for golang bindings #417

jeremyfelder · 2024-03-04T15:08:16Z

Describe the changes

This PR adds multi gpu support in the golang bindings.

Tha main changes are to DeviceSlice which now includes a deviceId attribute specifying which device the underlying data resides on and checks for correct deviceId and current device when using DeviceSlices in any operation.

In Go, most concurrency can be done via Goroutines (described as lightweight threads - in reality, more of a threadpool manager), however, there is no guarantee that a goroutine stays on a specific host thread. Therefore, a function RunOnDevice was added to the cuda_runtime package which locks a goroutine into a specific host thread, sets a current GPU device, runs a provided function, and unlocks the goroutine from the host thread after the provided function finishes. While the goroutine is locked to the hsot thread, the Go runtime will not assign other goroutines to that host thread

DmytroTym · 2024-03-05T13:13:31Z

@jeremyfelder one thing I found really useful when designing Rust-side device slices is the ability to look up device id of any device pointer using CUDA runtime cudaPointerGetAttributes function. This allows to avoid explicitly storing device id alongside the actual data.

…ll device association managed implicitly

wrappers/golang/core/slice.go

wrappers/golang/cuda_runtime/device_context.go

ImmanuelSegol

its the same design as the rust version of multigpu so no comments on design.

The code looks good. no blocking comments so i just added some suggestions.

ImmanuelSegol · 2024-03-12T12:32:00Z

wrappers/golang/core/slice.go

+}
+
+// CheckDevice is used to ensure that the DeviceSlice about to be used resides on the currently set device
+func (d DeviceSlice) CheckDevice() {


honestly the name is confusing i think

ImmanuelSegol · 2024-03-12T16:23:58Z

wrappers/golang/internal/generator/templates/curve.go.tmpl

 	return convert{{if .IsG2}}G2{{end}}ProjectivePointsMontgomery(points, true)
 }

 func {{if .IsG2}}G2{{end}}ProjectiveFromMontgomery(points *core.DeviceSlice) cr.CudaError {
+	points.CheckDevice()
 	return convert{{if .IsG2}}G2{{end}}ProjectivePointsMontgomery(points, false)
 }
 {{end}}


missing extra line

ImmanuelSegol · 2024-03-12T16:24:11Z

wrappers/golang/internal/generator/templates/scalar_field.go.tmpl

 	return convertScalarsMontgomery(scalars, true)
 }

 func FromMontgomery(scalars *core.DeviceSlice) cr.CudaError {
+	scalars.CheckDevice()
 	return convertScalarsMontgomery(scalars, false)
 }{{- end}}


jeremyfelder added 4 commits March 4, 2024 10:36

Add multigpu support in golang bindings

9be750d

Fix locking to device for thread

9bc2b00

Template multigpu changes

12218af

golang formatting

6806620

jeremyfelder requested review from ImmanuelSegol, bigsky77 and LeonHibnik March 4, 2024 15:22

Fix g2 tests

a4eb950

jeremyfelder added 2 commits March 6, 2024 13:02

Use pointerAttribute to track DeviceSlice's associated device. Make a…

7e35f31

…ll device association managed implicitly

formatting

afa9617

jeremyfelder force-pushed the feat/golang/multigpu branch from 139fcab to afa9617 Compare March 6, 2024 11:08

ImmanuelSegol suggested changes Mar 6, 2024

View reviewed changes

wrappers/golang/core/slice.go Show resolved Hide resolved

wrappers/golang/cuda_runtime/device_context.go Show resolved Hide resolved

jeremyfelder added 2 commits March 6, 2024 16:54

Add getter for device id on DeviceContext

ae69f5b

Add comment and print err code

1f05c1c

jeremyfelder requested a review from ImmanuelSegol March 7, 2024 07:30

ImmanuelSegol approved these changes Mar 12, 2024

View reviewed changes

LeonHibnik approved these changes Mar 13, 2024

View reviewed changes

LeonHibnik merged commit 89082fb into main Mar 13, 2024
21 checks passed

LeonHibnik deleted the feat/golang/multigpu branch March 13, 2024 14:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: MultiGPU for golang bindings #417

FEAT: MultiGPU for golang bindings #417

jeremyfelder commented Mar 4, 2024

DmytroTym commented Mar 5, 2024

ImmanuelSegol left a comment

ImmanuelSegol Mar 12, 2024

ImmanuelSegol Mar 12, 2024

ImmanuelSegol Mar 12, 2024

FEAT: MultiGPU for golang bindings #417

FEAT: MultiGPU for golang bindings #417

Conversation

jeremyfelder commented Mar 4, 2024

Describe the changes

DmytroTym commented Mar 5, 2024

ImmanuelSegol left a comment

Choose a reason for hiding this comment

ImmanuelSegol Mar 12, 2024

Choose a reason for hiding this comment

ImmanuelSegol Mar 12, 2024

Choose a reason for hiding this comment

ImmanuelSegol Mar 12, 2024

Choose a reason for hiding this comment