icicle

AtHeartEngineer/icicle

Fork 0

mirror of https://github.com/pseXperiments/icicle.git synced 2026-01-09 15:37:58 -05:00

Commit Graph

Author	SHA1	Message	Date
Jeremy Felder	2b07513310	[FEAT]: Golang Bindings for pinned host memory (#519 ) ## Describe the changes This PR adds the capability to pin host memory in golang bindings allowing data transfers to be quicker. Memory can be pinned once for multiple devices by passing the flag `cuda_runtime.CudaHostRegisterPortable` or `cuda_runtime.CudaHostAllocPortable` depending on how pinned memory is called	2024-06-24 14:03:44 +03:00
Jeremy Felder	89082fb561	FEAT: MultiGPU for golang bindings (#417 ) ## Describe the changes This PR adds multi gpu support in the golang bindings. Tha main changes are to DeviceSlice which now includes a `deviceId` attribute specifying which device the underlying data resides on and checks for correct deviceId and current device when using DeviceSlices in any operation. In Go, most concurrency can be done via Goroutines (described as lightweight threads - in reality, more of a threadpool manager), however, there is no guarantee that a goroutine stays on a specific host thread. Therefore, a function `RunOnDevice` was added to the cuda_runtime package which locks a goroutine into a specific host thread, sets a current GPU device, runs a provided function, and unlocks the goroutine from the host thread after the provided function finishes. While the goroutine is locked to the hsot thread, the Go runtime will not assign other goroutines to that host thread	2024-03-13 16:19:45 +02:00
Jeremy Felder	e8cd2d7a98	GoLang bindings for v1.x (#386 )	2024-02-22 20:52:48 +02:00

Author

SHA1

Message

Date

Jeremy Felder

2b07513310

[FEAT]: Golang Bindings for pinned host memory (#519 )

## Describe the changes

This PR adds the capability to pin host memory in golang bindings
allowing data transfers to be quicker. Memory can be pinned once for
multiple devices by passing the flag
`cuda_runtime.CudaHostRegisterPortable` or
`cuda_runtime.CudaHostAllocPortable` depending on how pinned memory is
called

2024-06-24 14:03:44 +03:00

Jeremy Felder

89082fb561

FEAT: MultiGPU for golang bindings (#417 )

## Describe the changes

This PR adds multi gpu support in the golang bindings.

Tha main changes are to DeviceSlice which now includes a `deviceId`
attribute specifying which device the underlying data resides on and
checks for correct deviceId and current device when using DeviceSlices
in any operation.

In Go, most concurrency can be done via Goroutines (described as
lightweight threads - in reality, more of a threadpool manager),
however, there is no guarantee that a goroutine stays on a specific host
thread. Therefore, a function `RunOnDevice` was added to the
cuda_runtime package which locks a goroutine into a specific host
thread, sets a current GPU device, runs a provided function, and unlocks
the goroutine from the host thread after the provided function finishes.
While the goroutine is locked to the hsot thread, the Go runtime will
not assign other goroutines to that host thread

2024-03-13 16:19:45 +02:00

Jeremy Felder

e8cd2d7a98

GoLang bindings for v1.x (#386 )

2024-02-22 20:52:48 +02:00

3 Commits