Cuda pinned host memory and os-level shared memory

Ziqi · October 22, 2021, 3:08pm

We are recently developing an application where data has to be shared from a parent process to child processes using system shared memory (not CUDA shared memory). In each child process, CUDA pinned memory is allocated, and data shared from its parent process is copied into pinned memory for following kernel launches. Is there a way to directly create OS-level shared memory as CUDA pinned memory? That way, the copy from OS shared memory to pinned memory is not necessary, and thus, we may get some performance improvement.

striker159 · October 22, 2021, 4:01pm

Does cudaHostRegister work? CUDA Runtime API :: CUDA Toolkit Documentation

Topic		Replies	Views
CUDA device pointer host-side processes sharing implementation CUDA Programming and Performance	0	669	June 7, 2016
Already locked memory: making it known to CUDA - how ? CUDA Programming and Performance	2	847	September 6, 2016
How to make host pinned shared memory across process fork(2)? CUDA Programming and Performance	14	5271	January 6, 2015
Shared pinned/registered memory between multiple processes CUDA Programming and Performance	1	557	March 19, 2018
zero-copy sharing pinned memory between two linux processes CUDA Programming and Performance	0	695	August 17, 2011
How to share the same Device Memory between 2 process CUDA Programming and Performance	12	7462	October 28, 2009
Does cudaMemcpyAsync require host memory to be pinned? CUDA Programming and Performance cuda	1	405	October 6, 2022
Is it possible to use pinned memory? Outside of CUDA CUDA Programming and Performance	14	6298	January 22, 2025
gpu access host memory CUDA Programming and Performance	1	643	January 20, 2012
Pinned Memory Usage CUDA Programming and Performance	0	3119	November 6, 2009

Cuda pinned host memory and os-level shared memory

Related topics