Using cudaHostAlloc

wilsonbt · May 9, 2011, 4:30pm

I am performing a simulation where I am concerned with how a system evolves in time. This entails that I make several memory copies from the GPU to the CPU. I was reading Cuda by Example by Sanders and Kandrot and noticed that page-locked memory might be of some use help speed up these memory copies. However, I have run into problems. Here is how I currently have the memory allocated:

x = (double*)malloc(n*sizeof(double));
y = (double*)malloc(n*sizeof(double));
z = (double*)malloc(n*sizeof(double));

cudaSetDevice(0);

cudaHostAlloc((void**)&x, n*sizeof(double), cudaHostAllocDefault);
cudaHostAlloc((void**)&y, n*sizeof(double), cudaHostAllocDefault);
cudaHostAlloc((void**)&y, n*sizeof(double), cudaHostAllocDefault);


    cudaHostGetDevicePointer (&xd, x, 0);
cudaHostGetDevicePointer (&yd, y, 0);
cudaHostGetDevicePointer (&zd, z, 0);

I initially had the malloc command commented out; however, when I tried to run the code, it segfaulted immediately. My system has 2 GPU’s, a Quardro 295 and a GTX 570 (which is device “0”), hence the cudaSetDevice. I was wondering if I am using these allocations correctly, and if so, why am I not noticing a speedup?

Thank you in advance.

Topic		Replies	Views
Low performance for CPU accessing page-locked memory? CUDA Programming and Performance	3	610	March 7, 2019
Is cudaHostAlloc() fast? CUDA Programming and Performance	5	625	March 28, 2024
malloc() + cuMemHostRegister() faster than cuMemAllocHost() CUDA Programming and Performance	0	1083	October 9, 2013
Why is cudaMallocHost() so slow? CUDA Programming and Performance	7	8871	November 17, 2021
cudaMalloc() CUDA Programming and Performance	0	838	October 9, 2013
CUDA multiple gpus page-locked memory malloc and free CUDA Programming and Performance cuda	0	308	August 14, 2020
a question about cudaMallocManaged（） CUDA Programming and Performance	4	539	November 17, 2018
CPU operation is very slow on memory allocated by cudaMallocHost CUDA Programming and Performance	0	382	October 9, 2018
CPU operation is very slow on memory allocated by cudaMallocHost TensorRT	1	830	October 8, 2018
cudaHostAlloc performance is slow CUDA Programming and Performance	1	1150	June 26, 2012

Using cudaHostAlloc

Related topics