How to create a dynamic size array in device?

timothy · August 19, 2008, 1:35am

Hi all,

I have a question about how to create a dynamic size array in the kernel function of device-side code? What I want is to create an array in thread local memory.

I tried to use cudaMalloc(…), but the compiler says:
“calling a host function from a device/global function is only allowed in device emulation mode.”

I wonder if I can achieve the above thread local memory allocation in device-side code?
If I can’t, is there anyway I can allocate thread local memory in host-side code?

Thanks for looking!

-timothy

S.Warris · August 19, 2008, 4:25am

In short: no you can’t

Only global memory is accessible from the host and the amount of (local/shared) memory used by a thread is fixed. See the programming guide for details on the different types of memory and what you can and cannot do with it.

_Big_Mac · August 19, 2008, 3:03pm

You can dynamically (that is at kernel call time, not from the kernel itself) allocate shared memory. See sections 4.2.2.3 and 4.2.3 in the Programming Guide (1.1)

timothy · August 19, 2008, 3:40pm

Thanks guys. Maybe I can’t do that for each thread. Even shared memory is accessible for all threads in a block according to the specification.

alex_dubinsky · August 26, 2008, 7:26am

It’s trivial to allocate local memory on the device or host. You just have to do it yourself :) E.g., you can allocate a bunch of local memory upfront and then write a simple allocator to use inside the kernel. Or you can allocate a dynamic amount of global memory on the device and then index into it by threadIdx (and you can even use c++ operator overloading to make it all transparent).

Sarnath · August 26, 2008, 7:42am

So, How do you allocate local memory upfront?

alex_dubinsky · August 26, 2008, 7:45am

I meant statically. Just get a big 'ol chunk. Unfortunately, as I wrote in the other thread, that can kill performance if you get into the kilobytes for some unfathomable reason.

Topic		Replies	Views
array created inside a kernel usage of dynamic arrays that are created inside a kernel CUDA Programming and Performance	2	524	December 17, 2010
malloc in a kernel CUDA Programming and Performance	2	1779	July 1, 2009
Efficient way of reading dynamic array in kernel? CUDA Programming and Performance	5	1616	July 12, 2010
how to create arrays in runtime in shared memory? CUDA Programming and Performance	3	987	December 26, 2011
dynamic array issue? CUDA Programming and Performance	5	6409	November 3, 2009
Create array in local memory? Create array without size known at compile time CUDA Programming and Performance	2	838	July 11, 2010
Variable array size within kernel? CUDA Programming and Performance	3	5239	September 5, 2008
How can we allocate memory dynamically in __device__ functions? CUDA Programming and Performance	2	1594	March 6, 2009
Dynamic allocate memory inside kernel How to dynamic allocate memory inside kernel CUDA Programming and Performance	3	3387	June 12, 2009
dynamic array in shared memory CUDA Programming and Performance	2	1932	October 16, 2015

How to create a dynamic size array in device?

Related topics