Is there any way to get more specific info when Error code=2(cudaErrorMemoryAllocation)?

opengpu · December 21, 2023, 1:40am

is there any way to get more specific info when Error code=2(cudaErrorMemoryAllocation)?
eg. Use a OpenGL debug callback to get more details on the error.

njuffa · December 21, 2023, 2:51am

What kind of “more specific info” are you thinking of?

Here is a typical scenario: The last-level memory allocator gets a request to allocate a block of a particular size (and maybe some additional required properties), walks its list of free blocks, and cannot find any free block that satisfies the allocation request. At this point it either returns with “allocation failed” or it may call into a lower-level allocator to increase the amount of memory it can parcel out going forward.

If the lower-level allocator has no memory available, the last-level allocator returns “allocation failed”, otherwise it satisfies the current allocation request and adds the balance of the memory just made available to it (if any) to its freelist.

In this scenario, what additional information would be useful to an application? What would the application do differently based on this information? Keep in mind that this information would be based on internal implementation artifacts of the allocator that could change at any time.

opengpu · December 21, 2023, 3:04am

Is there a maximum number of graphics driver allocations of GPU memory in CUDA?(there is a limit in Vulkan, OpenGL) - CUDA / CUDA Programming and Performance - NVIDIA Developer Forums
just doubt if there is a MAX number of allocation, then after reach that limit, CUDA return cudaErrorMemoryAllocation even there is actually enough contigous GPU memory there.

opengpu · December 21, 2023, 3:08am

thanks! and is there any good article about the memory allocator mechanism and different levels inside CUDA?

njuffa · December 21, 2023, 3:38am

Generally speaking, NVIDIA does not publish internal implementation details of their software.

I have written a few simple memory allocators myself when for some reason or other the system-provided allocators were not to my liking. Usually this happened when the performance was lower than desired. System-provided memory allocators have no knowledge of the usage patters of a particular app, so better performance can often be achieved when the application grabs a huge chunk of memory from the system allocator at the start, and then use that for memory pools, buffer rings, slab allocators, etc custom-tailored to the needs of the application. For example, an application may only need allocate blocks of a few different sizes.

opengpu · December 21, 2023, 3:50am

not sure, but some project such as PyTorch has there own GPU memory mangement method which is obviously that raw cuda API is not enough to use.

njuffa · December 21, 2023, 4:26am

It does not matter whether the memory allocator is for a CPU-based or a GPU-based programming platform: the generic allocators provided by a system are usually a grand compromise and therefore rarely optimal for any particular purpose, which is why app-specific custom allocators are quite common where performance is important.

Even when custom allocators are used one should try to minimize allocation and de-allocation of memory inside performance-critical code sections and re-use already allocated buffers as much as possible.

Topic		Replies	Views
How to solve memory allocation problem in cuda?? CUDA Programming and Performance	4	31105	February 2, 2015
GPU Allocating memory Memory allocation on GPU CUDA Programming and Performance	2	4678	April 23, 2009
cudaMalloc() is returning cudaErrorMemoryAllocation. what could be the reasons? CUDA Programming and Performance	5	13045	August 13, 2009
cudaErrorMemoryAllocation error CUDA Programming and Performance	5	1574	August 20, 2013
cudaMalloc failed with unknown error after only 491656bytes CUDA Programming and Performance	9	4414	July 2, 2009
cudamalloc not allocating memeory CUDA Programming and Performance	0	1273	May 1, 2012
Crash and got cudaErrorMemoryAllocation BUT there is enough contiguous GPU memory...any other clue? CUDA Programming and Performance	0	337	December 20, 2023
Determine memory request size and availability? CUDA Programming and Performance	8	477	October 12, 2021
cuMemAlloc limited to 1/4 total GPU memory? CUDA Programming and Performance	10	12815	April 1, 2010
Accurately determining available global memory on a CUDA device CUDA Programming and Performance	2	14412	April 11, 2011

Is there any way to get more specific info when Error code=2(cudaErrorMemoryAllocation)?

Related topics