Why does access cudaMallocManaged memory throw exception?

1911773844 · June 17, 2025, 6:54am

/example
│-- CMakeLists.txt
│-- main.cu

cmake_minimum_required(VERSION 3.12)
project(SimpleCudaProject CUDA)
set(CMAKE_CUDA_STANDARD 17)
add_executable(SimpleCudaProject main.cu)

#include <cassert>
#include <cstdio>
#define CHECK(call)                                                     \
    do {                                                                \
        const cudaError_t error_code = call;                            \
        if (error_code != cudaSuccess) {                                \
            printf("CUDA Error:\n");                                    \
            printf("File: %s\n", __FILE__);                             \
            printf("Line: %d\n", __LINE__);                             \
            printf("Error code: %d\n", error_code);                     \
            printf("Error text: %s\n", cudaGetErrorString(error_code)); \
            assert(0);                                                  \
            exit(1);                                                    \
        }                                                               \
    } while (0)

__global__ void warmup() {}
int main() {
    for (size_t i = 0; i < 3; i++) {
        warmup << <1, 1 >> > ();
        float* buffer;
        CHECK(cudaMallocManaged(&buffer, sizeof(float) * 48));
        buffer[0] = 1;
        cudaFree(buffer);
    }
    return 0;
}

This is so confused. Can anyone help me?
Hardware: RTX5070ti laptop
latest Win11 and latest vs2022

nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Tue_May_27_02:24:01_Pacific_Daylight_Time_2025
Cuda compilation tools, release 12.9, V12.9.86
Build cuda_12.9.r12.9/compiler.36037853_0

±----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 2276 C+G …iceHub.ThreadedWaitDialog.exe N/A |
| 0 N/A N/A 12412 C+G C:\Windows\explorer.exe N/A |
| 0 N/A N/A 29160 C …d\Debug\SimpleCudaProject.exe N/A |
| 0 N/A N/A 29220 C+G …munity\Common7\IDE\devenv.exe N/A |
±----------------------------------------------------------------------------------------+

striker159 · June 17, 2025, 7:15am

Managed memory on Windows has limitations which are explained here: CUDA C++ Programming Guide — CUDA C++ Programming Guide

Does it work if you cudaDeviceSynchronize() before allocation?
Does it work if you allocate outside of the loop?

1911773844 · June 17, 2025, 7:47am

This works.

int main() {
    for (size_t i = 0; i < 3; i++) {
        warmup << <1, 1 >> > ();
        cudaDeviceSynchronize();
        float* buffer;
        CHECK(cudaMallocManaged(&buffer, sizeof(float) * 48));
        buffer[0] = 1;
        cudaFree(buffer);
    }
    return 0;
}

In my test, i should allocate in loop.

Topic		Replies	Views
Whole system freezes when using cudaMallocManaged CUDA Programming and Performance	18	2482	February 11, 2019
cudaMemset: illegal memory access with RTX5090 with 570.86.16 CUDA Programming and Performance llama	16	292	June 5, 2025
CUDA on first memory allocation call only weird issue CUDA Programming and Performance	3	3737	October 11, 2011
using cudaMalloc and cudaFree within a loop unspecified launch failure! CUDA Programming and Performance	21	37704	April 23, 2009
CUDA_ERROR_ILLEGAL_ADDRESS CUDA Programming and Performance	6	11071	September 26, 2017
cudaMalloc3DArray out of memory can not allocate the available amount of memory CUDA Programming and Performance	3	1812	January 31, 2011
cudaMallocManaged suceeds, but memory access fails, for size greater than hardware memory CUDA Programming and Performance	2	1143	April 26, 2017
CudaMalloc on Vista : strange behaviour Works on XP, Fails on Vista CUDA Programming and Performance	6	12260	July 1, 2009
Maximum size of memory block in cudaMallocManaged() CUDA Programming and Performance	7	2552	November 28, 2017
cudaMallocManaged() not allocating memory in device memory CUDA Programming and Performance	4	2048	August 22, 2018

Why does access cudaMallocManaged memory throw exception?

Related topics