YOLOv8 Python Script has really high inference time due unused GPU Memory

dikuerec · March 18, 2024, 10:20am

I am using YOLOv8 inference on Jetson Orin NX 16GB running Python. The GPU however does not take the load and instead the CPU is still utilized.

Screenshot from 2024-03-18 10-24-35

I imagine CUDA compatibility is an issue here.
Jetpack 6.0, but I manually installed CUDA12.2, cudnn9.0, PyTorch 2.2.1, Torchvision 0.17, CudaCompat-12.2
As per the instruction provided by NVIDIA and PyTorch.

wget https://developer.download.nvidia.com/compute/cudnn/9.0.0/local_installers/cudnn-local-tegra-repo-ubuntu2204-9.0.0_1.0-1_arm64.debsudo dpkg -i cudnn-local-tegra-repo-ubuntu2204-9.0.0_1.0-1_arm64.debsudo cp /var/cudnn-local-tegra-repo-ubuntu2204-9.0.0/cudnn-*-keyring.gpg /usr/share/keyrings/sudo apt-get updatesudo apt-get -y install cudnn
sudo apt-get -y install cudnn-cuda-12

The initilization in the script reads:

import cv2
import pandas as pd
import torch
from ultralytics import YOLO
from tracker import*
import paho.mqtt.client as mqtt
import json
import cuda

tracker = Tracker() # tracks objects and assigns IDs to them

model = YOLO(‘trial_29_yolov8_s_best.pt’)

I have read about specifically setting a target device for cuda but no torch.cuda device was detected.

torch.cuda.set_device(0)

Sample scripts I use to confirm the utilization of CUDA lead to negative results.

import torch

def print_gpu_properties():
if torch.cuda.is_available():
num_devices = torch.cuda.device_count()
print(f"Number of CUDA devices: {num_devices}“)
for i in range(num_devices):
device = torch.device(f"cuda:{i}”)
properties = torch.cuda.get_device_properties(device)
print(f"Device {i} properties:“)
print(f”\tName: {properties.name}“)
print(f”\tTotal memory: {properties.total_memory / (1024 ** 3):.2f} GB")
print(f"\tCompute capability: {properties.major}.{properties.minor}")
else:
print(“CUDA is not available on this system.”)

print_gpu_properties()

I could not verify that Jetson Orin NX was even able to use CUDA12 so that might be the most banal answer. The product documentation specifically mentions CUDA10 cores but I have a Ubuntu 22.4 Hostmachine and I dread setting up a new one for Jetpack 5.0

AastaLLL · March 19, 2024, 5:50am

Hi,

Please install our prebuilt PyTorch package instead:

The standard way to setup PyTorch on Jetson is:

Setup your system with the same JetPack (including OS, CUDA, cuDNN, … )
Install the PyTorch with the above doc, please select the package according to your JetPack version.

Thanks.

1075471989 · March 20, 2024, 1:12am

Hi AsataLLL,
I see this Pytorch is just torch but no torchvision and torchaudio. How to install them for jetson platform?

AastaLLL · March 20, 2024, 7:24am

Hi,

TorchVision and TorchAudio need to be built from the source.
You can find the building instructions below:

Or you can use our container which has all pre-installed.

Thanks.

system · April 10, 2024, 5:55am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
CUDA Not Detected in PyTorch - Unable to Use GPU for YOLO Training Jetson Orin NX cuda , pytorch	8	1539	December 13, 2024
Unable to Connect CUDA with YOLOv8 on Jetson Xavier AGX Jetson AGX Xavier cuda , yolo	6	189	November 18, 2024
Pytorch compatibility issues (torch 2.0.0+nv23.5 && torchvision 0.15.1) Jetson Orin NX pytorch	10	16764	June 13, 2023
Unable to use cuda cores for yolo. tried everything found on internet for orin nano dev kit Jetson Orin Nano cuda , jetson	6	81	February 7, 2025
YOLOv7(8,9) on Jetson Orin Nano 4GB. RuntimeError: CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling `cublasCreate(handle)` Jetson Orin Nano cuda	3	644	May 8, 2024
JetPack 6.0 – Missing PyTorch + CUDA Support for Jetson Inference (Need Official Wheel or Build Instructions) Jetson Orin Nano pytorch	15	130	May 26, 2025
How to develop yolov7 with Jetson AGX ORIN? Jetson AGX Orin yolo	9	1652	July 24, 2023
OSError: libcufft.so.10: cannot open shared object file: No such file or directory Jetson Orin NX cuda , pytorch	4	1387	October 14, 2024
Installing PyTorch for CUDA 10.2 on Jetson Xavier NX for YOLOv5 Jetson Xavier NX cuda , yolo	19	8846	September 21, 2021
AssertionError: CUDA unavailable, invalid device 0 requested on jetson Nano Jetson Nano cuda , yolo	8	9623	September 5, 2021

YOLOv8 Python Script has really high inference time due unused GPU Memory

Related topics