Getting Error in installing vllm error on Jetson orin nx jetpack6.2

1692930439 · May 8, 2025, 2:41am

jetson orin nx 16G


$ nvcc -V

nvcc: NVIDIA (R) Cuda compiler driver

Copyright (c) 2005-2024 NVIDIA Corporation

Built on Wed_Aug_14_10:14:07_PDT_2024

Cuda compilation tools, release 12.6, V12.6.68

Build cuda_12.6.r12.6/compiler.34714021_0

$ python

Python 3.10.0 | packaged by conda-forge | (default, Nov 20 2021, 02:50:31) [GCC 9.4.0] on linux

Type "help", "copyright", "credits" or "license" for more information.

>>> import torch

>>> print(torch.__version__)

2.6.0+cu126

>>> print(torch.cuda.is_available())

True

When I install vllm from https://pypi.jetson-ai-lab.dev/jp6/cu126 I get the following error


pip install vllm-0.8.6+cu126-cp310-cp310-linux_aarch64.whl

Processing ./vllm-0.8.6+cu126-cp310-cp310-linux_aarch64.whl

ERROR: Wheel 'vllm' located at /home/ygsj/dependency/vllm-0.8.6+cu126-cp310-cp310-linux_aarch64.whl is invalid.

AastaLLL · May 8, 2025, 12:18pm

Hi,

Is docker an option for you?
As we have the vllm container so you don’t need to install it manually.

https://hub.docker.com/r/dustynv/vllm/tags

Thanks.

1692930439 · May 9, 2025, 11:54am

Hello, your suggestion inspired me, but I couldn’t get vllm running with Docker. I encountered a

RuntimeError: operator torchvision::nms does not exist.

But the vllm was successfully run using jetson-containers by jetson-containers run $(autotag vllm)

AastaLLL · May 12, 2025, 10:56am

Hi,

jetson-container also uses docker to launch the container but with some jetson-specific settings.
Have you tried to use the same command as jetson-container (which can be found in the console log) to see if it can work?

Thansk.

1692930439 · May 15, 2025, 6:36am

Actually, I don’t really understand Docker, but I can show you the output of my run and the versions of each package.When installing VLLM with Jetson container, I noticed that the VLLM version installed with Jetson container is 0.6.6.post1+cu126.
The versions of other packages are.
python3.10,torch2.5.0,torchvision0.20.0,transformers-4.52.0.dev0,jetpack6.2.
This may be useful for you.

1692930439 · May 15, 2025, 6:41am

jetson-containers run -v /home/ygsj/workspace/Qwen/Qwen2___5-3B-Instruct:/model/Qwen/Qwen2___5-3B-Instruct/ dustynv/vllm:0.6.6.post1-r36.4.0
V4L2_DEVICES:

ARM64 architecture detected

Jetson Detected

SYSTEM_ARCH=tegra-aarch64

sudo docker run --runtime nvidia --env NVIDIA_DRIVER_CAPABILITIES=compute,utility,graphics -it --rm --network host --shm-size=8g --volume /tmp/argus_socket:/tmp/argus_socket --volume /etc/enctune.conf:/etc/enctune.conf --volume /etc/nv_tegra_release:/etc/nv_tegra_release --volume /tmp/nv_jetson_model:/tmp/nv_jetson_model --volume /var/run/dbus:/var/run/dbus --volume /var/run/avahi-daemon/socket:/var/run/avahi-daemon/socket --volume /var/run/docker.sock:/var/run/docker.sock --volume /home/ygsj/dependency/jetson-containers/data:/data -v /etc/localtime:/etc/localtime:ro -v /etc/timezone:/etc/timezone:ro --device /dev/snd -e PULSE_SERVER=unix:/run/user/1000/pulse/native -v /run/user/1000/pulse:/run/user/1000/pulse --device /dev/bus/usb --device /dev/i2c-0 --device /dev/i2c-1 --device /dev/i2c-2 --device /dev/i2c-4 --device /dev/i2c-5 --device /dev/i2c-7 --device /dev/i2c-9 -v /run/jtop.sock:/run/jtop.sock --name jetson_container_20250515_143735 -v /home/ygsj/workspace/Qwen/Qwen2___5-3B-Instruct:/model/Qwen/Qwen2___5-3B-Instruct/ dustynv/vllm:0.6.6.post1-r36.4.0

AastaLLL · May 15, 2025, 8:11am

Hi,

Do you mind testing our latest vLLM container to see if it can work?

jetson-containers run dustynv/vllm:0.8.6-r36.4-cu128-24.04

Thanks.

system · June 17, 2025, 6:32am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Jetson orin NX jetpack6.2 vllm install error Jetson Orin NX jetpack , python , jetson	2	32	May 8, 2025
Vllm on Jetson AGX orin Jetson AGX Orin pytorch , generative_ai	9	2647	July 17, 2024
Getting Error in installing vllm on Nvidia Jetson AGX ORIN Jetson AGX Orin generative_ai	3	873	July 12, 2024
Ollama unable to detect gpu on JetPack 6.1 Jetson AGX Orin generative_ai	7	684	October 15, 2024
Jetson Nano Torch 1.6.0 PyTorch Vision v0.7.0-rc2 Runtime Error Jetson Nano pytorch	4	1439	October 18, 2021
Pytorch installed on l4t-jetpack:r35.4.1 container on Jetson Orin Nano (JetPack 6.0 Developer Kit) fails to recognize CUDA Jetson Orin Nano cuda , docker , pytorch , python , containers	2	159	October 22, 2024
Install Pytorch with cuda on Jetson Orin nano Devloper Kit Jetson Orin Nano cuda , pytorch	13	2865	July 30, 2024
Incompatible torch2.2+Cuda12.2 wheel with other python libraries for AGX Orin Jetpack6.0 Jetson AGX Orin cuda , pytorch	9	1783	May 17, 2024
Error flashing Jetson Orin including Cuda Jetson AGX Orin reflash , cuda	9	1112	July 11, 2022
Nano_LLM or nanollm for Python package? Jetson Orin Nano generative_ai , llama	8	54	May 15, 2025

Getting Error in installing vllm error on Jetson orin nx jetpack6.2

ARM64 architecture detected

Jetson Detected

Related topics