I want to serve DeepSeek-v4 w/ H100, driver version 535.129.03, so install vllm w/ uv pip install vllm --torch-backend=cu129 as described at v0.20.0 releases.
After installing, tried to serve uv run vllm serve ~~, below error occured:
Traceback (most recent call last):
File "/user-volume/latest_serve/.venv/bin/vllm", line 10, in <module>
sys.exit(main())
^^^^^^
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/entrypoints/cli/main.py", line 18, in main
import vllm.entrypoints.cli.benchmark.main
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/entrypoints/cli/benchmark/main.py", line 10, in <module>
from vllm.entrypoints.utils import VLLM_SUBCMD_PARSER_EPILOG
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/entrypoints/utils.py", line 19, in <module>
from vllm.engine.arg_utils import EngineArgs
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/engine/arg_utils.py", line 35, in <module>
from vllm.config import (
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/config/__init__.py", line 6, in <module>
from vllm.config.compilation import (
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/config/compilation.py", line 22, in <module>
from vllm.platforms import current_platform
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/platforms/__init__.py", line 278, in __getattr__
_current_platform = resolve_obj_by_qualname(platform_cls_qualname)()
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/utils/import_utils.py", line 109, in resolve_obj_by_qualname
module = importlib.import_module(module_name)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/importlib/__init__.py", line 90, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/user-volume/latest_serve/.venv/lib/python3.12/site-packages/vllm/platforms/cuda.py", line 21, in <module>
import vllm._C # noqa
^^^^^^^^^^^^^^
ImportError: libcudart.so.13: cannot open shared object file: No such file or directory
Same error occured when using --torch-backend=cu128, etc. How can I solve it? Using latest cuda version raise below error, and can’t upgrade driver right now:
RuntimeError: The NVIDIA driver on your system is too old (found version 12020). Please update your GPU driver by downloading and installing a new version from the URL: http://www.nvidia.com/Download/index.aspx Alternatively, go to: https://pytorch.org to install a PyTorch version that has been compiled with your version of the CUDA driver.