Vllm0.11.0适配的llmcompressor 版本是什么

sxcnmslll · January 5, 2026, 5:05am

我使用下面这个镜像执行Qwen3vl模型的量化，请问我应该安装哪个版本的llmcompressor

modelscope-registry.cn-hangzhou.cr.aliyuncs.com/modelscope-repo/modelscope:ubuntu22.04-cuda12.9.1-py311-torch2.8.0-vllm0.11.0-modelscope1.32.0-swift3.11.1

RunLLM · January 5, 2026, 5:06am

看起来你使用的镜像（modelscope:ubuntu22.04-cuda12.9.1-py311-torch2.8.0-vllm0.11.0-modelscope1.32.0-swift3.11.1）已经集成了vLLM 0.11.0和modelscope 1.32.0，但关于llmcompressor的具体版本，数据源中没有明确说明Qwen3-VL量化所需llmcompressor的推荐版本。modelscope仓库通常会有llmcompressor的最新兼容版本，但建议优先选择与modelscope 1.32.0兼容的llmcompressor版本。

我的理解是，建议你在容器内通过pip安装llmcompressor，并优先选择modelscope官方文档或仓库中与1.32.0兼容的最新稳定版。如果有modelscope官方的requirements.txt或示例代码，按其指定的llmcompressor版本安装最为保险。如需进一步确认具体版本号，建议查阅modelscope官方文档或相关Qwen3-VL量化指南。目前数据源未直接给出llmcompressor的推荐版本号。如需详细操作步骤，请告知。

Sources:

Qwen3-VL Usage Guide - vLLM Recipes

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
A bit of frustration with Quantization Quantization	5	559	October 14, 2025
Gemma 3 Quantization General	5	1342	June 21, 2025
RuntimeError: Int8 not supported on SM120. Use FP8 quantization instead, or run on older arch (SM < 100) NVIDIA GPU Support	3	103	November 27, 2025
Can Support Qwen3-VL or Qwen2.5 VL 72B on Vllm-ascend 0.11.0? Ascend Support	2	177	November 20, 2025
求救，vllm怎么安装cuda13.0 General	3	585	November 11, 2025

Vllm0.11.0适配的llmcompressor 版本是什么

Related topics