-max-num-seqs parameters not recognized

youlearnit · August 14, 2025, 1:54pm

when running:
docker run -it --gpus all --max-num-seqs 64 --ipc=host --network host -p 8000:8000 -v /home/ubuntu/vllm_models:/root/.cache/huggingface -e HUGGING_FACE_HUB_TOKEN=“removed” --name vllm_download_and_serve9 vllm/vllm-openai:latest --model google/gemma-3-12b-it --served-model-name vllm/gemma-3 --max-model-len 4096 --num-scheduler-steps 10

Why --num-scheduler-steps 10 and --max-num-seqs 64 are not known:
unknown flag: --max-num-seqs

How to set them ?

RunLLM · August 14, 2025, 1:55pm

It seems likely that the --max-num-seqs and --num-scheduler-steps flags are not recognized because they are not valid top-level Docker flags, but should be passed to the vLLM server command inside the container, after the image name. The correct syntax is to place all vLLM arguments after the image name, not before it. For example:

docker run -it --gpus all --ipc=host --network host -p 8000:8000 \
  -v /home/ubuntu/vllm_models:/root/.cache/huggingface \
  -e HUGGING_FACE_HUB_TOKEN="removed" \
  --name vllm_download_and_serve9 \
  vllm/vllm-openai:latest \
  --model google/gemma-3-12b-it \
  --served-model-name vllm/gemma-3 \
  --max-model-len 4096 \
  --max-num-seqs 64 \
  --num-scheduler-steps 10

This way, --max-num-seqs and --num-scheduler-steps are passed to the vLLM server, not Docker. My understanding is that this matches the documented usage in the vLLM CLI guide and Docker deployment docs. Would you like more detail on the available vLLM CLI arguments or troubleshooting tips?

Sources:

https://github.com/vllm-project/vllm/issues/17250

Topic		Replies	Views
Is this correct syntax General	1	22	September 16, 2025
Can anyone help me? Why is this not working? It used 😭 NVIDIA GPU Support	1	1226	May 8, 2025
How to benchmark concurrency General	26	1694	September 15, 2025
How to serve two vLLM instance using docker? General	3	584	February 26, 2026
"served-model-name" and "model" General	6	1164	January 26, 2026

-max-num-seqs parameters not recognized

Related topics