What is the Perf Difference between V0 Engine (--num-scheduler-steps) vs V1 Engine?

tjtanaa · May 29, 2025, 2:54pm

Has anyone ever tried comparing the perf of V0 --num-scheduler-steps vs V1 ? Do we expect V1 is better than V0 --num-scheduler-steps?

RunLLM · May 29, 2025, 2:54pm

Yes, several users have compared vLLM V1 to V0 with --num-scheduler-steps (multi-step scheduling). Results show that V0 with multi-step scheduling can be faster for single requests, but V1 generally offers better performance for high concurrency and long-context scenarios. For example, in one benchmark, V0 with multi-step scheduling had lower latency for a single request, but V1 outperformed V0 in concurrent throughput and efficiency, especially with long prompts or many parallel requests. However, the exact performance difference can depend on workload and environment specifics. See detailed benchmarks and discussion in this issue and this issue.

Would you like a step-by-step breakdown of the benchmark results or more details on specific scenarios?

Sources:

Topic		Replies	Views
V1 has lower end-to-end performance than V0(--num-scheduler-steps=8) General	1	70	June 11, 2025
V1 has lower end-to-end performance than V0(--num-scheduler-steps=8) V1 Feedback	6	291	June 12, 2025
vllm的V1为什么删除了multi step特性 General	3	235	June 11, 2025
Performance degradation report (0.9.0.1 vs 0.10.0) General	9	188	August 18, 2025
Scheduler in vllm Features	1	150	June 26, 2025

What is the Perf Difference between V0 Engine (--num-scheduler-steps) vs V1 Engine?

Related topics