Features Speculative Decoding
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
|
About the Speculative Decoding category
|
|
0 | 50 | March 20, 2025 |
|
Qwen3.5-27B-FP8 Speculative Decoding
|
|
2 | 75 | March 12, 2026 |
|
Distributed Speculative Decoding using Ray
|
|
3 | 45 | February 11, 2026 |
|
Standalone draft model spec decode support in v0.x and v1
|
|
3 | 91 | January 20, 2026 |
|
Does vllm support draft model use tp>1 when I use speculative decoding
|
|
1 | 134 | July 29, 2025 |
|
Improving Speculative Decoding for Beginning Tokens & Structured Output
|
|
1 | 133 | July 16, 2025 |
|
Question: Specifying Medusa Choice Tree in vllm
|
|
1 | 89 | July 11, 2025 |
|
How to use speculative decoding?
|
|
3 | 766 | May 1, 2025 |
|
Spec decode with eagle get very low Draft acceptance rate
|
|
1 | 320 | April 25, 2025 |
|
Goodput Guided Speculative Decoding
|
|
2 | 217 | April 19, 2025 |
|
Why remove bonus token of requset in draft model?
|
|
0 | 54 | March 30, 2025 |