Is guided decoding used during automatic function calling (--enable-auto-tool-choice)?

heyzude · May 27, 2025, 1:43pm

Is guided decoding used during automatic function calling (–enable-auto-tool-choice)?
As I read vllm doc (Tool Calling - vLLM), it seems that guided decoding is enabled by default only when named function calling or required function calling is used.

RunLLM · May 27, 2025, 1:44pm

Guided decoding is not used during automatic function calling (--enable-auto-tool-choice). Guided decoding is only enabled by default for named function calling (when you specify a function in tool_choice) and for required function calling (tool_choice="required"). For automatic tool choice (tool_choice="auto"), vLLM relies on the model’s own output and does not enforce the schema via guided decoding, so the output may not always strictly match the tool schema. This is explicitly stated in the official documentation and tool calling docs.

Would you like more detail or code examples?

Sources:

https://github.com/vllm-project/vllm/blob/main/docs/features/tool_calling.md

Topic		Replies	Views
Tool calling using Offline Inference? Tool Calling	1	38	April 14, 2025
Specifying special tokens Structured Outputs	5	64	May 8, 2025
Goodput Guided Speculative Decoding Speculative Decoding	2	102	April 19, 2025
How to use speculative decoding? Speculative Decoding	3	77	May 1, 2025
Data Parallel Attention Cuda Graph General	1	20	May 26, 2025

Is guided decoding used during automatic function calling (--enable-auto-tool-choice)?

Related topics