What is the correct chat template when serving gemma4?

Ri0S · April 30, 2026, 4:47am

The official vLLM guide recommends serving with --chat-template examples/tool_chat_template_gemma4.jinja for reasoning and tool use(Gemma 4 Usage Guide - vLLM Recipes). However, I’ve noticed a bug where reasoning is omitted during tool calls when this template is applied. Removing the option fixes the issue(v0.20.0). Is it safe to serve without this specific chat template, or is it strictly required for proper model behavior?

RunLLM · April 30, 2026, 4:47am

The official vLLM documentation recommends using --chat-template examples/tool_chat_template_gemma4.jinja for Gemma 4 to ensure correct reasoning and tool calling behavior, as this template is optimized for vLLM’s reasoning and tool call parsers. Serving without this template may work in some cases, but can lead to subtle bugs or degraded multi-turn tool use, since the default HuggingFace template may not handle all vLLM-specific requirements for reasoning and tool call boundaries, especially in multi-turn or streaming scenarios. See Gemma 4 vLLM Recipe and Gemma 4 Reasoning/Tool Use.

If you observe reasoning being omitted during tool calls with the recommended template, this is a known issue that has been discussed and patched in recent vLLM/Gemma4 updates (PR #39027, issue #39885). Using the latest vLLM main branch and ensuring your model’s chat template is up to date is advised. Serving without the template is not strictly required, but may result in unpredictable or unsupported behavior, especially for advanced features. Would you like a step-by-step breakdown of the bug and template interactions?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Problem with Gemma3 and vLLM General	11	801	December 6, 2025
How to use Gemma 4 with the new MTP drafters? General	8	2266	May 14, 2026
How to serve gemma-4-31b-it General	2	835	April 2, 2026
Why is there no effect after changing chat template file? General	4	326	November 14, 2025
How does `tool_choice` effect vllm? General	13	601	November 18, 2025

What is the correct chat template when serving gemma4?

Related topics