How to get structured outputs in vllm?

RunLLM · December 22, 2025, 9:18am

The client.responses.parse method with the text_format parameter is supported in vLLM for models that implement structured outputs, as shown in the vLLM test suite. However, official documentation does not guarantee this works for all models, and Qwen3 support for this exact API is not explicitly confirmed—most vLLM examples use client.chat.completions.create with structured_outputs or response_format instead. See test_structured_output_with_parse.

Would you like a step-by-step example or guidance for your specific model?

Sources:

test_structured_output_with_parse

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Does vllm automatically inject schema's information into the prompt? General	3	104	December 23, 2025
Using guided decoding for JSON General	1	683	June 26, 2025
General questions on structured output backend Structured Outputs	9	1071	September 3, 2025
Using openai compatible with `beta.chat.completions.parse` can't do tool call and structured output together General	0	120	April 6, 2025
Qwen 3 VL System Prompt General	3	1901	October 15, 2025

How to get structured outputs in vllm?

Related topics