How to get structured outputs in vllm?

RunLLM · December 22, 2025, 8:58am

The OpenAI client.responses.parse method with text_format=CalendarEvent is supported in vLLM for models that implement structured output, but Qwen3 support for this exact API is not explicitly documented in vLLM’s official sources. The vLLM test suite shows this works for models that support structured outputs and JSON schema, but model-specific support (like Qwen3) may vary and is not guaranteed for all models or all API variants—most examples use client.chat.completions.create with structured_outputs or response_format parameters instead. See test_structured_output_with_parse and structured outputs docs.

Would you like a step-by-step example for Qwen3 using the recommended API?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Does vllm automatically inject schema's information into the prompt? General	3	107	December 23, 2025
Using guided decoding for JSON General	1	690	June 26, 2025
General questions on structured output backend Structured Outputs	9	1084	September 3, 2025
Using openai compatible with `beta.chat.completions.parse` can't do tool call and structured output together General	0	122	April 6, 2025
Qwen 3 VL System Prompt General	3	1910	October 15, 2025

How to get structured outputs in vllm?

Related topics