Is the prompt parameter in the OpenAI Transcription API supported by vLLM?

Blaze-DSP · August 22, 2025, 5:35am

I have been trying out ASR models like Voxtral and Whisper. But I feel like changing the prompt parameter is not having any effect on the transcripton. OpenAI has provided a prompting guide for whisper model( Whisper prompting guide ). Is there a support for this in vLLM and also, does voxtral support the same?

RunLLM · August 22, 2025, 5:35am

For Whisper, vLLM supports the prompt parameter, which can be used to provide prior context or steer transcription style, similar to OpenAI’s guide. The prompt is passed as part of the input and is included in the decoder prompt for Whisper models in vLLM, so you can use it to influence the output as described in the OpenAI Whisper prompting guide and vLLM’s own examples and code.

For Voxtral, the prompt parameter is also supported, but its effect may differ from Whisper. Voxtral’s implementation in vLLM allows for prompts, but the model’s behavior and how it uses the prompt may not exactly match Whisper’s, as the architectures and training data differ. There is no explicit documentation stating Voxtral’s prompt handling is identical to Whisper’s in vLLM—so results may vary and you should test prompt effectiveness for your use case.

Would you like more detail or code examples?

Sources:

Topic		Replies	Views
Prompt_embeds usage in vllm openai completion api Multi-modality	4	56	June 17, 2025
How to obtain the logits of LLM General	31	570	July 21, 2025
How to pass vLLM specific parameters via openai API from clients General	2	45	June 18, 2025
Value error Conversation roles must alternate General	5	41	August 14, 2025
How to do inference of BGE-m3 embedding with vllm General	1	9	August 27, 2025

Is the prompt parameter in the OpenAI Transcription API supported by vLLM?

Related topics