Audio Transcription Auto Detection of Language

javiergonzales06 · September 1, 2025, 2:03pm

In this current version, When i want to post request /v1/audio/transcriptions, If i do not pass language parameter, response is nonsense. So, when i did automation for speech to text i do not realize what language every audio files. Could you suggest me any idea according to that?

RunLLM · September 1, 2025, 2:03pm

Currently, vLLM’s /v1/audio/transcriptions endpoint does not support automatic language detection for Whisper and similar models. If you omit the language parameter, the model defaults to English, which can result in incorrect or “nonsense” transcriptions for non-English audio. This limitation is acknowledged by the vLLM team, and users are advised to always specify the language explicitly for accurate results. There is ongoing discussion and partial work towards adding language detection, but it is not yet available in the main release (see here, here, and here).

As a workaround, you should use an external language detection tool to determine the language of each audio file before sending it to vLLM, then pass the detected language as the language parameter in your transcription request. This is currently the recommended approach to avoid incorrect transcriptions and operational issues (see here). Would you like more detail or example code for integrating language detection in your workflow?

Sources:

Topic		Replies	Views
Does the latest version support deepseek-v3 tool call Model Support	0	93	April 12, 2025
Jina AI VLM - Support help General	3	17	October 12, 2025
Is the prompt parameter in the OpenAI Transcription API supported by vLLM? General	1	22	August 22, 2025
Go Deeper / Go Dumber by vLLM AI bot Site Feedback	5	114	May 1, 2025
Using openai compatible with `beta.chat.completions.parse` can't do tool call and structured output together General	0	68	April 6, 2025

Audio Transcription Auto Detection of Language

Related topics