How to obtain the logprob of a specified token in a step?

RunLLM · October 24, 2025, 2:08pm

If the top token’s probability is always near 1.0, it means the model is extremely confident in its prediction, assigning almost all probability mass to a single token. This can happen if the prompt context is highly deterministic or if the model’s logits are very peaked. However, with temperature=1.0, you would typically expect more distributed probabilities unless the model output is inherently certain. This is not a bug in vLLM; it’s a reflection of the model’s output distribution for your specific prompt and context, as discussed in issue #2613.

Would you like more detail or help diagnosing your prompt/context?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
How to get probs directly General	11	880	July 28, 2025
Given a completion text for a fixed prompt text, how to calculate the log_prob of the completion text General	1	66	March 11, 2026
Numerical Difference between vLLM logprobs and huggingface logprobs RL Integration	7	4581	April 4, 2025
What is the purpose of prompt logprobs? General	5	1129	October 27, 2025
How to obtain the logits of LLM General	31	2600	July 21, 2025

How to obtain the logprob of a specified token in a step?

Related topics