How to custom end token?

Septemberlemon · December 4, 2025, 9:56am

I wanna set "\n” as end token like <|im_end|> and <|endoftext|>, what should I do?

RunLLM · December 4, 2025, 9:57am

To set a custom end token (e.g., “â”) as a stop condition in vLLM, add it to the stop list in your SamplingParams or API request. For example, use stop=["â", "<|im_end|>", "<|endoftext|>"] in your SamplingParams, or include it in the stop field of your API payload. This will cause generation to stop when any of these tokens are produced, without needing to modify the tokenizer or model files directly. This approach is confirmed in multiple vLLM discussions for custom and special stop tokens usage, such as for Qwen and Llama models.

Example:

sampling_params = SamplingParams(
    temperature=0.8,
    top_p=0.95,
    max_tokens=2000,
    stop=["â", "<|im_end|>", "<|endoftext|>"]
)

Would you like more detail or a full code example?

Sources:

Hint: Mention @RunLLM in the post for followups.

Septemberlemon · December 4, 2025, 10:01am

is there any way to set that in vllm serve cli?

@RunLLM

Topic		Replies	Views
How to custom end token in vllm serve cli? General	4	193	December 9, 2025
How can I determine which specific stop token triggered the termination? General	3	166	December 10, 2025
Custom function-based stopping criteria General	1	202	August 8, 2025
推理Qwen3-VL-235B-A22B-Instruct-FP8时， General	1	43	December 22, 2025
Vllm v1 在哪个地方判读结束符或者停止输出的？ General	1	192	June 21, 2025

How to custom end token?

Related topics