How to start embedding models

dugutianxue · March 20, 2025, 3:55am

提个问题 vllm如何跑 embadding 用正常跑模型的参数貌似无法跑embadding模型求大佬解答
A question: How to run embadding with vllm? I can’t seem to run an embadding model with the normal model parameters, so I’m looking for an answer.

youkaichao · March 20, 2025, 5:39am

you can check Pooling Models — vLLM

dugutianxue · March 20, 2025, 5:49am

这个没有看明白用途，有使用案例可以参考么！！

youkaichao · March 20, 2025, 5:56am

better to use english in this forum.

dugutianxue · March 20, 2025, 6:09am

Didn’t see the use for this one, is there a use case to look at!

Robin · March 20, 2025, 6:14am

It is possible to calculate similarity based on the hidden state embedding for recall.

DarkLight1337 · March 20, 2025, 6:25am

You can follow the example at vllm/examples/offline_inference/basic/embed.py at main · vllm-project/vllm · GitHub

DarkLight1337 · March 20, 2025, 6:27am

It is possible to calculate similarity based on the hidden state embedding for recall.

vLLM supports LLM.score, please check out this example: vllm/examples/offline_inference/basic/score.py at main · vllm-project/vllm · GitHub

Our OpenAI-compatible server also has a Score API that does the same thing.

Topic		Replies	Views
Welcome to vLLM Forums! :wave: General	1	364	March 24, 2025
How to load the model successfully through multi-card in vllm? General	5	79	April 3, 2025
Does VLLM support BERT model General	2	46	April 7, 2025
Tool calling using Offline Inference? Tool Calling	1	13	April 14, 2025
Numerical Difference between vLLM logprobs and huggingface logprobs RL Integration	7	183	April 4, 2025

How to start embedding models

Related topics