提个问题 vllm如何跑 embadding 用正常跑模型的参数 貌似无法跑embadding模型 求大佬解答
A question: How to run embadding with vllm? I can’t seem to run an embadding model with the normal model parameters, so I’m looking for an answer.
you can check Pooling Models — vLLM
这个没有看明白用途,有使用案例可以参考么!!
better to use english in this forum.
Didn’t see the use for this one, is there a use case to look at!
It is possible to calculate similarity based on the hidden state embedding for recall.
You can follow the example at vllm/examples/offline_inference/basic/embed.py at main · vllm-project/vllm · GitHub
1 Like
It is possible to calculate similarity based on the hidden state embedding for recall.
vLLM supports LLM.score
, please check out this example: vllm/examples/offline_inference/basic/score.py at main · vllm-project/vllm · GitHub
Our OpenAI-compatible server also has a Score API that does the same thing.
1 Like