you can check Pooling Models — vLLM
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Prompt_embeds usage in vllm openai completion api | 4 | 199 | June 17, 2025 | |
| Does the vLLM embedding model support inputting input_ids | 21 | 288 | September 2, 2025 | |
| OpenAI Embeddings Not Working | 2 | 169 | January 22, 2026 | |
| How to use prompt (prompt and prompt_name) for EmbeddingGemma using vllm | 1 | 88 | February 4, 2026 | |
| How does vllm process multimodal embedding requests | 8 | 74 | May 7, 2026 |