What type of model you want to deploy? Which modality?
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| How to serve a transformed Pytorch model | 1 | 95 | September 2, 2025 | |
| Support for Deploying 4-bit Fine-Tuned Model with LoRA on vLLM | 13 | 1052 | July 30, 2025 | |
| Using vLLM on a HF model architecture modified locally | 1 | 242 | July 7, 2025 | |
| RunBot's math-to-text on NVIDIA NeMo Framework AutoModel | 11 | 161 | May 19, 2025 | |
| Mukti-GPUs on vLLM using a custom network | 5 | 128 | September 5, 2025 |