What is the best way to support vLLM on a custom hardware?

mrtpk · October 16, 2025, 9:08am

Hello,

I have seen that we can support custom hardware using a platform plugin. If a custom hardware already has support for torch backend, can we avoid implementing a platform plugin?

RunLLM · October 16, 2025, 9:09am

No, you cannot avoid implementing a platform plugin even if your custom hardware supports the torch backend. vLLM requires a platform plugin to register and manage hardware-specific components (e.g., executor, worker, model runner, attention backend, communicator, and custom ops), regardless of torch compatibility. The plugin system enables vLLM to recognize, initialize, and dispatch to your hardware in a modular and maintainable way, as detailed in the official plugin documentation and RFCs. Without a platform plugin, vLLM will not detect or utilize your hardware backend automatically, even if torch operations work on it natively. See vLLM’s Plugin System and Hardware pluggable RFC.

Would you like more detail on how to implement a platform plugin for your hardware?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Is there any plan to organize the cuda-only configuration Hardware Support	1	36	August 15, 2025
Installation issue General	8	374	July 3, 2025
VLLM supporting CPU and GPU General	1	278	September 4, 2025
Can vLLM built for old GPU (GT 630M) ? It may use CUDA 9.1.85 Hardware Support	1	137	August 4, 2025
vLLM install for 5090 General	2	2593	October 31, 2025

What is the best way to support vLLM on a custom hardware?

Related topics