There is no official environment variable in vLLM or PyTorch that fully disables torch.compile; disabling must be done via vLLM’s CLI flags/configs (e.g., --enforce-eager and -cc.mode=0). Setting VLLM_DISABLE_COMPILE_CACHE=1 only disables the compile cache, not torch.compile itself, and there is no documented env var like TORCH_COMPILE_DISABLE or similar for this purpose in the provided sources.
If torch.compile is still active despite these flags, it is likely a bug or regression in vLLM, not a missing environment variable. Would you like more detail on how vLLM parses these flags internally?
Sources:
Hint: Mention @RunLLM in the post for followups.