This might be a naive question but does anyone know why does phi3 implementation in vLLM inherit from llama?
Inference-only Phi3 model code inherit from Llama.py
Why does vLLM not have an implementation of Phi3Model?
I wanted to extend the phi3 class with custom code but unable to get it working.
The models are very similar architecture. If you need to extend Phi, I would suggest extending the Llama definition