I want to get the logits generated by the model instead of logprobs. Which parts of the code do I need to modify? Do I need to modify the request and scheduling related code?
I want to get the logits generated by the model instead of logprobs. Which parts of the code do I need to modify? Do I need to modify the request and scheduling related code?