Is is possible to initialize an AsyncLLMEngine inside the LLM object?

SparkJiao · April 7, 2025, 5:10am

The background is that we are using vllm as the backend for rollout in veRL for RL training. During rollout stage, we need to invoke several function calls inside each trajectory. Currently most implementations simply use multi-round form, where each form contains exactly one function call. This is will cause large latency when some trajectories are finished earlier and they have to wait the others before the next round generation.

My question is that if we could initialize the AsyncLLMEngine inside the LLM object so that we can use async method _add_request to decouple the external agent workflow management and internal generation process. Or if there is any other recommended workflow for this?

hmellor · April 8, 2025, 10:45pm

In V1 we have the AsyncLLM class you could use vllm/vllm/v1/engine/async_llm.py at main · vllm-project/vllm · GitHub. Does that suit your needs?

SparkJiao · April 9, 2025, 2:29am

Hi, Hmellor, I think this is good.

One followup question is that does current AsyncLLM support similar usage with this that we can access the handler of the model executor so that we can update parameters? I remember for AsyncLLMEngine the model executor is launched at the background so we cannot directly access it.

Thank you!

hmellor · April 9, 2025, 8:21am

You can access the Engine at AsyncLLM.engine_core (which will be a AsyncMPClient(MPClient) class). I’m not sure how the model is accessed from these client classes though.

youkaichao · April 12, 2025, 4:58pm

We are discussing with verl on how to support agent / multi-turn / toolcalling in RL, please follow the discussion in [Question] Is vLLMRollout.generate_sequences the right place to implement tool calling? · Issue #176 · volcengine/verl · GitHub .

Topic		Replies	Views
Dose vllm V1 support asynchronous scheduling? V1 Feedback	1	85	April 14, 2025
Async version of LLM.chat()? General	0	44	March 26, 2025
Intergate async-llm to openrlfh, when tensor_parallel_size > 1 will cause ray-error OpenRLHF	1	38	April 27, 2025
Is there a newly example to show how to add a new LLM into vLLM? General	3	11	June 10, 2025
Welcome to vLLM Forums! :wave: General	1	481	March 24, 2025

Is is possible to initialize an AsyncLLMEngine inside the LLM object?

Related topics