RL Integration
verl OpenRLHF TRL
Topic | Replies | Views | Activity | |
---|---|---|---|---|
About the RL Integration category |
![]() |
0 | 11 | March 26, 2025 |
Is is possible to initialize an AsyncLLMEngine inside the LLM object? |
![]() ![]() ![]() |
4 | 58 | April 12, 2025 |
No HIP GPUs are available for VeRL |
![]() ![]() ![]() ![]() |
4 | 53 | April 4, 2025 |
Numerical Difference between vLLM logprobs and huggingface logprobs |
![]() ![]() ![]() ![]() |
7 | 184 | April 4, 2025 |
RL Training with vLLM Rollout: How to Mitigate Load Imbalance from Variable Response Lengths |
![]() ![]() ![]() ![]() |
4 | 144 | April 1, 2025 |