-
I have the genRM on my local machine and have similar
My shell params associated with the reward model are as follows: Please help me figure out the solution and make genRM work. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
For local LLMs, use the below helper function To extract the responses, do Thank you VERL community! |
Beta Was this translation helpful? Give feedback.
For local LLMs, use the below helper function