Skip to content

Commit b1ba0e2

Browse files
author
swarna
committed
Fixing typo in comment
1 parent d14fb90 commit b1ba0e2

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

src/fairseq2/recipes/lm/_online_finetune/_grpo.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -194,7 +194,7 @@ def validate_reward(
194194
) in self._config.loss_config.validation_vllm_sampling_params.items():
195195
policy_sampling_params.__setattr__(k, v)
196196

197-
# For a pairwise RM, need to sample at least two judgments
197+
# For a pairwise RM, need to sample at least two rollouts
198198
policy_sampling_params.n = (
199199
2 if self._reward.reward_name == "generative_pairwise_verifier" else 1
200200
)

src/fairseq2/recipes/lm/_online_finetune/_online_dpo.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -140,7 +140,7 @@ def validate_reward(
140140
) in self._config.loss_config.validation_vllm_sampling_params.items():
141141
policy_sampling_params.__setattr__(k, v)
142142

143-
# For a pairwise RM, need to sample at least two judgments
143+
# For a pairwise RM, need to sample at least two rollouts
144144
policy_sampling_params.n = (
145145
2 if self._reward.reward_name == "generative_pairwise_verifier" else 1
146146
)

0 commit comments

Comments
 (0)