[feat] add config option to repeat samples from loadgen #2402
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Modern LLM evaluation techniques use 'repeating' a sample multiple times to judge model accuracy. This MR introduces a new config param,
repeats_per_sample, in which each sample is sent multiple times independantly. inPerformanceOnlymode, it is asserted thatrepeats_per_sample==1. InAccuracyOnlymode, the logging happens inmlperf_log_accuracy.jsonas follows: (with repeats_per_sample=5)[ { "seq_id": 1, "qsl_idx": 0, "repeat_idx": 0, "data": "...", "token_count": 100 }, { "seq_id": 2, "qsl_idx": 0, "repeat_idx": 1, "data": "...", "token_count": 105 }, { "seq_id": 3, "qsl_idx": 0, "repeat_idx": 2, "data": "...", "token_count": 98 }, { "seq_id": 4, "qsl_idx": 0, "repeat_idx": 3, "data": "...", "token_count": 110 }, { "seq_id": 5, "qsl_idx": 0, "repeat_idx": 4, "data": "...", "token_count": 102 }, { "seq_id": 6, "qsl_idx": 1, "repeat_idx": 0, "data": "...", "token_count": 150 }, ... ]To use, the user can add a line in user.conf. Default is 1 (no repeats)