Skip to content

Pull requests: kubernetes-sigs/inference-perf

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(openai_client): request token usage stats for streamed responses cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#365 opened Mar 17, 2026 by adelsam Loading…
Update coverage check script approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#362 opened Mar 12, 2026 by jjk-g Loading…
Fix saturation detection and harden load generator cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#360 opened Mar 2, 2026 by Bslabe123 Loading…
fix: handle ShareGPT dataset exhaustion by reinitializing iterator cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#359 opened Feb 27, 2026 by DebuggingMax Loading…
[WIP] Add raw time series metric output. approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#356 opened Feb 25, 2026 by jjk-g Loading…
Multi report comparitive analysis feature approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#355 opened Feb 25, 2026 by SachinVarghese Loading…
fix: preserve configured tokenizer when using MockModelServerClient cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/bug Categorizes issue or PR as related to a bug. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#353 opened Feb 24, 2026 by alonh Loading…
Fix ShareGPT StopIteration error on dataset exhaustion cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. do-not-merge/invalid-commit-message Indicates that a PR should not merge because it has an invalid commit message. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#341 opened Feb 4, 2026 by loganionian Loading…
feat: add structured output support for vLLM backend cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/feature Categorizes issue or PR as related to a new feature. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#339 opened Feb 3, 2026 by dhxshop Loading…
2 tasks
fix(config): substitute timestamp in storage paths approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#330 opened Jan 29, 2026 by yangligt2 Loading…
feat: Add Chat Completion API support to SharedPrefixDataGenerator cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#287 opened Nov 19, 2025 by bongwoobak Loading…
Support setting custom y-axis limits optionally cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#268 opened Nov 3, 2025 by Shuwen-Fang Loading…
refactor: Make base client concrete and usable cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#246 opened Oct 7, 2025 by LukeAVanDrie Loading…
ProTip! Follow long discussions with comments:>50.