[DeepSeek v3.2] Make top-k work for any logit values. #27568

dcampora · 2025-10-27T11:03:56Z

Purpose

This PR allows top_k_per_row work for any values in logits. Even if the logits differ only in the least significant bytes, top-k is now guaranteed to always give a correct answer.

Solves #26554

Test Plan

The test test_top_k_per_row has been amplified to cover these cases too. They didn't pass before and now they pass.

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Daniel Campora <[email protected]>

gemini-code-assist

Code Review

This pull request refactors the top-k kernel to correctly handle logit values that differ only in their least significant bits, by using a multi-pass histogram approach on the full 32-bit float representation. The changes are extensive and introduce a more complex but precise algorithm.

My review has identified several critical issues and areas for improvement:

There are critical bugs related to memory access and incorrect output indices when rowStart is not zero. The tests should be expanded to cover this case.
The logic for selecting the sorting algorithm in the host-side launcher functions appears to be flawed, potentially leading to significant performance degradation.
A fallback sorting algorithm within the kernel is misleadingly commented and has quadratic complexity, which can be a performance bottleneck.
There is some code duplication that should be addressed to improve maintainability.

I have provided specific comments with code suggestions to address these points. Overall, the direction is good, but these issues need to be fixed before merging.

csrc/sampler.cu

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

csrc/sampler.cu

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

csrc/sampler.cu

Signed-off-by: Daniel Campora <[email protected]>

Make top-k work for any logit values.

9560927

Signed-off-by: Daniel Campora <[email protected]>

dcampora requested review from WoosukKwon, mgoin, tlrmchlsmth and yewentao256 as code owners October 27, 2025 11:03

mergify bot added the deepseek Related to DeepSeek models label Oct 27, 2025

gemini-code-assist bot reviewed Oct 27, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Oct 27, 2025

View reviewed changes

csrc/sampler.cu Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Oct 27, 2025

View reviewed changes

csrc/sampler.cu Outdated Show resolved Hide resolved

Fix comments from gemini.

2ea7448

Signed-off-by: Daniel Campora <[email protected]>

dcampora mentioned this pull request Oct 27, 2025

[Bug]: top_k_per_row is not a strict Topk Algo #26554

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[DeepSeek v3.2] Make top-k work for any logit values. #27568

[DeepSeek v3.2] Make top-k work for any logit values. #27568

dcampora commented Oct 27, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

[DeepSeek v3.2] Make top-k work for any logit values. #27568

Are you sure you want to change the base?

[DeepSeek v3.2] Make top-k work for any logit values. #27568

Conversation

dcampora commented Oct 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dcampora commented Oct 27, 2025 •

edited by github-actions bot

Loading