-
Notifications
You must be signed in to change notification settings - Fork 3.5k
feat: implement ngram tokenizer with token_chars and custom_token_chars #45040
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: SpadeA <[email protected]>
|
[ci-v2-notice]
To rerun ci-v2 checks, comment with:
If you have any questions or requests, please contact @zhikunyao. |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #45040 +/- ##
==========================================
- Coverage 76.97% 76.93% -0.04%
==========================================
Files 1838 1838
Lines 285526 285503 -23
==========================================
- Hits 219775 219648 -127
- Misses 58555 58647 +92
- Partials 7196 7208 +12
🚀 New features to boost your workflow:
|
|
please cp 2.6 |
|
/ci-rerun-ut-go |
done #45046 |
|
/ci-rerun-ut-go |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: SpadeA-Tang, zhengbuqian The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/lgtm |
…rs [2.6] (#45046) pr: #45040 issue: #45039 Signed-off-by: SpadeA <[email protected]>
issue: #45039