Update Sage to current OpenAI, Claude, Gemini, Z.AI, and Kimi defaults by majiayu000 · Pull Request #2 · majiayu000/sage

majiayu000 · 2026-04-22T00:10:41Z

Summary

move built-in OpenAI, Anthropic, Google, GLM/Zhipu defaults to current official model families
add first-class Z.AI and Moonshot/Kimi provider support on top of the existing OpenAI-compatible transport
update embedded provider catalogs, onboarding, model discovery, context heuristics, and sample configs/docs together

Verification

cargo test -q -p sage-core -p sage-cli

Notes

Google defaults use stable production Gemini 2.5 IDs rather than newer preview-only model families
Z.AI and Moonshot/Kimi are wired into provider discovery and /model selection instead of being left as custom-only endpoints

Sage still shipped GPT-4, Gemini 1.5, and older Claude defaults across runtime config, provider catalogs, and model discovery. This change moves the built-in defaults to current official OpenAI, Anthropic, Google, Z.AI, and Moonshot/Kimi families and wires Z.AI and Moonshot in as first-class OpenAI-compatible providers so the newer models are actually selectable and configurable. Constraint: Keep the existing provider architecture and reuse the OpenAI-compatible transport for Z.AI and Moonshot Rejected: Expose Z.AI and Kimi only through custom_* providers | users would lose registry, onboarding, and model-select support Confidence: medium Scope-risk: moderate Reversibility: clean Directive: When bumping model families again, update runtime defaults, embedded catalogs, context heuristics, and slash-command model discovery together Tested: cargo test -q -p sage-core -p sage-cli Not-tested: Live requests against vendor APIs

The latest-model refresh changed several match arms and long string lists in provider/config code. This follow-up applies the repository rustfmt style so CI can evaluate the actual behavioral diff instead of failing early on formatting noise. Constraint: CI format gate runs before most validation jobs Rejected: Leave formatting to reviewers or CI reruns | blocks signal from downstream checks Confidence: high Scope-risk: narrow Reversibility: clean Directive: After large provider-catalog edits, run rustfmt before pushing so CI reaches behavioral checks Tested: cargo fmt Not-tested: Full test suite rerun after formatting

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 893e07bcad

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-22T00:19:38Z

+            ("anthropic", m)
+                if m.contains("claude-opus-4") || m.contains("claude-sonnet-4") =>
+            {


Reorder Anthropic context matching for Opus 4

The new claude-opus-4/claude-sonnet-4 branch is placed after the existing m.contains("claude-opus") branch, so claude-opus-4-7 never reaches the 1M-token path and is treated as a 200K model instead. With the commit’s new default Anthropic model (claude-opus-4-7), this causes premature context truncation in normal runs; the broader claude-opus rule should be narrowed or moved below the v4-specific rule.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-22T00:19:38Z

+                if model.contains("claude-opus-4") || model.contains("claude-sonnet-4") {
+                    (1_000_000, 32_000)
+                } else if model.contains("3.5") || model.contains("3-5") {
                    (200_000, 13_000) // Claude 3.5: 200K context, 13K reserved (like Claude Code)
                } else {


Include Claude Haiku 4.5 in Anthropic auto-compact limits

This Anthropic branch only special-cases Opus/Sonnet v4 and otherwise falls back to much smaller defaults, so the newly added claude-haiku-4-5 model gets 100K context behavior despite being configured elsewhere as a 200K-context model. That mismatch triggers compaction/truncation earlier than necessary for Haiku 4.5 sessions and can degrade response quality under longer chats.

Useful? React with 👍 / 👎.

The model-update PR surfaced two unrelated repository-wide gates: strict Clippy warnings and a RustSec finding in Cargo.lock. This commit resolves the mechanical lint failures that Clippy reported and updates rustls-webpki in the lockfile so the security audit can move past the current known vulnerability. Constraint: CI runs Clippy and cargo-audit repo-wide, not only on touched modules Rejected: Ignore repo-wide warnings and security findings as unrelated | the PR would remain non-mergeable Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Keep CI gates green before treating provider-model updates as review-ready Tested: cargo fmt Not-tested: Full clippy/audit rerun pending CI

The provider-model update itself was viable, but CI surfaced a mix of repo-wide style debt and one compile regression from the follow-up lint cleanup. This commit finishes the mechanical Clippy cleanups, fixes the background task sort key compile error, and keeps the lockfile on the audited rustls-webpki release. Constraint: Keep the PR mergeable under the repository's existing CI gates Rejected: Scope the PR to model changes only | the branch would stay blocked on existing CI failures Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Treat repo-wide CI failures as part of delivery when a PR cannot advance without them Tested: cargo fmt; cargo check --all-targets --all-features (in progress locally at commit time) Not-tested: Final CI rerun after this follow-up

Automated review caught two real heuristic mismatches in the latest-model update: Opus 4 was still being shadowed by the older claude-opus branch, and Haiku 4.5 was falling back to overly aggressive auto-compact limits. This commit reorders the Anthropic matching rules and gives Haiku 4.5 the same 200K path used elsewhere in the repo. Constraint: Keep context management aligned with the newly selected Anthropic defaults Rejected: Leave heuristics inconsistent and rely on provider-side limits | would still trigger premature truncation/compaction in long sessions Confidence: high Scope-risk: narrow Reversibility: clean Directive: When adding a new model family, verify both ContextConfig and AutoCompactConfig match the intended context window Tested: cargo test -q -p sage-core context::config -- --nocapture; cargo test -q -p sage-core context::auto_compact -- --nocapture Not-tested: End-to-end long-context Anthropic session

majiayu000 added 2 commits April 22, 2026 07:54

chatgpt-codex-connector Bot reviewed Apr 22, 2026

View reviewed changes

majiayu000 added 3 commits April 22, 2026 08:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Sage to current OpenAI, Claude, Gemini, Z.AI, and Kimi defaults#2

Update Sage to current OpenAI, Claude, Gemini, Z.AI, and Kimi defaults#2
majiayu000 wants to merge 5 commits intomainfrom
codex/latest-model-updates

majiayu000 commented Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

majiayu000 commented Apr 22, 2026

Summary

Verification

Notes

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant