Skip to content

Add on-device LLM models to dev and prod environments#2175

Merged
gary149 merged 1 commit intomainfrom
claude/update-model-skill-dQVIz
Mar 12, 2026
Merged

Add on-device LLM models to dev and prod environments#2175
gary149 merged 1 commit intomainfrom
claude/update-model-skill-dQVIz

Conversation

@gary149
Copy link
Collaborator

@gary149 gary149 commented Mar 12, 2026

Summary

This PR adds five new lightweight language models to both development and production environments, expanding support for on-device inference with multilingual and regional capabilities.

Key Changes

  • Added Qwen/Qwen3.5-9B: Dense multimodal model optimized for on-device reasoning with 262K context window
  • Added CohereLabs/tiny-aya-global: Multilingual assistant supporting 70+ languages for on-device deployment
  • Added CohereLabs/tiny-aya-earth: Regional variant optimized for African languages with cultural tuning
  • Added CohereLabs/tiny-aya-fire: Regional variant optimized for South Asian languages with cultural tuning
  • Added CohereLabs/tiny-aya-water: Regional variant optimized for Asia-Pacific and European languages

Implementation Details

  • Updated both chart/env/dev.yaml and chart/env/prod.yaml to maintain consistency across environments
  • New models are positioned at the beginning of the MODELS list, making them available as primary options
  • Each model includes descriptive metadata highlighting their specific use cases and capabilities
  • All additions focus on lightweight, on-device inference capabilities complementing the existing larger models

https://claude.ai/code/session_01Cv3ZqNWtFJ9dSbeobd1zLX

Add Qwen3.5-9B, tiny-aya-global, tiny-aya-earth, tiny-aya-fire,
and tiny-aya-water to prod.yaml and dev.yaml MODELS arrays.

https://claude.ai/code/session_01Cv3ZqNWtFJ9dSbeobd1zLX
@gary149 gary149 merged commit 6c460f7 into main Mar 12, 2026
5 of 6 checks passed
@gary149 gary149 deleted the claude/update-model-skill-dQVIz branch March 12, 2026 18:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants