Skip to content

Version v0.4.0

Latest

Choose a tag to compare

@MHindermann MHindermann released this 09 Dec 12:42
· 5 commits to main since this release

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog,
and this project adheres to Semantic Versioning.

v0.4.0 - 2025-12-09

Added

  • 4 new models: gpt-5.1 (OpenAI), gemini-3-pro-preview (GenAI), magistral-medium-2509 (Mistral), mistral-small-2506 (Mistral)
  • 42 new benchmark test configurations (T0403-T0444) across all benchmarks for new models
  • Pricing data for 2025-11-24 with updated model prices and source URLs
  • Cohere provider support with 5 models: command-r-08-2024, command-r-plus-08-2024, command-r7b-12-2024, command-a-03-2025, command-a-vision-07-2025
  • book_advert_xml benchmark for correcting malformed XML from 18th century book advertisements
  • 43 new benchmark test configurations (T0445-T0487) for book_advert_xml across all providers
  • Pricing data for Cohere models (2025-12-09)

Changed

Removed

  • All tests with claude-3-5-sonnet-20241022 (now legacy)
  • All renders and related docs (now handled by dedicated frontend)

Full Changelog: v0.3.1...v0.4.0