Skip to content

v1.0.8

Latest

Choose a tag to compare

@AlbumenJ AlbumenJ released this 26 Jan 03:13

This release introduces multimodal capabilities with Qwen TTS integration, significantly expands the RAG ecosystem with PostgreSQL and Elasticsearch support, and empowers agents with robust Code Execution capabilities.

🌟 Key Highlights

1. Qwen TTS Integration

We have officially expanded the multimodal capabilities of the framework by integrating Qwen (Tongyi Qianwen) Text-to-Speech.

  • Voice Generation: Agents can now generate high-quality speech output using Qwen's TTS models, enabling richer, voice-based interactions for your applications (#541).

2. Expanded RAG & Vector Storage

We have greatly broadened the storage options for Retrieval-Augmented Generation (RAG), allowing for more flexible and production-ready deployments.

  • PostgreSQL & pgvector: Added native support for using PostgreSQL with pgvector as a vector store (#525).
  • Elasticsearch: Introduced Elasticsearch as a backend for RAG storage, offering powerful full-text search and vector capabilities (#503).
  • Documentation: Added documentation for the Tika reader to assist with diverse document parsing (#654).

3. Secure Code Execution (SkillBox)

The Skill system has been empowered with secure code execution capabilities, allowing agents to generate and run code dynamically.

  • Code Execution: Added support for executing code within the SkillBox environment (#614).
  • Developer Experience: Refactored the code execution API with a Builder pattern for easier configuration and usage (#646), and simplified the skill lifecycle (#615).

🚀 New Features

  • Model & AG-UI:

    • Thinking Mode: Added support for visualizing "Thinking mode" outputs (e.g., from reasoning models) in AG-UI (#574).
    • DashScope: Added support for the response_format parameter (#564).
    • Custom Events: AG-UI now supports custom events for more flexible frontend interactions (#605).
    • Spring Boot: Improved auto-configuration for AG-UI (#653).
  • Agent Core & ReAct:

    • Hooks: Added summary phase hook support for ReActAgent (#577).
    • Planning: Enforced strict maxSubtasks limits in PlanNotebook and ReActAgent to prevent infinite loops (#645, #656).
  • Memory:

    • Mem0: Added support for metadata recording and filtering in Mem0 storage (#563).
    • Structured Output: Optimized structured output to preserve ThinkingBlock content after memory compression (#655).

🛠️ Refactoring & Fixes

  • Concurrency & Stability:

    • Shell Tool: Fixed a pipe buffer deadlock issue in ShellCommandTool (#619).
    • Tool Execution: Used mergeSequential to strictly preserve the order of parallel tool calls (#652).
    • Non-blocking: Standardized non-blocking behavior for all ChatModelBase implementations (#642).
  • Compatibility:

    • JDK 25: Excluded lombok dependency to ensure forward compatibility with JDK 25 (#604).
    • WebFlux: Fixed Jakarta Servlet compatibility issues in A2A communication (#587).
  • Bug Fixes:

    • ReAct: Fixed logic ensuring agents only execute tools that lack results when resuming (#650).
    • Ollama: Handled edge cases where tool calls have empty parameters (#572).
    • Observability: Fixed callTool pipeline observability and tracing (#576).

❤️ New Contributors

Full Changelog: v1.0.7...v1.0.8