Release v1.0.8 · agentscope-ai/agentscope-java

This release introduces multimodal capabilities with Qwen TTS integration, significantly expands the RAG ecosystem with PostgreSQL and Elasticsearch support, and empowers agents with robust Code Execution capabilities.

🌟 Key Highlights

1. Qwen TTS Integration

We have officially expanded the multimodal capabilities of the framework by integrating Qwen (Tongyi Qianwen) Text-to-Speech.

Voice Generation: Agents can now generate high-quality speech output using Qwen's TTS models, enabling richer, voice-based interactions for your applications (#541).

2. Expanded RAG & Vector Storage

We have greatly broadened the storage options for Retrieval-Augmented Generation (RAG), allowing for more flexible and production-ready deployments.

PostgreSQL & pgvector: Added native support for using PostgreSQL with pgvector as a vector store (#525).
Elasticsearch: Introduced Elasticsearch as a backend for RAG storage, offering powerful full-text search and vector capabilities (#503).
Documentation: Added documentation for the Tika reader to assist with diverse document parsing (#654).

3. Secure Code Execution (SkillBox)

The Skill system has been empowered with secure code execution capabilities, allowing agents to generate and run code dynamically.

Code Execution: Added support for executing code within the SkillBox environment (#614).
Developer Experience: Refactored the code execution API with a Builder pattern for easier configuration and usage (#646), and simplified the skill lifecycle (#615).

🚀 New Features

Model & AG-UI:
- Thinking Mode: Added support for visualizing "Thinking mode" outputs (e.g., from reasoning models) in AG-UI (#574).
- DashScope: Added support for the response_format parameter (#564).
- Custom Events: AG-UI now supports custom events for more flexible frontend interactions (#605).
- Spring Boot: Improved auto-configuration for AG-UI (#653).
Agent Core & ReAct:
- Hooks: Added summary phase hook support for ReActAgent (#577).
- Planning: Enforced strict maxSubtasks limits in PlanNotebook and ReActAgent to prevent infinite loops (#645, #656).
Memory:
- Mem0: Added support for metadata recording and filtering in Mem0 storage (#563).
- Structured Output: Optimized structured output to preserve ThinkingBlock content after memory compression (#655).

🛠️ Refactoring & Fixes

Concurrency & Stability:
- Shell Tool: Fixed a pipe buffer deadlock issue in ShellCommandTool (#619).
- Tool Execution: Used mergeSequential to strictly preserve the order of parallel tool calls (#652).
- Non-blocking: Standardized non-blocking behavior for all ChatModelBase implementations (#642).
Compatibility:
- JDK 25: Excluded lombok dependency to ensure forward compatibility with JDK 25 (#604).
- WebFlux: Fixed Jakarta Servlet compatibility issues in A2A communication (#587).
Bug Fixes:
- ReAct: Fixed logic ensuring agents only execute tools that lack results when resuming (#650).
- Ollama: Handled edge cases where tool calls have empty parameters (#572).
- Observability: Fixed callTool pipeline observability and tracing (#576).

❤️ New Contributors

@fangxiu-wf made their first contribution in #576
@xingmengxian made their first contribution in #560
@B18150228NJUPT made their first contribution in #587
@ShenJunkun made their first contribution in #503
@feelshana made their first contribution in #606
@mengnankkkk made their first contribution in #591
@Hipple made their first contribution in #652
@tcsp made their first contribution in #645

Full Changelog: v1.0.7...v1.0.8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.0.8

Choose a tag to compare

Sorry, something went wrong.