Skip to content

Release DualCodec and DualCodec-VALLE. Remove previous valle_v2 model in favor of new models.#443

Merged
jiaqili3 merged 9 commits intoopen-mmlab:mainfrom
jiaqili3:main
May 26, 2025
Merged

Release DualCodec and DualCodec-VALLE. Remove previous valle_v2 model in favor of new models.#443
jiaqili3 merged 9 commits intoopen-mmlab:mainfrom
jiaqili3:main

Conversation

@jiaqili3
Copy link
Collaborator

@jiaqili3 jiaqili3 commented May 26, 2025

✨ Description

This PR releases DualCodec.
DualCodec is a low-frame-rate (12.5Hz or 25Hz), semantically-enhanced (with SSL feature) Neural Audio Codec designed to extract discrete tokens for efficient speech generation.

You can check out its demo page. arxiv: http://arxiv.org/abs/2505.13000.

Meanwhile, the valle_v2 models and folders are removed. Recommend use latest models.

@jiaqili3 jiaqili3 requested review from RMSnow and yuantuo666 May 26, 2025 08:46
@jiaqili3 jiaqili3 requested a review from viewfinder-annn May 26, 2025 08:51
Copy link
Collaborator

@viewfinder-annn viewfinder-annn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jiaqili3 jiaqili3 merged commit c532b04 into open-mmlab:main May 26, 2025
1 check passed
KhryptorGraphics pushed a commit to KhryptorGraphics/Amphion that referenced this pull request Feb 2, 2026
… in favor of new models. (open-mmlab#443)

* add dualcodec code

* update amphion readme of dualcodec

* update amphion readme

* readme

* remove valle_v2 infavor of dualcodec-valle

* readme

* format codes

* remove chinese characters in dualcodec

* add amphion license
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants