Skip to content

Commit 412ff2a

Browse files
authored
Merge pull request #2929 from pipecat-ai/mb/cartesia-sonic-3
Update Cartesia's default model to sonic-3
2 parents 82ccc16 + 9ef60bd commit 412ff2a

File tree

2 files changed

+7
-4
lines changed

2 files changed

+7
-4
lines changed

CHANGELOG.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,9 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
99

1010
### Changed
1111

12+
- Updated the default model to `sonic-3` for `CartesiaTTSService` and
13+
`CartesiaHttpTTSService`.
14+
1215
- `FunctionFilter` now has a `filter_system_frames` arg, which controls whether
1316
or not SystemFrames are filtered.
1417

src/pipecat/services/cartesia/tts.py

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -119,7 +119,7 @@ def __init__(
119119
voice_id: str,
120120
cartesia_version: str = "2025-04-16",
121121
url: str = "wss://api.cartesia.ai/tts/websocket",
122-
model: str = "sonic-2",
122+
model: str = "sonic-3",
123123
sample_rate: Optional[int] = None,
124124
encoding: str = "pcm_s16le",
125125
container: str = "raw",
@@ -135,7 +135,7 @@ def __init__(
135135
voice_id: ID of the voice to use for synthesis.
136136
cartesia_version: API version string for Cartesia service.
137137
url: WebSocket URL for Cartesia TTS API.
138-
model: TTS model to use (e.g., "sonic-2").
138+
model: TTS model to use (e.g., "sonic-3").
139139
sample_rate: Audio sample rate. If None, uses default.
140140
encoding: Audio encoding format.
141141
container: Audio container format.
@@ -498,7 +498,7 @@ def __init__(
498498
*,
499499
api_key: str,
500500
voice_id: str,
501-
model: str = "sonic-2",
501+
model: str = "sonic-3",
502502
base_url: str = "https://api.cartesia.ai",
503503
cartesia_version: str = "2024-11-13",
504504
sample_rate: Optional[int] = None,
@@ -512,7 +512,7 @@ def __init__(
512512
Args:
513513
api_key: Cartesia API key for authentication.
514514
voice_id: ID of the voice to use for synthesis.
515-
model: TTS model to use (e.g., "sonic-2").
515+
model: TTS model to use (e.g., "sonic-3").
516516
base_url: Base URL for Cartesia HTTP API.
517517
cartesia_version: API version string for Cartesia service.
518518
sample_rate: Audio sample rate. If None, uses default.

0 commit comments

Comments
 (0)