Skip to content

Commit 0b925f3

Browse files
committed
feat: update links in tech report
1 parent 01eaf0d commit 0b925f3

File tree

1 file changed

+12
-8
lines changed

1 file changed

+12
-8
lines changed

tts_tech_report/index.html

Lines changed: 12 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -28,9 +28,11 @@ <h4 style="font-size: 1.3rem; line-height: 1; text-align: center;">Intrinsic Zer
2828
Encoder</h4>
2929
<p class="author">
3030
MiniMax Team <span class="date">May 2025</span><br />
31-
<a style="font-size: 1.1rem;" target="_blank"
32-
href="https://minimax-ai.github.io/tts_tech_report/MiniMax_Speech.pdf">[Tech
31+
<a style="font-size: 1.1rem;" target="_blank" href="https://arxiv.org/abs/2505.07916">[Tech
3332
Report]</a>
33+
<a style="font-size: 1.1rem; margin-left: 1rem;" target="_blank"
34+
href="https://huggingface.co/datasets/MiniMaxAI/TTS-Multilingual-Test-Set">[Multilingual Test Set]</a>
35+
<a style="font-size: 1.1rem; margin-left: 1rem;" target="_blank" href="https://github.com/MiniMax-AI">[GitHub]</a>
3436
</p>
3537
</header>
3638

@@ -57,13 +59,16 @@ <h2>Abstract</h2>
5759
control
5860
via LoRA; text to voice (T2V) by synthesizing timbre features directly from text description; and professional
5961
voice
60-
cloning (PVC) by fine-tuning timbre features with additional data. Welcome to visit
61-
<a href="https://www.minimax.io/audio">MiniMax Audio</a> and
62-
explore our powerful TTS features.
62+
cloning (PVC) by fine-tuning timbre features with additional data.
6363
</p>
6464
</div>
6565

6666
<nav role="navigation" class="toc">
67+
<h2>Explore MiniMax-Speech</h2>
68+
<p>Welcome to visit
69+
<a href="https://www.minimax.io/audio">MiniMax Audio</a> and
70+
explore our powerful TTS features.
71+
</p>
6772
<h2>Contents</h2>
6873
<ol>
6974
<li>
@@ -232,9 +237,8 @@ <h2 id="zero-shot-vs-one-shot-demonstrations">Zero-Shot vs. One-Shot Demonstrati
232237
features based
233238
on the text content, whereas OneShot adheres more strictly to the speaker characteristics (prosody, speech
234239
rate,
235-
emotions, etc.) demonstrated in the audio prompt (The additional input that OneShot has compared to ZeroShot,
236-
see
237-
technical report for details).
240+
emotions, etc.). For details of Zero-Shot and One-Shot, refer to the <a
241+
href="https://arxiv.org/abs/2505.07916" target="_blank">technical report</a>.
238242
</p>
239243
<div class="scroll-wrapper" style="margin-top: 2rem;">
240244
<table style="width: 100%;">

0 commit comments

Comments
 (0)