Skip to content

Commit b2d01dd

Browse files
committed
feat: update speech tech report
1 parent 987176e commit b2d01dd

File tree

3 files changed

+14
-14
lines changed

3 files changed

+14
-14
lines changed

tts_tech_report/MiniMax_Speech.pdf

33.3 KB
Binary file not shown.

tts_tech_report/index.html

Lines changed: 11 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -521,16 +521,16 @@ <h2 id="multilingual-and-cross-lingual-capabilities-demonstrations">Multilingual
521521
<tbody>
522522
<tr class="border-bottom-thin">
523523
<th scope="col">Original Language</th>
524-
<th scope="col">Target Language</th>
524+
<th scope="col">Mixed Language</th>
525525
<th scope="col">Source Audio</th>
526526
<th scope="col">Text</th>
527527
<th scope="col">Minimax<br>Speech_02_HD</th>
528528
<th scope="col">ElevenLabs<br>Multilingual_v2</th>
529529
<th scope="col">OpenAI<br>TTS_1_HD<br>(*not cloned voice)</th>
530530
</tr>
531531
<tr class="border-bottom-thin">
532-
<th>English</th>
533-
<th>Mandarin</th>
532+
<td>English</td>
533+
<td>English + Mandarin</td>
534534
<td>
535535
<audio class="audio-sm" src="assets/audios/Wong_Sourse.mp3" controls></audio>
536536
</td>
@@ -552,8 +552,8 @@ <h2 id="multilingual-and-cross-lingual-capabilities-demonstrations">Multilingual
552552
</td>
553553
</tr>
554554
<tr class="border-bottom-thin">
555-
<th>Mandarin</th>
556-
<th>Cantonese</th>
555+
<td>Mandarin</td>
556+
<td>Mandarin + Cantonese</td>
557557
<td>
558558
<audio class="audio-sm" src="assets/audios/ShiBanYu_Sourse.mp3" controls></audio>
559559
</td>
@@ -573,8 +573,8 @@ <h2 id="multilingual-and-cross-lingual-capabilities-demonstrations">Multilingual
573573
</td>
574574
</tr>
575575
<tr class="border-bottom-thin">
576-
<th>Mandarin</th>
577-
<th>English</th>
576+
<td>Mandarin</td>
577+
<td>Mandarin + English</td>
578578
<td>
579579
<audio class="audio-sm" src="assets/audios/ShuanQ_Sourse.mp3" controls></audio>
580580
</td>
@@ -594,8 +594,8 @@ <h2 id="multilingual-and-cross-lingual-capabilities-demonstrations">Multilingual
594594
</td>
595595
</tr>
596596
<tr class="border-bottom-thin">
597-
<th>English</th>
598-
<th>Spanish</th>
597+
<td>English</td>
598+
<td>English + Spanish</td>
599599
<td>
600600
<audio class="audio-sm" src="assets/audios/CoCo_Sourse.mp3" controls></audio>
601601
</td>
@@ -615,8 +615,8 @@ <h2 id="multilingual-and-cross-lingual-capabilities-demonstrations">Multilingual
615615
</td>
616616
</tr>
617617
<tr class="border-bottom-thin">
618-
<th>Japanese</th>
619-
<th>Korean</th>
618+
<td>Japanese</td>
619+
<td>Japanese + Korean</td>
620620
<td>
621621
<audio class="audio-sm" src="assets/audios/Powerful_Girl_Sourse.mp3" controls></audio>
622622
</td>

tts_tech_report/style.css

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -285,15 +285,15 @@ audio {
285285
}
286286

287287
.audio-sm {
288-
width: 190px;
288+
min-width: 190px;
289289
}
290290

291291
.audio-md {
292-
width: 220px;
292+
min-width: 220px;
293293
}
294294

295295
.audio-lg {
296-
width: 300px;
296+
min-width: 300px;
297297
}
298298

299299

0 commit comments

Comments
 (0)