Skip to content

Commit 7a8a2f3

Browse files
committed
feat: add some demo for speech report
1 parent b2d01dd commit 7a8a2f3

File tree

6 files changed

+77
-6
lines changed

6 files changed

+77
-6
lines changed
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.

tts_tech_report/index.html

Lines changed: 77 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -157,6 +157,39 @@ <h3 id="showcase-with-high-versatility">Showcase with High Versatility</h3>
157157
<audio class="audio-md" src="assets/audios/Warm%20and%20Magnetic.mp3" controls></audio>
158158
</td>
159159
</tr>
160+
<tr class="border-bottom-thin">
161+
<td>
162+
An ASMR Whispering Voice with Generated Breathing and Sound Effects
163+
</td>
164+
<td>
165+
<audio class="audio-md" src="assets/audios/Breathy%20ASMR_Sourse.wav" controls></audio>
166+
</td>
167+
<td>
168+
<audio class="audio-md" src="assets/audios/Breathy%20ASMR.MP3" controls></audio>
169+
</td>
170+
</tr>
171+
<tr class="border-bottom-thin">
172+
<td>
173+
A Robotic Voice with Rich Bass Resonance and Spatial Presence
174+
</td>
175+
<td>
176+
<audio class="audio-md" src="assets/audios/Lucky%20Robot_Sourse.wav" controls></audio>
177+
</td>
178+
<td>
179+
<audio class="audio-md" src="assets/audios/Lucky%20Robot.mp3" controls></audio>
180+
</td>
181+
</tr>
182+
<tr class="border-bottom-thin">
183+
<td>
184+
A Sardonic Mature Female Voice
185+
</td>
186+
<td>
187+
<audio class="audio-md" src="assets/audios/Onee-san_Sourse.wav" controls></audio>
188+
</td>
189+
<td>
190+
<audio class="audio-md" src="assets/audios/Onee-san.wav" controls></audio>
191+
</td>
192+
</tr>
160193
</tbody>
161194
</table>
162195
</div>
@@ -884,8 +917,8 @@ <h2 id="text-prompted-voice-generation-demonstrations">Text-Prompted Voice Gener
884917
</tr>
885918
<tr class="border-bottom-thin">
886919
<td>
887-
男性中年声音,说中文,音色浑厚醇厚,带有自然的磁性,语速偏慢,<br>
888-
音量适中,音调偏低沉。声音整体给人沉稳可靠的感觉,<br>
920+
男性中年声音,说中文,音色浑厚醇厚,带有自然的磁性,<br>
921+
语速偏慢,音量适中,音调偏低沉。声音整体给人沉稳可靠的感觉,<br>
889922
在深度访谈场景中表现出专业性和亲和力,音质清晰,吐字规整有力。
890923
</td>
891924
<td>
@@ -900,9 +933,9 @@ <h2 id="text-prompted-voice-generation-demonstrations">Text-Prompted Voice Gener
900933
</tr>
901934
<tr class="border-bottom-thin">
902935
<td>
903-
说中文的女青年,音色偏甜美,语速比较快,说话时带着一种轻快的感觉,<br>
904-
整体音调较高,像是在直播带货,整体氛围比较活跃<br>
905-
声音清晰,听起来很有亲和力。
936+
说中文的女青年,音色偏甜美,语速比较快,<br>
937+
说话时带着一种轻快的感觉,整体音调较高,像是在直播带货,<br>
938+
整体氛围比较活跃,声音清晰,听起来很有亲和力。
906939
</td>
907940
<td>
908941
亲爱的宝宝们,等了好久的神仙面霜终于到货啦!<br>
@@ -928,7 +961,7 @@ <h2 id="text-prompted-voice-generation-demonstrations">Text-Prompted Voice Gener
928961
<audio class="audio-md" src="assets/audios/体育解说男青年.wav" controls></audio>
929962
</td>
930963
</tr>
931-
<tr>
964+
<tr class="border-bottom-thin">
932965
<td>
933966
中国女青年的声音,音色清脆,说话速度偏快,语调活泼,<br>
934967
像是在做游戏直播,声音中带着愉快的感觉,整体音调较高,<br>
@@ -943,6 +976,44 @@ <h2 id="text-prompted-voice-generation-demonstrations">Text-Prompted Voice Gener
943976
<audio class="audio-md" src="assets/audios/游戏主播女青年.wav" controls></audio>
944977
</td>
945978
</tr>
979+
<tr class="border-bottom-thin">
980+
<td>
981+
English-speaking female voice, sounding relatively young,<br>
982+
with a sweet and pleasant tone. Speaking at a moderate pace<br>
983+
with a touch of energy, similar to someone narrating a<br>
984+
beauty/makeup tutorial video. The overall atmosphere is<br>
985+
relaxed and cheerful.
986+
</td>
987+
<td>
988+
Hi everyone! Today I'll be sharing a soft, romantic<br>
989+
makeup look that's perfect for dates. Many of you have <br>
990+
been asking how to apply this eyeshadow naturally - the<br>
991+
key is using gentle techniques. Let's go through the<br>
992+
steps together...
993+
</td>
994+
<td>
995+
<audio class="audio-md" src="assets/audios/美妆女博主.wav" controls></audio>
996+
</td>
997+
</tr>
998+
<tr>
999+
<td>
1000+
English-speaking middle-aged male voice, slightly husky, <br>
1001+
speaking at a moderate-to-slow pace with a deep tone. Like<br>
1002+
someone telling an old story, conveying a nostalgic feeling,<br>
1003+
with a relaxed and composed manner of speaking.
1004+
</td>
1005+
<td>
1006+
That was back in the late 1970s. I remember when our <br>
1007+
village first got electricity - everyone was so excited. <br>
1008+
In theevenings, people would bring their stools and <br>
1009+
gather under the big banyan tree by the village committee <br>
1010+
office to watch movies projected on the wall. Even now, <br>
1011+
thinking back to those moments still fills me with warmth.
1012+
</td>
1013+
<td>
1014+
<audio class="audio-md" src="assets/audios/回忆男中年.wav" controls></audio>
1015+
</td>
1016+
</tr>
9461017
</tbody>
9471018
</table>
9481019
</div>

0 commit comments

Comments
 (0)