-
Notifications
You must be signed in to change notification settings - Fork 0
/
storyttsdemo.html
98 lines (70 loc) · 4.27 KB
/
storyttsdemo.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
<html><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<title>TTS</title>
</head>
<body data-new-gr-c-s-check-loaded="14.1160.0" data-gr-ext-installed="">
<h2>Audio samples for "Emotion-Coherent Speech Data Augmentation and Self-Supervised Contrastive Style Training for Enhancing Kids's Story Speech Synthesis"</h1>
<h2>Audio samples</h2>
<p>
<p>
<b>Author: Raymond Chung</b><br>
<hr>
<h4>Kid-storybook Page-level Storytelling utterances</h4>
<table border="1">
<tbody><tr>
<th style="width: 230px;"></th><th>Ground truth utterances</th><th>M1: <br>1-sentence utterances</th><th>M2: <br>1-sentence utterances <br>+<br> 2-sentence utterances</th><th>M4: <br>1-sentence utterances <br>+<br> 2-sentence utterances of augmented data <br>+<br> contrastive loss</th>
</tr>
<tr>
<td>Sample 1 <br> "Build strong, safe houses!"
So they packed their bags and waved goodbye.
"Watch out for the Big Bad Wolf," called Mother Pig.
"We will!"</td>
<td><audio controls="" style="width: 300px;"><source src="./audio/thethreelittlepigs_3.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/a_thethreelittlepigs_3.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/b_thethreelittlepigs_3.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/c_thethreelittlepigs_3.wav"></audio></td>
</tr>
<tr>
<td>Sample 2<br> ...they met a man selling straw.
"Can I buy some straw?" asked Pinky Pig.
"I'm going to build a house."</td>
<td><audio controls="" style="width: 300px;"><source src="./audio/thethreelittlepigs_5.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/a_thethreelittlepigs_5.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/b_thethreelittlepigs_5.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/c_thethreelittlepigs_5.wav"></audio></td>
</tr>
<tr>
<td>Sample 3<br>The next day, the Big Bad Wolf went to the straw house.
"Little pig, little pig, let me come in," he called.
"No!" cried Pinky Pig.
"Not by the hair on my chinny-chin-chin." </td>
<td><audio controls="" style="width: 300px;"><source src="./audio/thethreelittlepigs_14.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/a_thethreelittlepigs_14.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/b_thethreelittlepigs_14.wav"></audio></td>
<td><audio controls="" style="width: 300px;"><source src="./audio/c_thethreelittlepigs_14.wav"></audio></td>
</tr>
</tbody></table>
<p></p>
<h4>Longer example</h4>
<p> This one-minute kid storytelling speech was generated by the prosposed TTS model (M4) in one step and with each sentence conditioned by its predicted speaking style.</p>
<p> The story is from <a href="https://storyplanets.com/the-jackal-and-the-turtle/">this page</a>.</p>
<audio controls="" style="width: 300px;"><source src="./audio/bonus_story.wav"></audio>
</body><grammarly-desktop-integration data-grammarly-shadow-root="true"><template shadowrootmode="open"><style>
div.grammarly-desktop-integration {
position: absolute;
width: 1px;
height: 1px;
padding: 0;
margin: -1px;
overflow: hidden;
clip: rect(0, 0, 0, 0);
white-space: nowrap;
border: 0;
-moz-user-select: none;
-webkit-user-select: none;
-ms-user-select:none;
user-select:none;
}
div.grammarly-desktop-integration:before {
content: attr(data-content);
}
</style><div aria-label="grammarly-integration" role="group" tabindex="-1" class="grammarly-desktop-integration" data-content="{"mode":"full","isActive":true,"isUserDisabled":false}"></div></template></grammarly-desktop-integration></html>