Skip to content

Commit 9121dfc

Browse files
authored
Merge pull request #1752 from yt605155624/fix_wavernn
[TTS]fix wavernn white noise bug for paddle develop(2.3)
2 parents 32a75cd + 08a4673 commit 9121dfc

File tree

3 files changed

+6
-3
lines changed

3 files changed

+6
-3
lines changed

examples/csmsc/tts3/local/synthesize_e2e.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -109,6 +109,6 @@ if [ ${stage} -le 4 ] && [ ${stop_stage} -ge 4 ]; then
109109
--lang=zh \
110110
--text=${BIN_DIR}/../sentences.txt \
111111
--output_dir=${train_output_path}/test_e2e \
112-
--phones_dict=dump/phone_id_map.txt #\
113-
# --inference_dir=${train_output_path}/inference
112+
--phones_dict=dump/phone_id_map.txt \
113+
--inference_dir=${train_output_path}/inference
114114
fi

examples/csmsc/voc6/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -114,6 +114,7 @@ The pretrained model can be downloaded here:
114114

115115
The static model can be downloaded here:
116116
- [wavernn_csmsc_static_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/wavernn/wavernn_csmsc_static_0.2.0.zip)
117+
- [wavernn_csmsc_static_1.0.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/wavernn/wavernn_csmsc_static_1.0.0.zip) (fix bug for paddle 2.3)
117118

118119
Model | Step | eval/loss
119120
:-------------:|:------------:| :------------:

paddlespeech/t2s/models/wavernn/wavernn.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -360,7 +360,9 @@ def generate(self,
360360
x = sample.transpose([1, 0, 2])
361361

362362
elif self.mode == 'RAW':
363-
posterior = F.softmax(logits, axis=1)
363+
# fix bug for paddle 2.3, see https://github.com/PaddlePaddle/Paddle/commit/01f606b4f1ca3e184a59111084ed460ee0798a5a
364+
# posterior = F.softmax(logits, axis=1)
365+
posterior = logits
364366
distrib = paddle.distribution.Categorical(posterior)
365367
# corresponding operate [np.floor((fx + 1) / 2 * mu + 0.5)] in enocde_mu_law
366368
# distrib.sample([1])[0].cast('float32'): [0, 2**bits-1]

0 commit comments

Comments
 (0)