@@ -74,44 +74,4 @@ Also, there is a `metadata.jsonl` in each subfolder. It is a table-like file tha
74
74
75
75
### Training details can refer to the script of [ examples/aishell3/tts3] ( ../../aishell3/tts3 ) .
76
76
77
- ## Pretrained Model(Waiting========)
78
- Pretrained FastSpeech2 model with no silence in the edge of audios:
79
- - [ fastspeech2_aishell3_ckpt_1.1.0.zip] ( https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_aishell3_ckpt_1.1.0.zip )
80
- - [ fastspeech2_conformer_aishell3_ckpt_0.2.0.zip] ( https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_conformer_aishell3_ckpt_0.2.0.zip ) (Thanks for [ @awmmmm ] ( https://github.com/awmmmm ) 's contribution)
81
-
82
-
83
- FastSpeech2 checkpoint contains files listed below.
84
-
85
- ``` text
86
- fastspeech2_aishell3_ckpt_1.1.0
87
- ├── default.yaml # default config used to train fastspeech2
88
- ├── energy_stats.npy # statistics used to normalize energy when training fastspeech2
89
- ├── phone_id_map.txt # phone vocabulary file when training fastspeech2
90
- ├── pitch_stats.npy # statistics used to normalize pitch when training fastspeech2
91
- ├── snapshot_iter_96400.pdz # model parameters and optimizer states
92
- ├── speaker_id_map.txt # speaker id map file when training a multi-speaker fastspeech2
93
- └── speech_stats.npy # statistics used to normalize spectrogram when training fastspeech2
94
- ```
95
- You can use the following scripts to synthesize for ` ${BIN_DIR}/../sentences.txt ` using pretrained fastspeech2 and parallel wavegan models.
96
- ``` bash
97
- source path.sh
98
-
99
- FLAGS_allocator_strategy=naive_best_fit \
100
- FLAGS_fraction_of_gpu_memory_to_use=0.01 \
101
- python3 ${BIN_DIR} /../synthesize_e2e.py \
102
- --am=fastspeech2_aishell3 \
103
- --am_config=fastspeech2_aishell3_ckpt_1.1.0/default.yaml \
104
- --am_ckpt=fastspeech2_aishell3_ckpt_1.1.0/snapshot_iter_96400.pdz \
105
- --am_stat=fastspeech2_aishell3_ckpt_1.1.0/speech_stats.npy \
106
- --voc=pwgan_aishell3 \
107
- --voc_config=pwg_aishell3_ckpt_0.5/default.yaml \
108
- --voc_ckpt=pwg_aishell3_ckpt_0.5/snapshot_iter_1000000.pdz \
109
- --voc_stat=pwg_aishell3_ckpt_0.5/feats_stats.npy \
110
- --lang=zh \
111
- --text=${BIN_DIR} /../sentences.txt \
112
- --output_dir=exp/default/test_e2e \
113
- --phones_dict=fastspeech2_aishell3_ckpt_1.1.0/phone_id_map.txt \
114
- --speaker_dict=fastspeech2_aishell3_ckpt_1.1.0/speaker_id_map.txt \
115
- --spk_id=0 \
116
- --inference_dir=exp/default/inference
117
- ```
77
+ ## Pretrained Model
0 commit comments