Skip to content

Conversation

HighCWu
Copy link
Contributor

@HighCWu HighCWu commented Feb 1, 2023

PR types

Bug fixes

PR changes

Models

Describe

When I used WaveNetDenoiser to train diffusion-related tasks, I found that the loss did not decrease. After compared with diffsinger again, I modified the initialization of the last convolution. The meaning of stack and dilation_by_cycle_length in diffsinger is different. stack is actually equal to the number of layers divided by dilation_by_cycle_length.

我自己使用WaveNetDenoiser训练diffusion相关任务时发现损失不下降,与diffsinger再次对齐后修改了最后一个卷积的初始化,并且stack与diffsinger中的dilation_by_cycle_length含义不同,stack实际上等于layer数除以dilation_by_cycle_length。

@mergify mergify bot added the T2S label Feb 1, 2023
@yt605155624 yt605155624 added this to the r1.4.0 milestone Feb 1, 2023
@yt605155624 yt605155624 requested a review from lym0302 February 1, 2023 10:21
Copy link
Collaborator

@yt605155624 yt605155624 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yt605155624 yt605155624 merged commit a55fd2e into PaddlePaddle:develop Feb 2, 2023
@yt605155624 yt605155624 mentioned this pull request Feb 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants