Thanks for your great work!
I used the half-body video which is called example_speech. The input length is 10s. However, the result is only 2s.
There only 30 frames generated by the method while processing a half-body video.
Is this caused by some bug or pre-setting?
Thanks!