WER is high：synthesized audio words have a high error rate #178

Open

Open

WER is high：synthesized audio words have a high error rate#178

opened

on Jul 11, 2024

I use Whisper's base model to calculate the word error rate for the audio synthesized by Vallex, but the word error rate is as high as 0.22, is this normal?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests