- [x] Add Wav2vec model as STT (ASR and Audio classification) - [x] The torch mel seems to work better than current - [x] Fix model skipping the first few words