Voice Verification

This repository contains the framework for training speaker verification model described in [2]
with score normalization post-processing described in [3].

Dependencies

pip install -r requirements.txt

Data Preparation

Generate train, validate list (if ratio == -1, take 3 files for each speaker for validate)

python src/dataprep.py --generate --split_ratio -1 --config yaml/configuration.yaml

In addition to the Python dependencies, wget and ffmpeg must be installed on the system.

Training

Set cuda usage

export CUDA_VISIBLE_DEVICES=5

then add the device="cuda:5" to args
Single GPU

CUDA_VISIBLE_DEVICES=0 python main.py --do_train --config yaml/configuration.yaml

Data parallel

CUDA_VISIBLE_DEVICES=0,1,2 python main.py --do_train --config yaml/configuration.yaml --data_parallel

Distributed

CUDA_VISIBLE_DEVICES=0,1,2 python main.py --do_train --config yaml/configuration.yaml --distributed --mixedprec --distributed_backend nccl --port 10001

Note: the best model is automatically saved during the training process, if the initial_model is not provided, automatically load from the best_state weight if possible. add --augment to train with augment data

Inference

prepare cohorts

CUDA_VISIBLE_DEVICES=0 python main.py --do_infer --prepare --config yaml/configuration.yaml

Evaluate and tune thresholds

CUDA_VISIBLE_DEVICES=0 python main.py --do_infer --eval --config yaml/configuration.yaml

Run on Test set

CUDA_VISIBLE_DEVICES=0 python main.py --do_infer --test --config yaml/configuration.yaml

Citation

[1] In defence of metric learning for speaker recognition

@inproceedings{chung2020in,
    title={In defence of metric learning for speaker recognition},
    author={Chung, Joon Son and Huh, Jaesung and Mun, Seongkyu and Lee, Minjae and Heo, Hee Soo and Choe, Soyeon and Ham, Chiheon and Jung, Sunghwan and Lee, Bong-Jin and Han, Icksang},
    booktitle={Interspeech},
    year={2020}
}

[2] Clova baseline system for the VoxCeleb Speaker Recognition Challenge 2020

@article{heo2020clova,
    title={Clova baseline system for the {VoxCeleb} Speaker Recognition Challenge 2020},
    author={Heo, Hee Soo and Lee, Bong-Jin and Huh, Jaesung and Chung, Joon Son},
    journal={arXiv preprint arXiv:2009.14153},
    year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.idea		.idea
backup/20220709/save		backup/20220709/save
dataset/voxceleb_list		dataset/voxceleb_list
src		src
yaml		yaml
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
collect_env.py		collect_env.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Verification

Dependencies

Data Preparation

Training

Inference

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

hiimmuc/SpeakerVerification

Folders and files

Latest commit

History

Repository files navigation

Voice Verification

Dependencies

Data Preparation

Training

Inference

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages