[ICASSP 2025] Voice Conversion for Low-Resource Languages via Knowledge Transfer and Domain-Adversarial Training
Link to paper and pretrained model
- Download WavLM-Large and put it under directory 'wavlm/', download checkpoint and put it under directory checkpoint/
- Install requirements: pip install -r requirements.txt
python3 convert.py --hpfile ./checkpoint/vi_200_10_sr/config.json --ptfile ./checkpoint/vi_200_10_sr/G_266000.pth
- https://github.com/OlaWod/FreeVC
- https://github.com/KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding
@INPROCEEDINGS{10889083,
author={Tu, Huu Tuong and Thanh Long, Luong and Huan, Vu and Phuong Thao, Nguyen Thi and Van Thang, Nguyen and Cuong, Nguyen Tien and Thi Thu Trang, Nguyen},
booktitle={ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
title={Voice Conversion for Low-Resource Languages via Knowledge Transfer and Domain-Adversarial Training},
year={2025},
pages={1-5},
doi={10.1109/ICASSP49660.2025.10889083}}