Natural speech

No. Source speaker Target speaker
1
2

Converted speech

Cross-lingual VC condition No. Samples
JPN2ENG 1
2
ENG2ENG 1
2
JPNENG2ENG 1
2