For more details of our work, please refer to the paper.
Our implementation is available in the github repository.
Contents
Ground Truth | |||||
WaveNet (MoL) | |||||
WaveGlow | |||||
MelGAN | |||||
HiFi-GAN V1 (ours) | |||||
HiFi-GAN V2 (ours) | |||||
HiFi-GAN V3 (ours) |
Ground Truth | |||||
WaveNet (MoL) | |||||
WaveGlow | |||||
MelGAN | |||||
HiFi-GAN V1 (ours) | |||||
HiFi-GAN V2 (ours) | |||||
HiFi-GAN V3 (ours) |
Ground Truth | |||||
WaveGlow (fine-tuned) | |||||
HiFi-GAN V1 (ours) (fine-tuned) | |||||
HiFi-GAN V2 (ours) (fine-tuned) | |||||
HiFi-GAN V3 (ours) (fine-tuned) | |||||
WaveGlow (w/o fine-tuning) | |||||
HiFi-GAN V1 (ours) (w/o fine-tuning) | |||||
HiFi-GAN V2 (ours) (w/o fine-tuning) | |||||
HiFi-GAN V3 (ours) (w/o fine-tuning) |
baseline | |||||
w/o MPD | |||||
w/o MSD | |||||
w/o MRF | |||||
w/o Mel-Spectrogram Loss | |||||
MPD p=[2,4,8,16,32] |
MelGAN with MPD | |||||
MelGAN |
HiFi-GAN V1 (500k step) |