For more details of our work, please refer to the paper.
Our implementation is available in the github repository.
Contents
| Ground Truth | |||||
| WaveNet (MoL) | |||||
| WaveGlow | |||||
| MelGAN | |||||
| HiFi-GAN V1 (ours) | |||||
| HiFi-GAN V2 (ours) | |||||
| HiFi-GAN V3 (ours) |
| Ground Truth | |||||
| WaveNet (MoL) | |||||
| WaveGlow | |||||
| MelGAN | |||||
| HiFi-GAN V1 (ours) | |||||
| HiFi-GAN V2 (ours) | |||||
| HiFi-GAN V3 (ours) |
| Ground Truth | |||||
| WaveGlow (fine-tuned) | |||||
| HiFi-GAN V1 (ours) (fine-tuned) | |||||
| HiFi-GAN V2 (ours) (fine-tuned) | |||||
| HiFi-GAN V3 (ours) (fine-tuned) | |||||
| WaveGlow (w/o fine-tuning) | |||||
| HiFi-GAN V1 (ours) (w/o fine-tuning) | |||||
| HiFi-GAN V2 (ours) (w/o fine-tuning) | |||||
| HiFi-GAN V3 (ours) (w/o fine-tuning) | |||||
| baseline | |||||
| w/o MPD | |||||
| w/o MSD | |||||
| w/o MRF | |||||
| w/o Mel-Spectrogram Loss | |||||
| MPD p=[2,4,8,16,32] |
| MelGAN with MPD | |||||
| MelGAN |
| HiFi-GAN V1 (500k step) |