Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder (original) (raw)
Related papers
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Cornell University - arXiv, 2017
Procedia Computer Science, 2021
Reducing over-smoothness in speech synthesis using Generative Adversarial Networks
2019 International Multi-Conference on Engineering, Computer and Information Sciences (SIBIRCON)
WAVENET: A GENERATIVE MODEL FOR RAW AUDIO
Waveglow: A Flow-based Generative Network for Speech Synthesis
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
The Huya Multi-Speaker and Multi-Style Speech Synthesis System for M2voc Challenge 2020
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021
WaveNet-Based Speech Synthesis Applied to Czech
Text, Speech, and Dialogue, 2018
Multi-speaker TTS with Deep Learning
2020
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis
2021 29th European Signal Processing Conference (EUSIPCO), 2021
Efficient Neural Audio Synthesis
Cornell University - arXiv, 2018
Natural Human Voice Text To Speech Engine (Linnet) FOR THE DEGREE OF MSC APPLIED AI & DATA SCIENCE
2022
Quasi-fully Convolutional Neural Network with Variational Inference for Speech Synthesis
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Expediting TTS Synthesis with Adversarial Vocoding
arXiv (Cornell University), 2019
The voice synthesis business: 2022 update
Natural Language Engineering
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Interspeech 2021, 2021
Universal Neural Vocoding with Parallel Wavenet
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Waveform-Based Speaker Representations for Speech Synthesis
Interspeech 2018
Introducing Prosodic Speaker Identity for a Better Expressive Speech Synthesis Control
10th International Conference on Speech Prosody 2020, 2020
MFCCGAN: A Novel MFCC-Based Speech Synthesizer Using Adversarial Learning
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Voice generation using deep learning
2016
arXiv (Cornell University), 2023
Integrated speaker-adaptive speech synthesis
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
The Tencent speech synthesis system for Blizzard Challenge 2020
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
Hierarchical RNNs for Waveform-Level Speech Synthesis
2018 IEEE Spoken Language Technology Workshop (SLT), 2018
Interspeech 2017
Text to Speech Synthesis in Celebrity’s Voice
SAMRIDDHI : A Journal of Physical Sciences, Engineering and Technology, 2020
A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality
2023 34th Irish Signals and Systems Conference (ISSC)
A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
FlowVocoder: A small Footprint Neural Vocoder based Normalizing Flow for Speech Synthesis
Interspeech 2022
SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
Interspeech 2021, 2021
A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems
2020 12th International Conference on Knowledge and Systems Engineering (KSE), 2020