Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder (original) (raw)

Parallel WaveNet: Fast High-Fidelity Speech Synthesis

Cornell University - arXiv, 2017

An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis

Mircea Giurgiu

Procedia Computer Science, 2021

View PDFchevron_right

Reducing over-smoothness in speech synthesis using Generative Adversarial Networks

Evgeniy N Pavlovskiy

2019 International Multi-Conference on Engineering, Computer and Information Sciences (SIBIRCON)

View PDFchevron_right

WAVENET: A GENERATIVE MODEL FOR RAW AUDIO

Mr Swan

View PDFchevron_right

Waveglow: A Flow-based Generative Network for Speech Synthesis

Rafael Valle

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

View PDFchevron_right

The Huya Multi-Speaker and Multi-Style Speech Synthesis System for M2voc Challenge 2020

deyi tuo

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021

View PDFchevron_right

WaveNet-Based Speech Synthesis Applied to Czech

Daniel Tihelka

Text, Speech, and Dialogue, 2018

View PDFchevron_right

Multi-speaker TTS with Deep Learning

Ivan Carapinha

2020

View PDFchevron_right

Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis

Mircea Giurgiu

2021 29th European Signal Processing Conference (EUSIPCO), 2021

View PDFchevron_right

Efficient Neural Audio Synthesis

Edward Lockhart

Cornell University - arXiv, 2018

View PDFchevron_right

Natural Human Voice Text To Speech Engine (Linnet) FOR THE DEGREE OF MSC APPLIED AI & DATA SCIENCE

Oriyomi Adepitan

2022

View PDFchevron_right

Quasi-fully Convolutional Neural Network with Variational Inference for Speech Synthesis

deyi tuo

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

View PDFchevron_right

Expediting TTS Synthesis with Adversarial Vocoding

Shlomo Dubnov

arXiv (Cornell University), 2019

View PDFchevron_right

The voice synthesis business: 2022 update

Robert Dale

Natural Language Engineering

View PDFchevron_right

WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Mohammad norouzi

Interspeech 2021, 2021

View PDFchevron_right

Universal Neural Vocoding with Parallel Wavenet

Daniel Korzekwa, Adam Gabrys

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View PDFchevron_right

Waveform-Based Speaker Representations for Speech Synthesis

Moquan Wan

Interspeech 2018

View PDFchevron_right

Introducing Prosodic Speaker Identity for a Better Expressive Speech Synthesis Control

Aghilas SINI

10th International Conference on Speech Prosody 2020, 2020

View PDFchevron_right

MFCCGAN: A Novel MFCC-Based Speech Synthesizer Using Adversarial Learning

Majid Behdad

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View PDFchevron_right

Voice generation using deep learning

Gonzalo Gómez Sánchez

2016

View PDFchevron_right

Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN

Ankur Narang

arXiv (Cornell University), 2023

View PDFchevron_right

Integrated speaker-adaptive speech synthesis

Moquan Wan

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)

View PDFchevron_right

The Tencent speech synthesis system for Blizzard Challenge 2020

Qiao Tian

Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

View PDFchevron_right

Hierarchical RNNs for Waveform-Level Speech Synthesis

Moquan Wan

2018 IEEE Spoken Language Technology Workshop (SLT), 2018

View PDFchevron_right

Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks

Hsin-min Wang

Interspeech 2017

View PDFchevron_right

Text to Speech Synthesis in Celebrity’s Voice

Daulappa Bhalke

SAMRIDDHI : A Journal of Physical Sciences, Engineering and Technology, 2020

View PDFchevron_right

A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality

Emmanouil Benetos

2023 34th Irish Signals and Systems Conference (ISSC)

View PDFchevron_right

A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams

Helen Meng

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

View PDFchevron_right

FlowVocoder: A small Footprint Neural Vocoder based Normalizing Flow for Speech Synthesis

Viet Tran

Interspeech 2022

View PDFchevron_right

SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

Sandra Maria Aluísio

Interspeech 2021, 2021

View PDFchevron_right

A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems

Phan Hung Kinh

2020 12th International Conference on Knowledge and Systems Engineering (KSE), 2020

View PDFchevron_right

Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder (original) (raw)

Related papers