Char2Wav: End-to-End Speech Synthesis (original) (raw)
Related papers
Speech synthesis with neural networks
Orhan Karaali, Ira Gerson, Gerald Corrigan
Arxiv preprint cs/9811031, 1998
Hierarchical RNNs for Waveform-Level Speech Synthesis
2018 IEEE Spoken Language Technology Workshop (SLT), 2018
Journal of Advances in Information Technology
End-to-End Text-to-Speech Synthesis with Unaligned Multiple Language Units Based on Attention
Interspeech 2020, 2020
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Interspeech 2021, 2021
Fast Gated Recurrent Network for Speech Synthesis
IEICE Transactions on Information and Systems
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Georgia Maniati, Panos Kakoulidis
Interspeech 2020, 2020
Efficient Neural Audio Synthesis
Cornell University - arXiv, 2018
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
Cornell University - arXiv, 2022
RNN-based speech synthesis using a continuous sinusoidal model
2019 International Joint Conference on Neural Networks (IJCNN), 2019
Sequence to Sequence Neural Speech Synthesis with Prosody Modification Capabilities
10th ISCA Speech Synthesis Workshop
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
ArXiv, 2017
FastSpeech: Fast, Robust and Controllable Text to Speech
2019
2020
Continuous vocoder in feed-forward deep neural network based speech synthesis
2018
2020
End-To-End Speech Synthesis Applied to Brazilian Portuguese
2020
The voice synthesis business: 2022 update
Natural Language Engineering
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
ArXiv, 2022
Development and Evaluation of Speech Synthesis System Based on Deep Learning Models
Symmetry
Statistical parametric speech synthesis using deep neural networks
2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013
UFANS: U-Shaped Fully-Parallel Acoustic Neural Structure for Statistical Parametric Speech Synthesis
PRICAI 2019: Trends in Artificial Intelligence, 2019
Whistler: a trainable text-to-speech system
Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996
Quasi-fully Convolutional Neural Network with Variational Inference for Speech Synthesis
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
9th ISCA Speech Synthesis Workshop, 2016
Waveglow: A Flow-based Generative Network for Speech Synthesis
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Voice generation using deep learning
2016
FlowVocoder: A small Footprint Neural Vocoder based Normalizing Flow for Speech Synthesis
Interspeech 2022
On building phonetically and prosodically rich speech corpus for text-to-speech synthesis
2006