SVSNet: An End-to-End Speaker Voice Similarity Assessment Model (original) (raw)
Related papers
MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion
Interspeech 2019
The UFRJ Entry for the Voice Conversion Challenge 2020
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
Voice conversion using deep neural networks
Explicit Prosodic Modelling and Deep Speaker Embedding Learning for Non-standard Voice Conversion
arXiv: Audio and Speech Processing, 2020
Voice Conversion using Convolutional Neural Networks
ArXiv, 2016
A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
ArXiv, 2021
Voice Cloning Using Transfer Learning with Audio Samples
UMT Artificial Intelligence Review (UMT-AIR) , 2023
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion
Interspeech 2021, 2021
Continuous vocoder applied in deep neural network based voice conversion
Multimedia Tools and Applications, 2019
Hierarchical Sequence to Sequence Voice Conversion with Limited Data
2019
High quality voice conversion using prosodic and high-resolution spectral features
Multimedia Tools and Applications, 2015
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
ArXiv, 2022
Neural Vocoding for CycleGAN-Based Voice Conversion
Anais de XXXVIII Simpósio Brasileiro de Telecomunicações e Processamento de Sinais
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2018
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance
Interspeech 2018
ASVspoof 2019: a large-scale public database of synthetic, converted and replayed speech
2020
Unsupervised Cross-Domain Singing Voice Conversion
Interspeech 2020, 2020
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
2020
DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features
Neural Processing Letters, 2019
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis
2021 29th European Signal Processing Conference (EUSIPCO), 2021
Unsupervised Cross-Domain Speech-to-Speech Conversion with Time-Frequency Consistency
ArXiv, 2020
Interspeech 2016, 2016
Self supervised learning for robust voice cloning
Panos Kakoulidis, Karolos Nikitaras
Interspeech 2022
Autotuned voice cloning enabling multilingualism
SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
Interspeech 2021, 2021
Voice conversion with limited data and limitless data augmentations
arXiv (Cornell University), 2022
Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
Adversarially Trained Autoencoders for Parallel-data-free Voice Conversion
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
IEEE Access, 2018
Effects of Sinusoidal Model on Non-Parallel Voice Conversion with Adversarial Learning
Applied Sciences, 2021
Interspeech 2017
Voice Conversion for Whispered Speech Synthesis
IEEE Signal Processing Letters
Voice conversion from non-parallel corpora using variational auto-encoder
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2016