SVSNet: An End-to-End Speaker Voice Similarity Assessment Model (original) (raw)

MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion

Yu Tsao

Interspeech 2019

View PDFchevron_right

The UFRJ Entry for the Voice Conversion Challenge 2020

Luiz Biscainho

Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

View PDFchevron_right

Voice conversion using deep neural networks

Hy Nguyen

View PDFchevron_right

Explicit Prosodic Modelling and Deep Speaker Embedding Learning for Non-standard Voice Conversion

Helen Meng

arXiv: Audio and Speech Processing, 2020

View PDFchevron_right

Voice Conversion using Convolutional Neural Networks

Shariq Mobin

ArXiv, 2016

View PDFchevron_right

A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams

Helen Meng

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

View PDFchevron_right

Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning

Vincent Pollet

ArXiv, 2021

View PDFchevron_right

Voice Cloning Using Transfer Learning with Audio Samples

Usman Nawaz, Usman Ahmed Raza

UMT Artificial Intelligence Review (UMT-AIR) , 2023

View PDFchevron_right

Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion

Helen Meng

Interspeech 2021, 2021

View PDFchevron_right

Continuous vocoder applied in deep neural network based voice conversion

Géza Németh

Multimedia Tools and Applications, 2019

View PDFchevron_right

Hierarchical Sequence to Sequence Voice Conversion with Limited Data

Praveen Narayanan

2019

View PDFchevron_right

High quality voice conversion using prosodic and high-resolution spectral features

Minghui Dong

Multimedia Tools and Applications, 2015

View PDFchevron_right

SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis

Georgia Maniati

ArXiv, 2022

View PDFchevron_right

Neural Vocoding for CycleGAN-Based Voice Conversion

Luiz W P Biscainho

Anais de XXXVIII Simpósio Brasileiro de Telecomunicações e Processamento de Sinais

View PDFchevron_right

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders

Yu Tsao

2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2018

View PDFchevron_right

Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance

Jinghua Zhong

Interspeech 2018

View PDFchevron_right

ASVspoof 2019: a large-scale public database of synthetic, converted and replayed speech

Yu Tsao

2020

View PDFchevron_right

Unsupervised Cross-Domain Singing Voice Conversion

Yossi Adi

Interspeech 2020, 2020

View PDFchevron_right

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Tomi Kinnunen

2020

View PDFchevron_right

DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

M Kiran Reddy

Neural Processing Letters, 2019

View PDFchevron_right

Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis

Mircea Giurgiu

2021 29th European Signal Processing Conference (EUSIPCO), 2021

View PDFchevron_right

Unsupervised Cross-Domain Speech-to-Speech Conversion with Time-Frequency Consistency

Mohammad Khan

ArXiv, 2020

View PDFchevron_right

ML Parameter Generation with a Reformulated MGE Training Criterion — Participation in the Voice Conversion Challenge 2016

Eder Blanco

Interspeech 2016, 2016

View PDFchevron_right

Self supervised learning for robust voice cloning

Panos Kakoulidis, Karolos Nikitaras

Interspeech 2022

View PDFchevron_right

Autotuned voice cloning enabling multilingualism

IRJET Journal

View PDFchevron_right

SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

Sandra Maria Aluísio

Interspeech 2021, 2021

View PDFchevron_right

Voice conversion with limited data and limitless data augmentations

Oscar Mayor

arXiv (Cornell University), 2022

View PDFchevron_right

Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach

Amir H Mohammadi

View PDFchevron_right

Non-parallel Voice Conversion based on Hierarchical Latent Embedding Vector Quantized Variational Autoencoder

Tuấn Hồ

Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

View PDFchevron_right

Adversarially Trained Autoencoders for Parallel-data-free Voice Conversion

Gokce Keskin

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

View PDFchevron_right

Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder

Nobuaki Minematsu

IEEE Access, 2018

View PDFchevron_right

Effects of Sinusoidal Model on Non-Parallel Voice Conversion with Adversarial Learning

Géza Németh

Applied Sciences, 2021

View PDFchevron_right

Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks

Hsin-min Wang

Interspeech 2017

View PDFchevron_right

Voice Conversion for Whispered Speech Synthesis

Marius Cotescu

IEEE Signal Processing Letters

View PDFchevron_right

Voice conversion from non-parallel corpora using variational auto-encoder

Hsin-min Wang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2016

View PDFchevron_right

SVSNet: An End-to-End Speaker Voice Similarity Assessment Model (original) (raw)

Related papers