ISCA Archive (original) (raw)

Text-Dependent Speaker Verification


A Low-Power Text-Dependent Speaker Verification System with Narrow-Band Feature Pre-Selection and Weighted Dynamic Time Warping Qing He, Gregory Wornell, Wei Ma Deep Neural Network based Text-Dependent Speaker Verification : Preliminary Results Gautam Bhattacharya, Patrick Kenny, Jahangir Alam, Themos Stafylakis Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition Patrick Kenny, Themos Stafylakis, Jahangir Alam, Vishwa Gupta, Marcel Kockmann Deep Neural Networks and Hidden Markov Models in i-vector-based Text-Dependent Speaker Verification Hossein Zeinali, Lukas Burget, Hossein Sameti, Ondrej Glembek, Oldrich Plchot

Language Recognition


Between-Class Covariance Correction For Linear Discriminant Analysis in Language Recognition Abhinav Misra, Qian Zhang, Finnian Kelly, John H.L. Hansen Incorporating uncertainty as a Quality Measure in I-Vector Based Language Recognition Amir Hossein Poorjam, Rahim Saeidi, Tomi Kinnunen, Ville Hautamaki Discriminating Languages in a Probabilistic Latent Subspace Aleksandr Sizov, Kong Aik Lee, Tomi Kinnunen Investigation of Senone-based Long-Short Term Memory RNNs for Spoken Language Recognition Yao Tian, Liang He, Yi Liu, Jia Liu Automatic Accent Recognition Systems and the Effects of Data on Performance Georgina Brown The “Sprekend Nederland” project and its application to accent location David van Leeuwen, Rosemary Orr Deep Language: a comprehensive deep learning approach to end-to-end language recognition Trung Ngo Trong, Ville Hautamäki, Kong Aik Lee On the use of phone-gram units in recurrent neural networks for language identification Christian Salamea, Luis Fernando D'Haro, Ricardo Cordoba, Rubén San-Segundo Language Recognition for Dialects and Closely Related Languages Gregory Gelly, Jean-Luc Gauvain, Lori Lamel, Antoine Laurent, Viet Bac Le, Abdel Messaoudi Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems Maryam Najafian, Saeid Safavi, Phil Weber, Martin Russell Improvements on Deep Bottleneck Network based I-Vector Representation for Spoken Language Identification Yan Song, Ruilian Cui, Mcloughlin Ian, Lirong Dai

Speaker and Language Recognition Systems


BAT System Description for NIST LRE 2015 Oldrich Plchot, Pavel Matejka, Ondrej Glembek, Radek Fer, Ondrej Novotny, Jan Pesan, Lukas Burget, Niko Brummer, Sandro Cumani The IBM 2016 Speaker Recognition System Seyed Omid Sadjadi, Sriram Ganapathy, Jason Pelecanos The Sheffield language recognition system in NIST LRE 2015 Raymond W. M. Ng, Mauro Nicolao, Oscar Saz, Madina Hasan, Bhusan Chettri, Mortaza Doulaty, Tan Lee, Thomas Hain Analyzing the Effect of Channel Mismatch on the SRI Language Recognition Evaluation 2015 System Mitchell Mclaren, Diego Castán, Luciana Ferrer The MITLL NIST LRE 2015 Language Recognition System Pedro Torres-Carrasquillo, Najim Dehak, Elizabeth Godoy, Douglas Reynolds, Fred Richardson, Stephen Shum, Elliot Singer, Douglas Sturim

Speaker and Language Recognition: deep learning approaches


Augmented Data Training of Joint Acoustic/Phonotactic DNN i-vectors for NIST LRE15 Alan Mccree, Greg Sell, Daniel Garcia-Romero LID-senone Extraction via Deep Neural Networks for End-to-End Language Identification Ma Jin, Yan Song, Ian Mcloughlin, Lirong Dai, Zhongfu Ye On autoencoders in the i-vector space for speaker recognition Timur Pekhovsky, Sergey Novoselov, Aleksei Sholohov, Oleg Kudashev Channel Compensation for Speaker Recognition using MAP Adapted PLDA and Denoising DNNs Fred Richardson, Brian Nemsick, Douglas Reynolds Evaluation of an LSTM-RNN System in Different NIST Language Recognition Frameworks Ruben Zazo, Alicia Lozano-Diez, Joaquin Gonzalez-Rodriguez

Speaker Recognition I


Feature-based likelihood ratios for speaker recognition from linguistically-constrained formant-based i-vectors Javier Franco-Pedroso, Joaquin Gonzalez-Rodriguez Improving Robustness of Speaker Verification Against Mimicked Speech Kuruvachan K George, C Santhosh Kumar, K I Ramachandran, Ashish Panda Multi-channel i-vector combination for robust speaker verification in multi-room domestic environments Alessio Brutti, Alberto Abad Voice Liveness Detection for Speaker Verification based on a Tandem Single/Double-channel Pop Noise Detector Sayaka Shiota, Fernando Villavicencio, Junichi Yamagishi, Nobutaka Ono, Isao Echizen, Tomoko Matsui A PLDA Approach for Language and Text Independent Speaker Recognition Abbas Khosravani, Mohammad Mehdi Homayounpour, Dijana Petrovska-Delacrétaz, Gérard Chollet Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks Md Jahangir Alam, Patrick Kenny, Vishwa Gupta, Themos Stafylakis Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy Rosa González Hautamäki, Md Sahidullah, Tomi Kinnunen, Ville Hautamäki A New Feature for Automatic Speaker Verification Anti-Spoofing: Constant Q Cepstral Coefficients Massimiliano Todisco, Héctor Delgado, Nicholas Evans Multi-Bit Allocation: Preparing Voice Biometrics for Template Protection Marco Paulini, Christian Rathgeb, Andreas Nautsch, Hermine Reichau, Herbert Reininger, Christoph Busch

Special Session 2: NIST 2015 Language Recognition i-Vector Machine Learning Challenge


Summary of the 2015 NIST Language Recognition i-Vector Machine Learning Challenge Audrey Tong, Craig Greenberg, Alvin Martin, Desire Banse, John Howard, Hui Zhao, George Doddington, Daniel Garcia-Romero, Alan McCree, Douglas Reynolds, Elliot Singer, Jaime Hernandez-Cordero, Lisa Mason Out-of-Set i-Vector Selection for Open-set Language Identification Hamid Behravan, Tomi Kinnunen, Ville Hautamäki I2R Submission to the 2015 NIST Language Recognition I-vector Challenge Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Kong Aik Lee, Bin Ma, Haizhou Li A Semisupervised Approach for Language Identification based on Ladder Networks Ehud Ben-Reuven, Jacob Goldberger

Speaker Recognition II


Cantonese forensic voice comparison with higher-level features: likelihood ratio-based validation using F-pattern and tonal F0 trajectories over a disyllabic hexaphone Phil Rose, Bruce Xiao Wang I-Vectors for speech activity detection Elie Khoury, Matt Garland Compensation for phonetic nuisance variability in speaker recognition using DNNs Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Jahangir Alam, Marcel Kockmann Local binary patterns as features for speaker recognition Waad Ben Kheder, Driss Matrouf, Moez Ajili, Jean-Francois Bonastre Analysis and Optimization of Bottleneck Features for Speaker Recognition Alicia Lozano-Diez, Anna Silnova, Pavel Matejka, Ondrej Glembek, Oldrich Plchot, Jan Pesan, Lukas Burget, Joaquin Gonzalez-Rodriguez Robustness of Quality-based Score Calibration of Speaker Recognition Systems with respect to low-SNR and short-duration conditions Andreas Nautsch, Rahim Saeidi, Christian Rathgeb, Christoph Busch From Features to Speaker Vectors by means of Restricted Boltzmann Machine Adaptation Pooyan Safari, Omid Ghahabi, Javier Hernando Reducing Noise Bias in the i-Vector Space for Speaker Recognition Yosef Solewicz, Hagai Aronowitz, Timo Becker

Speaker Clustering and Diarization


Semi-supervised On-line Speaker Diarization for Meeting Data with Incremental Maximum A-posteriori Adaptation Giovanni Soldi, Massimiliano Todisco, Héctor Delgado, Christophe Beaugeant, Nicholas Evans Influence of transition cost in the segmentation stage of speaker diarization Beatriz Martínez-González, José M. Pardo, Rubén San-Segundo, J.M. Montero Analysis of the Impact of the Audio Database Characteristics in the Accuracy of a Speaker Clustering System Jesús Jorrín Prieto, Carlos Vaquero, Paola García Short- and Long-Term Speech Features for Hybrid HMM-i-Vector based Speaker Diarization System Abraham Woubie Zewoudie, Jordi Luque, Javier Hernando On the Use of PLDA i-vector Scoring for Clustering Short Segments Itay Salmun, Irit Opher, Itshak Lapidot