IEEE Transactions on Speech and Audio Processing, Volume 10 (original) (raw)



default search action
- combined dblp search
- author search
- venue search
- publication search
Authors:
- no matches

Venues:
- no matches

Publications:
- no matches



SPARQL queries 
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Volume 10, Number 1, January 2002

Rongshan Yu, Chi Chung Ko:
A warped linear-prediction-based subband audio coding algorithm. 1-8

Eric A. Durant, Gregory H. Wakefield:
Efficient model fitting using a genetic algorithm: pole-zero approximations of HRTFs. 18-27

Chao-Shih Huang, Hsiao-Chuan Wang, Chin-Hui Lee:
Correction to "An SNR-incremental stochastic matching algorithm for noisy speech recognition". 28
Volume 10, Number 2, February 2002

Mark J. F. Gales:
Maximum likelihood multiple subspace projections for hidden Markov models. 37-47

Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array. 48-56

Yifan Gong:
Noise-dependent Gaussian mixture classifiers for robust rejection decision. 57-64

Shrikanth S. Narayanan, Alexandros Potamianos:
Creating conversational interfaces for children. 65-78

Mohamed Afify, Olivier Siohan, Chin-Hui Lee:
Upper and lower bounds on the mean of noisy speech: application to minimax classification. 79-88

Rita Singh, Bhiksha Raj, Richard M. Stern
:
Automatic generation of subword units for speech recognition systems. 89-99

Geert Rombouts, Marc Moonen:
A sparse block exact affine projection algorithm. 100-108

M. Marzinzik, Birger Kollmeier:
Speech pause detection for noise spectrum estimation by tracking power envelope dynamics. 109-118

Johan Hellgren:
Analysis of feedback cancellation in hearing aids with Filtered-x LMS and the direct method of closed loop identification. 119-131
Volume 10, Number 3, March 2002

Liang Gu, Kenneth Rose:
Substate tying with combined parameter training and reduction in tied-mixture HMM design. 137-145

Qi Li, Jinsong Zheng, Augustine Tsai, Qiru Zhou
:
Robust endpoint detection and energy normalization for real-time speech and speaker recognition. 146-157

Néstor Becerra Yoma, Miguel Villar Fernandez:
Speaker verification in noise using a stochastic version of the weighted Viterbi algorithm. 158-166

Zihou Meng, Kimihiro Sakagami
, Masayuki Morimoto, Guoan Bi
, Alex ChiChung Kot:
Extending the sound impulse response of room using extrapolation. 167-172

Jaco Vermaak, Christophe Andrieu, Arnaud Doucet
, Simon J. Godsill:
Particle methods for Bayesian modeling and enhancement of speech signals. 173-185

Tom Bäckström
, Paavo Alku
, Erkki Vilkman:
Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range. 186-192

Imed Zitouni:
A hierarchical language model based on variable-length class sequences: the MCnnu approach. 193-198
Volume 10, Number 4, May 2002

William M. Campbell, Khaled T. Assaleh
, Charles C. Broun:
Speaker recognition with polynomial classifiers. 205-212

Sven Johansson, Sven Nordebo
, Ingvar Claesson:
Convergence analysis of a twin-reference complex least-mean-squares algorithm. 213-221

Nam Phamdo, Udar Mittal:
A joint source-channel speech coder using hybrid digital-analog (HDA) modulation. 222-231

Darryl W. Purnell, Elizabeth C. Botha:
Improved generalization of MCE parameter estimation with application to speech recognition. 232-239
Volume 10, Number 5, July 2002

Stefan Gustafsson, Rainer Martin
, Peter Jax
, Peter Vary:
A psychoacoustic approach to combined acoustic echo cancellation and noise reduction. 245-256

Tomas Gänsler, Jacob Benesty
:
New insights into the stereophonic acoustic echo cancellation problem and an adaptive nonlinearity solution. 257-267

Jen-Tzung Chien
:
Quasi-Bayes linear regression for sequential learning of hidden Markov models. 268-278

Ahmed M. Abdelatty Ali
, Jan Van der Spiegel, Paul Mueller:
Robust auditory-based speech processing using the average localized synchrony detection. 279-292

George Tzanetakis
, Perry R. Cook:
Musical genre classification of audio signals. 293-302

Berlin Chen, Hsin-Min Wang
, Lin-Shan Lee:
Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese. 303-314

Marie A. Roch, Richard R. Hurtig:
The integral decode: a smoothing technique for robust HMM-based speaker recognition. 315-324

Yuan-Hao Huang, Tzi-Dar Chiueh:
A new audio coding scheme using a forward masking model and perceptually weighted vector quantization. 325-335
Volume 10, Number 6, September 2002

David Burshtein, Sharon Gannot
:
Speech enhancement using a mixture-maximum model. 341-351

Lucas C. Parra, Christopher V. Alvino:
Geometric source separation: merging convolutive source separation with geometric beamforming. 352-362

Ran D. Zilca:
Text-independent speaker verification using utterance level scoring and covariance modeling. 363-370

Ivan Magrin-Chagnolleau
, Geoffrey Durou, Frédéric Bimbot:
Application of time-frequency principal component analysis to text-independent speaker identification. 371-378

Gerald Schuller, Bin Yu, Dawei Huang, Bernd Edler:
Perceptual audio coding using adaptive pre- and post-filters and lossless compression. 379-390

Ronald M. Aarts
, Roy Irwan, Augustus J. E. M. Janssen:
Efficient tracking of the cross-correlation coefficient. 391-402

Ji Ming, Peter Jancovic, Francis Jack Smith:
Robust speech recognition using probabilistic union models. 403-414

Lutz Welling, Hermann Ney, Stephan Kanthak:
Speaker adaptive modeling by vocal tract normalization. 415-426
Volume 10, Number 7, October 2002

Mukund Padmanabhan, George Saon
, Jing Huang, Brian Kingsbury, Lidia Mangu:
Automatic speech recognition performance on a voicemail transcription task. 433-442

Néstor Becerra Yoma, Jorge F. Silva
:
MAP speaker adaptation of state duration distributions for speech recognition. 443-450

Juan Manuel Huerta:
Alignment-based codeword-dependent cepstral normalization. 451-459

Stephen Cox, Srinandan Dasmahapatra
:
High-level approaches to confidence estimation in speech recognition. 460-471

C. Chandra Sekhar, B. Yegnanarayana:
A constraint satisfaction model for recognition of stop consonant-vowel (SCV) utterances. 472-480

Fu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee:
A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese. 481-494

Frank Baumgarte:
Improved audio coding using a psychoacoustic model based on a cochlear filter bank. 495-503

Lie Lu
, Hong-Jiang Zhang, Hao Jiang:
Content analysis for audio classification and segmentation. 504-516

Khaled A. Mayyas:
Stereophonic acoustic echo cancellation using lattice orthogonalization. 517-525
Volume 10, Number 8, November 2002

Harry Printz, Isabel Trancoso
:
Editorial. 529-530

Eric Chang, Frank Seide, Helen M. Meng, Zhuoran Chen, Yu Shi, Yuk-Chi Li:
A system for spoken query information retrieval on mobile devices. 531-541

Satya Dharanipragada, Salim Roukos:
A multistage algorithm for spotting new words in speech. 542-550

Sabine Deligne, Satya Dharanipragada, Ramesh A. Gopinath, Benoît Maison, Peder A. Olsen
, Harry Printz:
A robust high accuracy speech recognition system for mobile applications. 551-561

Imre Varga, Stefanie Aalburg, Bernt Andrassy, Sergey Astrov, Josef G. Bauer, Christophe Beaugeant, Christian Geißler, Harald Höge:
ASR in mobile phones - an industrial approach. 562-569

Alexis Bernard, Abeer Alwan:
Low-bitrate distributed speech recognition for packet-based and wireless communication. 570-579

Constantinos Boulis, Mari Ostendorf, Eve A. Riskin, Scott Otterson:
Graceful degradation of speech recognition performance over packet-erasure networks. 580-590

Hong Kook Kim
, Richard V. Cox, Richard C. Rose:
Performance improvement of a bitstream-based front-end for wireless speech recognition in adverse environments. 591-604

Li Deng, Kuansan Wang, Alex Acero
, Hsiao-Wuen Hon, Jasha Droppo
, Constantinos Boulis, Ye-Yi Wang, Derek Jacoby, Milind Mahajan, Ciprian Chelba, Xuedong Huang:
Distributed speech processing in miPad's multimodal user interface. 605-619

Bruno Bessette, Redwan Salami, Roch Lefebvre, Milan Jelinek, J. Rotola-Pukkila, Janne Vainio, Hannu Mikkola, Kari Järvinen:
The adaptive multirate wideband speech codec (AMR-WB). 620-636

Antonio Servetti
, Juan Carlos De Martin:
Perception-based partial encryption of compressed speech. 637-643

Jhing-Fa Wang, Jia-Ching Wang, Han-Chiang Chen, Tai-Lung Chen, Chin-Chan Chang, Ming-Chi Shih:
Chip design of portable speech memopad suitable for persons with visual disabilities. 644-658

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
load links from unpaywall.org
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
load content from archive.org
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
load data from openalex.org
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
dblp was originally created in 1993 at:
since 2018, dblp has been operated and maintained by:







