Default Cover Image

2004 IEEE International Conference on Acoustics, Speech, and Signal Processing

May 17 2004 to May 21 2004

Montreal, Que.

Volume:

Table of Contents

Copyright pageFreely available from IEEE.pp. ii
ICASSP 2004 Conference CommitteeFreely available from IEEE.pp. iv
Technical Program CommitteeFreely available from IEEE.pp. v,vi,vii,viii,ix,x,xi
Future ICASSP conferencesFreely available from IEEE.pp. xii
ICASSP 2004 PhiladelphiaFreely available from IEEE.pp. xiii
Conference proceedings overviewFreely available from IEEE.pp. xiv
Table of contentsFreely available from IEEE.pp. xv,xvi,xvii,xviii
ICASSP 2004 technical programFreely available from IEEE.pp. XIX
Non-parallel training for voice conversion by maximum likelihood constrained adaptationFull-text access may be available. Sign in or learn about subscription options.pp. I-1-4 vol.1
Speaking style adaptation using context clustering decision tree for HMM-based speech synthesisFull-text access may be available. Sign in or learn about subscription options.pp. I-5-8 vol.1
High quality voice morphingFull-text access may be available. Sign in or learn about subscription options.pp. I-9-12 vol.1
Algorithm amalgam: morphing waveform based methods, sinusoidal models and STRAIGHTFull-text access may be available. Sign in or learn about subscription options.pp. I-13-16 vol.1
Voice characteristics conversion for TTS using reverse VTLNFull-text access may be available. Sign in or learn about subscription options.pp. I-17-20 vol.1
Voice conversion through transformation of spectral and intonation featuresFull-text access may be available. Sign in or learn about subscription options.pp. I-21-4 vol.1
Discriminative training for speaker identification based on maximum model distance algorithmFull-text access may be available. Sign in or learn about subscription options.pp. I-25-8 vol.1
Discovering relations among discriminative training objectives [speak recognition applications]Full-text access may be available. Sign in or learn about subscription options.pp. I-33-6 vol.1
Disentangling speaker and channel effects in speaker verificationFull-text access may be available. Sign in or learn about subscription options.pp. I-37-40 vol.1
Generalized locally recurrent probabilistic neural networks for text-independent speaker verificationFull-text access may be available. Sign in or learn about subscription options.pp. I-41-4 vol.1
Discrimination power weighted subword-based speaker verificationFull-text access may be available. Sign in or learn about subscription options.pp. I-45-8 vol.1
Soft decoding strategies for distributed speech recognition over IP networksFull-text access may be available. Sign in or learn about subscription options.pp. I-49-52 vol.1
The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstructionFull-text access may be available. Sign in or learn about subscription options.pp. I-53-6 vol.1
A subvector-based error concealment algorithm for speech recognition over mobile networksFull-text access may be available. Sign in or learn about subscription options.pp. I-57-60 vol.1
A complexity reduction of ETSI advanced front-end for DSRFull-text access may be available. Sign in or learn about subscription options.pp. I-61-4 vol.1
Robust speech recognition techniques evaluation for telephony server based in-car applicationsFull-text access may be available. Sign in or learn about subscription options.pp. I-65-8 vol.1
High-level speaker verification with support vector machinesFull-text access may be available. Sign in or learn about subscription options.pp. I-73-6 vol.1
Using Haar transformed vocal source information for automatic speaker recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-77-80 vol.1
Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMMFull-text access may be available. Sign in or learn about subscription options.pp. I-81-4 vol.1
Applying articulatory features to telephone-based speaker verificationFull-text access may be available. Sign in or learn about subscription options.pp. I-85-8 vol.1
Speaker identification using supra-segmental pitch pattern dynamicsFull-text access may be available. Sign in or learn about subscription options.pp. I-89-92 vol.1
Improvement of speaker recognition by combining residual and prosodic features with acoustic featuresFull-text access may be available. Sign in or learn about subscription options.pp. I-93-6 vol.1
Pitch prediction from MFCC vectors for speech reconstructionFull-text access may be available. Sign in or learn about subscription options.pp. I-97-100 vol.1
Algorithm for automatic glottal waveform estimation without the reliance on precise glottal closure informationFull-text access may be available. Sign in or learn about subscription options.pp. I-101-4 vol.1
Tone recognition with fractionized models and outlined featuresFull-text access may be available. Sign in or learn about subscription options.pp. I-105-8 vol.1
Extraction of pitch in adverse conditionsFull-text access may be available. Sign in or learn about subscription options.pp. I-109-12 vol.1
Weighted autocorrelation-based F0 estimation for distant-talking interaction with a distributed microphone networkFull-text access may be available. Sign in or learn about subscription options.pp. I-113-16 vol.1
A novel method for computation of periodicity, aperiodicity and pitch of speech signalsFull-text access may be available. Sign in or learn about subscription options.pp. I-117-20 vol.1
Non-uniform speaker normalization using affine-transformationFull-text access may be available. Sign in or learn about subscription options.pp. I-121-4 vol.1
Product of power spectrum and group delay function for speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-125-8 vol.1
Robust speech feature extraction by growth transformation in reproducing kernel Hilbert spaceFull-text access may be available. Sign in or learn about subscription options.pp. I-133-6 vol.1
Dimensionality reduction using MCE-optimized LDA transformationFull-text access may be available. Sign in or learn about subscription options.pp. I-137-40 vol.1
Speech feature extraction method representing periodicity and aperiodicity in sub bands for robust speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-141-4 vol.1
Low-complexity predictive trellis coded quantization of wideband speech LSF parametersFull-text access may be available. Sign in or learn about subscription options.pp. I-145-8 vol.1
Multiple frame block quantisation of line spectral frequencies using Gaussian mixture modelsFull-text access may be available. Sign in or learn about subscription options.pp. I-149-52 vol.1
Variable-dimension quantization of sinusoidal amplitudes using Gaussian mixture modelsFull-text access may be available. Sign in or learn about subscription options.pp. I-153-6 vol.1
On split quantization of LSF parametersFull-text access may be available. Sign in or learn about subscription options.pp. I-157-60 vol.1
Improved quantization structures using generalized HMM modelling with application to wideband speech codingFull-text access may be available. Sign in or learn about subscription options.pp. I-161-4 vol.1
Waveform quantization of speech using Gaussian mixture modelsFull-text access may be available. Sign in or learn about subscription options.pp. I-165-8 vol.1
Effects on transcription errors on supervised learning in speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-169-72 vol.1
Combination of hidden Markov models with dynamic time warping for speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-173-6 vol.1
Joint decoding for phoneme-grapheme continuous speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-177-80 vol.1
A locally weighted distance measure for example based speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-181-4 vol.1
Light supervision in acoustic model trainingFull-text access may be available. Sign in or learn about subscription options.pp. I-185-8 vol.1
Lightly supervised acoustic model training using consensus networksFull-text access may be available. Sign in or learn about subscription options.pp. I-189-92 vol.1
Spectral entropy based feature for robust ASRFull-text access may be available. Sign in or learn about subscription options.pp. I-193-6 vol.1
Higher order cepstral moment normalization (HOCMN) for robust speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-197-200 vol.1
Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approachFull-text access may be available. Sign in or learn about subscription options.pp. I-201-4 vol.1
Phase autocorrelation (PAC) features in entropy based multi-stream for robust speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-205-8 vol.1
Cepstral gain normalization for noise robust speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-209-12 vol.1
Robust speech recognition using cepstral domain missing data techniques and noisy masksFull-text access may be available. Sign in or learn about subscription options.pp. I-213-16 vol.1
Optimal blind separation of convolutive audio mixtures without temporal constraintsFull-text access may be available. Sign in or learn about subscription options.pp. I-217-20 vol.1
Microphone array post-filter for separation of simultaneous non-stationary sourcesFull-text access may be available. Sign in or learn about subscription options.pp. I-221-4 vol.1
Overdetermined blind separation for convolutive mixtures of speech based on multistage ICA using subarray processingFull-text access may be available. Sign in or learn about subscription options.pp. I-225-8 vol.1
Speech enhancement based on a combined multi-channel array with constrained iterative and auditory masked processingFull-text access may be available. Sign in or learn about subscription options.pp. I-229-32 vol.1
Multiple-microphone time-varying filters for robust speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-233-6 vol.1
Noise suppression for automotive applications based on directional informationFull-text access may be available. Sign in or learn about subscription options.pp. I-237-40 vol.1
Meta-data conditional language modelingFull-text access may be available. Sign in or learn about subscription options.pp. I-241-4 vol.1
Exact training of a neural syntactic language modelFull-text access may be available. Sign in or learn about subscription options.pp. I-245-8 vol.1
Development of the 2003 CU-HTK conversational telephone speech transcription systemFull-text access may be available. Sign in or learn about subscription options.pp. I-249-52 vol.1
Vocabulary-independent search in spontaneous speechFull-text access may be available. Sign in or learn about subscription options.pp. I-253-6 vol.1
Cross-lingual latent semantic analysis for language modelingFull-text access may be available. Sign in or learn about subscription options.pp. I-257-60 vol.1
The use of a linguistically motivated language model in conversational speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-261-4 vol.1
A study of design compromises for speech coders in packet networksFull-text access may be available. Sign in or learn about subscription options.pp. I-265-8 vol.1
A scalable speech and audio coding scheme with continuous bitrate flexibilityFull-text access may be available. Sign in or learn about subscription options.pp. I-273-6 vol.1
A multiple description speech coder based on AMR-WB for mobile ad hoc networksFull-text access may be available. Sign in or learn about subscription options.pp. I-277-80 vol.1
A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standardFull-text access may be available. Sign in or learn about subscription options.pp. I-285-8 vol.1
A two-step noise reduction techniqueFull-text access may be available. Sign in or learn about subscription options.pp. I-289-92 vol.1
On the decision-directed estimation approach of Ephraim and MalahFull-text access may be available. Sign in or learn about subscription options.pp. I-293-6 vol.1
Employing Laplacian-Gaussian densities for speech enhancementFull-text access may be available. Sign in or learn about subscription options.pp. I-297-300 vol.1
Robust adaptive Kalman filtering-based speech enhancement algorithmFull-text access may be available. Sign in or learn about subscription options.pp. I-301-4 vol.1
A noise estimation algorithm with rapid adaptation for highly nonstationary environmentsFull-text access may be available. Sign in or learn about subscription options.pp. I-305-8 vol.1
Low distortion speech denoising using an adaptive parametric Wiener filterFull-text access may be available. Sign in or learn about subscription options.pp. I-309-12 vol.1
Performance comparisons of all-pass transform adaptation with maximum likelihood linear regressionFull-text access may be available. Sign in or learn about subscription options.pp. I-313-16 vol.1
Adaptive training using structured transformsFull-text access may be available. Sign in or learn about subscription options.pp. I-317-20 vol.1
MPE-based discriminative linear transform for speaker adaptationFull-text access may be available. Sign in or learn about subscription options.pp. I-321-4 vol.1
A study of various composite kernels for kernel eigenvoice speaker adaptationFull-text access may be available. Sign in or learn about subscription options.pp. I-325-8 vol.1
Feature space GaussianizationFull-text access may be available. Sign in or learn about subscription options.pp. I-329-32 vol.1
Online speaker clusteringFull-text access may be available. Sign in or learn about subscription options.pp. I-333-6 vol.1
Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-337-40 vol.1
Enrollment in low-resource speech recognition systemsFull-text access may be available. Sign in or learn about subscription options.pp. I-341-4 vol.1
An investigation into front-end signal processing for speaker normalizationFull-text access may be available. Sign in or learn about subscription options.pp. I-345-8 vol.1
Eigen-MLLRs applied to unsupervised speaker enrollment for large vocabulary continuous speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-349-52 vol.1
Speaker indexing and adaptation using speaker clustering based on statistical model selectionFull-text access may be available. Sign in or learn about subscription options.pp. I-353-6 vol.1
Eigenspace-based MLLR with speaker adaptive training in large vocabulary conversational speech recognitionFull-text access may be available. Sign in or learn about subscription options.pp. I-357-60 vol.1
Showing 100 out of 271