Book Chapter

New sub-band processing framework using non-linear predictive models for speech feature extraction

Publisher DOI

Details

Citation

Chetouani M, Hussain A, Gas B & Zarader J (2005) New sub-band processing framework using non-linear predictive models for speech feature extraction. In: Faundez-Zanuy M, Janer L, Esposito A, Satue-Villar A, Roure J & Espinosa-Duro V (eds.) Nonlinear Analyses and Algorithms for Speech Processing: International Conference on Non-Linear Speech Processing, NOLISP 2005, Barcelona, Spain, April 19-22, 2005, Revised Selected Papers. Lecture Notes in Computer Science, 3817. Berlin Heidelberg: Springer, pp. 284-290. http://link.springer.com/chapter/10.1007/11613107_25#; https://doi.org/10.1007/11613107_25

Abstract
Speech feature extraction methods are commonly based on time and frequency processing approaches. In this paper, we propose a new framework based on sub-band processing and non-linear prediction. The key idea is to pre-process the speech signal by a filter bank. From the resulting signals, non-linear predictors are computed. The feature extraction method involves the association of different Neural Predictive Coding (NPC) models. We apply this new framework to phoneme classification and experiments carried out with the NTIMIT database show an improvement of the classification rates in comparison with the full-band approach. The new method is also shown to give better performance than the traditional Linear Predictive Coding (LPC), Mel Frequency Cepstral Coding (MFCC) and Perceptual Linear Prediction (PLP) methods.

Status	Published
Title of series	Lecture Notes in Computer Science
Number in series	3817
Publication date	31/12/2005
Publisher	Springer
Publisher URL	http://link.springer.com/chapter/10.1007/11613107_25#
Place of publication	Berlin Heidelberg
ISSN of series	0302-9743
ISBN	978-3-540-31257-4