A comparative study of explicit frequency and conventional signal representations for speech recognition

Krishnan, S. ; Rao, P. V. S. (1996) A comparative study of explicit frequency and conventional signal representations for speech recognition Digital Signal Processing, 6 (4). pp. 249-264. ISSN 1051-2004

Full text not available from this repository.

Official URL: http://linkinghub.elsevier.com/retrieve/pii/S10512...

Related URL: http://dx.doi.org/10.1006/dspr.1996.0028

Abstract

The performance of a speech recognition system depends to a large extent on the signal representation strategy. It is therefore important to evaluate various signal representations, with a view to comparing their relative performance on speech recognition tasks. Various comparative studies have been reported earlier in the literature in this respect. However, these studies are limited to comparing some subsets of representations on different kinds of recognition tasks. In this sense, they preclude a fair comparison of the representations on the same task. In contrast, we attempt here a systematic and fairly comprehensive comparison of signal representations along various dimensions (frequency and amplitude scales, speaker normalization, and two statistical classifiers) on a standard (TIMIT) speech database. This study shows that the line spectrum pair frequency representation augmented with spectral amplitudes yields the best recognition performance.

Item Type:Article
Source:Copyright of this article belongs to Elsevier Science.
ID Code:52212
Deposited On:02 Aug 2011 11:10
Last Modified:02 Aug 2011 11:10

Repository Staff Only: item control page