Modeling durations of syllables using neural networks

Sreenivasa Rao, K. ; Yegnanarayana, B. (2007) Modeling durations of syllables using neural networks Computer Speech & Language, 21 (2). pp. 282-295. ISSN 0885-2308

Full text not available from this repository.

Official URL: http://www.sciencedirect.com/science/article/pii/S...

Related URL: http://dx.doi.org/10.1016/j.csl.2006.06.003

Abstract

In this paper, we propose a neural network model for predicting the durations of syllables. A four layer feedforward neural network trained with backpropagation algorithm is used for modeling the duration knowledge of syllables. Broadcast news data in three Indian languages Hindi, Telugu and Tamil is used for this study. The input to the neural network consists of a set of features extracted from the text. These features correspond to phonological, positional and contextual information. The relative importance of the positional and contextual features is examined separately. For improving the accuracy of prediction, further processing is done on the predicted values of the durations. We also propose a two-stage duration model for improving the accuracy of prediction. From the studies we find that 85% of the syllable durations could be predicted from the models within 25% of the actual duration. The performance of the duration models is evaluated using objective measures such as average prediction error (μ), standard deviation (σ) and correlation coefficient (γ).

Item Type:Article
Source:Copyright of this article belongs to Elsevier Science.
ID Code:57740
Deposited On:29 Aug 2011 10:18
Last Modified:29 Aug 2011 10:18

Repository Staff Only: item control page