Script recognition

Rao, P. V. S. (1994) Script recognition Sadhana (Academy Proceedings in Engineering Sciences), 19 (2). pp. 257-270. ISSN 0256-2499

[img]
Preview
PDF - Publisher Version
1MB

Official URL: http://www.ias.ac.in/j_archive/sadhana/19/2/257-27...

Related URL: http://dx.doi.org/10.1007/BF02811898

Abstract

This paper describes an approach for word-based on-line and off-line recognition of handwritten cursive script composed of English lower-case letters. The system uses simple and easily extractable features such as the direction of movement and curvature and the relative locations of regions where these suffer discontinuities. Our approach was evolved based on our concept of 'shape vectors' introduced earlier. We visualise script characters as having shapes which are composed of comparatively straight segments alternating with regions of relatively high curvature. We derive the shape vectors from each script character essentially by identifying regions of least curvature and approximating these by straight lines. That these shape vectors carry adequate information about the identity of the character is established by showing that the original character can be faithfully reconstructed from the shape vectors. We thus use slopes of the shape vectors and relative locations of points of maximum curvature (both highly quantised) as parameters for recognition. The system extracts parameters for individual characters from single specimens written in isolation and uses these to construct feature matrices for words in the vocabulary. These are used for matching with the feature matrices of test words during the recognition phase. The advantage of the system is that it does not require elaborate training. Recognition scores are in the neighbourhood of 94% for vocabulary sizes of 200 words. The approach has been extended for off-line information as well and performs quite well even in this case.

Item Type:Article
Source:Copyright of this article belongs to Indian Academy of Sciences.
Keywords:Character Synthesis; Cursive Script; Feature Matrices; On-line and Off-line Recognition; Overlapping Segments; Script Recognition; Shape Vectors; Tract Segments; Quantisation
ID Code:52205
Deposited On:03 Aug 2011 06:36
Last Modified:18 May 2016 05:50

Repository Staff Only: item control page