Recognition of newspaper printed in Gurumukhi script

来源期刊:中南大学学报(英文版)2019年第9期

论文作者:Rupinder Pal Kaur Manish Kumar Jindal Munish Kumar

文章页码:2495 - 2503

Key words:newspaper recognition; feature extraction; classification; Gurumukhi script; random forest

Abstract: In this work, a system for recognition of newspaper printed in Gurumukhi script is presented. Four feature extraction techniques, namely, zoning features, diagonal features, parabola curve fitting based features, and power curve fitting based features are considered for extracting the statistical properties of the characters printed in the newspaper. Different combinations of these features are also applied to improve the recognition accuracy. For recognition, four classification techniques, namely, k-NN, linear-SVM, decision tree, and random forest are used. A database for the experiments is collected from three major Gurumukhi script newspapers which are Ajit, Jagbani and Punjabi Tribune. Using 5-fold cross validation and random forest classifier, a recognition accuracy of 96.19% with a combination of zoning features, diagonal features and parabola curve fitting based features has been reported. A recognition accuracy of 95.21% with a partitioning strategy of data set (70% data as training data and remaining 30% data as testing data) has been achieved.

Cite this article as: Rupinder Pal Kaur, Manish Kumar Jindal, Munish Kumar. Recognition of newspaper printed in Gurumukhi script [J]. Journal of Central South University, 2019, 26(9): 2495-2503. DOI: https://doi.org/10.1007/ s11771-019-4189-1.

有色金属在线官网  |   会议  |   在线投稿  |   购买纸书  |   科技图书馆

中南大学出版社 技术支持 版权声明   电话:0731-88830515 88830516   传真:0731-88710482   Email:administrator@cnnmol.com

互联网出版许可证:(署)网出证(京)字第342号   京ICP备17050991号-6      京公网安备11010802042557号