Title: Integrated High-Performance and Web-Oriented System of The Kazakh Language Text Recognition

Year of Publication: Nov - 2013
Page Numbers: 25-36
Authors: David L. Zaurbekov, Bakyt M. Kairakbay, Daniyar B. Nurseitov, Yuriy Y. Stolyarov, Ilyas E. Tursunov, Igor A. Ugay
Conference Name: The Second International Conference on Informatics Engineering & Information Science (ICIEIS2013)
- Malaysia

Abstract:


We presented in the paper the integrated high-performance and Web-oriented computer recognition system for the Kazakh language text. The design and integration methodology of the system are based on service-oriented architecture that allows provide an easy, flexible, and extensible integration of any language service into any desktop or mobile client. We have designed and built 4-tier SOA on the basis of W3C Web service standard. Use the high-performance cluster shows significant advantages in the Kazakh text processing, especially for large arrays of texts. The main objective of the developing system is to provide any person easy access to text and documentation in Kazakh language with the following possibility of editing and manipulation different docs through respective Web services OCR and morphological analysis/correction. Developed web services cover the Kazakh language text OCR and morphological analysis for subsequent correction of errors after OCR, and show an acceptable quality of the Kazakh texts recognition which is better than existing tools of text recognition.