Text Extraction from Complex Color Images Using Optical Character Recognition
Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 7)Publication Date: 2015-07-05
Authors : Prachi R. Dussawar; Parul Bhanarkar Jha;
Page : 730-735
Keywords : Character recognition; Feature Extraction; Feature Matching; Text extraction; Character extraction;
Abstract
Optical Character Recognition (OCR) is a system that provides a full alphanumeric recognition of printed or hand written characters by simply scanning the text image. OCR system interprets the printed or handwritten characters image and converts it into corresponding editable text document. The text image is divided into regions by isolating each line, then individual characters with spaces. After character extraction, the texture and topological features like corner points, features of different regions, ratio of character area and convex area of all characters of text image are calculated. Previously features of each uppercase and lowercase letter, digit, and symbols are stored as a template. Based on the texture and topological features, the system recognizes the exact character using feature matching between the extracted character and the template of all characters as a measure of similarity.
Other Latest Articles
- Simulation Study of Photovoltaic System with MPPT Algorithms
- Monitoring Software Project Health Using Visual Analysis
- Techniques for Duplicate Detection in Hierarchical Data
- A Survey on Service Oriented Architecture in Remote Collaboration Systems
- Intelligent Energy Management System based on FPGA and GSM
Last modified: 2021-06-30 21:50:52