Document Image Binarization Using Independent Component Analysis For OCR
Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.3, No. 9)Publication Date: 2014-09-30
Authors : Varada Sreeja; G.Guru Prasad;
Page : 161-166
Keywords : Adaptive image contrast; ICA; pixel classification; pixel intensity; thresholding..;
Abstract
The Image binarization plays a vital role in text segmentation which is used in OCR application. Binarization of text in degraded images is a challenging task due to the variations in size, color and font of the text and the results be often affected by complex backgrounds, dissimilar lighting conditions, reflections and shadow. A robust solution to this problem can significantly enhance the precision of scene text recognition algorithms leading to a variety of applications such as scene understanding, navigation, automatic localization and image retrieval. In this paper, we propose a novel method to extract and binarize text as of images that contains complex background. We apply an Independent Component Analysis (ICA) based technique to map out the text region, which is uniform in nature, while removing specularity, shadows and reflections, which are included in the background. This algorithm works better on images with different degradations. We implement our method on various DIBCO datasets.
Other Latest Articles
- Thermoelastic Properties and Specific Heat Curve of C60 in Fcc Phase
- Integrated Solid Waste Management-An Innovative Approach
- Business Viability of off Grid Hybrid Biomass model over on Grid Solar Generation for RE in Developing countries
- Performance Improvement of Reading Brain Function Considering Quantified Analysis of Highly Specialized Neurons (Neural Networks Approach)
- Cloud Storage Security And Providing Integrity Proof
Last modified: 2014-10-14 22:48:45