ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A REVIEW ON SEGMENTATION TECHNIQUES OF LINES, WORDS AND CHARACTERS ON GUJARATI HANDWRITTEN DOCUMENT USING OCR

Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.5, No. 6)

Publication Date:

Authors : ; ; ; ; ;

Page : 198-208

Keywords : Connected Components; Gujarati Script; Segmentation;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

OCR is technique to convert the handwritten or printed document into the digital format by scanning it which can be understandable by a computer. OCR is important and challenging task in many computer vision applications. S egmentation is generally the first stage in any attempt to analyse or interpret an image automatically. Segmentation is separate the document into lines, lines to words and words to characters which has been one of the major laboriousness in handwritten t ext recognition. The role of segmentation is a crucial in most tasks requiring image analysis. The success or failure of a task is often a direct consequence of the success or failure of segmentation. Handwritten text documents contain text in free flow ma nner, also writing style of users may different even sometimes same user’s handwriting are different in different time. That is why segmentation is difficult in case of handwritten text document. As this paper focuses on Gujarati language, it contains more curves, overlapping character & slopes. So, it is very difficult to do segmentation on it. In this paper we have applied some of the segmentation techniques to segment the handwritten Guajarati documen ts & reached to some conclusion .

Last modified: 2016-06-17 16:42:16