A REVIEW ON SEGMENTATION TECHNIQUES OF LINES, WORDS AND CHARACTERS ON GUJARATI HANDWRITTEN DOCUMENT USING OCRJournal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.5, No. 6)
Publication Date: 2016-06-30
Authors : Nilam Mistry; Sameer Vashi; Vidhi Patel; Kunal Shah; Denish Rixawa p la;
Page : 198-208
Keywords : Connected Components; Gujarati Script; Segmentation;
OCR is technique to convert the handwritten or printed document into the digital format by scanning it which can be understandable by a computer. OCR is important and challenging task in many computer vision applications. S egmentation is generally the first stage in any attempt to analyse or interpret an image automatically. Segmentation is separate the document into lines, lines to words and words to characters which has been one of the major laboriousness in handwritten t ext recognition. The role of segmentation is a crucial in most tasks requiring image analysis. The success or failure of a task is often a direct consequence of the success or failure of segmentation. Handwritten text documents contain text in free flow ma nner, also writing style of users may different even sometimes same user’s handwriting are different in different time. That is why segmentation is difficult in case of handwritten text document. As this paper focuses on Gujarati language, it contains more curves, overlapping character & slopes. So, it is very difficult to do segmentation on it. In this paper we have applied some of the segmentation techniques to segment the handwritten Guajarati documen ts & reached to some conclusion .
Other Latest Articles
Last modified: 2016-06-17 16:42:16