High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

A Novel Scheme for the Extraction of Textual Areas of a Scanned Document using Page Layout Segmentation Algorithm

Author(s):

Jyoti , OM INSTITUTE OF TECHNOLOGY & MANAGEMENT, HISAR (HARYANA); Amit Ranjan, OM INSTITUTE OF TECHNOLOGY & MANAGEMENT, HISAR (HARYANA)

Keywords:

Text Extraction, Page Layout Segmentation Algorithm

Abstract

Text extraction using page layout segmentation algorithm in a scanned document is a challenging task in the computer vision. This technique plays a very important role in providing useful and valuable information. Text extraction is a major component for document or textural image analysis. There are various factors texts in documents depend upon such as language, styles, font, sizes, color, background, orientation, fluctuating text lines, crossing or touching text lines. The ascending approach and many other methods to segmentation of scanned documents in the area of background, text, and photographs are considered. Such different algorithms can also be used in the printing industry for selective or enhanced scanning and object-oriented rendering. A page-layout-segmentation technique to extract text from scanned documents has proposed.

Other Details

Paper ID: IJSRDV7I40601
Published in: Volume : 7, Issue : 4
Publication Date: 01/07/2019
Page(s): 514-516

Article Preview

Download Article