Some Methods for Annotation Localization and Writer Identification for Processing Annotated Documents

Please use this identifier to cite or link to this item: http://theses.iitj.ac.in:8080/jspui/handle/123456789/164

Title:	Some Methods for Annotation Localization and Writer Identification for Processing Annotated Documents
Researcher :	Pandey, Shilpa
Supervisor:	Harit, Gaurav
Department:	Computer Science and Engineering
Issue Date:	May-2019
Citation:	Shilpa Pandey. (2019). Some Methods for Annotation Localization and Writer Identification for Processing Annotated Documents (Doctor's thesis). Indian Institute of Technology Jodhpur, Jodhpur.
Abstract:	Documents containing mixed types of text content (printed and handwritten) are proliferating in business and academic environments. They result frequently from annotating printed documents such as bills, administrative forms, birth-certificates, letters, etc. A document can be annotated in many ways. Annotations can be handwritten text, underlines, cuts, marks, special symbols of irregular shapes, and handmade drawings. The annotated text can be multi-oriented and multi-script. Several methods for extracting annotations are outlined in the literature. Most of these systems extract annotations in controlled scenario wherein the layout is predictable, like bank checks, postal address, drafts, forms etc. Extracting handwritten annotations with non-predictable layouts in a real environment remains a difficult task because annotations can be complex due to multi-oriented handwritten text and marks which may overlap with printed text. This thesis is aimed at developing methods for localizing complex annotations in non-predictable layouts and identifying the writer for the handwritten words. We develop methods for localizing annotations, categorizing them as textual and symbolic annotations. We further sub-categorize symbolic annotations as underlines and encirclements; and textual annotations as marginal text and inline text. We apply our methods to localize annotations written on documents such as conference papers, articles, books, office documents etc. We use statistical spectral partitioning to segment out annotations from printed text. In this approach, we work with a reduced feature set to efficiently extract the annotations on a cluttered background. We develop a new feature called Envelope Straightness to enhance the feature set. This has improved performance over the state-of-the-art features. We then investigate the use of two top-down visual saliency models for categorizing annotations. The first model makes use of supervised learning in the form of conditional random fields with a sparse encoding of feature vectors. The second model makes use of a weakly supervised learning formulation for discriminant saliency. The experimental results corroborate our hypothesis that our attention gets directed towards annotated regions in an image, and therefore, top-down saliency models can be learned to give high saliency values for the annotated regions. Along with supervision, these models take advantage of the structure and context of the annotations. For scenarios where multiple writers annotate on the same page we develop a method to identify the writers for the handwritten words. A sliding window technique is used to extract allographic features for sub-word portions. We formulate a supervised framework and exploit the discriminative properties of the features that belong to the same cluster. We propose a new technique for separating ascenders and descenders of hand-written words from its core-region. We use the structural properties of ascenders and descenders to identify the writers of the handwritten words. The work also contributes towards dataset creation and ground truth generation for the various problems addressed in this thesis.
Pagination:	xv, 158p.
URI:	http://theses.iitj.ac.in:8080/jspui/handle/123456789/164
Accession No.:	TP00041
Appears in Collections:	Ph. D. Theses

Files in This Item:

File	Size	Format
TP00041.pdf	477.21 MB	Adobe PDF	View/Open Request a copy
01_title.pdf	54.65 kB	Adobe PDF	View/Open
02_declaration.pdf	41.04 kB	Adobe PDF	View/Open
03_certificate.pdf	40.71 kB	Adobe PDF	View/Open
04_abstract.pdf	32.72 kB	Adobe PDF	View/Open
05_acknowledgements.pdf	60.92 kB	Adobe PDF	View/Open
06_contents.pdf	67.79 kB	Adobe PDF	View/Open
07_list_of_figures.pdf	56.58 kB	Adobe PDF	View/Open
08_list_of_tables.pdf	49.13 kB	Adobe PDF	View/Open
09_list_of_symbols.pdf	59.32 kB	Adobe PDF	View/Open
10_list_of_abbreviations.pdf	46.76 kB	Adobe PDF	View/Open
11_chapter 1.pdf	4.65 MB	Adobe PDF	View/Open
12_chapter 2.pdf	6.4 MB	Adobe PDF	View/Open
13_chapter 3.pdf	277.9 kB	Adobe PDF	View/Open
14_chapter 4.pdf	13.75 MB	Adobe PDF	View/Open
15_chapter 5.pdf	10.13 MB	Adobe PDF	View/Open
16_chapter 6.pdf	23.83 MB	Adobe PDF	View/Open
17_chapter 7.pdf	2.86 MB	Adobe PDF	View/Open
18_chapter 8.pdf	1.8 MB	Adobe PDF	View/Open
19_chapter 9 conclusion and future scope.pdf	110.59 kB	Adobe PDF	View/Open
20_references.pdf	272.39 kB	Adobe PDF	View/Open

Show full item record

IIT Jodhpur Theses Repository

IIT Jodhpur Theses Repository preserves and enables easy access to Ph.D., M.Tech. and M.Sc. Theses to their community.