Please use this identifier to cite or link to this item: http://theses.iitj.ac.in:8080/jspui/handle/123456789/164
Title: Some Methods for Annotation Localization and Writer Identification for Processing Annotated Documents
Researcher : Pandey, Shilpa
Supervisor: Harit, Gaurav
Department: Computer Science and Engineering
Issue Date: May-2019
Citation: Shilpa Pandey. (2019). Some Methods for Annotation Localization and Writer Identification for Processing Annotated Documents (Doctor's thesis). Indian Institute of Technology Jodhpur, Jodhpur.
Abstract: Documents containing mixed types of text content (printed and handwritten) are proliferating in business and academic environments. They result frequently from annotating printed documents such as bills, administrative forms, birth-certificates, letters, etc. A document can be annotated in many ways. Annotations can be handwritten text, underlines, cuts, marks, special symbols of irregular shapes, and handmade drawings. The annotated text can be multi-oriented and multi-script. Several methods for extracting annotations are outlined in the literature. Most of these systems extract annotations in controlled scenario wherein the layout is predictable, like bank checks, postal address, drafts, forms etc. Extracting handwritten annotations with non-predictable layouts in a real environment remains a difficult task because annotations can be complex due to multi-oriented handwritten text and marks which may overlap with printed text. This thesis is aimed at developing methods for localizing complex annotations in non-predictable layouts and identifying the writer for the handwritten words. We develop methods for localizing annotations, categorizing them as textual and symbolic annotations. We further sub-categorize symbolic annotations as underlines and encirclements; and textual annotations as marginal text and inline text. We apply our methods to localize annotations written on documents such as conference papers, articles, books, office documents etc. We use statistical spectral partitioning to segment out annotations from printed text. In this approach, we work with a reduced feature set to efficiently extract the annotations on a cluttered background. We develop a new feature called Envelope Straightness to enhance the feature set. This has improved performance over the state-of-the-art features. We then investigate the use of two top-down visual saliency models for categorizing annotations. The first model makes use of supervised learning in the form of conditional random fields with a sparse encoding of feature vectors. The second model makes use of a weakly supervised learning formulation for discriminant saliency. The experimental results corroborate our hypothesis that our attention gets directed towards annotated regions in an image, and therefore, top-down saliency models can be learned to give high saliency values for the annotated regions. Along with supervision, these models take advantage of the structure and context of the annotations. For scenarios where multiple writers annotate on the same page we develop a method to identify the writers for the handwritten words. A sliding window technique is used to extract allographic features for sub-word portions. We formulate a supervised framework and exploit the discriminative properties of the features that belong to the same cluster. We propose a new technique for separating ascenders and descenders of hand-written words from its core-region. We use the structural properties of ascenders and descenders to identify the writers of the handwritten words. The work also contributes towards dataset creation and ground truth generation for the various problems addressed in this thesis.
Pagination: xv, 158p.
URI: http://theses.iitj.ac.in:8080/jspui/handle/123456789/164
Accession No.: TP00041
Appears in Collections:Ph. D. Theses

Files in This Item:
File Description SizeFormat 
TP00041.pdf477.21 MBAdobe PDFView/Open    Request a copy
01_title.pdf54.65 kBAdobe PDFView/Open
02_declaration.pdf41.04 kBAdobe PDFView/Open
03_certificate.pdf40.71 kBAdobe PDFView/Open
04_abstract.pdf32.72 kBAdobe PDFView/Open
05_acknowledgements.pdf60.92 kBAdobe PDFView/Open
06_contents.pdf67.79 kBAdobe PDFView/Open
07_list_of_figures.pdf56.58 kBAdobe PDFView/Open
08_list_of_tables.pdf49.13 kBAdobe PDFView/Open
09_list_of_symbols.pdf59.32 kBAdobe PDFView/Open
10_list_of_abbreviations.pdf46.76 kBAdobe PDFView/Open
11_chapter 1.pdf4.65 MBAdobe PDFView/Open
12_chapter 2.pdf6.4 MBAdobe PDFView/Open
13_chapter 3.pdf277.9 kBAdobe PDFView/Open
14_chapter 4.pdf13.75 MBAdobe PDFView/Open
15_chapter 5.pdf10.13 MBAdobe PDFView/Open
16_chapter 6.pdf23.83 MBAdobe PDFView/Open
17_chapter 7.pdf2.86 MBAdobe PDFView/Open
18_chapter 8.pdf1.8 MBAdobe PDFView/Open
19_chapter 9 conclusion and future scope.pdf110.59 kBAdobe PDFView/Open
20_references.pdf272.39 kBAdobe PDFView/Open


Items in IIT Jodhpur Theses Repository are protected by copyright, with all rights reserved, unless otherwise indicated.