Result: Form item extraction based on line searching

Title:
Form item extraction based on line searching
Contributors:
READ (READ), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS), Rangachar Kasturi and Karl Tombre
Source:
International Workshop on Graphics Recognition - GRCE. :69-79
Publisher Information:
CCSD; Springer Berlin / Heidelberg, 1995.
Publication Year:
1995
Collection:
collection:CNRS
collection:INPL
collection:LABO-LORIA-SET
collection:LORIA2
collection:UNIV-LORRAINE
collection:LORIA
collection:AM2I-UL
Original Identifier:
HAL:
Document Type:
Conference conferenceObject<br />Conference papers
Language:
English
Relation:
info:eu-repo/semantics/altIdentifier/doi/10.1007/3-540-61226-2_7
DOI:
10.1007/3-540-61226-2_7
Accession Number:
edshal.inria.00537324v1
Database:
HAL

Further Information

The original publication is available at "www.springerlink.com"
This paper presents an item searching method which has been applied to various kinds of forms. This approach is based on line detection through the Hough transform. After obtaining the straight lines, Hough directions are used to detect the real segments in the image. Segments can correspond either to continuous line, or to black parts of dashed or dotted lines. So, the segments are grouped together and classified between both adjacent line crossing points. Items are located by searching the minimum cycles of the graph constructed from the line intersection points. The last step consists of verifying the line classes based on the homogeneity hypothesis of item sides. This method was applied to French Tax forms and tables coming from scientific publications. The experimental results have demonstrated the robustness and the reliability of such an approach to various forms with different types of item delimiters.