[Back]


Talks and Poster Presentations (with Proceedings-Entry):

M. Goebel, T. Hassan, E. Oro, G. Orsi:
"A Methodology for Evaluating Algorithms for Table Understanding in PDF Documents";
Talk: DocEng 2012 ACM Symposium on Document Engineering, Pairs, France; 2012-09-04 - 2012-09-07; in: "DocEng '12 Proceedings of the 2012 ACM symposium on Document engineering", ACM New York, NY, USA 2012, New York, USA (2012), ISBN: 978-1-4503-1116-8; 45 - 48.



English abstract:
This paper presents a methodology for the evaluation of
table understanding algorithms for PDF documents. The
evaluation takes into account three major tasks: table detection,
table structure recognition and functional analysis.
We provide a general and
exible output model for
each task along with corresponding evaluation metrics and
methods. We also present a methodology for collecting
and ground-truthing PDF documents based on consensusreaching
principles and provide a publicly available groundtruthed
dataset.
Categories and Subject Descriptors: I.7.5
[Document and Text Processing]: Document Capture|
document analysis; H.3.4 [Information Storage and Re-
trieval]: Systems and Software|performance evaluation
Keywords: Table processing, metrics, ground-truth
dataset, performance evaluation, document analysis, document
understanding


"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
http://dx.doi.org/10.1145/2361354.2361365


Created from the Publication Database of the Vienna University of Technology.