Talks and Poster Presentations (with Proceedings-Entry):
M. Goebel, T. Hassan, E. Oro, G. Orsi:
"A Methodology for Evaluating Algorithms for Table Understanding in PDF Documents";
Talk: DocEng 2012 ACM Symposium on Document Engineering,
Pairs, France;
2012-09-04
- 2012-09-07; in: "DocEng '12 Proceedings of the 2012 ACM symposium on Document engineering",
ACM New York, NY, USA ©2012,
New York, USA
(2012),
ISBN: 978-1-4503-1116-8;
45
- 48.
English abstract:
This paper presents a methodology for the evaluation of
table understanding algorithms for PDF documents. The
evaluation takes into account three major tasks: table detection,
table structure recognition and functional analysis.
We provide a general and
exible output model for
each task along with corresponding evaluation metrics and
methods. We also present a methodology for collecting
and ground-truthing PDF documents based on consensusreaching
principles and provide a publicly available groundtruthed
dataset.
Categories and Subject Descriptors: I.7.5
[Document and Text Processing]: Document Capture|
document analysis; H.3.4 [Information Storage and Re-
trieval]: Systems and Software|performance evaluation
Keywords: Table processing, metrics, ground-truth
dataset, performance evaluation, document analysis, document
understanding
"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
http://dx.doi.org/10.1145/2361354.2361365
Created from the Publication Database of the Vienna University of Technology.