[Zurück]


Vorträge und Posterpräsentationen (mit Tagungsband-Eintrag):

M. Goebel, T. Hassan, E. Oro, G. Orsi:
"A Methodology for Evaluating Algorithms for Table Understanding in PDF Documents";
Vortrag: DocEng 2012 ACM Symposium on Document Engineering, Pairs, France; 04.09.2012 - 07.09.2012; in: "DocEng '12 Proceedings of the 2012 ACM symposium on Document engineering", ACM New York, NY, USA ©2012, New York, USA (2012), ISBN: 978-1-4503-1116-8; S. 45 - 48.



Kurzfassung englisch:
This paper presents a methodology for the evaluation of
table understanding algorithms for PDF documents. The
evaluation takes into account three major tasks: table detection,
table structure recognition and functional analysis.
We provide a general and
exible output model for
each task along with corresponding evaluation metrics and
methods. We also present a methodology for collecting
and ground-truthing PDF documents based on consensusreaching
principles and provide a publicly available groundtruthed
dataset.
Categories and Subject Descriptors: I.7.5
[Document and Text Processing]: Document Capture|
document analysis; H.3.4 [Information Storage and Re-
trieval]: Systems and Software|performance evaluation
Keywords: Table processing, metrics, ground-truth
dataset, performance evaluation, document analysis, document
understanding


"Offizielle" elektronische Version der Publikation (entsprechend ihrem Digital Object Identifier - DOI)
http://dx.doi.org/10.1145/2361354.2361365


Erstellt aus der Publikationsdatenbank der Technischen Universität Wien.