Talks and Poster Presentations (with Proceedings-Entry):
A. Gabdulkhakova, T. Hassan:
"Document Understanding of Graphical Content in Natively Digital PDF Documents";
Talk: DocEng'12 ACM Symposium on Document Engineering,
- 2012-09-07; in: "DocEng'12 Proceedings of the 2012 ACM symposium on Document engineering",
This paper presents an object-based method for analysing the content drawn by graphical operators in natively digital PDF documents. We propose that graphical content in a document can be classified either as structural or non-structural and present an output model for our analysis result. Heuristic techniques are used to group the instructions into regions and determine their logical role in the document's structure. Experimental results demonstrate the effectiveness of the algorithm.
Electronic version of the publication:
Created from the Publication Database of the Vienna University of Technology.