[Zurück]


Vorträge und Posterpräsentationen (mit Tagungsband-Eintrag):

W. Gatterbauer, B. Krüpl, W. Holzinger, M. Herzog:
"Web Information Extraction Using Eupeptic Data in Web Tables";
Vortrag: RAWS 2005, Tocna, Tschechien; 14.09.2005 - 16.09.2005; in: "Proceedings of the 1st International Workshop on Representation and Analysis of Web Space", V. Snásel, V. Svátek (Hrg.); Faculty of Electrical Engineering and Computer Science, VSB - Technical University of Ostrava, (2005), ISBN: 80-248-0864-1; S. 41 - 48.



Kurzfassung englisch:
By leveraging on the redundant information on the Web, we are
building an Web information extraction system that concentrates on
eupeptic data in Web tables. We use the term eupeptic to describe
such representations of information that allow for easy
interpretation of the subject--predicate--object nature of
individual data items. The system mimics a human approach to
information gathering. It explicitly uses visual cues on rendered
Web pages to locate eupeptic data; it uses keywords to identify
relevant chunks of data that gets processed later on a deeper
level; and it expands its initial search to include more pages
when it spots new relevant information.


Online-Bibliotheks-Katalog der TU Wien:
http://aleph.ub.tuwien.ac.at/F?base=tuw01&func=find-c&ccl_term=AC05936218



Zugeordnete Projekte:
Projektleitung Reinhard Pichler:
Know it All and Know it Right: High Quality Mining in the Web


Erstellt aus der Publikationsdatenbank der Technischen Universität Wien.