[Back]


Talks and Poster Presentations (without Proceedings-Entry):

G. Gottlob:
"Web information Extraction with Lixto: Visual Logic and Expressive Power";
Talk: The 2003 IEEE/WIC International Conference on Web Intelligence (WI 2003), Halifax, Kanada (invited); 2003-10-13 - 2003-10-16.



English abstract:
Intelligent Web agents and Web information agents need structured data from disparate Web scources for decision making. Wrapper technology is used to extract structired data from unstructured or poorly structured Web pages of continually changing content. In this talk I will give a survey of the Lixto approach to Web data extraction. Lixto assists a user in semi-automatically creating wrapper programs by providing a fully visual and interactive user interface. Lixto wrappers are able to extract deeply nested XML data structures from HTML pages. Visual user operations on exaple pages are directly translated in logical conditions and rules in a declarative logic-based language. Basic features of this system will be demonstrated and theoretical results about itīs expresive power will be discussed. Time permitting, we will also discuss some more advanced features of the system and some industrial applications. Papers from Lixto are available at www.lixto.com. (downloads section). Thia talk describes joint work with Robert Baumgartner, Sergio Flesca, Marcus Herzoog and Christoph Koch.

Created from the Publication Database of the Vienna University of Technology.