[Zurück]


Vorträge und Posterpräsentationen (mit Tagungsband-Eintrag):

L. Andersson, M. Lupu, J. Palotti, F. Piroi, A. Hanbury, A. Rauber:
"Insight to Hyponymy Lexical Relation Extraction in the Patent Genre Versus Other Text Genres";
Vortrag: First International Workshop on Patent Mining and Its Applications (IPaMin 2014), Hildesheim, Germany; 06.10.2014 - 07.10.2014; in: "Patent Mining and Its Applications", CEUR Workshop Proceedings, 1292 (2014), ISSN: 1613-0073.



Kurzfassung englisch:
Due to the large amount of available patent data, it is no longer feasible for industry actors to manually create their own termi- nology lists and ontologies. Furthermore, domain specific the- sauruses are rarely accessible to the research community. In this paper we present extraction of hyponymy lexical relations con- ducted on patent text using lexico-syntactic patterns. We explore the lexico-syntactic patterns. Since this kind of extraction involves Natural Language Processing we also compare the extractions made with and without domain adaptation of the extraction pipeline. We also deployed our modified extraction method to other text genres in order to demonstrate the methodīs portability to other text do- mains. From our study we conclude that the lexico-syntactic pat- terns are portable to domain specific text genre such as the patent genre. We observed that general Natural Language Processing tools, when not adapted to the patent genre, reduce the amount of correct hyponymy lexical relation extractions and increase the number of incomplete extractions. This was also observed in other domain specific text genres.

Schlagworte:
patent, retrieval, hyponymy, text analysis, text extraction