Christina Feilmayr, Stefan Parzer, Birgit Pröll,
"Ontology-based Information Extraction from Tourism Web sites"
, in Information Technology and Tourism, Vol. 11, Nummer 3, Seite(n) pp. 183-196, 2009, ISSN: 1943-4294
Original Titel:
Ontology-based Information Extraction from Tourism Web sites
Sprache des Titels:
Englisch
Original Kurzfassung:
The enlarging amount of semistructured and unstructured data on heterogeneously designed tourism
websites creates a need for information extraction (IE) mechanisms for semiautomatic data
acquisition in order to build tourism recommender systems or tourism Web portals. In this article
we analyze heterogeneity aspects of individually maintained accommodation websites and discuss
the applicability of different IE types and techniques for this domain. We then develop a rule/
ontology-based IE approach and discuss the components of our prototype crawler. Finally, we
discuss some relevant issues that emerged during the development and evaluation of the prototype.