Logo des Repositoriums
 

A generalized framework for an ontology-based data-extraction system

dc.contributor.authorWessman, Alan
dc.contributor.authorLiddle, Stephen W.
dc.contributor.authorEmbley, David W.
dc.contributor.editorKaschek, Roland
dc.contributor.editorMayr, Heinrich C.
dc.contributor.editorLiddle, Stephen
dc.date.accessioned2019-10-11T08:54:15Z
dc.date.available2019-10-11T08:54:15Z
dc.date.issued2005
dc.description.abstractExtraction of information from semi-structured or unstructured documents, such as web pages, is a useful yet complex task. Ontologies can achieve a high degree of accuracy in data extraction while maintaining resiliency in the face of document changes. Ontologies do not, however, diminish the complexity of a data-extraction system. As research in the field progresses, the need for a modular data-extraction system that decouples the associated processes continues to grow. In this paper we report on the implementation of such a system. The nature of our framework allows new algorithms and ideas to be incorporated into a data extraction system without requiring wholesale rewrites of a large part of the system's source code. It allows researchers to focus their attention on parts of the system relevant to their research without having to worry about introducing incompatibilities with the remaining components. We demonstrate the value of the framework by providing an implementation that exhibits appropriate characteristics.en
dc.identifier.isbn3-88579-392-X
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/28350
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofInformation systems technology and its applications, ISTA' 2005
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-63
dc.titleA generalized framework for an ontology-based data-extraction systemen
dc.typeText/Conference Paper
gi.citation.endPage253
gi.citation.publisherPlaceBonn
gi.citation.startPage239
gi.conference.date23.-25. May 2005
gi.conference.locationPalmerston North, New Zealand
gi.conference.sessiontitleRegular Research Papers

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
GI-Proceedings.63-20.pdf
Größe:
341.44 KB
Format:
Adobe Portable Document Format