GI LogoGI Logo
  • Login
Digital Library
    • All of DSpace

      • Communities & Collections
      • Titles
      • Authors
      • By Issue Date
      • Subjects
    • This Collection

      • Titles
      • Authors
      • By Issue Date
      • Subjects
Digital Library Gesellschaft für Informatik e.V.
GI-DL
    • English
    • Deutsch
  • English 
    • English
    • Deutsch
View Item 
  •   DSpace Home
  • Lecture Notes in Informatics
  • Proceedings
  • ISTA - Information Systems Technolopgy and its Applications
  • P063 - ISTA 2005 - Information Systems Technolopgy and its Applications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
  •   DSpace Home
  • Lecture Notes in Informatics
  • Proceedings
  • ISTA - Information Systems Technolopgy and its Applications
  • P063 - ISTA 2005 - Information Systems Technolopgy and its Applications
  • View Item

A generalized framework for an ontology-based data-extraction system

Author:
Wessman, Alan [DBLP] ;
Liddle, Stephen W. [DBLP] ;
Embley, David W. [DBLP]
Abstract
Extraction of information from semi-structured or unstructured documents, such as web pages, is a useful yet complex task. Ontologies can achieve a high degree of accuracy in data extraction while maintaining resiliency in the face of document changes. Ontologies do not, however, diminish the complexity of a data-extraction system. As research in the field progresses, the need for a modular data-extraction system that decouples the associated processes continues to grow. In this paper we report on the implementation of such a system. The nature of our framework allows new algorithms and ideas to be incorporated into a data extraction system without requiring wholesale rewrites of a large part of the system's source code. It allows researchers to focus their attention on parts of the system relevant to their research without having to worry about introducing incompatibilities with the remaining components. We demonstrate the value of the framework by providing an implementation that exhibits appropriate characteristics.
  • Citation
  • BibTeX
Wessman, A., Liddle, S. W. & Embley, D. W., (2005). A generalized framework for an ontology-based data-extraction system. In: Kaschek, R., Mayr, H. C. & Liddle, S. (Hrsg.), Information systems technology and its applications, ISTA' 2005. Bonn: Gesellschaft für Informatik e.V.. (S. 239-253).
@inproceedings{mci/Wessman2005,
author = {Wessman, Alan AND Liddle, Stephen W. AND Embley, David W.},
title = {A generalized framework for an ontology-based data-extraction system},
booktitle = {Information systems technology and its applications, ISTA' 2005},
year = {2005},
editor = {Kaschek, Roland AND Mayr, Heinrich C. AND Liddle, Stephen} ,
pages = { 239-253 },
publisher = {Gesellschaft für Informatik e.V.},
address = {Bonn}
}
DateienGroesseFormatAnzeige
GI-Proceedings.63-20.pdf341.4Kb PDF View/Open

Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Send Feedback

More Info

ISBN: 3-88579-392-X
ISSN: 1617-5468
xmlui.MetaDataDisplay.field.date: 2005
Language: en (en)
Content Type: Text/Conference Paper
Collections
  • P063 - ISTA 2005 - Information Systems Technolopgy and its Applications [22]

Show full item record


About uns | FAQ | Help | Imprint | Datenschutz

Gesellschaft für Informatik e.V. (GI), Kontakt: Geschäftsstelle der GI
Diese Digital Library basiert auf DSpace.

 

 


About uns | FAQ | Help | Imprint | Datenschutz

Gesellschaft für Informatik e.V. (GI), Kontakt: Geschäftsstelle der GI
Diese Digital Library basiert auf DSpace.