Logo des Repositoriums
 

Expected utility of content blocks in web content extraction

dc.contributor.authorKowalkiewicz, Marek
dc.contributor.editorAbramowicz, Witold
dc.contributor.editorMayr, Heinrich C.
dc.date.accessioned2019-08-12T12:38:32Z
dc.date.available2019-08-12T12:38:32Z
dc.date.issued2006
dc.description.abstractIn this paper we discuss the possible application of new concepts in web content extraction: utility assessment, utility annealing, and dynamic aggregated document generation. After analysis of the state of the art in web content extraction, results of a survey study among Polish managers are presented. The discussion covers a web content extraction system with possible extensions that may help tackle the information overload problem. The discussed extensions go beyond current state of the art. Utility assessment considers economical view on value of information, while utility annealing allows for removing content blocks that cover information already acquired from other content blocks. Due to the existing content block extraction technology and new concepts proposed in the paper, it is possible to dynamically generate aggregated documents.en
dc.identifier.isbn2-88579-179-X
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/24152
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofBusiness Information Systems – 9th International Conference on Business Information Systems (BIS 2006)
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-85
dc.titleExpected utility of content blocks in web content extractionen
dc.typeText/Conference Paper
gi.citation.endPage352
gi.citation.publisherPlaceBonn
gi.citation.startPage342
gi.conference.dateMay 31-June 2 2006
gi.conference.locationKlagenfurt, Austria
gi.conference.sessiontitleRegular Research Papers

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
GI-Proceedings-85-29.pdf
Größe:
265.36 KB
Format:
Adobe Portable Document Format