Expected utility of content blocks in web content extraction

Kowalkiewicz, Marek

Expected utility of content blocks in web content extraction

dc.contributor.author	Kowalkiewicz, Marek
dc.contributor.editor	Abramowicz, Witold
dc.contributor.editor	Mayr, Heinrich C.
dc.date.accessioned	2019-08-12T12:38:32Z
dc.date.available	2019-08-12T12:38:32Z
dc.date.issued	2006
dc.description.abstract	In this paper we discuss the possible application of new concepts in web content extraction: utility assessment, utility annealing, and dynamic aggregated document generation. After analysis of the state of the art in web content extraction, results of a survey study among Polish managers are presented. The discussion covers a web content extraction system with possible extensions that may help tackle the information overload problem. The discussed extensions go beyond current state of the art. Utility assessment considers economical view on value of information, while utility annealing allows for removing content blocks that cover information already acquired from other content blocks. Due to the existing content block extraction technology and new concepts proposed in the paper, it is possible to dynamically generate aggregated documents.	en
dc.identifier.isbn	2-88579-179-X
dc.identifier.pissn	1617-5468
dc.identifier.uri	https://dl.gi.de/handle/20.500.12116/24152
dc.language.iso	en
dc.publisher	Gesellschaft für Informatik e.V.
dc.relation.ispartof	Business Information Systems – 9th International Conference on Business Information Systems (BIS 2006)
dc.relation.ispartofseries	Lecture Notes in Informatics (LNI) - Proceedings, Volume P-85
dc.title	Expected utility of content blocks in web content extraction	en
dc.type	Text/Conference Paper
gi.citation.endPage	352
gi.citation.publisherPlace	Bonn
gi.citation.startPage	342
gi.conference.date	May 31-June 2 2006
gi.conference.location	Klagenfurt, Austria
gi.conference.sessiontitle	Regular Research Papers

Dateien

Originalbündel

1 - 1 von 1

Name:: GI-Proceedings-85-29.pdf
Größe:: 265.36 KB
Format:: Adobe Portable Document Format

Herunterladen

Sammlungen

P085 - Business Information Systems - 9th International Conference 2006