Logo des Repositoriums
 

A Hybrid Information Extraction Approach Exploiting Structured Data Within a Text Mining Process

dc.contributor.authorKiefer, Cornelia
dc.contributor.authorReimann, Peter
dc.contributor.authorMitschang, Bernhard
dc.contributor.editorGrust, Torsten
dc.contributor.editorNaumann, Felix
dc.contributor.editorBöhm, Alexander
dc.contributor.editorLehner, Wolfgang
dc.contributor.editorHärder, Theo
dc.contributor.editorRahm, Erhard
dc.contributor.editorHeuer, Andreas
dc.contributor.editorKlettke, Meike
dc.contributor.editorMeyer, Holger
dc.date.accessioned2019-04-11T07:21:14Z
dc.date.available2019-04-11T07:21:14Z
dc.date.issued2019
dc.description.abstractMany data sets encompass structured data fields with embedded free text fields. The text fields allow customers and workers to input information which cannot be encoded in structured fields. Several approaches use structured and unstructured data in isolated analyses. The result of isolated mining of structured data fields misses crucial information encoded in free text. The result of isolated text mining often mainly repeats information already available from structured data. The actual information gain of isolated text mining is thus limited. The main drawback of both isolated approaches is that they may miss crucial information. The hybrid information extraction approach suggested in this paper adresses this issue. Instead of extracting information that in large parts was already available beforehand, it extracts new, valuable information from free texts. Our solution exploits results of analyzing structured data within the text mining process, i.e., structured information guides and improves the information extraction process on textual data. Our main contributions comprise the description of the concept of hybrid information extraction as well as a prototypical implementation and an evaluation with two real-world data sets from aftersales and production with English and German free text fields.en
dc.identifier.doi10.18420/btw2019-10
dc.identifier.isbn978-3-88579-683-1
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/21694
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofBTW 2019
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) – Proceedings, Volume P-289
dc.subjectinformation extraction
dc.subjectclustering
dc.subjecttext mining
dc.subjectfree text fields
dc.titleA Hybrid Information Extraction Approach Exploiting Structured Data Within a Text Mining Processen
gi.citation.endPage168
gi.citation.startPage149
gi.conference.date4.-8. März 2019
gi.conference.locationRostock
gi.conference.sessiontitleWissenschaftliche Beiträge

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
B3-1.pdf
Größe:
1.37 MB
Format:
Adobe Portable Document Format