Logo des Repositoriums
 

On the impact of document representation on classifier performance in e-mail categorization

dc.contributor.authorBerger, Helmut
dc.contributor.authorKöhle, Monika
dc.contributor.authorMerkl, Dieter
dc.contributor.editorKaschek, Roland
dc.contributor.editorMayr, Heinrich C.
dc.contributor.editorLiddle, Stephen
dc.date.accessioned2019-10-11T08:54:15Z
dc.date.available2019-10-11T08:54:15Z
dc.date.issued2005
dc.description.abstractThis paper provides an analysis of multi-class e-mail categorization performance. In order to investigate this issue, the quality of various classification algorithms based on two distinct document representation formalisms is compared. In particular, both a standard word-based document representation as well as a character n-gram document representation is used. The latter is regarded as highly noise-tolerant and was originally proposed for automatic language identification and as a convenient means for producing compact document indices. Furthermore the impact of using available e-mail specific meta-information on classification performance is explored and the findings are presented.en
dc.identifier.isbn3-88579-392-X
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/28349
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofInformation systems technology and its applications, ISTA' 2005
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-63
dc.titleOn the impact of document representation on classifier performance in e-mail categorizationen
dc.typeText/Conference Paper
gi.citation.endPage30
gi.citation.publisherPlaceBonn
gi.citation.startPage19
gi.conference.date23.-25. May 2005
gi.conference.locationPalmerston North, New Zealand
gi.conference.sessiontitleRegular Research Papers

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
GI-Proceedings.63-2.pdf
Größe:
237.08 KB
Format:
Adobe Portable Document Format