GI LogoGI Logo
  • Login
Digital Library
    • All of DSpace

      • Communities & Collections
      • Titles
      • Authors
      • By Issue Date
      • Subjects
    • This Collection

      • Titles
      • Authors
      • By Issue Date
      • Subjects
Digital Library Gesellschaft für Informatik e.V.
GI-DL
    • English
    • Deutsch
  • English 
    • English
    • Deutsch
View Item 
  •   DSpace Home
  • Lecture Notes in Informatics
  • Proceedings
  • ISTA - Information Systems Technolopgy and its Applications
  • P063 - ISTA 2005 - Information Systems Technolopgy and its Applications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
  •   DSpace Home
  • Lecture Notes in Informatics
  • Proceedings
  • ISTA - Information Systems Technolopgy and its Applications
  • P063 - ISTA 2005 - Information Systems Technolopgy and its Applications
  • View Item

On the impact of document representation on classifier performance in e-mail categorization

Author:
Berger, Helmut [DBLP] ;
Köhle, Monika [DBLP] ;
Merkl, Dieter [DBLP]
Abstract
This paper provides an analysis of multi-class e-mail categorization performance. In order to investigate this issue, the quality of various classification algorithms based on two distinct document representation formalisms is compared. In particular, both a standard word-based document representation as well as a character n-gram document representation is used. The latter is regarded as highly noise-tolerant and was originally proposed for automatic language identification and as a convenient means for producing compact document indices. Furthermore the impact of using available e-mail specific meta-information on classification performance is explored and the findings are presented.
  • Citation
  • BibTeX
Berger, H., Köhle, M. & Merkl, D., (2005). On the impact of document representation on classifier performance in e-mail categorization. In: Kaschek, R., Mayr, H. C. & Liddle, S. (Hrsg.), Information systems technology and its applications, ISTA' 2005. Bonn: Gesellschaft für Informatik e.V.. (S. 19-30).
@inproceedings{mci/Berger2005,
author = {Berger, Helmut AND Köhle, Monika AND Merkl, Dieter},
title = {On the impact of document representation on classifier performance in e-mail categorization},
booktitle = {Information systems technology and its applications, ISTA' 2005},
year = {2005},
editor = {Kaschek, Roland AND Mayr, Heinrich C. AND Liddle, Stephen} ,
pages = { 19-30 },
publisher = {Gesellschaft für Informatik e.V.},
address = {Bonn}
}
DateienGroesseFormatAnzeige
GI-Proceedings.63-2.pdf237.0Kb PDF View/Open

Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Send Feedback

More Info

ISBN: 3-88579-392-X
ISSN: 1617-5468
xmlui.MetaDataDisplay.field.date: 2005
Language: en (en)
Content Type: Text/Conference Paper
Collections
  • P063 - ISTA 2005 - Information Systems Technolopgy and its Applications [22]

Show full item record


About uns | FAQ | Help | Imprint | Datenschutz

Gesellschaft für Informatik e.V. (GI), Kontakt: Geschäftsstelle der GI
Diese Digital Library basiert auf DSpace.

 

 


About uns | FAQ | Help | Imprint | Datenschutz

Gesellschaft für Informatik e.V. (GI), Kontakt: Geschäftsstelle der GI
Diese Digital Library basiert auf DSpace.