Logo des Repositoriums
 
Konferenzbeitrag

Approaches to feature selection for document categorization

Lade...
Vorschaubild

Volltext URI

Dokumententyp

Text/Conference Paper

Zusatzinformation

Datum

2003

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Gesellschaft für Informatik e.V.

Zusammenfassung

One of the problems faced by document categorization is that terms present in the collection of example documents are numerous. From the point of view of coherence between the models used in document categorization, we analyses the frameworks of both k-NN and NB categorization models and feature selection problem. Two algorithms CBA and IBA to feature selection are proposed. The empirical results done with k-NN and NB classifiers show that the coherence between models in the categorization system can bring benefits for performance.

Beschreibung

Kou, Huaizhong; Gardarin, Georges; Zeitouni, Karina (2003): Approaches to feature selection for document categorization. Natural language processing and information systems. Bonn: Gesellschaft für Informatik e.V.. PISSN: 1617-5468. ISBN: 3-88579-358-X. pp. 141-154. Regular Research Papers. Burg (Spreewald). June 2003

Schlagwörter

Zitierform

DOI

Tags