Classifying documents by distributed P2P clustering
dc.contributor.author | Eisenhardt, Martin | |
dc.contributor.author | Müller, Wolfgang | |
dc.contributor.author | Henrich, Andreas | |
dc.contributor.editor | Dittrich, Klaus R. | |
dc.contributor.editor | König, Wolfgang | |
dc.contributor.editor | Oberweis, Andreas | |
dc.contributor.editor | Rannenberg, Kai | |
dc.contributor.editor | Wahlster, Wolfgang | |
dc.date.accessioned | 2019-11-14T10:42:57Z | |
dc.date.available | 2019-11-14T10:42:57Z | |
dc.date.issued | 2003 | |
dc.description.abstract | Clustering documents into classes is an important task in many Information Retrieval (IR) systems. This achieved grouping enables a description of the contents of the document collection in terms of the classes the documents fall into. The compactness of such a description is even more desirable in cases where the document collection is spread across different computers and locations; document classes can then be used to describe each partial document collection in a conveniently short form that can easily be exchanged with other nodes on the network. Unfortunately, most clustering schemes cannot easily be distributed. Additionally, the costs of transferring all data to a central clustering service are prohibitive in large-scale systems. In this paper, we introduce an approach which is capable of classifying documents that are distributed across a Peer-to-Peer (P2P) network. We present measurements taken on a P2P network using synthetic and real-world data sets. | en |
dc.identifier.isbn | 3-88579-364-4 | |
dc.identifier.pissn | 1617-5468 | |
dc.identifier.uri | https://dl.gi.de/handle/20.500.12116/29710 | |
dc.language.iso | en | |
dc.publisher | Gesellschaft für Informatik e.V. | |
dc.relation.ispartof | INFORMATIK 2003 – Innovative Informatikanwendungen, Band 2, Beiträge der 33. Jahrestagung der Gesellschaft für Informatik e.V. (GI) | |
dc.relation.ispartofseries | Lecture Notes in Informatics (LNI) - Proceedings, Volume P-35 | |
dc.title | Classifying documents by distributed P2P clustering | en |
dc.type | Text/Conference Paper | |
gi.citation.endPage | 291 | |
gi.citation.publisherPlace | Bonn | |
gi.citation.startPage | 286 | |
gi.conference.date | 29. September - 2. Oktober 2003 | |
gi.conference.location | Frankfurt am Main | |
gi.conference.sessiontitle | Regular Research Papers |
Dateien
Originalbündel
1 - 1 von 1
Lade...
- Name:
- GI-Proceedings.35-51.pdf
- Größe:
- 103.12 KB
- Format:
- Adobe Portable Document Format