Evaluation of GPU-Compression Algorithms for CUDA-Aware MPI

dc.contributor.author	Vogel, Marco
dc.contributor.author	Oden, Lena
dc.date.accessioned	2024-09-25T11:27:24Z
dc.date.available	2024-09-25T11:27:24Z
dc.date.issued	2024
dc.description.abstract	This study evaluates an efficient compression algorithm suitable for use with CUDA-aware MPI, aiming to lessen the latency of extensive GPU message transfers. We examine the performance of various compression algorithms on distinct datasets. Ndzip emerges as the optimal compression algorithm for our needs. Our findings reveal that large message latency can improve depending on the dataset. However, latency may increase for non-compressible data due to overhead when using compression. With well-compressible data, the Cannon algorithm for dense matrix-matrix multiplication can improve performance by up to 30%. For data that is not highly compressible, there’s only a minor performance penalty, as the compression overhead remains relatively small.	en
dc.identifier.issn	0177-0454
dc.identifier.uri	https://dl.gi.de/handle/20.500.12116/44641
dc.language.iso	en
dc.pubPlace	Aachen
dc.publisher	Gesellschaft für Informatik e.V., Fachgruppe PARS
dc.relation.ispartof	PARS-Mitteilungen: Vol. 36
dc.title	Evaluation of GPU-Compression Algorithms for CUDA-Aware MPI	en
dc.type	Text/Journal Article
mci.reference.pages	37-46

Dateien

1 - 1 von 1