Logo des Repositoriums
 

A comparison of CUDA and OpenACC: Accelerating the Tsunami Simulation EasyWave

dc.contributor.authorChristgau, Steffen
dc.contributor.authorSpazier, Johannes
dc.contributor.authorSchnor, Bettina
dc.contributor.authorHammitzsch, Martin
dc.contributor.authorBabeyko, Andrey
dc.contributor.authorWächter, Joachim
dc.date.accessioned2017-06-29T16:28:10Z
dc.date.available2017-06-29T16:28:10Z
dc.date.issued2014
dc.description.abstractThis paper presents an GPU accelerated version of the tsunami simulation EasyWave. Using two different GPU generations (Nvidia Tesla and Fermi) different optimization techniques were applied to the application following the principle of locality. Their performance impact was analyzed for both hardware generations. The Fermi GPU not only has more cores, but also possesses a L2 cache shared by all streaming multiprocessors. It is revealed that even the most tuned code on the Tesla does not reach the performance of the unoptimized code on the Fermi GPU. Further, a comparison between CUDA and OpenACC shows that the platform independent approach does not reach the speed of the native CUDA code. A deeper analysis shows that memory access patterns have a critical impact on the compute kernels’ performance, although this seems to be caused by the compiler in use.en
dc.identifier.pissn0177-0454
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V., Fachgruppe PARS
dc.relation.ispartofPARS-Mitteilungen: Vol. 31, Nr. 1
dc.titleA comparison of CUDA and OpenACC: Accelerating the Tsunami Simulation EasyWaveen
dc.typeText/Journal Article
gi.citation.publisherPlaceBerlin

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
paper08.pdf
Größe:
1.97 MB
Format:
Adobe Portable Document Format