A SSE-Based Implementation of a Ray Tracer for the Comparision with Coprocessors in Hybrid Computing System

Hampel, VolkerMaehle, Erik2017-12-062017-12-062011https://dl.gi.de/handle/20.500.12116/8561In this paper we present and discuss our efforts to accelerate a sample application by using the Streaming SIMD Extensions (SSE) to the x64 instruction set. Several approaches to their integration into the source code are tested and evaluated against each other. They are assembler intrinsics, the initial source code combined with different compiler flags, and enhanced code for better SSE inference. Their performances are compared to benchmarks from two hybrid computing systems, which use a Field Programmable Gate Array (FPGA) and a Graphics Processing Unit (GPU), respectively. As the interfaces to manipulated/accelerated code sections are the same in all cases, comparability always is maintained.enGraphic Processing UnitField Programmable Gate ArrayCentral Processing UnitData Flow GraphCompiler OptimizationA SSE-Based Implementation of a Ray Tracer for the Comparision with Coprocessors in Hybrid Computing SystemText/Journal Article10.1007/BF033419920177-0454