2. Performance Evaluation and Benchmarking of Modern GPU
Transcription
2. Performance Evaluation and Benchmarking of Modern GPU
Performance Evaluation and Benchmarking of Modern GPU Architectures Text Desislava Ivanova, Vladimir Kadurin, Yanko Belov Technical University of Sofia, Computer Systems Department, Sofia, Bulgaria GPU advantages • The advantage of heterogeneous modern computing systems based on GPUs for parallel processing is undisputable • GPGPUs have many smaller "cores" which do less work than a conventional CPU core • The heavy parallel work is done by the GPU cores • GPUs offer great performance/consumed power ratio • Many of the already existing codes optimised for serial execution can be tweaked to take advantage of the GPU parallel processing power GPU Architecture The main GPU used for benchmarking is the GTX Titan It is Kepler based with code name GK110 GTX Titan has 15 SMX units each with 192 CUDA cores, for total of 2688 CUDA cores It has 6GB of GDDR5 memory and bandwidth of 288GB/s GPU Technologies Dynamic parallelism Hyper - Q Grid Management Unit GPUDirect Comparison GPUs NVIDIA GeForce GTX 680 AMD RADEON HD7970 • Benchmark programs Computational benchmarks - GEMM and FFT Synthetic benchmarks - 3DMark, 3DMark11, Unigine Heaven 3D games - Battlefield 3 at 4K resolution Experimental Framework Specially selected components so no bottleneck occurs CPU and RAM are high level parts All GPUs are overclocked All benchmarks are completed several times so all external factors can be minimised Experimental results GEMM SGEMM DGEMM TFlops 3189 GTX Titan 1324 2405 HD7970 702 1258 GTX 680 124 0 1000 2000 3000 4000 Experimental results - FFT FFT Single precision FFT Double precision GFlops/s 455 GTX Titan 229 277 GTX 680 60 0 125 250 375 500 Experimental results 3DMark Fire Strike Score Graphics Score Score 8786 9888 GTX Titan 14002 GTX Titan SLI 17980 6422 7098 GTX 680 9021 10743 GTX 680 SLI 7106 7801 HD7970 0 4500 9000 13500 18000 22500 Experimental results 3DMark Extreme Score Graphics Score Score 4923 4587 GTX Titan 9042 8811 GTX Titan SLI 3399 3087 GTX 680 6287 6042 GTX 680 SLI 3266 2965 HD7970 0 2500 5000 7500 10000 Experimental results Unigine Heaven 4.0 Normal Extreme FPS 77 GTX Titan 65 129 GTX Titan SLI 108 48 GTX 680 41 90 GTX 680 SLI 79 50 HD7970 40 0 32.5 65 97.5 130 162.5 Experimental results Battlefield 3 4K resolution Min FPS Average FPS FPS 25 GTX Titan 36 53 GTX Titan SLI 61 15 GTX 680 22 30 GTX 680 SLI 39 17 HD7970 25 0 17.5 35 52.5 70 Experimental results power consumption Standby Heavy load WATTS 72 GTX Titan 371 87 GTX Titan SLI 554 79 GTX 680 312 96 GTX 680 SLI 487 86 HD7970 377 0 150 300 450 600 Conclusion Kepler architecture used in NVIDIA GeForce Titan offers great performance and efficiency Its advantages are significant Despite providing such computing power, it consumes a lot less energy than previous architectures Thank you for your attention! FOR MORE INFORMATION, PLEASE CONTACT: Assist. Prof. PhD Desislava Ivanova d_ivanova@tu-sofia.bg