2. Performance Evaluation and Benchmarking of Modern GPU

Transcription

2. Performance Evaluation and Benchmarking of Modern GPU
Performance Evaluation and
Benchmarking of Modern GPU
Architectures
Text
Desislava Ivanova, Vladimir Kadurin, Yanko Belov
Technical University of Sofia, Computer Systems Department, Sofia, Bulgaria
GPU advantages
•
The advantage of heterogeneous modern
computing systems based on GPUs for
parallel processing is undisputable
•
GPGPUs have many smaller "cores"
which do less work than a conventional
CPU core
•
The heavy parallel work is done by the
GPU cores
•
GPUs offer great performance/consumed
power ratio
•
Many of the already existing codes
optimised for serial execution can be
tweaked to take advantage of the GPU
parallel processing power
GPU Architecture
The main GPU used for benchmarking is the GTX
Titan
It is Kepler based with code name GK110
GTX Titan has 15 SMX units each with 192 CUDA
cores, for total of 2688 CUDA cores
It has 6GB of GDDR5 memory and bandwidth of
288GB/s
GPU Technologies
Dynamic parallelism
Hyper - Q
Grid Management Unit
GPUDirect
Comparison GPUs
NVIDIA GeForce GTX 680
AMD RADEON HD7970
•
Benchmark programs
Computational benchmarks - GEMM and FFT
Synthetic benchmarks - 3DMark, 3DMark11, Unigine
Heaven
3D games - Battlefield 3 at 4K resolution
Experimental Framework
Specially selected
components so no bottleneck
occurs
CPU and RAM are high level
parts
All GPUs are overclocked
All benchmarks are completed
several times so all external
factors can be minimised
Experimental results GEMM
SGEMM
DGEMM
TFlops
3189
GTX Titan
1324
2405
HD7970
702
1258
GTX 680
124
0
1000
2000
3000
4000
Experimental results - FFT
FFT Single precision
FFT Double precision
GFlops/s
455
GTX Titan
229
277
GTX 680
60
0
125
250
375
500
Experimental results 3DMark Fire Strike
Score
Graphics Score
Score
8786
9888
GTX Titan
14002
GTX Titan
SLI
17980
6422
7098
GTX 680
9021
10743
GTX 680 SLI
7106
7801
HD7970
0
4500
9000
13500
18000
22500
Experimental results 3DMark Extreme
Score
Graphics Score
Score
4923
4587
GTX Titan
9042
8811
GTX Titan SLI
3399
3087
GTX 680
6287
6042
GTX 680 SLI
3266
2965
HD7970
0
2500
5000
7500
10000
Experimental results Unigine Heaven 4.0
Normal
Extreme
FPS
77
GTX Titan
65
129
GTX Titan
SLI
108
48
GTX 680
41
90
GTX 680 SLI
79
50
HD7970
40
0
32.5
65
97.5
130
162.5
Experimental results Battlefield 3 4K resolution
Min FPS
Average FPS
FPS
25
GTX Titan
36
53
GTX Titan SLI
61
15
GTX 680
22
30
GTX 680 SLI
39
17
HD7970
25
0
17.5
35
52.5
70
Experimental results power consumption
Standby
Heavy load
WATTS
72
GTX Titan
371
87
GTX Titan SLI
554
79
GTX 680
312
96
GTX 680 SLI
487
86
HD7970
377
0
150
300
450
600
Conclusion
Kepler architecture used in NVIDIA GeForce Titan
offers great performance and efficiency
Its advantages are significant
Despite providing such computing power, it consumes
a lot less energy than previous architectures
Thank you for your
attention!
FOR MORE INFORMATION, PLEASE CONTACT:
Assist. Prof. PhD Desislava Ivanova
d_ivanova@tu-sofia.bg

Similar documents