TCM
UoC crest

Radeon glmark2 scores

Below are to be found some Radeon scores from the glmark2 benchmark. Version 2014.03 of the benchmark was used, compiled on OpenSuSE 13.1 (gcc 4.8), and run on Ubuntu 16.04 LTS (kernel 4.4.0, Xorg 1.18.3, module version 7.7.0 of free radeon driver, Mesa 11.2.0, DRI2). The default window size of 800x600 was used, and the window manager was fvwm2 (i.e. free of compositing effects). The machines were otherwise idle.

All of the cards were passively cooled, and were purchased for under £50 (some second-hand).

Scores are in frames per second (FPS).

CPUBusGPUclasscoresMHzVRAMscore
Haswell QC 3.1GHzPCIe v3 x8R7 250EGCN 1512800128 bit GDDR5/45004703
Ivy Bridge QC 3.3GHzPCIe v3 x8HD 7750GCN 1512800128 bit GDDR5/45004610
Ivy Bridge QC 3.3GHzPCIe v2 x8HD 6670T'scale 2480800128 bit GDDR5/40001757
Ivy Bridge QC 3.3GHzPCIe v2 x8HD 5670T'scale 2400775128 bit GDDR5/40001719
Ryzen 5 QC 3.2GHzPCIe v2 x16HD 5670T'scale 2400775128 bit GDDR5/40001494
Haswell QC 3.1GHzPCIe v2 x8HD 6570T'scale 2480650128 bit DDR3/1333968
Core2 QC 2.4GHzPCIe v2 x16HD 5570T'scale 2400650128 bit DDR3/1000634
Haswell QC 3.1GHzPCIe v2 x8R5 230 FlexT'scale 216062564 bit DDR3/1600542
Haswell QC 3.1GHzPCIe v2 x8HD 6450T'scale 216062564 bit DDR3/1333483
Core2 QC 2.4GHzPCIe v2 x16HD 5450T'scale 28065064 bit DDR3/1333388
Haswell QC 3.1GHzPCIe v2 x8HD 5450T'scale 28065064 bit DDR3/1333378
Kaby Lake QC 3.5GHzPCIe v2 x8R5 230T'scale 216062564 bit DDR3/1066377
Core2 DC 2.4GHzPCIe x16X550RV3704400?128 bit DDR/500?2981,2
Athlon II DC 3.0GHzRS880/HD 4250T'scale 140500Hyper & sideport275
Core2 QC 2.83GHzPCIe x16HD 3450T'scale 14060064 bit DDR2/800250
Core2 QC 2.4GHzPCIe x16HD 4350T'scale 18060064 bit DDR2/800246
Core2 DC 2.4GHzPCIe x16X1550RV5004550?128 bit DDR2/800?1411
Athlon II DC 3.0GHzRS880/HD 4250T'scale 140500Hyper & sideport117
Core2 DC 2.4GHzPCIe x16X300SERV3704325?64 bit DDR/400?841,2

T'scale: Terascale
GCN: Graphics Core Next

Memory speeds given in MT/s. Memory clock speed is one half this (one quarter for GDDR5).

1 does not support GL version 3.0 and unable to run the Terrain benchmark from this suite. Score is average of those it could run, which will flatter the result as the Terrain benchmark is generally the slowest.

2 failed to complete four other benchmarks correctly, substituting a dummy shader, and failed to run the Jellyfish correctly. Again, this flatters the results as the dummy shader is less work than a proper shader.

The performance of the RS880 may depend on the exact memory configuration (no sideport, exclusively sideport, sideport and hypermemory interleaved in different ways). The precise configuration of the two machines above is not known.

Comments

The benchmark does seem to be testing things which are entirely offloaded to the graphics card, in that very significant changes in CPU power have no impact.

The benchmark seems to be heavily influenced by GPU memory speed, in that the HD 5670 would be expected to be about 20% faster than the HD 5570 if the constraint was GPU processing power. It is over 2.7 times faster, presumably reflecting a memory subsystem four times as fast. Similarly the only difference between the R5 230 and the HD 6350 is that the memory is 20% faster on the former, leading to a score over 12% higher. Conversely the HD6670 has much better GPU performance than the HD5670 (20% more cores and 3.2% higher clock speed), yet the same memory, and scores just 2.2% better. One should be careful when purchasing cards: I purchased the HD 5570 believing it to have DDR3/1333 memory, which would probably have increased its score by about a hundred FPS if it were true.

The GCN-based card seems to be able to make much better use of the available memory bandwidth.

The Terascale and GCN-based cards switch between different power states, changing GPU and memory speed, and GPU voltage. The Terascale-based cards seem incapable of transitioning from their high-power state whilst a 2560x1440 monitor is active, save for the XFX HD5670 (the other cards were mostly Sapphire).

Upgrading from Mesa 11.2.0 to 12.0.6 made no significant difference to the above scores. The Haswell with the HD 5450 achieved an identical score, for instance.

The main focus of the computers in TCM is not games-playing. The embedded RS880 is a little weak, and those that failed some aspects of the test are far from ideal, but there is little pressure to improve these frame-rates. The HD 5670 is in a "public" (to TCM) computer room, so that those needing better performance have access to it.

Video Decoding

The above Radeon cards all include some version of AMD's "Unified Video Decoder" (UVD). This may sit on the same die as the GPU, but it is separately clocked, and is mostly independent in generation from the rest of the GPU. Firefox and Chromium are notorious for not using it, but mplayer and VLC can. UVD 2 can decode two H264 1080p streams simultaneously, and is found on the HD4000 series and later. Only very recent cards (and none of the above) can decode H264 at 4K, H265 or VP9.

The line in /sys/kernel/debug/dri/0/radeon_pm_info reading

uvd    vclk: 0 dclk: 0

refers to the current clock rates of the UVD. Here, quiescent. The program vdpauinfo will describe the capabilities of the UVD.