This website wants to provide insight into the performance of GPUs
used for speeding up GP-GPU algorithms.
With the microbenchmarks we measure specific GPU performance characteristics; each microbenchmarks is targeting one specific characterictic.
At first one is interested in the peak computational or memory performance (measured in flops or bytes per second).
The compute intensity (number of instructions per byte read) of an algorithm determines whether the algorithm is compute-bound or memory-bound.
However, often the computational or memory peak performance is not achieved. Such algorithms are often called occupancy-bound or latency-bound algorithms since the amount of concurrency is not large enough to hide all latencies. It results in an efficiency lower than 100%.