Configuration of Measurement Processes

The configuration of performance measurement processes is a challanging task and has been researched in computer science widely. This page aims for giving a starting point for practitioners.

Basics

When executing a performance measurement, especially measurements of durations smaller than a millisecond, various non-deterministic effects shape your performance, including the inaccuracy of the time measurement method, Just-in-Time (JIT) compilation, garbage collections, thread scheduling and memory fragmentation. Therefore, the performance measurement needs to be repeated. This repetitions needs to be done on at least two levels

Inside of one VM, the measurement needs to be repeated to wait for warmup to finish (e.g. to wait until JIT compilation is finished), i.e. until the steady state is reached.
The VM starts itself need to be repeated, since a warmup may end in different steady states. Tools measuring the performance provide the environment for executing those measurement; the concrete configuration is specific to use cases and left to the user (which, in this field, is a software developer or a performance engineer).

Sample Configuration

By definition of artificial workload pairs (e.g. creation and addition of 300/(300+d) integers, reservation of 20/20+d blocks), we evaluated when a performance change can be measured. The summary of our results is: A Performance change can be measured if the relative change is at least half of the standard deviation of the VM measurements. The standard deviation of the VM measurements may be decreased by increasing warmup and iterations inside a VM. Depending on the relation between relative change that should be (at least) measured and standard deviation of the measurements, more or less VM executions are needed. Some practitioners recommend to use at least 30 VM starts; to measure a performance change of 0,3% (e.g. the change between 300 and 301 additions), 400 VMs, 5 iterations and warmup iterations and 100 000 repetitions are required.

If you want to try this for other artificial workload pairs, have a look at the repository precision-experiments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuration of Measurement Processes

Basics

Sample Configuration

Further Reading

Clone this wiki locally