GPU tuning

GPU cards come with very different hardware capabilities and performance. It is possible to fine tune the GPU performance using a few parameters that control the execution of the particle tracking kernel.

Tip

You can check the GPUs installed on your system using fred -gpusetup and then use the provided information to configure your environment using fred -config.

Available environment variables and their default values:

export FGE_PRIMARY_MAX_BATCH=1000000
export FGE_SECONDARY_MULT_FACTOR=1
export FGE_CL_NQUEUEDEV=6
FGE_PRIMARY_MAX_BATCH

Set number of primary particles that are sent to GPU for each kernel invocation.

A large enough buffer will reduce the communication times between CPU and GPU and reduce the number of kernel invocations.

FGE_SECONDARY_MULT_FACTOR

Increase the particle buffer by this integer multiplication factor.

For a proton beam, the number of secondary produced is always less than the primary number, so by default FGE_SECONDARY_MULT_FACTOR is set to 1. For a Carbon beam, the number of secondary fragments per primary ion is up to 3.5 in the therapeutic range, so FGE_SECONDARY_MULT_FACTOR = 4 is typically requested.

Note

Fred will quietly signal the occurance of a particle buffer overflow at the end of a simulation. In that case try to undestand the problem, and readjust the buffer size using FGE_SECONDARY_MULT_FACTOR.

FGE_CL_NQUEUEDEV

Set the number of independent execution queues on each device.

A number >=2 is required for hiding memory transfer latency between CPU and GPU. A number 4-6 of queues is required for increasing performance in the generation of the dose influence matrix Dij.

Tip

In case of memory shortage on the GPU, the variables FGE_PRIMARY_MAX_BATCH, FGE_SECONDARY_MULT_FACTOR, and FGE_CL_NQUEUEDEV can be used to reduce the memory needed by the GPU kernel.

Try changing the values depending on the type of simulation you want to run.

Check the relative performance using for instance fred -benchmark 1e7.

Once you found a satisfactory set of parameters, you could add the envvars to the configuration file of Fred (.fredrc in Linux or Registry in Windows).