-
Notifications
You must be signed in to change notification settings - Fork 101
QUDA Environment Variables
maddyscientist edited this page May 8, 2019
·
31 revisions
Variable name | Function |
---|---|
QUDA_RESOURCE_PATH |
Path where tune cache and profile files will be output |
QUDA_PROFILE_OUTPUT_BASE |
Filename prefix for profile output. Setting this will result in the files $(QUDA_PROFILE_OUTPUT_BASE).tsv and $(QUDA_PROFILE_OUTPUT_BASE_async).tsv and being written out (default is simply profile.tsv and profile_async.tsv ) |
QUDA_ENABLE_P2P |
QUDA_ENABLE_P2P=0 # disable all p2p transfersQUDA_ENABLE_P2P=1 # enable only copy enginesQUDA_ENABLE_P2P=2 # enable only remote writingQUDA_ENABLE_P2P=3 # enable both copy engines and remote writing |
QUDA_ENABLE_TUNING |
Enable / disable kernel autotuning. Default is enabled, disable with QUDA_ENABLE_TUNING=0
|
QUDA_REORDER_LOCATION |
Set where data should be reordered when transferring CPU<->GPU (default is GPU) |
QUDA_ENABLE_DEVICE_MEMORY_POOL |
Enable / disable device memory allocator (default is enabled, disable with QUDA_ENABLE_DEVICE_MEMORY_POOL=0
|
QUDA_ENABLE_PINNED_MEMORY_POOL |
Enable / disable device memory allocator (default is enabled, disable with QUDA_ENABLE_PINNED_MEMORY_POOL=0
|
QUDA_ENABLE_NUMA |
Enabled NUMA placement. Default is enabled, if NUMA has been enabled in cmake, disabled with QUDA_ENABLE_NUMA=0
|
QUDA_MILC_HISQ_RECONSTRUCT |
Set the reconstruct type in the MILC interface used for the long links in the HISQ solver. Allowed values are 9/13/18 with 18 the default |
QUDA_ENABLE_GDR |
Enable GPU-Direct RDMA. Default is disabled, enabled with QUDA_ENABLE_GDR=1
|
QUDA_TEST_GRID_SIZE |
Set the process geometry for the unit tests. Overrides the --gridsize parameter if set. |
QUDA_DEVICE_RESET |
Call cudaDeviceReset in endQuda - this legacy behavior can be useful for profiling, but destroys the CUDA context of other CUDA libraries outside of QUDA (e.g., GPU-aware MPI). Default is disabled, enable with QUDA_DEVICE_RESET=1
|
QUDA_DETERMINISTIC_REDUCE |
Perform all MPI reductions deterministically: setting this flag means that post-tuning or no tuning, QUDA will run completely deterministically regardless of the rank order. Default is disabled, enable with QUDA_DETERMINISTIC_REDUCE=1
|
The below entries are experimental, and not yet in the main develop branch | |
QUDA_TUNE_VERSION_CHECK |
Set QUDA_TUNE_VERSION_CHECK=0 to disable the check that prevents using a tunecache.tsv file from a different QUDA version (feature/blocksolver) |