From d7f876f285aa5019bb14c70e70adc44e85211219 Mon Sep 17 00:00:00 2001 From: Jerome Soumagne Date: Fri, 1 Dec 2023 17:23:10 -0600 Subject: [PATCH] Bump HG version to 2.4.0rc1 Update release notes --- Documentation/CHANGES.md | 65 +++++++------------------------ Documentation/CHANGES_v2.3.1.md | 69 +++++++++++++++++++++++++++++++++ version.txt | 2 +- 3 files changed, 84 insertions(+), 52 deletions(-) create mode 100644 Documentation/CHANGES_v2.3.1.md diff --git a/Documentation/CHANGES.md b/Documentation/CHANGES.md index 5947884d..f6393e83 100644 --- a/Documentation/CHANGES.md +++ b/Documentation/CHANGES.md @@ -1,67 +1,30 @@ ## Summary -This version brings bug fixes and updates to our v2.3.0 release. +This is a preview release of the v2.4.0 release. ## New features -- __[HG info]__ - - Add support for CSV and JSON output formats +- __[NA]__ + - Add init info version compatibility wrappers + - Bump NA version to v4.1.0 + - Add support for `traffic_class` init info (only supported by ofi plugin) - __[HG/NA Perf Test]__ - - Enable sizes to be passed using k/m/g qualifiers + - Add `-f`/`--hostfile` option to select hostfile to write to / read from + - Add `-T`/`--tclass` option to select trafic class + - Autodetect MPI implementation in perf utilities + - MPI can now be autodetected and dynamically loaded in utilities, even if `MERCURY_TESTING_ENABLE_PARALLEL` was turned off. If `MERCURY_TESTING_ENABLE_PARALLEL` is turned on, tests remain manually linked against MPI as they used to be. - __[NA OFI]__ - - Add `tcp_rxm` alias for `tcp;ofi_rxm` - - Find CXI `svc_id` or `vni` if `auth_key` components have zeros (e.g., `auth_key=0:0`) - - Add VNI index for `SLINGSHOT_VNIS` discovery as extra auth_key parameter + - Attempt to distribute multi-NIC domains based on selected CPU ID + - Support selection of traffic classes (single class per NA class) ## Bug fixes -- __[HG/NA]__ - - Fix potential race when checking secondary completion queue -- __[HG]__ - - Prevent multiple threads from entering `HG_Core_progress()` - - Add `HG_ALLOW_MULTI_PROGRESS` CMake option to control behavior (`ON` by default) - - Disable `NA_HAS_MULTI_PROGRESS` if `HG_ALLOW_MULTI_PROGRESS` is `ON` - - Fix expected operation count for handle to be atomic - - Expected operation count can change if extra RPC payload must be transferred - - Let poll events remain private to HG poll wait - - Prevent a race when multiple threads call progress and `HG_ALLOW_MULTI_PROGRESS` is `OFF` - - Separate internal list from user created list of handles - - Address an issue where `HG_Context_unpost()` would unnecessarily wait -- __[HG Core]__ - - Cache disabled response info in proc info - - Add `HG_Core_registered_disable(d)_response()` routines - - Refactor and optimize self RPC code path - - Add additional logging of refcount/expected op count - - Fixes for self RPCs with no response - __[HG Util]__ - - Prevent locking in `hg_request_wait()` - - Concurrent progress in multi-threaded scenarios on the same context could complete another thread's request and let a thread blocked in progress -- __[HG Perf]__ - - Fix tests to be run in parallel with any communicator size -- __[HG Test]__ - - Ensure affinity of class thread is set - - Add concurrent multi RPC test - - Add multi-progress test - - Add multi-progress test with handle creation - - Refactoring of unit test cleanup -- __[NA]__ - - Fix memory leak on `NA_Get_protocol_info()` + - Add missing prototype to `hg_atomic_fence()` definition - __[NA OFI]__ - - Fix `na_ofi_get_protocol_info()` not returning `opx` protocol - - Refactor `na_ofi_getinfo()` to account for `NA_OFI_PROV_NULL` type - - Ensure there are no duplicated entries - - Refactor parsing of init info strings and fix OPX parsing - - Simplify parsing of some address strings - - Bump default CQ size to have a maximum depth of 128k entries - - Remove sockets as the only provider on macOS - - Remove send afer send tagged msg ordering - - Ensure that `rx_ctx_bits` are not set if SEP is not used - - Set CXI domain ops w/ slingshot 2.2 to prevent from potential memory corruptions + - Remove excessive MR count warning message - __[NA Perf]__ - - Prevent tests from being run as parallel tests -- __[CMake]__ - - Pass `INSTALL_NAME_DIR` through target properties - - This fixes an issue seen on macOS where libraries would not be found using `@rpath` + - Ensure perf tests wait on send completion ## :warning: Known Issues diff --git a/Documentation/CHANGES_v2.3.1.md b/Documentation/CHANGES_v2.3.1.md new file mode 100644 index 00000000..5947884d --- /dev/null +++ b/Documentation/CHANGES_v2.3.1.md @@ -0,0 +1,69 @@ +## Summary + +This version brings bug fixes and updates to our v2.3.0 release. + +## New features + +- __[HG info]__ + - Add support for CSV and JSON output formats +- __[HG/NA Perf Test]__ + - Enable sizes to be passed using k/m/g qualifiers +- __[NA OFI]__ + - Add `tcp_rxm` alias for `tcp;ofi_rxm` + - Find CXI `svc_id` or `vni` if `auth_key` components have zeros (e.g., `auth_key=0:0`) + - Add VNI index for `SLINGSHOT_VNIS` discovery as extra auth_key parameter + +## Bug fixes + +- __[HG/NA]__ + - Fix potential race when checking secondary completion queue +- __[HG]__ + - Prevent multiple threads from entering `HG_Core_progress()` + - Add `HG_ALLOW_MULTI_PROGRESS` CMake option to control behavior (`ON` by default) + - Disable `NA_HAS_MULTI_PROGRESS` if `HG_ALLOW_MULTI_PROGRESS` is `ON` + - Fix expected operation count for handle to be atomic + - Expected operation count can change if extra RPC payload must be transferred + - Let poll events remain private to HG poll wait + - Prevent a race when multiple threads call progress and `HG_ALLOW_MULTI_PROGRESS` is `OFF` + - Separate internal list from user created list of handles + - Address an issue where `HG_Context_unpost()` would unnecessarily wait +- __[HG Core]__ + - Cache disabled response info in proc info + - Add `HG_Core_registered_disable(d)_response()` routines + - Refactor and optimize self RPC code path + - Add additional logging of refcount/expected op count + - Fixes for self RPCs with no response +- __[HG Util]__ + - Prevent locking in `hg_request_wait()` + - Concurrent progress in multi-threaded scenarios on the same context could complete another thread's request and let a thread blocked in progress +- __[HG Perf]__ + - Fix tests to be run in parallel with any communicator size +- __[HG Test]__ + - Ensure affinity of class thread is set + - Add concurrent multi RPC test + - Add multi-progress test + - Add multi-progress test with handle creation + - Refactoring of unit test cleanup +- __[NA]__ + - Fix memory leak on `NA_Get_protocol_info()` +- __[NA OFI]__ + - Fix `na_ofi_get_protocol_info()` not returning `opx` protocol + - Refactor `na_ofi_getinfo()` to account for `NA_OFI_PROV_NULL` type + - Ensure there are no duplicated entries + - Refactor parsing of init info strings and fix OPX parsing + - Simplify parsing of some address strings + - Bump default CQ size to have a maximum depth of 128k entries + - Remove sockets as the only provider on macOS + - Remove send afer send tagged msg ordering + - Ensure that `rx_ctx_bits` are not set if SEP is not used + - Set CXI domain ops w/ slingshot 2.2 to prevent from potential memory corruptions +- __[NA Perf]__ + - Prevent tests from being run as parallel tests +- __[CMake]__ + - Pass `INSTALL_NAME_DIR` through target properties + - This fixes an issue seen on macOS where libraries would not be found using `@rpath` + +## :warning: Known Issues + +- __[NA OFI]__ + - [tcp/verbs;ofi_rxm] Using more than 256 peers requires `FI_UNIVERSE_SIZE` to be set. diff --git a/version.txt b/version.txt index 2bf1c1cc..2eca1561 100644 --- a/version.txt +++ b/version.txt @@ -1 +1 @@ -2.3.1 +2.4.0rc1