rapids_cpm_nvbench properly specify usage of external fmt library #376

robertmaynard · 2023-02-17T19:39:18Z

Description

When we are inside a conda env the linker will be set to ld.bfd which will try to resolve all undefined symbols at link time.

Since we could be using a shared library version of fmt we need it on the final link line of consumers. So patch nvbench to understand this requirement.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
The cmake-format.json is up to date with these changes.

When we are inside a conda env the linker will be set to ld.bfd which will try to resolve all undefined symbols at time. Since we could be using a shared library version of fmt we need it on the final link line of consumers. So patch nvbench to understand this requirement.

…function

vyasr · 2023-02-22T01:28:31Z

I'm afraid I don't understand this change. Won't any linker require that symbols be resolved at link-time (and if linking dynamically, the requirement will be enforced again at runtime by the loader)? IIUC the change in this PR is propagating the fmt requirement from nvbench to its consumers, but I don't understand how the resulting change in terms of generated compile/link commands is related to the choice of linker.

robertmaynard · 2023-02-22T16:31:31Z

I'm afraid I don't understand this change. Won't any linker require that symbols be resolved at link-time (and if linking dynamically, the requirement will be enforced again at runtime by the loader)? IIUC the change in this PR is propagating the fmt requirement from nvbench to its consumers, but I don't understand how the resulting change in terms of generated compile/link commands is related to the choice of linker.

Good question. This PR is resolving an interacation between hidden symbols, undefined symbols and the conda linker ld.bfd.

When compiling things like the libcudf benchmarks we run into the following scenario:

Usage of rmm which requests using the fmt header only library
Usage of nvbench which has used the fmt via shared library

This results in the final executable having a link error like the following:

/x86_64-conda-linux-gnu/bin/ld: uses_fmt: hidden symbol `_ZN3fmt2v96detail18throw_format_errorEPKc' in CMakeFiles/uses_fmt.dir/use_fmt.cpp.o is referenced by DSO

Since we compiled nvbench against the shared library version of fmt, it has recorded that _ZN3fmt2v96detail18throw_format_errorEPKc is
an undefined symbol. When we compile the executable translation unit it records that _ZN3fmt2v96detail18throw_format_errorEPKc is weak and PRIVATE,
due to conda injecting -fvisibility-inlines-hidden on the compilation line.

So when we go to link the final executable the linker notices that we have inconsistent definitions for this symbol. One library says it is PRIVATE and the other says it is PUBLIC and undefined. Since ld.bfd behavior is to reject when we have inconsistent specifications it errors out.

To fix this I have identified two solutions:

Ensure that any translation unit compiled under conda which ingests FMT via both shared lib and header has -fno-visibility-inlines-hidden
on the compile line.
Ensure that any translation unit compiled under conda which ingests FMT via both shared lib and header has -DFMT_SHARED on the compile line

I went with option 2 as it has the smallest impact on other functions / libraries being used. So to implement 2 I made the nvbench dependency on fmt public so we propagate the FMT_SHARED flag.

vyasr · 2023-02-23T01:11:02Z

Got it, that makes sense. What happens in the case where a TU links to both rmm (which links to the fmt::fmt-header-only target) and the patched nvbench (which links to fmt::fmt)? Will the two not end up conflicting?

robertmaynard · 2023-02-23T13:42:03Z

Got it, that makes sense. What happens in the case where a TU links to both rmm (which links to the fmt::fmt-header-only target) and the patched nvbench (which links to fmt::fmt)? Will the two not end up conflicting?

They don't end up conflicting since when fmt give the defines for both header and library mode it goes with library mode.

robertmaynard · 2023-02-27T14:02:17Z

/merge

robertmaynard added bug Something isn't working non-breaking Introduces a non-breaking change 3 - Ready for Review Ready for review by team labels Feb 17, 2023

robertmaynard requested a review from a team as a code owner February 17, 2023 19:39

robertmaynard changed the title ~~rapids_cpm_spdlog properly specify usage of external fmt library~~ rapids_cpm_nvbench properly specify usage of external fmt library Feb 21, 2023

robertmaynard force-pushed the bug/ensure_nvbench_uses_header_fmt_only branch from 9df6eb3 to c142539 Compare February 21, 2023 13:11

Avoid using CUDA 11.4 + gcc-11 due to a known compiler bug with std::…

04f44ca

…function

robertmaynard added 2 commits February 22, 2023 09:40

Update cpm_nvbench-conda-fmt.cmake

3202e1f

Update cpm_nvbench-conda-fmt.cmake

98c984d

vyasr approved these changes Feb 23, 2023

View reviewed changes

rapids-bot bot merged commit 7db9ade into rapidsai:branch-23.04 Feb 27, 2023

robertmaynard deleted the bug/ensure_nvbench_uses_header_fmt_only branch February 27, 2023 14:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rapids_cpm_nvbench properly specify usage of external fmt library #376

rapids_cpm_nvbench properly specify usage of external fmt library #376

robertmaynard commented Feb 17, 2023 •

edited by vyasr

Loading

vyasr commented Feb 22, 2023 •

edited

Loading

robertmaynard commented Feb 22, 2023 •

edited

Loading

vyasr commented Feb 23, 2023

robertmaynard commented Feb 23, 2023

robertmaynard commented Feb 27, 2023

rapids_cpm_nvbench properly specify usage of external fmt library #376

rapids_cpm_nvbench properly specify usage of external fmt library #376

Conversation

robertmaynard commented Feb 17, 2023 • edited by vyasr Loading

Description

Checklist

vyasr commented Feb 22, 2023 • edited Loading

robertmaynard commented Feb 22, 2023 • edited Loading

vyasr commented Feb 23, 2023

robertmaynard commented Feb 23, 2023

robertmaynard commented Feb 27, 2023

robertmaynard commented Feb 17, 2023 •

edited by vyasr

Loading

vyasr commented Feb 22, 2023 •

edited

Loading

robertmaynard commented Feb 22, 2023 •

edited

Loading