-
Notifications
You must be signed in to change notification settings - Fork 74.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rocm_helpers missing dependency declarations #61354
Comments
Hi @MrTreev , Could you please test with the below configurations of Clang and Bazel and let us know if problem still persists.Because higher versions may or may not compatible. It seems your Clang version is 17.0 against tested version of 16.0.0. Same for Bazel also where it seems you have 6.1.0 installed and tested version is 5.3.0 for Tf2.13.
You can find the build instructions here. |
I certainly can try that either later tonight or tomorrow morning (AEST). I'll get back to you when that's done. |
On the r2.13 branch, I've switched to Bazel 5.3.0, I'll try the other changes in the morning tomorrow, but so far, no difference in the error (Been doing full clean builds each time) |
Hi @MrTreev , Kindly update on this. Thanks! |
Hi @SuryanarayanaY, I'm attempting to get Clang 16 reliably working at the moment, Sadly the archlinux repos currently have only 15 and 17, so I'm having to do it manually and trying not to break the rest of my environment while doing so is proving a little tricky. Thankfully I should be able to dedicate a good bit of time over the next couple of days to this, so I hope to have an update soon. |
I've found a set of working rocm packages with clang-16 included, since I've switched to them I have gotten a different error, which I believe should be able to be fixed by adding the files somewhere in the bazel build system, I'm trying to figure out where exactly at the moment, but if there's anyone that could look at this that'd be appreciated.
|
I found that ROCm tensorflow-upstream goes further in the build process, so I'm looking at the differences at the moment to try to find a fix |
I don't think there's a simple fix I can apply, and the best place for my issue is likely in the RadeonOpenCompute fork until the changes I need are merged. |
Hi @MrTreev did you solve this problem at last? Or you switch to tensorflow in Rocm repo? |
@guangzlu I switched to the ROCm version, haven't tried the base version on my ROCm hardware recently. I might get some time to try this weekend if it'd help solve issues, but for me the ROCm version is stable and functioning. |
Hi @MrTreev I found the solution of this issue: add
|
Issue type
Build/Install
Have you reproduced the bug with TensorFlow Nightly?
Yes
Source
source
TensorFlow version
master/nightly
Custom code
Yes
OS platform and distribution
Arch Linux (Linux 6.4.4-arch1-1 #1 SMP PREEMPT_DYNAMIC x86_64 GNU/Linux)
Mobile device
N/A
Python version
3.10
Bazel version
6.1.0
GCC/compiler version
gcc (GCC) 13.1.1 20230714
CUDA/cuDNN version
None
GPU model and memory
AMD Radeon RX 7900 XT
Current behavior?
After adding
#include <stdint.h>
to line 16 oftensorflow/tsl/lib/io/cache.cc
to fix a different error, and using the installation method described in the reproduce field.Bazel gives the error described in the attached log.
This persists through different Bazel versions, and full cleans.
I am using the following archlinux packages for ROCm:
Standalone code to reproduce the issue
Relevant log output
The text was updated successfully, but these errors were encountered: