-
-
Notifications
You must be signed in to change notification settings - Fork 201
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tuners segfault #243
Comments
I've just renamed this because this has nothing to do with the "alltuners", it is just that whenever you would run a tuner (e.g. |
Greatly thanks, my big brother. Strange, I tried the commands gpu@gpu-FP4:~/yuanshuai/code/CLBlast/build$ ./clblast_tuner_copy_fast --platform 0 --device 1
* Options given/available:
-platform 0 [=default]
-device 1
-precision 32 (single) [=default]
-m 1024 [=default]
-n 1024 [=default]
-alpha 2.00 [=default]
-fraction 1.00 [=default]
-runs 10 [=default]
-max_l2_norm 0.00 [=default]
* Found 144 configuration(s)
* Parameters explored: COPY_DIMX COPY_DIMY COPY_WPT COPY_VW
| ID | total | param | compiles | time | GB/s | status |
x------x-------x---------------------x----------------x--------------x--------x-------------------x
Segmentation fault (core dumped)
gpu@gpu-FP4:~/yuanshuai/code/CLBlast/build$ ./clblast_tuner_copy_fast --platform 0 --device 0
* Options given/available:
-platform 0 [=default]
-device 0 [=default]
-precision 32 (single) [=default]
-m 1024 [=default]
-n 1024 [=default]
-alpha 2.00 [=default]
-fraction 1.00 [=default]
-runs 10 [=default]
-max_l2_norm 0.00 [=default]
* Found 144 configuration(s)
* Parameters explored: COPY_DIMX COPY_DIMY COPY_WPT COPY_VW
| ID | total | param | compiles | time | GB/s | status |
x------x-------x---------------------x----------------x--------------x--------x-------------------x
Segmentation fault (core dumped) I can successfully run opencl program (of course, including opencl kernel files) using our company's internal opencl framework. But strange either, segmentfault using clpeak as below: ./clpeak
Platform: AMD Accelerated Parallel Processing
Device: Carrizo
Driver version : 1912.5 (VM) (Linux x64)
Compute units : 6
Clock frequency : 576 MHz
Segmentation fault (core dumped) Thus, I think maybe some configs in host functions about OCL are not correct or etc. I think maybe I need to check configs in our company framework about OCL next. |
Yes, this looks like something is wrong with your system rather than with CLBlast. To verify you can also try the |
Thanks, I'll have a try 🙇 |
I am experiencing similar problems to the ones described above. When running the invert tuner on an Intel platform it segfaults with one of several back-traces, for example:
or
I also got a segmentation fault with the following back-trace that looks like it originated in the reference kernel itself, because the thread running the tuner stopped at a line that times the reference kernel:
Maybe this is the actual source of the problem? I tried on two different machine with Intel platforms with no luck. I was able to run all tuners on an NVIDIA platform just fine. Your help would be very much appreciated, many thanks in advance! |
I doubt that this is related... the original question made all tuners crash at the very first kernel invocation, whereas you have something specific for the invert tuner (all others work?). Also you report that this only happens on Intel platforms, whereas the original question was on an AMD platform. Nevertheless, it should be fixed :-) Can you give details of your system, e.g. OS, Intel OpenCL version, and device you are testing on. Perhaps I can then try to reproduce. |
Sorry, you are right, it is not directly related. I can open a new issue if you'd prefer that. I am testing on CentOS 7.4.1708 on the following platform and device (clinfo output):
All other tuners are running correctly. I was also able to tune CLBlast version 0.11.0 a few months ago on that same machine. |
OK, thanks for the info. With that I actually managed to reproduce it on Debian system with an Intel CPU and Intel OpenCL. I'll investigate. |
I fixed quite a few issues in the tuner. The tuner now works successfully on my test system, could you also have a go with the latest master? Background info: The invert tuner is new and isn't really used yet, it more like a stub. But still good to not make it crash of course :-) |
Hi guys! I'm not sure how relevant it is, but a couple of months ago I ran into similar problems with my project (I used AMD APP SDK 3.0 on Ubuntu 14.04.3 LTS). In my case both the drivers and the SDK were installed properly, clinfo and other pre-built OpenCL software were working correctly and showing some meaningful information. Eventually I've solved the issue. Here's what I suggest you to check/try:
After these changes both clpeak and all my code began working like a charm. So try it, at least if you're using the same configuration. |
Thanks for the quick fix! I can now successfully run the invert tuner. |
@richardschulze: Good to hear your issue is solved 👍 @kpot: Thanks for the info. I think I'll add your info to the CLBlast documentation under frequently asked questions. I don't think this is something that should go into the mainly CLBlast, right? Or do you think this applies for every platform? |
@CNugteren. I don't think this should be included into CLBlast's config. Such things can and should be controlled externally - by the developers who use CLBlast. But I agree that adding this to the FAQ might be helpful. Because it definitely applies at least to Linux. With The |
@kpot: OK, thanks for the explanation. I've added a note in the installation guide Since @richardschulze issue is confirmed to be solved and @ysh329's issue seems to be more a general OpenCL issue (and can perhaps be solved by the above suggestion), I'm closing this issue. |
Sorry, novice
at the start of |
I think |
I think so - I just removed the entire |
Hi, I wanna tune the GEMM performance on my AMD gpu referring the commands below from readme:
However, I met problems below:
The text was updated successfully, but these errors were encountered: