Skip to content

Commit

Permalink
KMeans spmd timeout fix (#2651)
Browse files Browse the repository at this point in the history
  • Loading branch information
inteldimitrius authored Feb 1, 2024
1 parent 09838b7 commit dc9f31f
Showing 1 changed file with 7 additions and 2 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -218,9 +218,14 @@ sycl::event extract_and_share_by_index(const bk::context_gpu& ctx,

// Share part
if (rank_count > 1l) {
const auto dst =
// comm.bcast() is sporadically failing by timeout
// TODO: Need to fix comm.bcast() and replace comm.allreduce() with comm.bcast()
if (rank != target) {
pr::fill(queue, place, Float(0), new_deps).wait_and_throw();
}
auto wrap_place =
array<Float>::wrap(queue, place.get_mutable_data(), place.get_count(), new_deps);
comm.bcast(dst, target).wait();
comm.allreduce(wrap_place).wait();
}

return bk::wait_or_pass(new_deps);
Expand Down

0 comments on commit dc9f31f

Please sign in to comment.