Using `io_callback` under `shard_map`ped functions #15750

rhaps0dy · 2023-04-25T23:54:13Z

rhaps0dy
Apr 25, 2023

I'm trying to use shard_map to compute independently on each CPU, and then use io_callback() to launch some other computation on GPU, and return the result. I've registered a shard_map rule for io_callback that seems correct in this specific scenario, but I'm getting an XLA Assertion error:

2023-04-25 16:47:53.206001: F external/xla/xla/hlo/ir/hlo_sharding.cc:843] Check failed: !IsManual()

(https://github.com/openxla/xla/blob/main/xla/hlo/ir/hlo_sharding.cc#L843)

It seems that using a manual sharding strategy is not possible when running callbacks? The jax.debug.debug_callback works, but it can't return anything, so maybe that's why.

Here's the example I'm using (invoke with XLA_FLAGS="--xla_force_host_platform_device_count=8" )

from functools import partial

import jax
import jax.numpy as jnp
import numpy as np
from jax._src.callback import io_callback_p
from jax.experimental import io_callback
from jax.experimental.shard_map import register_rule, shard_map
from jax.sharding import Mesh
from jax.sharding import PartitionSpec as P


@register_rule(io_callback_p)
def _io_callback_rule(mesh, *in_rep, **_kwargs):
    return in_rep


def print_args(x):
    print("args shapes  :", x.shape)
    return x


cpu_mesh = Mesh(np.array(jax.devices("cpu")), axis_names=("i",))


@jax.jit
@partial(shard_map, mesh=cpu_mesh, in_specs=(P("i"),), out_specs=P("i"))
def compute_in_cpu(x):
    x = x + x
    x = io_callback(print_args, x, x, ordered=False) # works fine if commenting out this line
    return x


if __name__ == "__main__":
    ones = jnp.ones((8000, 3))
    print(ones.devices())
    out = compute_in_cpu(ones)
    print(out.devices())
    print("out shape=", out.shape)

(My actual objective is to do some Jax computation on many CPUs, and some Jax computation on a single GPU. My attempts to do this have led me to set XLA_FLAGS="--xla_force_host_platform_device_count=8" to create multiple devices from CPUs, and call io_callback to later do some work on the GPU from them. Maybe I should implement a custom CPU and GPU primitive that communicate under the hood instead?)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using `io_callback` under `shard_map`ped functions #15750

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Using io_callback under shard_mapped functions #15750

rhaps0dy Apr 25, 2023

Replies: 0 comments

Using `io_callback` under `shard_map`ped functions #15750

rhaps0dy
Apr 25, 2023