Public API to force load custom ops #1151

guillaumekln · 2020-02-25T10:23:30Z

Currently, it is inconvenient to load in Python a SavedModel that includes Addons custom ops. Consider the example below:

save.py

import tensorflow as tf
import tensorflow_addons as tfa

class Model(tf.keras.Model):
    @tf.function(input_signature=(tf.TensorSpec(shape=[None, 32], dtype=tf.float32),))
    def call(self, x):
        return tfa.activations.gelu(x)

model = Model()
tf.saved_model.save(model, '/tmp/model', signatures=model.call)

load.py

import tensorflow as tf
tf.saved_model.load("/tmp/model")

The load will fail because Addons custom ops are not registered to the TensorFlow runtime. This is expected as we first have to invoke tf.load_op_library on the custom ops.

However, with the new work on lazy loading #855 it got harder to force this op registration. For this model, the user should run the following which relies on internal APIs:

from tensorflow_addons.activations.gelu import _activation_so
_activation_so.ops

If the custom ops are not loaded during the main import (i.e. during import tensorflow_addons), then the package should expose a public API that registers all custom ops.

Any thoughts?

The text was updated successfully, but these errors were encountered:

gabrieldemarmiesse · 2020-02-25T17:24:39Z

That's definitely a big problem.

I'd be in favor of avoiding using an explicit function to force the loading of the ops. It's not a very good move for UX as users might expect this to be automatic.

We can try to load all the SO at import time and then throw a warning if something goes wrong? With a mechanism similar to #1137 . But then we should have a way of disabling the warnings if some users don't care about custom ops.

In a perfect world, one would be able to save a model using the .so, and another user could reload the same model using the python-only equivalent op. Is that possible? Maybe with the keras model saving?

EDIT: Actually, I'm not so sure it's the best UX to register everything at import time. If we made a function to register all ops, would we make this function register keras functions too? The register_keras_serializable part?

guillaumekln · 2020-02-28T09:17:05Z

From the user perspective, loading everything at import time sounds like the less surprising behavior. In the example above,

import tensorflow as tf
tf.saved_model.load("/tmp/model")

adding the Addons import would seem a natural fix for the SavedModel loading issue:

import tensorflow as tf
import tensorflow_addons as tfa
tf.saved_model.load("/tmp/model")

We can try to load all the SO at import time and then throw a warning if something goes wrong?

IIRC, the custom op errors we faced was mostly segmentation faults on import, right? In that case, it would be difficult to catch the error. We could fork the process and load the custom op but that seems overkill.

Or we could load custom ops by default and add an option to disable this automatic loading for users facing issues.

gabrieldemarmiesse · 2020-02-28T10:21:27Z

When loading a .so, if there are any kind of issue, it throws a tensorflow.errors.NotFoundError, not a segfault. So it's easy to give a warning and just continue if there is a problem at import time.

I believe we need to discuss more this statement:

From the user perspective, loading everything at import time sounds like the less surprising behavior.

If we take the "load the .so files at import time" approach:

import tensorflow as tf
import tensorflow_addons as tfa
tf.saved_model.load("/tmp/model")

If the user is using pycharm, it's going to tell the user: tensorflow_addons imported but unused
If the user is using visual studio code, it's going to tell the user: tensorflow_addons imported but unused
If the user is using flake8, it's going to tell the user: tensorflow_addons imported but unused
If the user has a colleague who has never worked with addons, it's going to say that tensorflow_addons was imported but unused and add a commit to remove the import.

This doesn't happen if we go for:

import tensorflow as tf
import tensorflow_addons as tfa
tfa.register_all_ops()
tf.saved_model.load("/tmp/model")

When reading the code done with a library that has a well written API (like keras/pygithub/numpy...) it's possible to follow a piece of code without reading the documentation beforehand. In this case it's really clear what is happening, unlike when the shared objects are loaded at import time. A piece of code is read many more time than it's written, so we should go for readability.

If we follow the principal of least surprises, for some users, doing any significant work at import time sounds very strange and unintuitive, as demonstrated by the errors thrown by flake8, pycharm and visual studio code. It's also against the zen of python:

Explicit is better than implicit

We have other benefits in doing this: we catch errors early and we don't make an additional environment variable for people who don't want to load the SO files or want to debug what happens at import time. Because the function call means "load everything" and not "maybe load everything" we can throw a hard error and it's easier to debug. People who want to use TFA with another version of Tensorflow than the recommended one can just not use the function and they won't have issues with the shared objects.

I would recommend making a new function as proposed by @guillaumekln in his first message:

def register_all_ops(keras_objects=True, custom_kernels=True):
    if keras_objects:
        for obj in keras_objects:
            tf.keras.register_keras_serializable(obj)
    if custom_kernels:
        for custom_kernel_path in custom_kernel_paths:
            tf.load_op_library(custom_kernel_path)

guillaumekln added discussion needed custom-ops labels Feb 25, 2020

gabrieldemarmiesse mentioned this issue Mar 1, 2020

Public function to register custom ops #1193

Merged

seanpmorgan closed this as completed in #1193 Mar 7, 2020

This was referenced Apr 3, 2020

Use tf.saved_model with tfa.activations.mish produce error: "Op type not registered 'Addons>Mish' in binary running" #1554

Closed

Register keras objects in the function register_all, not at import time #1567

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Public API to force load custom ops #1151

Public API to force load custom ops #1151

guillaumekln commented Feb 25, 2020 •

edited

Loading

gabrieldemarmiesse commented Feb 25, 2020 •

edited

Loading

guillaumekln commented Feb 28, 2020

gabrieldemarmiesse commented Feb 28, 2020

Public API to force load custom ops #1151

Public API to force load custom ops #1151

Comments

guillaumekln commented Feb 25, 2020 • edited Loading

gabrieldemarmiesse commented Feb 25, 2020 • edited Loading

guillaumekln commented Feb 28, 2020

gabrieldemarmiesse commented Feb 28, 2020

guillaumekln commented Feb 25, 2020 •

edited

Loading

gabrieldemarmiesse commented Feb 25, 2020 •

edited

Loading