Speed up TF tests by reducing hidden layer counts #24595

Rocketknight1 · 2023-06-30T13:21:51Z

A lot of our slow TF tests are caused by TF compilation. TF compilation isn't really affected by layer width at all - the main thing is just the number of operations it has to build a graph for. By reducing the number of hidden layers, compilation gets much faster, (hopefully) without interfering with test coverage at all.

Rocketknight1 · 2023-06-30T13:23:07Z

tests/test_modeling_tf_common.py

-            # Make sure fit works with tf.data.Dataset and results are consistent
-            dataset = tf.data.Dataset.from_tensor_slices(prepared_for_class)
-
-            if sample_weight is not None:
-                # Add in the sample weight
-                weighted_dataset = dataset.map(lambda x: (x, None, tf.convert_to_tensor(0.5, dtype=tf.float32)))
-            else:
-                weighted_dataset = dataset
-            # Pass in all samples as a batch to match other `fit` calls
-            weighted_dataset = weighted_dataset.batch(len(dataset))
-            dataset = dataset.batch(len(dataset))
-            # Reinitialize to fix batchnorm again
-            model.set_weights(model_weights)
-
-            # To match the other calls, don't pass sample weights in the validation data
-            history3 = model.fit(
-                weighted_dataset,
-                validation_data=dataset,
-                steps_per_epoch=1,
-                validation_steps=1,
-                shuffle=False,
-            )
-            val_loss3 = history3.history["val_loss"][0]
-            self.assertTrue(not isnan(val_loss3))
-            accuracy3 = {key: val[0] for key, val in history3.history.items() if key.endswith("accuracy")}
-            self.check_keras_fit_results(val_loss1, val_loss3)
-            self.assertEqual(history1.history.keys(), history3.history.keys())
-            if metrics:
-                self.assertTrue(len(accuracy1) == len(accuracy3) > 0, "Missing metrics!")
-


I added this section in test_keras_fit a long time ago when I was paranoid about issues from fitting tf.data.Dataset, but I believe it should not be possible for this section to fail if the other model tests pass, so I removed it!

ydshieh · 2023-06-30T13:45:11Z

Would be nice if you can show the timing for one model (before v.s. after) 🙏 . Thanks.

HuggingFaceDocBuilderDev · 2023-06-30T13:45:23Z

The documentation is not available anymore as the PR was closed or merged.

Rocketknight1 · 2023-06-30T13:56:43Z

@ydshieh testing locally BERT went from 510 seconds -> 220 seconds

ydshieh · 2023-06-30T14:21:00Z

tests/models/bert/test_modeling_tf_bert.py

@@ -57,7 +57,7 @@ def __init__(
        use_labels=True,
        vocab_size=99,
        hidden_size=32,
-        num_hidden_layers=5,
+        num_hidden_layers=1,


Let's not use 1 but 2 if possible. 1 is kind exceptional.

ydshieh

Works for me except not use 1 as num of layers 🙏 .

Let's also have an approval from @sgugger as I am not the one who decide to use 5.

Rocketknight1 · 2023-06-30T14:24:07Z

I don't know if it was @sgugger either - a lot of this code is really old! I see tf.tuple() in there, and even I had to look up the TF 1.x docs to remember what that was supposed to do, lol

ydshieh · 2023-06-30T14:24:44Z

I know he is probably not the one to decide use 5, but he might know the history :-)

sgugger

Thanks!

Rocketknight1 requested a review from ydshieh June 30, 2023 13:21

Rocketknight1 commented Jun 30, 2023

View reviewed changes

hidden layers, huh, what are they good for (absolutely nothing)

c411d1a

Rocketknight1 force-pushed the tf_test_speedup branch from 93a4d9d to c411d1a Compare June 30, 2023 13:28

Rocketknight1 added 2 commits June 30, 2023 14:51

Some tests break with 1 hidden layer, use 2

8419e14

Use 1 hidden layer in a few slow models

be0d0a8

ydshieh reviewed Jun 30, 2023

View reviewed changes

Use num_hidden_layers=2 everywhere

198f9df

ydshieh approved these changes Jun 30, 2023

View reviewed changes

Rocketknight1 requested a review from sgugger June 30, 2023 14:23

sgugger approved these changes Jun 30, 2023

View reviewed changes

Rocketknight1 added 2 commits June 30, 2023 15:58

Slightly higher tol for groupvit

7b3e77f

Slightly higher tol for groupvit

415a085

Rocketknight1 merged commit 134caef into main Jun 30, 2023

Rocketknight1 deleted the tf_test_speedup branch June 30, 2023 15:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up TF tests by reducing hidden layer counts #24595

Speed up TF tests by reducing hidden layer counts #24595

Rocketknight1 commented Jun 30, 2023

Rocketknight1 Jun 30, 2023 •

edited

Loading

ydshieh commented Jun 30, 2023

HuggingFaceDocBuilderDev commented Jun 30, 2023 •

edited

Loading

Rocketknight1 commented Jun 30, 2023

ydshieh Jun 30, 2023

Rocketknight1 Jun 30, 2023

ydshieh left a comment

Rocketknight1 commented Jun 30, 2023

ydshieh commented Jun 30, 2023

sgugger left a comment

Speed up TF tests by reducing hidden layer counts #24595

Speed up TF tests by reducing hidden layer counts #24595

Conversation

Rocketknight1 commented Jun 30, 2023

Rocketknight1 Jun 30, 2023 • edited Loading

Choose a reason for hiding this comment

ydshieh commented Jun 30, 2023

HuggingFaceDocBuilderDev commented Jun 30, 2023 • edited Loading

Rocketknight1 commented Jun 30, 2023

ydshieh Jun 30, 2023

Choose a reason for hiding this comment

Rocketknight1 Jun 30, 2023

Choose a reason for hiding this comment

ydshieh left a comment

Choose a reason for hiding this comment

Rocketknight1 commented Jun 30, 2023

ydshieh commented Jun 30, 2023

sgugger left a comment

Choose a reason for hiding this comment

Rocketknight1 Jun 30, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 30, 2023 •

edited

Loading