Simplify AutoAugment graph #4751

stiepan · 2023-03-28T17:56:35Z

Category:

Refactoring (Redesign of existing code that doesn't affect functionality

Description:

This PR combines a couple of improvements to AutoAugment processing graph.

The list of augmentations to split processing into is created independently for each stage - this can reduces the size of split-merge tree generated by the select operation.

For example, the default v0 policy uses 25 sub-policies, each sub-policy is a sequence of (at most) 2 augmentations. Even though there are 25 sub-policies, there are only 11 unique augmentations present in those sub-polcies. DALI already splits the computation only in 11 parts. However, it can be further improved if the operators used in given stages of sub-policies form even smaller groups. In case of the default v0 policy, it is 8 vs 11.

In other words, when the comutation is split to apply the "i-th" augmentation in a sequence according to selected sub-polciy, there is no need to take into account all augmentations present in all sub-polcies, but only the augmentations at the i-th position in all sub-policies.

The signed_bin helper that handles random negation of magnitudes can now accept additional shape parameter. With that, if you have a multiple random augmentations to apply in a sequence, you can generate the random signs of magnitudes once for all stages, rather than do it for every stage separately.

Additional information:

Affected modules and functionalities:

AutoAugment (points 1., 2.), RandAugment (point 2.).

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: DALI-3357

Signed-off-by: Kamil Tokarski <[email protected]>

stiepan · 2023-03-28T17:57:48Z

dali/python/nvidia/dali/auto_aug/core/_augmentation.py

+        self._random_sign = random_sign
+        self._signed_magnitude_idx = signed_magnitude_idx
+
+    def __getitem__(self, idx: int):


This enables the signed_bin(mutli_stage_bin)[stage_idx].

stiepan · 2023-03-30T08:55:07Z

!build

dali-automaton · 2023-03-30T09:00:24Z

CI MESSAGE: [7762008]: BUILD STARTED

dali-automaton · 2023-03-30T11:47:57Z

CI MESSAGE: [7762008]: BUILD PASSED

klecki · 2023-04-04T12:55:26Z

dali/python/nvidia/dali/auto_aug/auto_augment.py

+    The output is a tuple of matrix `m` and per stage operators augmentations list `augments`,
+    such that for policy `sub_policy_idx` as the `stage_idx`-ith operation in a sequence, the
+    `augments[stage_idx][m[sub_policy_idx][stage_idx]]` operator should be called.


If I may try to expand the comment:

Suggested change

The output is a tuple of matrix `m` and per stage operators augmentations list `augments`,

such that for policy `sub_policy_idx` as the `stage_idx`-ith operation in a sequence, the

`augments[stage_idx][m[sub_policy_idx][stage_idx]]` operator should be called.

The output is a tuple `(m, augments)`, where `augments` is a list of augmentations to be

used for given stage - each entry contains the reduced list of unique augmentations to be

used for that stage.

The `m` matrix contains the mapping from the original sub_policy_id, to the index within the

reduced list, for every stage.T

That is for policy `sub_policy_idx` as the `stage_idx`-ith operation in a sequence, the

`augments[stage_idx][m[sub_policy_idx][stage_idx]]` operator should be called.

Sure, I'll merge this one, revase the #4753 and expand the comment there.

klecki · 2023-04-04T13:11:01Z

dali/python/nvidia/dali/auto_aug/auto_augment.py

    for stage_id in range(max_policy_len):
-        magnitude_bin = magnitude_bins[stage_id]
-        if use_signed_magnitudes:
-            magnitude_bin = signed_bin(magnitude_bin)
        if should_run[stage_id] < run_probabilities[stage_id]:
-            op_kwargs = dict(sample=sample, magnitude_bin=magnitude_bin,
+            op_kwargs = dict(sample=sample, magnitude_bin=magnitude_bins[stage_id],
                             num_magnitude_bins=policy.num_magnitude_bins, **kwargs)
-            sample = _pretty_select(augmentations, aug_ids[stage_id], op_kwargs,
+            sample = _pretty_select(augmentations[stage_id], aug_ids[stage_id], op_kwargs,
                                    auto_aug_name='apply_auto_augment',
                                    ref_suite_name='get_image_net_policy')


Honestly, this is so cool ;)

stiepan added 3 commits March 28, 2023 17:06

Separate sets of augmentations for each stage in AA

3614612

Signed-off-by: Kamil Tokarski <[email protected]>

Allow multi-dim signed magnitudes

dbba131

Signed-off-by: Kamil Tokarski <[email protected]>

Docs

acc2d59

Signed-off-by: Kamil Tokarski <[email protected]>

stiepan commented Mar 28, 2023

View reviewed changes

jantonguirao assigned szalpal and jantonguirao Mar 29, 2023

jantonguirao approved these changes Mar 29, 2023

View reviewed changes

stiepan added the automatic augmentations Automatic augmentations (AutoAugment, RandAugment, TrivialAugment and more) support in DALI. label Mar 29, 2023

szalpal assigned klecki and unassigned szalpal Mar 31, 2023

klecki approved these changes Apr 4, 2023

View reviewed changes

stiepan merged commit c0fc10d into NVIDIA:main Apr 4, 2023

stiepan mentioned this pull request Apr 4, 2023

Add more AutoAugment policies #4753

Merged

18 tasks

stiepan mentioned this pull request May 31, 2023

[AA] Fix augmentation mapping for sub-policies of different length #4887

Merged

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify AutoAugment graph #4751

Simplify AutoAugment graph #4751

stiepan commented Mar 28, 2023 •

edited

Loading

stiepan Mar 28, 2023

stiepan commented Mar 30, 2023

dali-automaton commented Mar 30, 2023

dali-automaton commented Mar 30, 2023

klecki Apr 4, 2023

stiepan Apr 4, 2023

klecki Apr 4, 2023

-    The output is a tuple of matrix `m` and per stage operators augmentations list `augments`,
-    such that for policy `sub_policy_idx` as the `stage_idx`-ith operation in a sequence, the
-    `augments[stage_idx][m[sub_policy_idx][stage_idx]]` operator should be called.
+    The output is a tuple `(m, augments)`, where `augments` is a list of augmentations to be
+    used for given stage - each entry contains the reduced list of unique augmentations to be
+    used for that stage.
+    The `m` matrix contains the mapping from the original sub_policy_id, to the index within the
+    reduced list, for every stage.T
+    That is for policy `sub_policy_idx` as the `stage_idx`-ith operation in a sequence, the
+    `augments[stage_idx][m[sub_policy_idx][stage_idx]]` operator should be called.

Simplify AutoAugment graph #4751

Simplify AutoAugment graph #4751

Conversation

stiepan commented Mar 28, 2023 • edited Loading

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

stiepan Mar 28, 2023

Choose a reason for hiding this comment

stiepan commented Mar 30, 2023

dali-automaton commented Mar 30, 2023

dali-automaton commented Mar 30, 2023

klecki Apr 4, 2023

Choose a reason for hiding this comment

stiepan Apr 4, 2023

Choose a reason for hiding this comment

klecki Apr 4, 2023

Choose a reason for hiding this comment

stiepan commented Mar 28, 2023 •

edited

Loading