Jpcbertoldo/mvtec ad loco 2 #553

jpcbertoldo · 2022-09-11T12:25:24Z

Description

Create a new dataset: MVTec LOCO Anomaly Detection.

"LOCO" stands for "LOgical COnstraints"

I based myself on anomalib/data/mvtec.py.

Fixes Add support for MVTEC LOCO AD #471

`imread_strategy`

The dataset supports an option imread_strategy which allows the user how to choose when the images are loaded:

onthefly: behaviour I found in mvtec.py, the images are loaded upon demand during the training;
preload: all the images are cached in the memory (RAM, not GPU) when the dataset is being initialized.

`anotype` and `super_anotype`

Besides providing the binary label, I also create the dataset with two other categorical values:

super_anotype: is it a logical or structural anomaly? (or a normal?)
anotype: "what is the problem with the image?", mvtec ad also has different types of anomalies for each category but this is particularly more interesting here because there are many types of logical violations possible.

I specifically included this because I am interested in evaluating separately by those types but I will later create an issue for that feature.

`mask` vs. `masks`

MVTec LOCO's logical anomalies may include several anoamlies in a single image and to properly evaluate them one needs to consider them separately so they are segmented in different mask files in the ground truth.

Since the rest of library expects a tensor mask (SINGULAR), I merge them all into a single binary maks (with loss information because they cannot be separated anymore).

In order to later peform proper evaluation there is a second tensor masks (PLURAL) which encodes each anomalous region with a different value (0 is a normal pixel, and 1, 2, ..., N are anomalous pixels).

things in `MVTecAD` but not in `MVTecLOCO`

1) `self.transform_config_val = self.transform_config_train`

        if self.transform_config_train is not None and self.transform_config_val is None:
            self.transform_config_val = self.transform_config_train

Is there a good reason for assuming this?

For me it could make sense that self.transform_config_val could have light data augmentations (say, tiny brightness changes) but that should not be repeated in the validation set.

2) `split_normal_images_in_train_set(samples, split_ratio, seed)`

MVTec LOCO already defines fixed validation sets so i did not include the option of doing it dinamically like in MVTec AD.

Checklists

Changes

Bug fix (non-breaking change which fixes an issue)
Refactor (non-breaking change which refactors the code base)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist

My code follows the pre-commit style and check guidelines of this project.
I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
[] New and existing tests pass locally with my changes

review-notebook-app · 2022-09-11T12:25:27Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

jpcbertoldo · 2022-09-11T12:26:48Z

anomalib/data/mvtec_loco.py

+                "mask_paths": str(self.samples.iloc[index]["mask_paths"]),
+                # TODO CHECK IF THE DOUBLE CALL TO PREPROCESS WILL WORK WITH ALBUMENTATIONS
+                "masks": self.pre_process(image=image, mask=mask_dict["masks"])["mask"],
+                "mask": self.pre_process(image=image, mask=mask_dict["mask"])["mask"],


self.pre_process is being called for the 3rd time here, will that create any problems?

I'm thinking that maybe the random transforms will apply the same transform every two times (for the image and for the mask).

jpcbertoldo · 2022-09-11T12:27:15Z

anomalib/data/mvtec_loco.py

+        category: str,
+        task: str = TASK_SEGMENTATION,
+        imread_strategy: str = IMREAD_STRATEGY_PRELOAD,
+        image_size: Optional[Union[int, Tuple[int, int]]] = None,


The images in this dataset are not squared.
The ratio of widh/height can end up too different than the original image when the image size is given as an int.

Maybe we should add a warning here?

jpcbertoldo

ready to review

* comet benchmarking enabled * updated BM docs * tweaked comment * commen changet * fixed end of file Co-authored-by: Samet Akcay <[email protected]>

jpcbertoldo · 2022-09-15T15:29:51Z

Please ignore for now, I will wait for PR #558.
Also, I realize that the pre-loading of data is probably unnecessary so I'll remove it to make the PR simpler.

…olkit#565)

…kit#567) * Fix category check * Fix config file

…openvinotoolkit#570) * move sample generation to datamodule instead of dataset * move sample generation from init to setup * remove inference stage and add base classes * replace dataset classes with AnomalibDataset * move setup to base class, create samples as class method * update docstrings * refactor btech to new format * allow training with no anomalous data * remove MVTec name from comment * raise NotImplementedError in base class * allow both png and bmp images for btech * use label_index to check if dataset contains anomalous images * refactor getitem in dataset class * use iloc for indexing * move dataloader getters to base class * refactor to add validate stage in setup * Add warning message when there is no config file passed * Extract get_transforms and get_height_and_width functions * refactor pre-processor and fix visualizer normalization issue * Revert thenew data refactor * rename variable * Revert the changes not merged yet * Fix tests * Fix tests * Address codacy concerns Co-authored-by: Dick Ameln <[email protected]>

Check for successful openvino conversion

* added hpo * lint fixed * Update hyperparameter_optimization.rst * fixed file lint * fixed documentation images * added sweep doc image * updated hpo docs to include images * fixed linting errors * added config folder to store sample sweeps * fixed docs for new location of config files * not needed. moved to config directory * not needed moved to config directory * renamed to configs * changed to "configs" * fixed grammar

* fix patchcore image-level score computation * docstring and comment * remove default value for n_neighbors * torch.Tensor -> Tensor

…otoolkit#589) * Fix CFlow anomaly map generator * Refactor anomaly_map

* Add benchmark to tutorial * Move export to tutorials * Move hpo to tutorials * Move inference to tutorials * Move logging to tutorials * Create installation in tutorials * Create training to tutorials * Create tutorials index * Update conf.py file * Add anomalib logos to logos directory * Add data docs * Add algos * Add model docs * Add reference api * Remove blank line in metrics * Add reference guide * Add how to guides * Add developer guide * Add blog to how-to-guide * Remove guides directory * Add train custom data to how-to-guides * Fix typos * Add notebooks to how-to-guides * Add anomalib favicon * Add missing algo descriptions * Rename Reference to Reference Guide * Add how to add a new model * fix typos * Merge PR 544 * Minor refactor (openvinotoolkit#587) * 🛠 Fix PatchCore image-level score computation (openvinotoolkit#580) * fix patchcore image-level score computation * docstring and comment * remove default value for n_neighbors * torch.Tensor -> Tensor * Minor refactor Co-authored-by: Dick Ameln <[email protected]> Co-authored-by: Ashwin Vaidya <[email protected]> * Address Dicks comments Co-authored-by: Ashwin Vaidya <[email protected]> Co-authored-by: Dick Ameln <[email protected]> Co-authored-by: Ashwin Vaidya <[email protected]>

* Add notebook for hpo * Reference notebook in docs Co-authored-by: Ashwin Vaidya <[email protected]>

* Fix comet hpo + refactoring + fix metriccallback in benchmarking * Move sweep runners + utils to anomalib Co-authored-by: Ashwin Vaidya <[email protected]>

* Add util to convert single value to tuple * Update documentation * Remove unused pytest import * Address PR comments * update text in documentation Co-authored-by: Ashwin Vaidya <[email protected]>

* refactor export callback * refactor export functions * Rename export_convert to export * Rename optimize to export + fix tests * Fix imports * Address tests * Add nosec to surpress subprocess warnings * Add nosec to surpress run

Address docs build dependency issues

* move sample generation to datamodule instead of dataset * move sample generation from init to setup * remove inference stage and add base classes * replace dataset classes with AnomalibDataset * move setup to base class, create samples as class method * update docstrings * refactor btech to new format * allow training with no anomalous data * remove MVTec name from comment * raise NotImplementedError in base class * allow both png and bmp images for btech * use label_index to check if dataset contains anomalous images * refactor getitem in dataset class * use iloc for indexing * move dataloader getters to base class * refactor to add validate stage in setup * implement alternative datamodules solution * small improvements * improve design * remove unused constructor arguments * adapt btech to new design * add prepare_data method for mvtec * implement more generic random splitting function * update docstrings for folder module * ensure type consistency when performing operations on dataset * change imports * change variable names * replace pass with NotImplementedError * allow training on folder without test images * use relative path for normal_test_dir * fix dataset tests * update validation set parameter in configs * change default argument * use setter for samples * hint options for val_split_mode * update assert message and docstring * revert name change dataset vs datamodule * typing and docstrings * remove samples argument from dataset constructor * val/test -> eval * remove Split.Full from enum * sort samples when setting * update warn message * formatting * use setter when creating samples in dataset classes * add tests for new dataset class * add test case for label aware random split * update parameter name in inferencers * move _setup implementation to base class * address codacy issues * fix pylint issues * codacy * update example dataset config in docs * fix test * move base classes to separate files (avoid circular import) * add base classes * update docstring * fix imports * validation_split_mode -> val_split_mode * update docs * Update anomalib/data/base/dataset.py Co-authored-by: Joao P C Bertoldo <[email protected]> * get length from self.samples * assert unique indices * check is_setup for individual datasets Co-authored-by: Joao P C Bertoldo <[email protected]> * remove assert in __getitem_\ Co-authored-by: Joao P C Bertoldo <[email protected]> * Update anomalib/data/btech.py Co-authored-by: Joao P C Bertoldo <[email protected]> * clearer assert message * clarify list inversion in comment * comments and typing * validate contents of samples dataframe before setting * add file paths check * add seed to random_split function * fix expected columns * fix typo * add seed parameter to datamodules * set global seed in test entrypoint * add NONE option to valsplitmode * clarify setup behaviour in docstring * fix typo Co-authored-by: Joao P C Bertoldo <[email protected]> Co-authored-by: Joao P C Bertoldo <[email protected]>

…anomalib into jpcbertoldo/mvtec-ad-loco-2

djdameln · 2023-04-05T14:44:52Z

@jpcbertoldo I am closing this because it has been inactive for a long time and is outdated. Feel free to re-open if you resume working on this.

jpcbertoldo added 8 commits September 11, 2022 11:49

copy mvtec and add some config conts in comments

434a7ad

first version building the dataset

92ad7e2

pass pre-commit hooks

7a80914

remove todos and correct a const

375ac65

remove todos and correct a const

f63d5a9

create notebook and make small corrections

fd2d917

manage multiple masks

43f154e

add unit tests for mvtec loco

49a5a5c

github-actions bot added Data Notebooks Tests labels Sep 11, 2022

jpcbertoldo commented Sep 11, 2022

View reviewed changes

jpcbertoldo marked this pull request as ready for review September 11, 2022 12:29

Benchmarking tool with Comet (openvinotoolkit#545)

e9809c4

* comet benchmarking enabled * updated BM docs * tweaked comment * commen changet * fixed end of file Co-authored-by: Samet Akcay <[email protected]>

samet-akcay and others added 12 commits September 16, 2022 15:46

🐞 Fix: Add map_location when loading the weights (openvinotoolkit#562)

7305246

Add patchcore to openvino export test + upgrade lightning (openvinoto…

a055416

…olkit#565)

🐞 Fix category check for folder dataset in anomalib CLI (openvinotool…

4860abc

…kit#567) * Fix category check * Fix config file

🔨 Check for successful openvino conversion (openvinotoolkit#571)

de1bea2

Check for successful openvino conversion

🛠 Fix PatchCore image-level score computation (openvinotoolkit#580)

6c59e1b

* fix patchcore image-level score computation * docstring and comment * remove default value for n_neighbors * torch.Tensor -> Tensor

🛠 Fix anomaly map computation in CFlow when batch size is 1. (openvin…

7d20aa2

…otoolkit#589) * Fix CFlow anomaly map generator * Refactor anomaly_map

📝 📊 Add notebook for hpo (openvinotoolkit#592)

d80d37d

* Add notebook for hpo * Reference notebook in docs Co-authored-by: Ashwin Vaidya <[email protected]>

🐞 Fix comet HPO (openvinotoolkit#597)

35df574

* Fix comet hpo + refactoring + fix metriccallback in benchmarking * Move sweep runners + utils to anomalib Co-authored-by: Ashwin Vaidya <[email protected]>

✨ Replace keys from benchmarking script (openvinotoolkit#595)

2de548d

* Add util to convert single value to tuple * Update documentation * Remove unused pytest import * Address PR comments * update text in documentation Co-authored-by: Ashwin Vaidya <[email protected]>

ashwinvaidya17 and others added 12 commits October 20, 2022 10:52

🖌 refactor export callback (openvinotoolkit#640)

406f79a

* refactor export callback * refactor export functions * Rename export_convert to export * Rename optimize to export + fix tests * Fix imports * Address tests * Add nosec to surpress subprocess warnings * Add nosec to surpress run

🐞 Address docs build (openvinotoolkit#639)

d78f995

Address docs build dependency issues

copy mvtec and add some config conts in comments

668ce5d

first version building the dataset

c0ea4d6

pass pre-commit hooks

478b2b9

remove todos and correct a const

7b9c096

remove todos and correct a const

1ecffc5

create notebook and make small corrections

c4c4463

manage multiple masks

f5321ac

add unit tests for mvtec loco

13ab40f

Merge branch 'jpcbertoldo/mvtec-ad-loco-2' of github.com:jpcbertoldo/…

106b616

…anomalib into jpcbertoldo/mvtec-ad-loco-2

github-actions bot added Benchmarking Callbacks CI CLI Config Dependencies Pull requests that update a dependency file HPO Inference Metrics Metric Component. Post-Processing The components that are related to post-processing Pre-Processing Setup Tools labels Oct 31, 2022

djdameln closed this Apr 5, 2023

lemonbuilder mentioned this pull request Sep 12, 2023

[Task]: logical anomaly detection #1341

Closed

willyfh mentioned this pull request Jan 13, 2024

🚀 v1 - Add support for MVTec LOCO dataset and sPRO metric #1635

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jpcbertoldo/mvtec ad loco 2 #553

Jpcbertoldo/mvtec ad loco 2 #553

jpcbertoldo commented Sep 11, 2022

review-notebook-app bot commented Sep 11, 2022

jpcbertoldo Sep 11, 2022

jpcbertoldo Sep 11, 2022

jpcbertoldo left a comment

jpcbertoldo commented Sep 15, 2022

djdameln commented Apr 5, 2023

Jpcbertoldo/mvtec ad loco 2 #553

Jpcbertoldo/mvtec ad loco 2 #553

Conversation

jpcbertoldo commented Sep 11, 2022

Description

imread_strategy

anotype and super_anotype

mask vs. masks

things in MVTecAD but not in MVTecLOCO

1) self.transform_config_val = self.transform_config_train

2) split_normal_images_in_train_set(samples, split_ratio, seed)

Checklists

Changes

Checklist

review-notebook-app bot commented Sep 11, 2022

jpcbertoldo Sep 11, 2022

Choose a reason for hiding this comment

jpcbertoldo Sep 11, 2022

Choose a reason for hiding this comment

jpcbertoldo left a comment

Choose a reason for hiding this comment

jpcbertoldo commented Sep 15, 2022

djdameln commented Apr 5, 2023

`imread_strategy`

`anotype` and `super_anotype`

`mask` vs. `masks`

things in `MVTecAD` but not in `MVTecLOCO`

1) `self.transform_config_val = self.transform_config_train`

2) `split_normal_images_in_train_set(samples, split_ratio, seed)`