Skip to content

Latest commit

 

History

History
33 lines (26 loc) · 1.14 KB

README.md

File metadata and controls

33 lines (26 loc) · 1.14 KB

Siamese NN implementation

Pytorch Siamese NN based on ResNet/RegNet with some tweaks to work with dataset crawled from internet.
It compares two images (mostly persons photos) to classify them as similar or not (the same photo or not).

Fine-tuned classification network based on google/vit

Dataset

Labeling done by Active learning:

  1. First, created manually 100 examples
  2. Train the model (overfitting is okay) for the beginning
  3. Use model to get predictions on 1000+ more examples
  4. Take 100 with the highest error and tune the model
  5. Repeat till the dataset is ready

Which images are similar?

These images should be considered as similar, regardless images have diff size, color, background. img.png

These images should be considered as different img.png

It means, the NN should pay attention to the main character's object on the photo, and should disregard the background.

The goal to implement the NN, which is able to detect the difference.

Phase 2

  • add data aug
  • track f1 score
  • Optimize model to work on CPU
  • Prepare model for inference