Zero to GANs - Human Protein Classification with Pytorch

Developed models to classify mixed patterns of proteins in microscope images

1^st place solution

Note

During training I used many manual steps.

I stopped the training when the model started to overfit and reduce the learning rate to train more.
I used checkpoint the save the best weight and use it as an initialize weight for the next training.
I hand-picked different training and validation set without using cross-validation for training the model.

Files

image_classification.ipynb is for training the classification.

ensemble.ipynb is for merging multiple predicted probabilities as the final submission.

Solution

Below are the solution that I used for the final submission.

Model: EfficientNet-B1 and EfficientNet-B2

EfficientNet-b0 vs EfficientNet-b1 vs EfficientNet-b2 vs EfficientNet-b4 vs ResetNet101 vs DenseNet121. Result shows that EfficientNets perform better than Resnets and Densenets.
EfficientNet-B1 vs EfficientNet-B2: I used both of them for ensemble.
I didn't use EfficientNet-B4 because the score drops when I resized the image.

Optimizer: AdamW, amsgrad=True, weight_decay=0.01

AdamW vs Adam: AdamW optimizer converges faster than Adam.

See more details in Why AdamW matters and AdamW and Super-convergence is now the fastest way to train neural nets

Learning rate scheduler: OneCyclic Learning Rate Scheduler

OneCyclic vs CosineAnnealingWarmRestarts:

CosineAnnealingWarmRestarts converges faster but the better f1 scores (>0.82) are from using Onecyclic with initial learning rate 0.0001 and 26 #epochs.
I used CosineAnnealingWarmRestarts for finding the best model, then OneCyclic is used for generating a single submission result before ensemble.

This article explains why choosing the right number of epochs and learning rate matters for OneCyclic.

Image augmentation:

I use below augmentation during training time and turn it off during validation and test time.
RandomHorizontalFlip
RandomVerticalFlip
RandomRotation
ColorJitter(brightness=0.2, saturation=0.2, contrast=0.2)
Resize() (used only when searching for hyperparameters, I get higher scores without using Resize())

Loss: Binary cross-entropy

Choose a loss function that align with the F1-score metric, which are binary cross-entropy or focal loss. When focal loss decreases, f1 score doesn't increase much. I get better f1-scores from using binary cross-entropy.

Ensemble

I ensemble the 6 best checkpoints of my model.

Convert probability that exceeds a certain threshold to labels: 0.5 > 0.46 > 0.445 > argmax

I set 0.5 as the threshold value as the predicted classes and then fill missing classes with threshold over 0.46 and 0.445 respectively. argmax() is used for predicting the rest of the missing classes.
At first I filled the missing classes with mode class (class 4) and then I changed to fill it with the argmax probabilities that the model generated.

Find my best single checkpoint here:

best_checkpoint/weight.pth

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
best_checkpoint		best_checkpoint
images		images
pred_data		pred_data
.gitignore		.gitignore
README.md		README.md
ensemble.ipynb		ensemble.ipynb
image_classification.ipynb		image_classification.ipynb
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zero to GANs - Human Protein Classification with Pytorch

1^st place solution

Note

Files

Solution

Model: EfficientNet-B1 and EfficientNet-B2

Optimizer: AdamW, amsgrad=True, weight_decay=0.01

Learning rate scheduler: OneCyclic Learning Rate Scheduler

Image augmentation:

Loss: Binary cross-entropy

Ensemble

Convert probability that exceeds a certain threshold to labels: 0.5 > 0.46 > 0.445 > argmax

Find my best single checkpoint here:

About

Releases

Packages

Languages

MimiCheng/kaggle-protein-image-classification

Folders and files

Latest commit

History

Repository files navigation

Zero to GANs - Human Protein Classification with Pytorch

1st place solution

Note

Files

Solution

Model: EfficientNet-B1 and EfficientNet-B2

Optimizer: AdamW, amsgrad=True, weight_decay=0.01

Learning rate scheduler: OneCyclic Learning Rate Scheduler

Image augmentation:

Loss: Binary cross-entropy

Ensemble

Convert probability that exceeds a certain threshold to labels: 0.5 > 0.46 > 0.445 > argmax

Find my best single checkpoint here:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

1^st place solution

Packages