add federated contrastive learning baseline SimCLR and its linear prob evaluation #278

xkxxfyf · 2022-08-01T19:33:54Z

No description provided.

CLAassistant · 2022-08-01T19:34:07Z

All committers have signed the CLA.

joneswong · 2022-08-02T09:45:51Z

federatedscope/cl/dataloader/Cifar10.py

+
+
+class SimCLRTransform():
+    def __init__(self, is_sup, image_size=32):


please provide python-style docstring for the newly added classes and functions

federatedscope/cl/dataloader/Cifar10.py

joneswong · 2022-08-02T09:48:50Z

federatedscope/cl/dataloader/Cifar10.py

+    transform_train = SimCLRTransform(is_sup=False, image_size=32)
+    transform_test = T.Compose([
+        T.ToTensor(), 
+        T.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))]


is it conventional to use 0.5 rather than the sample mean?

Setting both parameters to 0.5 and using with T.totensor() can force the data to be scaled to the [-1,1] interval

here the first arg is mean of the signals, to my knowledge, it is usually calculated from the available examples.

T.totensor() can take the value of image from 0-255 to 0-1, and T.Normalize(0.5,0.5) can take 0-1 to -1-1 using function (x-mean)/std

federatedscope/cl/dataloader/Cifar10.py

joneswong · 2022-08-02T09:53:30Z

federatedscope/cl/model/SimCLR.py

+
+
+class Bottleneck(nn.Module):
+    expansion = 4


is it necessary to use class attribute?

joneswong · 2022-08-02T10:01:21Z

federatedscope/cl/loss/NT_xentloss.py

+        representations = torch.cat([z1, z2], dim=0)
+        similarity_matrix = F.cosine_similarity(representations.unsqueeze(1), representations.unsqueeze(0), dim=-1)
+
+        l_pos = torch.diag(similarity_matrix, N)


I think, up to now, similarity_matrix is a 2N-by-2N matrix. Am I wrong? Why do we need to take the the above and below main diagonal?

joneswong · 2022-08-02T10:04:26Z

federatedscope/cl/trainer/trainer.py

+#         print(len(x), x[0].size(), x[1].size(), label.size())
+        x1, x2 = x[0], x[1]
+        z1, z2 = ctx.model(x1, x2)
+        if len(label.size()) == 0:


when will we enter such a branch?

we enter this branch in contrastive learning with two augment data

I mean when does the length of the size of label become zero

I follow the torch_trainer and add this branch. Should I remove it?

federatedscope/core/auxiliaries/eunms.py

joneswong · 2022-08-02T10:08:09Z

federatedscope/core/auxiliaries/optimizer_builder.py

@@ -10,7 +10,7 @@ def get_optimizer(model, type, lr, **kwargs):
    if isinstance(type, str):
        if hasattr(torch.optim, type):
            if isinstance(model, torch.nn.Module):
-                return getattr(torch.optim, type)(model.parameters(), lr,
+                return getattr(torch.optim, type)(filter(lambda p: p.requires_grad, model.parameters()), lr,


is any pfl algo affected by such a change? @yxdyc

federatedscope/cl/trainer/trainer.py

federatedscope/cl/model/SimCLR.py

joneswong

Any code snippet copied from or somewhat inspired by other place, please provide a copyright for your files.

joneswong · 2022-08-02T10:16:27Z

This pr includes a new trainer, which is designed for conducting contrastive learning. @DavdGao could you have a look at that part for us? Thanks!

joneswong

It seems that no readily available splitter has been adopted in your exp, right? So how do you construct the non-iidness? We have planed to start with the LDA splitter. Please conduct the exp accordingly.

rayrayraykk

Please see the inline comments and follow the contributor rule to format your code.

federatedscope/cl/dataloader/Cifar10.py

rayrayraykk · 2022-08-02T12:32:44Z

federatedscope/cl/dataloader/Cifar10.py

+    config = config
+    return data_dict, config
+
+def Cifar4LP(config):


Duplicated code from line 118 to line 156.

federatedscope/cl/dataloader/Cifar10.py

rayrayraykk · 2022-08-02T12:42:24Z

federatedscope/cl/model/SimCLR.py

+
+
+# Model class
+class ResNet(nn.Module):


Please import ResNet from https://github.com/alibaba/FederatedScope/blob/master/federatedscope/contrib/model/resnet.py if there is no other concern.

In this PR #267 , a ResNet model is already added in federatedscope/contrib/model/resnet.py. Please check if we still need to add a new resnet.

federatedscope/cl/test.ipynb

federatedscope/core/auxiliaries/model_builder.py

federatedscope/cl/dataloader/Cifar10.py

joneswong

shell scripts for reproducing results of standalone and fedavg should be provided.

DavdGao

Please see the inline comments and keep the code consistent with the master branch.

federatedscope/cl/dataloader/Cifar10.py

federatedscope/cl/trainer/trainer.py

DavdGao

Please add a new unit test for the new trainer and dataset

federatedscope/cl/trainer/trainer.py

rayrayraykk · 2022-08-04T02:57:55Z

federatedscope/core/aggregator.py

@@ -104,12 +104,37 @@ def _para_weighted_avg(self, models, recover_fun=None):
        return avg_model


-class NoCommunicationAggregator(Aggregator):
+class NoCommunicationAggregator(ClientsAvgAggregator):


@yxdyc Please have a look at this change to local mode, thanks.

rayrayraykk · 2022-08-04T03:03:04Z

federatedscope/cl/dataloader/Cifar10.py

+          # Split data into dict
+    data_dict = dict()
+
+    # Splitter


Please refer to splitter

FederatedScope/federatedscope/core/auxiliaries/data_builder.py

Line 493 in dfcfff3

# Build dict of Dataloader

and make it consistant.

rayrayraykk · 2022-08-04T03:05:32Z

federatedscope/cl/model/SimCLR.py

@@ -0,0 +1,235 @@
+import torch


This file looks like a copy from https://github.com/akhilmathurs/orchestra/blob/main/models.py.

Please consider the copyright issues.

rayrayraykk · 2022-08-04T03:08:59Z

federatedscope/cl/loss/NT_xentloss.py

@@ -0,0 +1,41 @@
+import torch


Copyright from
https://github.com/akhilmathurs/orchestra/blob/228d7a6379b6788e7dc288d3a9557d62b940c47a/models.py#L215

rayrayraykk · 2022-08-04T03:14:19Z

federatedscope/cl/dataloader/Cifar10.py

@@ -0,0 +1,190 @@
+import math


To reproduce baselines, the sample_splitter is needed.
https://github.com/akhilmathurs/orchestra/blob/228d7a6379b6788e7dc288d3a9557d62b940c47a/utils.py#L44

joneswong · 2022-08-05T10:07:17Z

please answer our questions about the code; 2. do not push the changes that obviously cannot pass the UnitTest.

joneswong · 2022-08-05T13:14:29Z

federatedscope/cl/dataloader/Cifar10.py

+            T.RandomResizedCrop(32, scale=(0.5, 1.0), interpolation=T.InterpolationMode.BICUBIC),
+            T.RandomHorizontalFlip(p=0.5),
+            T.ToTensor(),
+            T.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))


I am still very curious about why we have to use such a mean and std. If it is a conventional usage in CL, please explain for us. The ultimate image classification task does not use this transformation, right?

T.totensor() can take the value of image from 0-255 to 0-1, and T.Normalize(0.5,0.5) can take 0-1 to -1-1 using function (x-mean)/std. the data augement is time-costing and use sample mean and std wil take more time.

joneswong · 2022-08-05T13:17:58Z

federatedscope/cl/dataloader/Cifar10.py

+    splitter = get_splitter(config)
+    data_train = splitter(data_train)
+    data_val = data_train
+    data_test = splitter(data_test)


Although the original train and test data of CIFAR10 are iid, how to ensure that splitting them by our splitter respectively can keep the train and test data of a specific client iid?

rayrayraykk self-requested a review August 2, 2022 02:55

rayrayraykk added the Feature New feature label Aug 2, 2022

joneswong reviewed Aug 2, 2022

View reviewed changes

federatedscope/cl/dataloader/Cifar10.py Outdated Show resolved Hide resolved

joneswong reviewed Aug 2, 2022

View reviewed changes

federatedscope/cl/dataloader/Cifar10.py Outdated Show resolved Hide resolved

joneswong reviewed Aug 2, 2022

View reviewed changes

federatedscope/cl/model/SimCLR.py Outdated

class Bottleneck(nn.Module):

expansion = 4

Copy link

Collaborator

joneswong Aug 2, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is it necessary to use class attribute?

joneswong reviewed Aug 2, 2022

View reviewed changes

federatedscope/core/auxiliaries/eunms.py Outdated Show resolved Hide resolved

joneswong reviewed Aug 2, 2022

View reviewed changes

federatedscope/cl/trainer/trainer.py Outdated Show resolved Hide resolved

joneswong reviewed Aug 2, 2022

View reviewed changes

federatedscope/cl/model/SimCLR.py Outdated Show resolved Hide resolved

joneswong requested changes Aug 2, 2022

View reviewed changes

joneswong self-assigned this Aug 2, 2022

joneswong requested changes Aug 2, 2022

View reviewed changes

rayrayraykk reviewed Aug 2, 2022

View reviewed changes

joneswong reviewed Aug 2, 2022

View reviewed changes

federatedscope/cl/dataloader/Cifar10.py Outdated Show resolved Hide resolved

joneswong requested changes Aug 2, 2022

View reviewed changes

DavdGao requested changes Aug 3, 2022

View reviewed changes

DavdGao reviewed Aug 4, 2022

View reviewed changes

federatedscope/cl/trainer/trainer.py Outdated Show resolved Hide resolved

rayrayraykk reviewed Aug 4, 2022

View reviewed changes

joneswong reviewed Aug 5, 2022

View reviewed changes

xkxxfyf merged commit 34dba76 into alibaba:master Aug 21, 2022

xkxxfyf force-pushed the master branch from 79825ed to 34dba76 Compare August 21, 2022 19:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add federated contrastive learning baseline SimCLR and its linear prob evaluation #278

add federated contrastive learning baseline SimCLR and its linear prob evaluation #278

xkxxfyf commented Aug 1, 2022

CLAassistant commented Aug 1, 2022 •

edited

Loading

joneswong Aug 2, 2022

joneswong Aug 2, 2022

xkxxfyf Aug 2, 2022

joneswong Aug 2, 2022

xkxxfyf Aug 5, 2022

joneswong Aug 2, 2022

joneswong Aug 2, 2022

joneswong Aug 2, 2022

xkxxfyf Aug 2, 2022

joneswong Aug 3, 2022

xkxxfyf Aug 4, 2022

joneswong Aug 2, 2022

joneswong left a comment

joneswong commented Aug 2, 2022

joneswong left a comment

rayrayraykk left a comment

rayrayraykk Aug 2, 2022

rayrayraykk Aug 2, 2022

DavdGao Aug 12, 2022

joneswong left a comment

DavdGao left a comment •

edited

Loading

DavdGao left a comment

rayrayraykk Aug 4, 2022

rayrayraykk Aug 4, 2022

rayrayraykk Aug 4, 2022

rayrayraykk Aug 4, 2022

rayrayraykk Aug 4, 2022 •

edited

Loading

joneswong commented Aug 5, 2022

joneswong Aug 5, 2022

xkxxfyf Aug 6, 2022

joneswong Aug 5, 2022



		class SimCLRTransform():
		def __init__(self, is_sup, image_size=32):

add federated contrastive learning baseline SimCLR and its linear prob evaluation #278

add federated contrastive learning baseline SimCLR and its linear prob evaluation #278

Conversation

xkxxfyf commented Aug 1, 2022

CLAassistant commented Aug 1, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joneswong left a comment

Choose a reason for hiding this comment

joneswong commented Aug 2, 2022

joneswong left a comment

Choose a reason for hiding this comment

rayrayraykk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joneswong left a comment

Choose a reason for hiding this comment

DavdGao left a comment • edited Loading

Choose a reason for hiding this comment

DavdGao left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rayrayraykk Aug 4, 2022 • edited Loading

Choose a reason for hiding this comment

joneswong commented Aug 5, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CLAassistant commented Aug 1, 2022 •

edited

Loading

DavdGao left a comment •

edited

Loading

rayrayraykk Aug 4, 2022 •

edited

Loading