FedGlobalContrast and FedSimCLR baseline #354

xkxxfyf · 2022-08-30T21:20:04Z

current problem: global calculate contrast loss is slow because of epoch data size, but it will take more communication cost with batch size calculate global contrast loss

federatedscope/cl/dataloader/Cifar10.py

joneswong · 2022-08-31T13:00:01Z

federatedscope/cl/dataloader/Cifar10.py

+    # Split data into dict
+    data_dict = dict()
+    splitter = get_splitter(config)
+    data_train = splitter(data_train)


splitting data_train and data_test respectively cannot ensure the iid property for the train and test of the same client.

@rayrayraykk any suggestion to modify the splitter to enable splitting (data1, data2, ...) with the same categorical distributions?

We have the args to control the dist.
See

FederatedScope/federatedscope/core/auxiliaries/data_builder.py

Line 583 in 1fe2880

splitter(dataset[split], prior=train_label_distribution)):

.

add a paramater to decide using the same splitter of training and evaluation or not.

keep the train val test splits iid:

FederatedScope/federatedscope/core/splitters/generic/lda_splitter.py

Line 11 in ccafa14

def __call__(self, dataset, prior=None):

federatedscope/cl/dataloader/Cifar10.py

joneswong · 2022-10-13T09:09:54Z

federatedscope/cl/fedgc/server.py

+            for idx in range(1, self._cfg.federate.client_num + 1)
+        }
+
+    def _register_default_handlers(self):


is it possible to call base class's method to execute the first four registrations?

joneswong · 2022-10-13T09:22:10Z

federatedscope/cl/fedgc/server.py

+                                 if other_client_id != client_id]
+#                         print("start cal loss")
+                    self.loss_list[client_id] = global_loss_fn(z1, z2, others_z2)
+                    print(self.loss_list[client_id])


it's deleted

joneswong · 2022-10-13T09:25:23Z

federatedscope/cl/fedgc/client.py

+    GlobalContrastFL(Fedgc) Client receive aggregated model weight from server then update local 
+    weight; it also receive global loss from server to train model and update weight locally.
+    """
+    def _register_default_handlers(self):


is it possible to call base class's method to avoid repeating several of these lines?

joneswong · 2022-10-13T09:40:43Z

federatedscope/cl/fedgc/client.py

+        round, sender, content = message.state, message.sender, message.content
+        global_loss = content['global_loss']
+        model_para = self.trainer.train_with_global_loss(global_loss)
+        self.trainer.update(model_para)


it is extremely confusing to update the local model by a state_dict produced by itself.

it will be deleted

joneswong

It is ok to alternatively make local and global updates. However, we have suggested to enable a combination of these losses in one gradient descent step, but it seems that this implementation still fails to do that. 2. There are still many issues remaining. 3. This pr would not pass the check of a linter, imo, right? @rayrayraykk .

The quality of this pr is exceptionally low. Please complete it ASAP.

rayrayraykk · 2022-10-13T10:01:18Z

It is ok to alternatively make local and global updates. However, we have suggested to enable a combination of these losses in one gradient descent step, but it seems that this implementation still fails to do that. 2. There are still many issues remaining. 3. This pr would not pass the check of a linter, imo, right? @rayrayraykk .

The quality of this pr is exceptionally low. Please complete it ASAP.

Yes, the unit-test provided still does not work.

rayrayraykk

Run pre-commit run --all-files and fix the format issues before merging.
And please see the inline comments.

If done, please remove [WIP] in the title of this PR.

rayrayraykk · 2022-10-17T03:13:33Z

federatedscope/cl/fedgc/utils.py

+import numpy as np
+import torch.nn as nn
+import torch.nn.functional as F
+import networkx as nx


delete if never used

rayrayraykk · 2022-10-17T03:14:51Z

federatedscope/cl/fedgc/utils.py

@@ -0,0 +1,105 @@
+import torch
+import numpy as np


delete if never used

rayrayraykk · 2022-10-17T03:15:03Z

federatedscope/cl/fedgc/utils.py

+
+        return loss 
+
+# def compute_global_NT_xentloss(z1, z2, others_z2=[], temperature=0.5, device='cpu'):


delete if never used

rayrayraykk · 2022-10-17T03:18:40Z

federatedscope/cl/fedgc/server.py

+from federatedscope.core.workers.server import Server
+from federatedscope.core.auxiliaries.utils import merge_dict
+from federatedscope.cl.fedgc.utils import global_NT_xentloss
+from torchviz import make_dot, make_dot_from_trace


Please add torchviz to the dependency [cl], and I can't run CL with the minimal version of FS.

it's deleted in new pr

rayrayraykk · 2022-10-28T09:31:55Z

federatedscope/cl/dataloader/Cifar10.py

+        return data, modified_config
+
+
+register_data("Cifar4CL", load_cifar_dataset)


As you use register_data, your function should be taken into two args:
def load_cifar_dataset(config, client_cfgs=None):

I have deleted register_data and finished unit test

joneswong

The functionalities have been validated. However, the performance of FedSimCLR cannot be reproduced exactly. We annotate this as a TODO.

xkxxfyf added 22 commits August 2, 2022 02:06

merge contrastive baseline 通 lateset

f46aedb

delete other yamls

eaa50a1

Merge branch 'alibaba:master' into master

3ac6f85

script debug

289942b

debug unit test error

c1c9bea

debug

f303910

debug

5f8ef6d

debug

e3e2f64

debug

6ac15aa

debug

774419a

debug

a55b844

debug

88aa90a

debug

c716db6

debug

063e88f

debug

8058ae6

debug

e543b1b

Merge branch 'alibaba:master' into master

2f8b958

Merge branch 'master' of https://github.com/xkxxfyf/FederatedScope

79825ed

Merge branch 'master' of https://github.com/xkxxfyf/FederatedScope

e68c480

Merge branch 'alibaba:master' into master

00fe908

Merge branch 'alibaba:master' into master

ee7ce3c

FedGlobalContrast

aad2038

rayrayraykk added the Feature New feature label Aug 31, 2022

rayrayraykk requested review from rayrayraykk and joneswong August 31, 2022 07:10

joneswong self-assigned this Aug 31, 2022

joneswong reviewed Aug 31, 2022

View reviewed changes

federatedscope/cl/dataloader/Cifar10.py Show resolved Hide resolved

joneswong reviewed Aug 31, 2022

View reviewed changes

federatedscope/cl/dataloader/Cifar10.py Show resolved Hide resolved

joneswong reviewed Aug 31, 2022

View reviewed changes

federatedscope/cl/dataloader/Cifar10.py Outdated Show resolved Hide resolved

joneswong reviewed Oct 13, 2022

View reviewed changes

joneswong requested changes Oct 13, 2022

View reviewed changes

xkxxfyf added 4 commits October 13, 2022 19:19

Update test_simclr_cifar10.py

7934b13

delete print and repair for unit-test failing

7b4a1a6

re-try for unit-test timeout error after extending waiting time

3810fa5

re-try for unit-test timeout error with adding shared memory

a474986

rayrayraykk reviewed Oct 17, 2022

View reviewed changes

xkxxfyf added 13 commits October 18, 2022 04:35

delete never used

1eed27e

Merge branch 'alibaba:master' into paper-list

8ee8f7c

Merge branch 'master' into paper-list

7298ce7

Update utils.py

0a395c5

Update SimCLR.py

4cf9a96

modify format

0999288

modify format

a18aa3f

Merge branch 'master' of https://github.com/xkxxfyf/FederatedScope

34529d4

modify yapf format

d213297

debug for unit-test

d6fadbc

modify for unit-test

a250837

Merge branch 'alibaba:master' into master

25f2912

Update test_simclr_cifar10.py

d306dc0

rayrayraykk reviewed Oct 28, 2022

View reviewed changes

Update Cifar10.py

99915fa

xkxxfyf changed the title ~~[WIP] FedGlobalContrast and FedSimCLR baseline~~ FedGlobalContrast and FedSimCLR baseline Oct 31, 2022

xkxxfyf requested a review from joneswong October 31, 2022 11:52

joneswong approved these changes Nov 8, 2022

View reviewed changes

joneswong merged commit 94e0d97 into alibaba:master Nov 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FedGlobalContrast and FedSimCLR baseline #354

FedGlobalContrast and FedSimCLR baseline #354

xkxxfyf commented Aug 30, 2022

joneswong Aug 31, 2022

joneswong Aug 31, 2022

rayrayraykk Sep 1, 2022

xkxxfyf Sep 4, 2022

joneswong Sep 14, 2022

joneswong Oct 13, 2022

joneswong Oct 13, 2022

xkxxfyf Oct 14, 2022

joneswong Oct 13, 2022

joneswong Oct 13, 2022

xkxxfyf Oct 14, 2022

joneswong left a comment

rayrayraykk commented Oct 13, 2022

rayrayraykk left a comment •

edited

Loading

rayrayraykk Oct 17, 2022

xkxxfyf Oct 17, 2022

rayrayraykk Oct 17, 2022

xkxxfyf Oct 17, 2022

rayrayraykk Oct 17, 2022

xkxxfyf Oct 17, 2022

rayrayraykk Oct 17, 2022

xkxxfyf Oct 17, 2022

rayrayraykk Oct 28, 2022

xkxxfyf Oct 28, 2022 •

edited

Loading

joneswong left a comment


		return loss

		# def compute_global_NT_xentloss(z1, z2, others_z2=[], temperature=0.5, device='cpu'):

		return data, modified_config


		register_data("Cifar4CL", load_cifar_dataset)

FedGlobalContrast and FedSimCLR baseline #354

FedGlobalContrast and FedSimCLR baseline #354

Conversation

xkxxfyf commented Aug 30, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joneswong left a comment

Choose a reason for hiding this comment

rayrayraykk commented Oct 13, 2022

rayrayraykk left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xkxxfyf Oct 28, 2022 • edited Loading

Choose a reason for hiding this comment

joneswong left a comment

Choose a reason for hiding this comment

rayrayraykk left a comment •

edited

Loading

xkxxfyf Oct 28, 2022 •

edited

Loading