Add more models and datasets from external packages. #42

rayrayraykk · 2022-04-27T12:04:11Z

Add pre-trained transformers to NLP models.
Add datasets from hugging face.
add datasets from openml.

…fic.

joneswong

LGTM. Please help us review the usage of Transformer and the datasets (e.g., SST-2)? @yxdyc Thanks!

yxdyc

Good job, plz see the inline comments.

yxdyc · 2022-05-07T02:29:02Z

enviroment/docker_files/federatedscope-torch1.8-application.Dockerfile

@@ -42,11 +42,13 @@ RUN conda install -y pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoo
 # for graph
 RUN conda install -y pyg==2.0.1 -c pyg  \
    && conda install -y rdkit=2021.09.4 -c conda-forge \
+    && conda install -y nltk \


nltk should be put at the back of the line (the NLP part below)?

NLTK is used for generating features of some graph datasets.

yxdyc · 2022-05-07T02:45:27Z

federatedscope/nlp/baseline/fedavg_bert_on_sst2.yaml

+  local_update_steps: 1
+  total_round_num: 400
+  batch_or_epoch: 'epoch'
+  client_num: 1


why this baseline uses only 1 client?

yxdyc · 2022-05-07T02:54:11Z

federatedscope/core/auxiliaries/data_builder.py

    DATA_LOAD_FUNCS = {
        'torchvision': load_torchvision_data,
        'torchtext': load_torchtext_data,
        'torchaudio': load_torchaudio_data,
-        'torch_geometric': load_torch_geometric_data
+        'torch_geometric': load_torch_geometric_data,
+        'datasets': load_datasets_data,


the name "datasets" is too general, how about huggingface_datasets

yxdyc · 2022-05-07T02:59:57Z

federatedscope/nlp/baseline/fedavg_bert_on_sst2.yaml

+federate:
+  mode: standalone
+  local_update_steps: 1
+  total_round_num: 400


In centralized mode, the fine-tuning often takes only a few epochs. Maybe we can set the total_round_num to be in the order of dozens, e.g., 40?

Add more models and datasets from external packages.

Add pre-trained transformers as NLP model.

eba979f

rayrayraykk requested review from joneswong and yxdyc April 27, 2022 12:04

rayrayraykk added the Feature New feature label Apr 27, 2022

rayrayraykk added 9 commits April 28, 2022 11:55

TODO:@ZHEN, please fix online aggregator when the device is not speci…

c8ba16e

…fic.

Add a example for transformers.

f2611f3

Merge branch 'github-master' into transformers

d505fcf

Fix minor bugs

5bbee1e

merge master

5fbe43e

Add datasets from hugging face.

8b72cdb

Formatted and fix minor bugs.

a0a1719

Add datasets and scripts for openml.

c04163b

Modify the example yaml of openml datasets.

94f93db

rayrayraykk changed the title ~~Add pre-trained transformers to NLP model.~~ Add more models and datasets from external packages. May 6, 2022

joneswong previously approved these changes May 6, 2022

View reviewed changes

yxdyc reviewed May 7, 2022

View reviewed changes

merge master

d6bab16

rayrayraykk dismissed joneswong’s stale review via d6bab16 May 7, 2022 08:08

rename and modify some val

a74114c

yxdyc approved these changes May 7, 2022

View reviewed changes

yxdyc merged commit 9828965 into alibaba:master May 7, 2022

rayrayraykk deleted the transformers branch May 13, 2022 07:04

AnthonyXuan pushed a commit to AnthonyXuan/FederatedScope that referenced this pull request Aug 10, 2023

Merge pull request alibaba#42 from rayrayraykk/transformers

a3ec959

Add more models and datasets from external packages.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more models and datasets from external packages. #42

Add more models and datasets from external packages. #42

rayrayraykk commented Apr 27, 2022 •

edited

Loading

joneswong left a comment

yxdyc left a comment

yxdyc May 7, 2022

rayrayraykk May 7, 2022

yxdyc May 7, 2022

yxdyc May 7, 2022

yxdyc May 7, 2022

Add more models and datasets from external packages. #42

Add more models and datasets from external packages. #42

Conversation

rayrayraykk commented Apr 27, 2022 • edited Loading

joneswong left a comment

Choose a reason for hiding this comment

yxdyc left a comment

Choose a reason for hiding this comment

yxdyc May 7, 2022

Choose a reason for hiding this comment

rayrayraykk May 7, 2022

Choose a reason for hiding this comment

yxdyc May 7, 2022

Choose a reason for hiding this comment

yxdyc May 7, 2022

Choose a reason for hiding this comment

yxdyc May 7, 2022

Choose a reason for hiding this comment

rayrayraykk commented Apr 27, 2022 •

edited

Loading