R1 loss - called with all params #3

moshebeutel · 2024-09-07T03:45:26Z

Hi , reading your paper I understood that the first custom loss is regularized to minimize personalized parameters change. It seems that in line 112 in ours.py the diffs are calculated using all model.parameters() and w_glob and not just the personalized. at the end of iteration only personalized parameters are updated but I do not think this is equivalent to computing the diff on the personalized parameters as described in the paper. Am I missing something?
Thanks 🙏 Moshe

moshebeutel · 2024-09-07T03:47:27Z

Same holds for the second loss / shared parameters

xiyuanyang45 · 2024-09-10T00:32:38Z

Then if there's any alternatives for only calculating updates for partial selected params?
I think that code is an available approach for implementation, welcome to discuss any other possible implementations of this algorithm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

R1 loss - called with all params #3

R1 loss - called with all params #3

moshebeutel commented Sep 7, 2024

moshebeutel commented Sep 7, 2024

xiyuanyang45 commented Sep 10, 2024 •

edited

Loading

R1 loss - called with all params #3

R1 loss - called with all params #3

Comments

moshebeutel commented Sep 7, 2024

moshebeutel commented Sep 7, 2024

xiyuanyang45 commented Sep 10, 2024 • edited Loading

xiyuanyang45 commented Sep 10, 2024 •

edited

Loading