ENH: Enable OFT adapter for mixed adapter models #1204

BenjaminBossan · 2023-12-01T16:00:55Z

This PR makes it possible to use the newly added OFT adapter in mixed adapter type models, similar to LoRA, LoHa, etc.

Notes

Adding the integration was pretty straightforward, which is a good sign.

The difficult part was actually about the tests. This stems from the fact that OFT is (if my understanding is correct) never commutative. What I mean is that even if the adapters are applied to the last layer of a model, it makes a difference whether we apply, say, first LoRA, then OFT vs first OFT, then LoRA.

This is different for the other adapters that were added so far for mixed models, as they basically do:

Xa = X + dXa
Xab = Xa + dXb = X + dXa + dXb = X + dXb + dXa = Xb + dXa = Xba

IIUC, this is not true for OFT, so when OFT is used, I had to ensure that no test was applied that (implicitly) assumes commutativity. Ping @okotaku and @lukaskuhn-lku is my understanding correct?

Furthermore, I had to increase the model size, see this comment:

#1160 (comment)

This PR makes it possible to use the newly added OFT adapter in mixed adapter type models, similar to LoRA, LoHa, etc. Notes Adding the integration was pretty straightforward, which is a good sign. The difficult part was actually about the tests. This stems from the fact that OFT is (if my understanding is correct) never commutative. What I mean is that even if the adapters are applied to the last layer of a model, it makes a difference whether we apply, say, first LoRA, then OFT vs first OFT, then LoRA. This is different for the other adapters that were added so far for mixed models, as they basically do: - Xa = X + dXa - Xab = Xa + dXb = X + dXa + dXb = X + dXb + dXa = Xb + dXa = Xba IIUC, this is not true for OFT, so when OFT is used, I had to ensure that no test was applied that (implicitly) assumes commutativity. Furthermore, I had to increase the model size, see this comment: huggingface#1160 (comment)

HuggingFaceDocBuilderDev · 2023-12-01T16:04:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

No point in including "Config".

On CPU, apparently higher tolerance is required than on GPU.

lukaskuhn-lku · 2023-12-01T18:54:44Z

As far as my understanding of OFT goes your assumption about commutativity are correct

okotaku · 2023-12-02T03:47:11Z

@BenjaminBossan It's correct.

younesbelkada

Thanks @BenjaminBossan !

BenjaminBossan · 2023-12-04T11:18:16Z

Thanks @lukaskuhn-lku and @okotaku for your input, and @younesbelkada for the review.

This PR makes it possible to use the newly added OFT adapter in mixed adapter type models, similar to LoRA, LoHa, etc. Notes Adding the integration was pretty straightforward, which is a good sign. The difficult part was actually about the tests. This stems from the fact that OFT is (if my understanding is correct) never commutative. What I mean is that even if the adapters are applied to the last layer of a model, it makes a difference whether we apply, say, first LoRA, then OFT vs first OFT, then LoRA. This is different for the other adapters that were added so far for mixed models, as they basically do: - Xa = X + dXa - Xab = Xa + dXb = X + dXa + dXb = X + dXb + dXa = Xb + dXa = Xba This is not true for OFT, so when OFT is used, I had to ensure that no test was applied that (implicitly) assumes commutativity. Furthermore, I had to increase the model size, see this comment: huggingface#1160 (comment)

BenjaminBossan added 4 commits December 1, 2023 17:17

Make test name shorter

7e9c2f0

No point in including "Config".

Increase tolerance a bit more

ab08cb4

On CPU, apparently higher tolerance is required than on GPU.

Make style

59b27ae

Increase tolerance a bit more...

cbfe937

BenjaminBossan requested review from pacman100 and younesbelkada December 1, 2023 16:56

younesbelkada approved these changes Dec 4, 2023

View reviewed changes

BenjaminBossan merged commit e05b267 into huggingface:main Dec 4, 2023
14 checks passed

BenjaminBossan deleted the enh-add-oft-to-mixed-model branch December 4, 2023 11:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Enable OFT adapter for mixed adapter models #1204

ENH: Enable OFT adapter for mixed adapter models #1204

BenjaminBossan commented Dec 1, 2023

HuggingFaceDocBuilderDev commented Dec 1, 2023

lukaskuhn-lku commented Dec 1, 2023

okotaku commented Dec 2, 2023

younesbelkada left a comment

BenjaminBossan commented Dec 4, 2023

ENH: Enable OFT adapter for mixed adapter models #1204

ENH: Enable OFT adapter for mixed adapter models #1204

Conversation

BenjaminBossan commented Dec 1, 2023

Notes

HuggingFaceDocBuilderDev commented Dec 1, 2023

lukaskuhn-lku commented Dec 1, 2023

okotaku commented Dec 2, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

BenjaminBossan commented Dec 4, 2023