attribution of LLMs #65

Wafaa014 · 2024-01-12T15:01:19Z

Can you offer support for the ALTI attribution method for LLMs such as LLAMA?

avidale · 2024-01-17T16:08:40Z

Hi Wafaa!
Currently, Stopes is focused only on translation models, and ALTI+ was implemented only for seq2seq transformers, such as NLLB. We are not currently planning to adapt ALTI+ to other architectures.

If I learn that our other colleagues who are working with LLMs implement ALTI+ or similar attribution methods for their models, I will update this thread accordingly.

Otherwise, I suggest that maybe you make such a contribution. LLaMA is a decoder-only transformer, so the part of ALTI+ code responsible for the decoder can in principle be adapted to it. But of course, this adaptation will depend on the chosen framework and other exact implementation details of the LLM code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attribution of LLMs #65

attribution of LLMs #65

Wafaa014 commented Jan 12, 2024

avidale commented Jan 17, 2024

attribution of LLMs #65

attribution of LLMs #65

Comments

Wafaa014 commented Jan 12, 2024

avidale commented Jan 17, 2024