feat: pass model parameters to HFLocalInvocationLayer via `model_kwargs`, enabling direct model usage #4956

vblagoje · 2023-05-19T12:29:03Z

Related Issues

partially fixes feat: Ensure Agent works with open-source and other models #4557

Problem:

As we continue to see advancements in large language model (LLM) development, there has been a notable shift towards custom torch architectures not yet supported in HuggingFace's transformers. This trend poses a challenge for our existing setup, where we rely heavily on the transformers pipeline approach. Take, for example, MPT open-source models. As custom MPT model architecture is not yet part of the Hugging Face transformers and because it includes many custom architectural approaches ranging from FlashAttention, ALiBi, QK LayerNorm, one needs to create a model for MPT "by hand":

tokenizer = transformers.AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b")
config = transformers.AutoConfig.from_pretrained(
  'mosaicml/mpt-7b-instruct',
  trust_remote_code=True
)
config.attn_config['attn_impl'] = 'triton'

model = transformers.AutoModelForCausalLM.from_pretrained(
  'mosaicml/mpt-7b-instruct',
  config=config,
  torch_dtype=torch.bfloat16,
  trust_remote_code=True
)
model.to(device='cuda:0')

To complicate things, some models use tokenizers from other existing models, just like MPT models do.
This custom architecture trend will only likely increase, racing ahead of transformers.

Proposed Changes:

This PR proposes directly integrating HuggingFace models within PromptNode via the model_kwargs model parameter, allowing for more flexibility and compatibility with rapidly evolving LLM architectures.
Assuming someone created the above-mentioned model and tokenizer already, our users would simply create a PromptNode like this:

from haystack.nodes import PromptNode
pn = PromptNode("mosaicml/mpt-7b-instruct", stop_words=["<|endoftext|>"], model_kwargs={"model":model, "tokenizer": tokenizer})

Key Benefits:

Enhanced Flexibility: this minor change allows us to incorporate the latest LLMs into the Haystack ecosystem, including those based on custom torch architectures, increasing our capability to stay current with LLM advancements.
Future-proof: as LLM research outpaces transformers, this change ensures PromptNode is prepared to adapt quickly to new developments.

How did you test it?

New unit tests, manual tests, and custom demo colab

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

haystack/nodes/prompt/invocation_layer/hugging_face.py

test/prompt/invocation_layer/test_hugging_face.py

masci · 2023-05-22T13:27:15Z

test/prompt/invocation_layer/test_hugging_face.py

+    assert isinstance(layer.pipe.tokenizer, T5TokenizerFast)
+
+
+@pytest.mark.integration


this should be mocked and made unit

test/prompt/invocation_layer/test_hugging_face.py

coveralls · 2023-05-22T23:15:06Z

Pull Request Test Coverage Report for Build 5197464139

0 of 0 changed or added relevant lines in 0 files are covered.
406 unchanged lines in 11 files lost coverage.
Overall coverage increased (+0.7%) to 40.74%

Files with Coverage Reduction	New Missed Lines	%
nodes/prompt/prompt_template.py	1	89.39%
nodes/prompt/prompt_model.py	4	80.0%
nodes/answer_generator/openai.py	8	87.16%
nodes/audio/whisper_transcriber.py	11	24.32%
nodes/prompt/invocation_layer/hugging_face.py	15	83.2%
nodes/prompt/invocation_layer/chatgpt.py	19	55.74%
nodes/prompt/invocation_layer/open_ai.py	20	62.34%
nodes/prompt/prompt_node.py	30	40.57%
nodes/retriever/dense.py	49	25.98%
modeling/training/base.py	81	14.29%

Totals
Change from base Build 5174989324:	0.7%
Covered Lines:	9138
Relevant Lines:	22430

💛 - Coveralls

masci

Took another pass

haystack/nodes/prompt/invocation_layer/hugging_face.py

test/prompt/invocation_layer/test_hugging_face.py

vblagoje · 2023-06-05T09:46:24Z

Took another pass

Thanks, learned a ton! LMK if there are any additional changes needed.

masci

🚢

vblagoje requested a review from a team as a code owner May 19, 2023 12:29

vblagoje requested review from masci and removed request for a team May 19, 2023 12:29

github-actions bot added topic:promptnode topic:tests labels May 19, 2023

vblagoje force-pushed the huggingface_layer_updates branch from 82dff7a to 4d493a6 Compare May 19, 2023 12:35

vblagoje added this to the 1.17 milestone May 22, 2023

masci suggested changes May 22, 2023

View reviewed changes

vblagoje force-pushed the huggingface_layer_updates branch from 4d493a6 to 180c94f Compare May 22, 2023 22:56

vblagoje force-pushed the huggingface_layer_updates branch from 180c94f to f336129 Compare May 24, 2023 10:07

masci removed this from the 1.17 milestone May 24, 2023

dfokina mentioned this pull request May 24, 2023

Initializing new LLMs to use with PromptNode #5013

Closed

vblagoje force-pushed the huggingface_layer_updates branch from f336129 to 4b70381 Compare May 26, 2023 14:57

github-actions bot added the type:documentation Improvements on the docs label May 26, 2023

masci suggested changes Jun 1, 2023

View reviewed changes

haystack/nodes/prompt/invocation_layer/hugging_face.py Outdated Show resolved Hide resolved

test/prompt/invocation_layer/test_hugging_face.py Outdated Show resolved Hide resolved

test/prompt/invocation_layer/test_hugging_face.py Outdated Show resolved Hide resolved

vblagoje force-pushed the huggingface_layer_updates branch from 8ffcc55 to f5010af Compare June 5, 2023 08:16

vblagoje and others added 9 commits June 5, 2023 10:35

Simplify HFLocalInvocationLayer, move/add unit tests

59e9262

PR feedback

4f70233

Better pipeline invocation, add mocked tests

3a49382

Minor improvements

e2d4f09

Mock pipeline directly, unit test updates

0ef14bd

PR feedback, change pytest type to integration

eed8e27

Mock supports unit test

5fa53a0

add full stop

4ace1fa

PR feedback, improve unit tests

2c72bd3

vblagoje force-pushed the huggingface_layer_updates branch from f5010af to 2c72bd3 Compare June 5, 2023 08:35

vblagoje added 3 commits June 5, 2023 11:09

Add mock_get_task fixture

5cbfc22

Further improve unit tests

83f946b

Minor unit test improvement

8edb5a5

vblagoje added 3 commits June 7, 2023 09:29

Add unit tests, increase coverage

fcca124

Add unit tests, increase test coverage

5f9a907

Small optimization, improve _ensure_token_limit unit test

0ff8502

masci changed the title ~~Simplify HFLocalInvocationLayer, enable direct model use, move/add unit tests~~ feat: pass model parameters to HFLocalInvocationLayer via model_kwargs, enabling direct model usage Jun 7, 2023

masci approved these changes Jun 7, 2023

View reviewed changes

masci merged commit e3b0696 into main Jun 7, 2023

masci deleted the huggingface_layer_updates branch June 7, 2023 11:34

anakin87 mentioned this pull request Jun 30, 2023

Docs: possible improvements on PromptNode docs #5240

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: pass model parameters to HFLocalInvocationLayer via `model_kwargs`, enabling direct model usage #4956

feat: pass model parameters to HFLocalInvocationLayer via `model_kwargs`, enabling direct model usage #4956

vblagoje commented May 19, 2023 •

edited

Loading

masci May 22, 2023

coveralls commented May 22, 2023 •

edited

Loading

masci left a comment

vblagoje commented Jun 5, 2023

masci left a comment

		assert isinstance(layer.pipe.tokenizer, T5TokenizerFast)


		@pytest.mark.integration

feat: pass model parameters to HFLocalInvocationLayer via model_kwargs, enabling direct model usage #4956

feat: pass model parameters to HFLocalInvocationLayer via model_kwargs, enabling direct model usage #4956

Conversation

vblagoje commented May 19, 2023 • edited Loading

Related Issues

Problem:

Proposed Changes:

Key Benefits:

How did you test it?

Notes for the reviewer

Checklist

masci May 22, 2023

Choose a reason for hiding this comment

coveralls commented May 22, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5197464139

💛 - Coveralls

masci left a comment

Choose a reason for hiding this comment

vblagoje commented Jun 5, 2023

masci left a comment

Choose a reason for hiding this comment

feat: pass model parameters to HFLocalInvocationLayer via `model_kwargs`, enabling direct model usage #4956

feat: pass model parameters to HFLocalInvocationLayer via `model_kwargs`, enabling direct model usage #4956

vblagoje commented May 19, 2023 •

edited

Loading

coveralls commented May 22, 2023 •

edited

Loading