Add support for large prompts that don't fit in cmd line #133

aahouzi · 2024-02-20T16:57:37Z

Type of Change

Testing the prompt evaluation phase requires large number of tokens (e.g: +1000 tokens) that usually don't fit in cmd line, this PR allows user to provide large prompts via a txt file just like llama.cpp

Description

This PR adds support for large prompts via txt files just like llama.cpp, it's a useful feature to test Neural Speed during the prompt evaluation phase.

Expected Behavior & Potential Risk

N/A

How has this PR been tested?

PR was tested with a txt file of 2105 tokens, and requires increasing the context size (Test file: summarize-keynote-2105-tokens.txt):

# Run script
NEURAL_SPEED_VERBOSE=1 python scripts/run.py <huggingface-llama2> --weight_dtype int4 --compute_dtype int8 --group_size -1 -f summarize-keynote-2105-tokens.txt --ctx_size 2109

# Inference script
NEURAL_SPEED_VERBOSE=1 python scripts/inference.py --model_name llama2 -m llama_files/ne_llama_int4.bin -n 512 -f summarize-keynote-2105-tokens.txt --ctx_size 2109

Dependency Change?

N/A

for more information, see https://pre-commit.ci

…nto large_prompt_feat

aahouzi · 2024-03-13T20:04:21Z

@zhenwei-intel I think this PR is ready to be merged, can you please complete the review ?

aahouzi and others added 3 commits February 20, 2024 08:02

Support for large prompts

ce0ebc6

Merge branch 'main' into large_prompt_feat

8246ccd

[pre-commit.ci] auto fixes from pre-commit.com hooks

2a18c9c

for more information, see https://pre-commit.ci

airMeng requested review from zhenwei-intel and Zhenzhong1 February 20, 2024 23:45

aahouzi added 3 commits February 22, 2024 02:34

Fix text decoding issue on Windows

f0d1c16

Merge branch 'large_prompt_feat' of github.com:aahouzi/neural-speed i…

0ca9625

…nto large_prompt_feat

Add missing documentation

98b362c

Zhenzhong1 approved these changes Mar 4, 2024

View reviewed changes

kevinintel approved these changes Mar 12, 2024

View reviewed changes

hshen14 approved these changes Mar 14, 2024

View reviewed changes

hshen14 merged commit e76a58e into intel:main Mar 14, 2024
6 checks passed

aahouzi deleted the large_prompt_feat branch March 17, 2024 12:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for large prompts that don't fit in cmd line #133

Add support for large prompts that don't fit in cmd line #133

aahouzi commented Feb 20, 2024 •

edited

Loading

aahouzi commented Mar 13, 2024 •

edited

Loading

Add support for large prompts that don't fit in cmd line #133

Add support for large prompts that don't fit in cmd line #133

Conversation

aahouzi commented Feb 20, 2024 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

aahouzi commented Mar 13, 2024 • edited Loading

aahouzi commented Feb 20, 2024 •

edited

Loading

aahouzi commented Mar 13, 2024 •

edited

Loading