[Neural Speed] Improvements to run.py script #87

aahouzi · 2024-01-23T13:13:53Z

Type of Change

Tested with same commands provided in the README file, on diverse models requiring token access ID: llama2-7b, llama2-13b, llama2-70b. The script completes its executions as expected.

huggingface_hub package, but I guess this is already a dependency of transformers ?

scripts/inference.py

aahouzi added 6 commits January 22, 2024 02:53

snapshot_download weights

694fcf9

Merge branch 'intel:main' into main

a2e178c

No need for condition (Neural Speed is built as Python package)

699b20d

Support model weights download

8aaba86

Fix some typos

77a423e

Add support for convert.py

40d2c84

kevinintel requested review from Zhenzhong1 and zhenwei-intel and removed request for Zhenzhong1 January 26, 2024 10:00

Zhenzhong1 requested review from a32543254 and VincyZhang January 29, 2024 05:12

Zhenzhong1 reviewed Jan 29, 2024

View reviewed changes

scripts/inference.py Outdated Show resolved Hide resolved

Seperate concerns (PR intel#92 solves it)

b13074d

aahouzi requested a review from Zhenzhong1 January 29, 2024 10:24

Zhenzhong1 approved these changes Jan 29, 2024

View reviewed changes

aahouzi closed this Jan 31, 2024

aahouzi reopened this Jan 31, 2024

VincyZhang merged commit 33ffaf0 into intel:main Feb 21, 2024
5 checks passed