🧞 Fix genai exporter arguments bug and add Genai phi2 example #1061

trajepl · 2024-04-09T06:40:31Z

Describe your changes

With the latest onnxruntime-genai, there are a few bugs:
a. precision can only accept string value(int4, fp16, fp32) but not the

b. execution_provider accept ep value but not device value
c. the pass of GenAIModelExporter is not EP agnostic. For cuda int4, the input shape is changed to fp16, but for cpu int4, it is still fp32.
Add phi2 example.

CUDA: v100
CPU: Intel(R) Xeon(R) CPU E5-2698

Update genai generation example in README of llama2/phi2.

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.
Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link

examples/phi2/phi2.py

guotuofeng · 2024-04-09T07:30:02Z

    INT4 = "int4"

I think add __str__ could fix the str(Precision.FP32) issue.

Refers to: olive/passes/onnx/genai_model_exporter.py:31 in b5f1e71. [](commit_id = b5f1e71, deletion_comment = True)

olive/passes/onnx/genai_model_exporter.py

guotuofeng · 2024-04-09T07:35:41Z

olive/passes/onnx/genai_model_exporter.py

+
+    @staticmethod
+    def is_accelerator_agnostic(accelerator_spec: AcceleratorSpec) -> bool:
+        return False


Basically, hardware dependent means need create InferenceSession and need run on target.

Yes, like Mike says, a search point can be invalid for a particular ep while the pass is accelerator agnostic.

Pass is only non-agnostic if the output of the pass changes based on the accelerator.

Actually, line 81 depends on the accelerator_spec so it satisfies the requirement.

~~Seems, we need an extra pass config like target_ep. Then the is_accelerator_agnostic can be True.~~
Synced with Mike, the accelerator_spec here is target device so current change is fine.

For genai exporter, the output will be different for different ep.
For cuda int4, the input shape is changed to fp16, but for cpu int4, it is still fp32.

guotuofeng · 2024-04-09T07:50:09Z

    INT4 = "int4"
I think add __str__ could fix the str(Precision.FP32) issue.

Refers to: olive/passes/onnx/genai_model_exporter.py:31 in b5f1e71. [](commit_id = b5f1e71, deletion_comment = True)

Please make sure the search point is always Enum. If there are any possibility to use str for precision, we'd better add __str__

olive/passes/onnx/genai_model_exporter.py

Genai phi2 example

ed4bbbd

trajepl requested a review from shaahji April 9, 2024 06:42

github-advanced-security bot found potential problems Apr 9, 2024

View reviewed changes

examples/phi2/phi2.py Fixed Show fixed Hide fixed

examples/phi2/phi2.py Fixed Show fixed Hide fixed

examples/phi2/phi2.py Fixed Show fixed Hide fixed

linter fix

b5f1e71

shaahji previously approved these changes Apr 9, 2024

View reviewed changes

guotuofeng reviewed Apr 9, 2024

View reviewed changes

olive/passes/onnx/genai_model_exporter.py Show resolved Hide resolved

guotuofeng reviewed Apr 9, 2024

View reviewed changes

jambayk reviewed Apr 9, 2024

View reviewed changes

olive/passes/onnx/genai_model_exporter.py Outdated Show resolved Hide resolved

update precision with __str__

376a3a3

trajepl dismissed shaahji’s stale review via 376a3a3 April 9, 2024 08:09

guotuofeng approved these changes Apr 9, 2024

View reviewed changes

trajepl merged commit 960d387 into main Apr 9, 2024
33 checks passed

trajepl deleted the jiapli/genai_export_fix branch April 9, 2024 10:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🧞 Fix genai exporter arguments bug and add Genai phi2 example #1061

🧞 Fix genai exporter arguments bug and add Genai phi2 example #1061

trajepl commented Apr 9, 2024

guotuofeng commented Apr 9, 2024

guotuofeng Apr 9, 2024

jambayk Apr 9, 2024

jambayk Apr 9, 2024

trajepl Apr 9, 2024 •

edited

Loading

guotuofeng commented Apr 9, 2024

🧞 Fix genai exporter arguments bug and add Genai phi2 example #1061

🧞 Fix genai exporter arguments bug and add Genai phi2 example #1061

Conversation

trajepl commented Apr 9, 2024

Describe your changes

Checklist before requesting a review

(Optional) Issue link

guotuofeng commented Apr 9, 2024

guotuofeng Apr 9, 2024

Choose a reason for hiding this comment

jambayk Apr 9, 2024

Choose a reason for hiding this comment

jambayk Apr 9, 2024

Choose a reason for hiding this comment

trajepl Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

guotuofeng commented Apr 9, 2024

trajepl Apr 9, 2024 •

edited

Loading