Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

[GPTQ Enhence] Support Mistral-GPTQ #144

Merged
merged 3 commits into from
Mar 4, 2024
Merged

Conversation

Zhenzhong1
Copy link
Contributor

Type of Change

Feature Add

Related PR: #140

Description

image

Expected Behavior & Potential Risk

N/A

How has this PR been tested?

Manually

Dependency Change?

N/A

@Zhenzhong1 Zhenzhong1 marked this pull request as ready for review March 1, 2024 04:21
@Zhenzhong1 Zhenzhong1 requested a review from a32543254 March 1, 2024 04:21
Copy link
Contributor

@a32543254 a32543254 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@a32543254
Copy link
Contributor

could you also change here for permute func ?
https://github.com/intel/neural-speed/blob/main/neural_speed/convert/convert_mistral.py

@VincyZhang VincyZhang merged commit 96dc559 into main Mar 4, 2024
11 checks passed
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants