Skip to content
This repository was archived by the owner on Aug 30, 2024. It is now read-only.

[GPTQ Enhence] Support GPTQ for Baichuan2-13B & Falcon 7B & Phi-1.5 #169

Merged
merged 14 commits into from
Mar 15, 2024

Conversation

Zhenzhong1
Copy link
Contributor

@Zhenzhong1 Zhenzhong1 commented Mar 13, 2024

Type of Change

New Feature

Description

Expected Behavior & Potential Risk

N/A

How has this PR been tested?

Baichuan2-13b
image

falcon7b
image

phi-1-5
image

Dependency Change?

N/A

@Zhenzhong1 Zhenzhong1 changed the title [GPTQ Enhence] Support GPTQ & AWQ inference for Baichuan2-13B-Chat-GPTQ [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B-Chat-GPTQ Mar 13, 2024
@Zhenzhong1 Zhenzhong1 changed the title [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B-Chat-GPTQ [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B & Falcon 7B & Falcon 40B Mar 14, 2024
@Zhenzhong1 Zhenzhong1 changed the title [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B & Falcon 7B & Falcon 40B [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B & Falcon 7B/40B Mar 14, 2024
@Zhenzhong1 Zhenzhong1 changed the title [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B & Falcon 7B/40B [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B & Falcon 7B Mar 14, 2024
@Zhenzhong1 Zhenzhong1 changed the title [GPTQ Enhence] Support GPTQ inference for Baichuan2-13B & Falcon 7B [GPTQ Enhence] Support GPTQ for Baichuan2-13B & Falcon 7B Mar 14, 2024
@Zhenzhong1 Zhenzhong1 changed the title [GPTQ Enhence] Support GPTQ for Baichuan2-13B & Falcon 7B [GPTQ Enhence] Support GPTQ for Baichuan2-13B & Falcon 7B & Phi-1.5 Mar 14, 2024
# Conflicts:
#	neural_speed/convert/convert_quantized_bloom.py
Copy link
Contributor

@a32543254 a32543254 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Zhenzhong1 Zhenzhong1 requested a review from VincyZhang March 15, 2024 02:14
@VincyZhang VincyZhang merged commit eed9b30 into main Mar 15, 2024
11 checks passed
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants