Highlights:
1 Fix install issue in #387
2 support to export gguf q4_0 and q4_1 format in #393
3 fix llm cmd line seqlen issue in #399
What's Changed
- fix a critic bug of static activation quantization by @wenhuach21 in #392
- vlm 70B+ in single card by @n1ck-guo in #395
- enhance calibration dataset and add awq pre quantization warning by @wenhuach21 in #396
- support awq format for vlms by @WeiweiZhang1 in #398
- [critic bug]fix llm example seqlen issue by @WeiweiZhang1 in #399
- fix device auto issue by @wenhuach21 in #400
- Fix auto-round install & bump into 0.4.4 by @XuehaoSun in #387
- fix dtype converting issue by @wenhuach21 in #403
- support for deepseek vl2 by @n1ck-guo in #401
- llm_layer_config_bugfix by @WeiweiZhang1 in #406
- support awq with qbits, only support sym by @wenhuach21 in #402
- support to export gguf q4_0 and q4_1 format by @n1ck-guo in #393
Full Changelog: v0.4.3...v0.4.4