This repository includes the official pytorch implementation of Set AutoRregressive Modeling (SAR), presented in our paper:
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Wenze Liu, Le Zhuo, Yi Xin, Sheng Xia, Peng Gao, Xiangyu Yue
MMLab, CUHK & Shanghai AI Lab & Nanjing University
Currently we are working to organize the code.
- [2024.10.15] arXiv preprint is available.
Welcome to Set AutoRegressive Modeling (SAR)! SAR extends causal learning from next-token prediction to the next-set setting. We show that AR and MAR are unified under the SAR paradigm with special choices of sequence order and output intervals. Further, a seamless pathway between AR and MAR is built by manipulating the order and intervals, where models trained in the transition states enjoy both merits of AR and MAR, such as few-step inference, KV cache acceleration, image editing, etc.
- Generalized AR Paradigm: SAR conceptually unifies existing AR approaches, and provides broader design space to customize the training/inference process.
- Fully Masked Transformer: A new transformer architecture suitable for SAR.
- Transition States: The transition states in SAR offer new AR variants, which integrate both advantages of AR and MAR.
We run the code on:
- Python 3.11
- PyTorch 2.3.1
The code is built upon LlamaGen, MAR, VAR and MAE (PyTorch). Thank for their great work.
@article{liu2024customize,
title={Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling},
author={Liu, Wenze and Zhuo, Le and Xin, Yi and Xia, Sheng and Gao, Peng and Yue, Xiangyu},
journal={arXiv preprint arXiv:2410.10511},
year={2024}
}
You can contact me via email [email protected] if any questions.