GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers
Published in NeurIPS 2025, 2025
Recommended citation: Guang Liang, Xinyao Liu, Jianxin Wu. GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers. NeurIPS 2025. https://arxiv.org/abs/2506.11784
Proposes an “activation-first” ViT quantization framework that is 100x faster than traditional methods while achieving SOTA-level generalization performance.
| Download paper here | Code | arXiv |