EfficientQAT: Efficient Quantization-Aware Training for Large Language Models Paper • 2407.11062 • Published 13 days ago • 4
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Paper • 2407.12327 • Published 6 days ago • 64