Accumulator-Aware Post-Training Quantization for Large Language Models
Ian Colbert, Giuseppe Franco, Fabian Grob, Jinjie Zhang, Rayan Saab
Action editor: Jundong Li
https://openreview.net/forum?id=p6l0579yj7
#quantization #quantizing #multiplications