NIPQ: Noise proxy-based Integrated Pseudo-Quantization

Shin, Juncheol; So, Junhyuk; Park, Sein; Kang, Seungyeop; Yoo, Sungjoo; Park, Eunhyeok

Computer Science > Machine Learning

arXiv:2206.00820 (cs)

[Submitted on 2 Jun 2022 (v1), last revised 1 Jul 2023 (this version, v2)]

Title:NIPQ: Noise proxy-based Integrated Pseudo-Quantization

Authors:Juncheol Shin, Junhyuk So, Sein Park, Seungyeop Kang, Sungjoo Yoo, Eunhyeok Park

View PDF

Abstract:Straight-through estimator (STE), which enables the gradient flow over the non-differentiable function via approximation, has been favored in studies related to quantization-aware training (QAT). However, STE incurs unstable convergence during QAT, resulting in notable quality degradation in low precision. Recently, pseudoquantization training has been proposed as an alternative approach to updating the learnable parameters using the pseudo-quantization noise instead of STE. In this study, we propose a novel noise proxy-based integrated pseudoquantization (NIPQ) that enables unified support of pseudoquantization for both activation and weight by integrating the idea of truncation on the pseudo-quantization framework. NIPQ updates all of the quantization parameters (e.g., bit-width and truncation boundary) as well as the network parameters via gradient descent without STE instability. According to our extensive experiments, NIPQ outperforms existing quantization algorithms in various vision and language applications by a large margin.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2206.00820 [cs.LG]
	(or arXiv:2206.00820v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.00820

Submission history

From: Eunhyeok Park [view email]
[v1] Thu, 2 Jun 2022 01:17:40 UTC (167 KB)
[v2] Sat, 1 Jul 2023 08:27:18 UTC (1,947 KB)

Computer Science > Machine Learning

Title:NIPQ: Noise proxy-based Integrated Pseudo-Quantization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:NIPQ: Noise proxy-based Integrated Pseudo-Quantization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators