On neural and dimensional collapse in supervised and unsupervised contrastive learning with hard negative sampling

Jiang, Ruijie; Nguyen, Thuan; Aeron, Shuchin; Ishwar, Prakash

Computer Science > Machine Learning

arXiv:2311.05139 (cs)

[Submitted on 9 Nov 2023]

Title:On neural and dimensional collapse in supervised and unsupervised contrastive learning with hard negative sampling

Authors:Ruijie Jiang, Thuan Nguyen, Shuchin Aeron, Prakash Ishwar

View PDF

Abstract:For a widely-studied data model and general loss and sample-hardening functions we prove that the Supervised Contrastive Learning (SCL), Hard-SCL (HSCL), and Unsupervised Contrastive Learning (UCL) risks are minimized by representations that exhibit Neural Collapse (NC), i.e., the class means form an Equianglular Tight Frame (ETF) and data from the same class are mapped to the same representation. We also prove that for any representation mapping, the HSCL and Hard-UCL (HUCL) risks are lower bounded by the corresponding SCL and UCL risks. Although the optimality of ETF is known for SCL, albeit only for InfoNCE loss, its optimality for HSCL and UCL under general loss and hardening functions is novel. Moreover, our proofs are much simpler, compact, and transparent. We empirically demonstrate, for the first time, that ADAM optimization of HSCL and HUCL risks with random initialization and suitable hardness levels can indeed converge to the NC geometry if we incorporate unit-ball or unit-sphere feature normalization. Without incorporating hard negatives or feature normalization, however, the representations learned via ADAM suffer from dimensional collapse (DC) and fail to attain the NC geometry.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2311.05139 [cs.LG]
	(or arXiv:2311.05139v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.05139

Submission history

From: Ruijie Jiang [view email]
[v1] Thu, 9 Nov 2023 04:40:32 UTC (5,667 KB)

Computer Science > Machine Learning

Title:On neural and dimensional collapse in supervised and unsupervised contrastive learning with hard negative sampling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On neural and dimensional collapse in supervised and unsupervised contrastive learning with hard negative sampling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators