Discriminability-enforcing loss to improve representation learning

Croitoru, Florinel-Alin; Grigore, Diana-Nicoleta; Ionescu, Radu Tudor

Computer Science > Computer Vision and Pattern Recognition

arXiv:2202.07073 (cs)

[Submitted on 14 Feb 2022 (v1), last revised 7 Apr 2022 (this version, v2)]

Title:Discriminability-enforcing loss to improve representation learning

Authors:Florinel-Alin Croitoru, Diana-Nicoleta Grigore, Radu Tudor Ionescu

View PDF

Abstract:During the training process, deep neural networks implicitly learn to represent the input data samples through a hierarchy of features, where the size of the hierarchy is determined by the number of layers. In this paper, we focus on enforcing the discriminative power of the high-level representations, that are typically learned by the deeper layers (closer to the output). To this end, we introduce a new loss term inspired by the Gini impurity, which is aimed at minimizing the entropy (increasing the discriminative power) of individual high-level features with respect to the class labels. Although our Gini loss induces highly-discriminative features, it does not ensure that the distribution of the high-level features matches the distribution of the classes. As such, we introduce another loss term to minimize the Kullback-Leibler divergence between the two distributions. We conduct experiments on two image classification data sets (CIFAR-100 and Caltech 101), considering multiple neural architectures ranging from convolutional networks (ResNet-17, ResNet-18, ResNet-50) to transformers (CvT). Our empirical results show that integrating our novel loss terms into the training objective consistently outperforms the models trained with cross-entropy alone, without increasing the inference time at all.

Comments:	Accepted in CVPR Workshops
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2202.07073 [cs.CV]
	(or arXiv:2202.07073v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2202.07073

Submission history

From: Radu Tudor Ionescu [view email]
[v1] Mon, 14 Feb 2022 22:31:37 UTC (180 KB)
[v2] Thu, 7 Apr 2022 17:45:34 UTC (185 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Discriminability-enforcing loss to improve representation learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Discriminability-enforcing loss to improve representation learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators