Orthogonalising gradients to speed up neural network optimisation

Tuddenham, Mark; Prügel-Bennett, Adam; Hare, Jonathan

Computer Science > Machine Learning

arXiv:2202.07052 (cs)

[Submitted on 14 Feb 2022]

Title:Orthogonalising gradients to speed up neural network optimisation

Authors:Mark Tuddenham, Adam Prügel-Bennett, Jonathan Hare

View PDF

Abstract:The optimisation of neural networks can be sped up by orthogonalising the gradients before the optimisation step, ensuring the diversification of the learned representations. We orthogonalise the gradients of the layer's components/filters with respect to each other to separate out the intermediate representations. Our method of orthogonalisation allows the weights to be used more flexibly, in contrast to restricting the weights to an orthogonalised sub-space. We tested this method on ImageNet and CIFAR-10 resulting in a large decrease in learning time, and also obtain a speed-up on the semi-supervised learning BarlowTwins. We obtain similar accuracy to SGD without fine-tuning and better accuracy for naïvely chosen hyper-parameters.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2202.07052 [cs.LG]
	(or arXiv:2202.07052v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2202.07052

Submission history

From: Mark Tuddenham [view email]
[v1] Mon, 14 Feb 2022 21:46:07 UTC (2,931 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-02

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Orthogonalising gradients to speed up neural network optimisation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Orthogonalising gradients to speed up neural network optimisation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators