Defining an action of SO(d)-rotations on images generated by projections of d-dimensional objects: Applications to pose inference with Geometric VAEs

Nicolas Legendre; Khanh Dao Duc; Nina Miolane

Defining an action of SO(d)-rotations on images generated by projections of d-dimensional objects: Applications to pose inference with Geometric VAEs

Colloq Traitement Signal Imag. 2022 Sep:28:329-332.

Authors

Nicolas Legendre^{1

2}, Khanh Dao Duc², Nina Miolane³

Affiliations

¹ Department of Mathematics, Centrale Supelec 3 Rue Joliot Curie, Gif-sur-Yvette, 91190, France.
² Department of Mathematics, University of British Columbia 1984 Mathematics Road, Vancouver, BC V6T 1Z4, Canada.
³ Department of Electrical and Computer Engineering Harold Frank Hall, Santa Barbara, California, 93106, United States.

PMID: 37469528
PMCID: PMC10354539

Abstract
in English, French

Recent advances in variational autoencoders (VAEs) have enabled learning latent manifolds as compact Lie groups, such as SO(d). Since this approach assumes that data lies on a subspace that is homeomorphic to the Lie group itself, we here investigate how this assumption holds in the context of images that are generated by projecting a d-dimensional volume with unknown pose in SO(d). Upon examining different theoretical candidates for the group and image space, we show that the attempt to define a group action on the data space generally fails, as it requires more specific geometric constraints on the volume. Using geometric VAEs, our experiments confirm that this constraint is key to proper pose inference, and we discuss the potential of these results for applications and future work.

Les récents progrès dans le domaine des autoencodeurs variationnels (VAEs) ont permis l’apprentissage de variétés latentes sur des groupes de Lie compacts, tels que SO(d). Une telle approche supposant l’espace des données homéomorphe au groupe de Lie, nous étudions ici la validité de cette hypothèse dans le contexte d’images générées par projection d’un volume de dimension d, dont la pose dans SO(d) est inconnue. Après examen de différents candidats définissant l’espace des images et groupe, on montre que l’on ne peut de manière générale obtenir une action de groupe, sans une contrainte supplémentaire sur le volume. En appliquant des VAEs géométriques, nos expériences confirment que ces contraintes géométriques sont essentielles pour l’inférence de la pose associée au volume projeté, et nous discutons pour conclure des applications potentielles de ces résultats.

Grants and funding

R01 GM144965/GM/NIGMS NIH HHS/United States

Abstract in English, French

Grants and funding

Abstract
in English, French