Self-supervised audio-visual co-segmentation

A Rouditchenko, H Zhao, C Gan… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
… More recently, [1, 16, 11] used audio-visual correspondence to separate sound sources. An…
Our contribution is to develop a model for audio-visual co-segmentation using videos. In the …

Self-supervised learning of audio-visual objects from video

T Afouras, A Owens, JS Chung, A Zisserman - Computer Vision–ECCV …, 2020 - Springer
… upon recent works on self-supervised audio-visual localization. … facilitate a number of
audio-visual downstream tasks that … In each case, we significantly outperform other self-supervised

Self-supervised object detection from audio-visual correspondence

T Afouras, YM Asano, F Fagan… - Proceedings of the …, 2022 - openaccess.thecvf.com
… Inspired by recent work in selfsupervised learning, we seek to replace this source of ex…
clustering with audio-visual co-segmentation achieving combined audio-visual source separation. …

Self-supervised segmentation and source separation on videos

A Rouditchenko, H Zhao, C Gan… - Proceedings of the …, 2019 - openaccess.thecvf.com
… Joint audio-visual training and independent image and audio inference. After our neural …
The learning method is selfsupervised because the neural networks do not require labelled …

Audiovisual segmentation

J Zhou, J Wang, J Zhang, W Sun, J Zhang… - … on Computer Vision, 2022 - Springer
… Look, listen, and attend: co-attention network for self-supervised audio-visual representation
learning. In: Proceedings of the 28th ACM International Conference on Multimedia (ACM), …

Self-supervised learning by cross-modal audio-video clustering

H Alwassel, D Mahajan, B Korbar… - Advances in …, 2020 - proceedings.neurips.cc
… video models from self-supervised audio-visual information. At … prediction an enriching
self-supervised task compared to … Self-supervised audio-visual co-segmentation. In ICASSP, …

Annotation-free audio-visual segmentation

J Liu, Y Wang, C Ju, C Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
Audio-visual scene analysis with self-supervised multisensory features. In Proceedings of
the … Self-supervised audio-visual co-segmentation. In ICASSP 2019-2019 IEEE International …

Audio-visual segmentation with semantics

J Zhou, X Shen, J Wang, J Zhang, W Sun… - arXiv preprint arXiv …, 2023 - arxiv.org
… network for self-supervised audio-visual representation learning,… AA Efros, “Audio-visual
scene analysis with selfsupervised … Torralba, “Self-supervised audio-visual co-segmentation,” in …

Weakly-supervised audio-visual segmentation

S Mo, B Raj - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
Self-supervised equivariant attention mechanism for weakly supervised semantic
segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …

Weakly-supervised audio-visual sound source detection and separation

T Rahman, L Sigal - 2021 IEEE International Conference on …, 2021 - ieeexplore.ieee.org
… We propose an audio-visual co-segmentation, where the network learns both what … [5]
Andrew Owens and Alexei A Efros, “Audio-visual scene analysis with self-supervised multisensory …