Google Scholar

Self-supervised audio-visual co-segmentation

A Rouditchenko, H Zhao, C Gan… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

… More recently, [1, 16, 11] used audio-visual correspondence to separate sound sources. An…
Our contribution is to develop a model for audio-visual co-segmentation using videos. In the …

Speichern Sie Cite Cited by 125 Related articles All 7 versions

[PDF] arxiv.org

Self-supervised learning of audio-visual objects from video

T Afouras, A Owens, JS Chung, A Zisserman - Computer Vision–ECCV …, 2020 - Springer

… upon recent works on self-supervised audio-visual localization. … facilitate a number of
audio-visual downstream tasks that … In each case, we significantly outperform other self-supervised …

Speichern Sie Cite Cited by 258 Related articles All 8 versions

[PDF] thecvf.com

Self-supervised object detection from audio-visual correspondence

T Afouras, YM Asano, F Fagan… - Proceedings of the …, 2022 - openaccess.thecvf.com

… Inspired by recent work in selfsupervised learning, we seek to replace this source of ex…
clustering with audio-visual co-segmentation achieving combined audio-visual source separation. …

Speichern Sie Cite Cited by 53 Related articles All 8 versions View as HTML

[PDF] thecvf.com

Self-supervised segmentation and source separation on videos

A Rouditchenko, H Zhao, C Gan… - Proceedings of the …, 2019 - openaccess.thecvf.com

… Joint audio-visual training and independent image and audio inference. After our neural …
The learning method is selfsupervised because the neural networks do not require labelled …

Speichern Sie Cite Cited by 6 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Audio–visual segmentation

J Zhou, J Wang, J Zhang, W Sun, J Zhang… - … on Computer Vision, 2022 - Springer

… Look, listen, and attend: co-attention network for self-supervised audio-visual representation
learning. In: Proceedings of the 28th ACM International Conference on Multimedia (ACM), …

Speichern Sie Cite Cited by 114 Related articles All 5 versions

[PDF] neurips.cc

Self-supervised learning by cross-modal audio-video clustering

H Alwassel, D Mahajan, B Korbar… - Advances in …, 2020 - proceedings.neurips.cc

… video models from self-supervised audio-visual information. At … prediction an enriching
self-supervised task compared to … Self-supervised audio-visual co-segmentation. In ICASSP, …

Speichern Sie Cite Cited by 473 Related articles All 9 versions View as HTML

[PDF] thecvf.com

Annotation-free audio-visual segmentation

J Liu, Y Wang, C Ju, C Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com

… Audio-visual scene analysis with self-supervised multisensory features. In Proceedings of
the … Self-supervised audio-visual co-segmentation. In ICASSP 2019-2019 IEEE International …

Speichern Sie Cite Cited by 27 Related articles All 6 versions View as HTML

[PDF] arxiv.org

Audio-visual segmentation with semantics

J Zhou, X Shen, J Wang, J Zhang, W Sun… - arXiv preprint arXiv …, 2023 - arxiv.org

… network for self-supervised audio-visual representation learning,… AA Efros, “Audio-visual
scene analysis with selfsupervised … Torralba, “Self-supervised audio-visual co-segmentation,” in …

Speichern Sie Cite Cited by 23 Related articles All 2 versions View as HTML

[PDF] neurips.cc

Weakly-supervised audio-visual segmentation

S Mo, B Raj - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc

… Self-supervised equivariant attention mechanism for weakly supervised semantic
segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …

Speichern Sie Cite Cited by 6 Related articles All 5 versions View as HTML

[PDF] arxiv.org

Weakly-supervised audio-visual sound source detection and separation

T Rahman, L Sigal - 2021 IEEE International Conference on …, 2021 - ieeexplore.ieee.org

… We propose an audio-visual co-segmentation, where the network learns both what … [5]
Andrew Owens and Alexei A Efros, “Audio-visual scene analysis with self-supervised multisensory …

Speichern Sie Cite Cited by 8 Related articles All 5 versions

Cite

Advanced search

Saved to My library

Self-supervised audio-visual co-segmentation

Self-supervised learning of audio-visual objects from video

Self-supervised object detection from audio-visual correspondence

Self-supervised segmentation and source separation on videos

Audio–visual segmentation

Self-supervised learning by cross-modal audio-video clustering

Annotation-free audio-visual segmentation

Audio-visual segmentation with semantics

Weakly-supervised audio-visual segmentation

Weakly-supervised audio-visual sound source detection and separation

Related searches