Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Phung, S L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18967  [pdf, other

    cs.CV

    Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis

    Authors: Vu Minh Hieu Phan, Yutong Xie, Bowen Zhang, Yuankai Qi, Zhibin Liao, Antonios Perperidis, Son Lam Phung, Johan W. Verjans, Minh-Son To

    Abstract: Unpaired medical image synthesis aims to provide complementary information for an accurate clinical diagnostics, and address challenges in obtaining aligned multi-modal medical scans. Transformer-based models excel in imaging translation tasks thanks to their ability to capture long-range dependencies. Although effective in supervised training settings, their performance falters in unpaired image… ▽ More

    Submitted 28 August, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: MICCAI version before camera ready

  2. arXiv:2304.07199  [pdf, other

    cs.CV

    CROVIA: Seeing Drone Scenes from Car Perspective via Cross-View Adaptation

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Ashley Dowling, Son Lam Phung, Jackson Cothren, Khoa Luu

    Abstract: Understanding semantic scene segmentation of urban scenes captured from the Unmanned Aerial Vehicles (UAV) perspective plays a vital role in building a perception model for UAV. With the limitations of large-scale densely labeled data, semantic scene segmentation for UAV views requires a broad understanding of an object from both its top and side views. Adapting from well-annotated autonomous driv… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  3. arXiv:2211.09663  [pdf, other

    cs.CV

    Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach

    Authors: Pha Nguyen, Kha Gia Quach, Chi Nhan Duong, Son Lam Phung, Ngan Le, Khoa Luu

    Abstract: The development of autonomous vehicles generates a tremendous demand for a low-cost solution with a complete set of camera sensors capturing the environment around the car. It is essential for object detection and tracking to address these new challenges in multi-camera settings. In order to address these challenges, this work introduces novel Single-Stage Global Association Tracking approaches to… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: In review PR journal. arXiv admin note: text overlap with arXiv:2204.09151

  4. arXiv:2203.10233  [pdf, other

    cs.CV

    DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition

    Authors: Thanh-Dat Truong, Quoc-Huy Bui, Chi Nhan Duong, Han-Seok Seo, Son Lam Phung, Xin Li, Khoa Luu

    Abstract: Human action recognition has recently become one of the popular research topics in the computer vision community. Various 3D-CNN based methods have been presented to tackle both the spatial and temporal dimensions in the task of video action recognition with competitive results. However, these methods have suffered some fundamental limitations such as lack of robustness and generalization, e.g., h… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  5. arXiv:2108.03267  [pdf, other

    cs.CV

    BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Son Lam Phung, Chase Rainwater, Khoa Luu

    Abstract: Semantic segmentation aims to predict pixel-level labels. It has become a popular task in various computer vision applications. While fully supervised segmentation methods have achieved high accuracy on large-scale vision datasets, they are unable to generalize on a new test environment or a new domain well. In this work, we first introduce a new Un-aligned Domain Score to measure the efficiency o… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021