Skip to main content

Showing 1–50 of 207 results for author: Do, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10412  [pdf, other

    cs.HC

    Cultural Reflections in Virtual Reality: The Effects of User Ethnicity in Avatar Matching Experiences on Sense of Embodiment

    Authors: Tiffany D. Do, Juanita Benjamin, Camille Isabella Protko, Ryan P. McMahan

    Abstract: Matching avatar characteristics to a user can impact sense of embodiment (SoE) in VR. However, few studies have examined how participant demographics may interact with these matching effects. We recruited a diverse and racially balanced sample of 78 participants to investigate the differences among participant groups when embodying both demographically matched and unmatched avatars. We found that… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: To appear in IEEE Transactions on Visualization and Computer Graphics

  2. arXiv:2407.09503  [pdf, other

    cs.CV cs.HC cs.NE

    PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos

    Authors: Steven Abreu, Tiffany D. Do, Karan Ahuja, Eric J. Gonzalez, Lee Payne, Daniel McDuff, Mar Gonzalez-Franco

    Abstract: Intelligent assistance involves not only understanding but also action. Existing ego-centric video datasets contain rich annotations of the videos, but not of actions that an intelligent assistant could perform in the moment. To address this gap, we release PARSE-Ego4D, a new set of personal action recommendation annotations for the Ego4D dataset. We take a multi-stage approach to generating and e… ▽ More

    Submitted 14 June, 2024; originally announced July 2024.

  3. arXiv:2407.07229  [pdf, other

    astro-ph.IM cs.AI

    Using Galaxy Evolution as Source of Physics-Based Ground Truth for Generative Models

    Authors: Yun Qi Li, Tuan Do, Evan Jones, Bernie Boscoe, Kevin Alfaro, Zooey Nguyen

    Abstract: Generative models producing images have enormous potential to advance discoveries across scientific fields and require metrics capable of quantifying the high dimensional output. We propose that astrophysics data, such as galaxy images, can test generative models with additional physics-motivated ground truths in addition to human judgment. For example, galaxies in the Universe form and change ove… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 20 pages, 14 figures, 1 Table, code: https://github.com/astrodatalab/li2024_public, training data: https://zenodo.org/records/11117528

  4. arXiv:2407.07003  [pdf, other

    cs.CV cs.AI

    Learning to Complement and to Defer to Multiple Users

    Authors: Zheng Zhang, Wenjie Ai, Kevin Wells, David Rosewarne, Thanh-Toan Do, Gustavo Carneiro

    Abstract: With the development of Human-AI Collaboration in Classification (HAI-CC), integrating users and AI predictions becomes challenging due to the complex decision-making process. This process has three options: 1) AI autonomously classifies, 2) learning to complement, where AI collaborates with users, and 3) learning to defer, where AI defers to users. Despite their interconnected nature, these optio… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  5. arXiv:2407.02721  [pdf, ps, other

    cs.LG cs.CV

    Model and Feature Diversity for Bayesian Neural Networks in Mutual Learning

    Authors: Cuong Pham, Cuong C. Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Bayesian Neural Networks (BNNs) offer probability distributions for model parameters, enabling uncertainty quantification in predictions. However, they often underperform compared to deterministic neural networks. Utilizing mutual learning can effectively enhance the performance of peer BNNs. In this paper, we propose a novel approach to improve BNNs performance through deep mutual learning. The p… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to NeurIPS 2023

  6. arXiv:2407.00609  [pdf, other

    cs.CV cs.LG

    ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding

    Authors: Quang P. M. Pham, Khoi T. N. Nguyen, Lan C. Ngo, Truong Do, Truong Son Hy

    Abstract: Scene graphs have been proven to be useful for various scene understanding tasks due to their compact and explicit nature. However, existing approaches often neglect the importance of maintaining the symmetry-preserving property when generating scene graphs from 3D point clouds. This oversight can diminish the accuracy and robustness of the resulting scene graphs, especially when handling noisy, m… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  7. arXiv:2406.12593  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    PromptDSI: Prompt-based Rehearsal-free Instance-wise Incremental Learning for Document Retrieval

    Authors: Tuan-Luc Huynh, Thuy-Trang Vu, Weiqing Wang, Yinwei Wei, Trung Le, Dragan Gasevic, Yuan-Fang Li, Thanh-Toan Do

    Abstract: Differentiable Search Index (DSI) utilizes Pre-trained Language Models (PLMs) for efficient document retrieval without relying on external indexes. However, DSIs need full re-training to handle updates in dynamic corpora, causing significant computational inefficiencies. We introduce PromptDSI, a rehearsal-free, prompt-based approach for instance-wise incremental learning in document retrieval. Pr… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 21 pages

  8. arXiv:2406.09332  [pdf, other

    cs.RO

    RoTipBot: Robotic Handling of Thin and Flexible Objects using Rotatable Tactile Sensors

    Authors: Jiaqi Jiang, Xuyang Zhang, Daniel Fernandes Gomes, Thanh-Toan Do, Shan Luo

    Abstract: This paper introduces RoTipBot, a novel robotic system for handling thin, flexible objects. Different from previous works that are limited to singulating them using suction cups or soft grippers, RoTipBot can grasp and count multiple layers simultaneously, emulating human handling in various environments. Specifically, we develop a novel vision-based tactile sensor named RoTip that can rotate and… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 20 pages, 21 figures

  9. arXiv:2406.08815  [pdf, other

    cs.RO eess.SY

    Deep Reinforcement Learning-based Quadcopter Controller: A Practical Approach and Experiments

    Authors: Truong-Dong Do, Nguyen Xuan Mung, Sung Kyung Hong

    Abstract: Quadcopters have been studied for decades thanks to their maneuverability and capability of operating in a variety of circumstances. However, quadcopters suffer from dynamical nonlinearity, actuator saturation, as well as sensor noise that make it challenging and time consuming to obtain accurate dynamic models and achieve satisfactory control performance. Fortunately, deep reinforcement learning… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 6 pages, 5 figures, 3 tables

  10. arXiv:2406.07107  [pdf, other

    cs.LG

    Agnostic Sharpness-Aware Minimization

    Authors: Van-Anh Nguyen, Quyen Tran, Tuan Truong, Thanh-Toan Do, Dinh Phung, Trung Le

    Abstract: Sharpness-aware minimization (SAM) has been instrumental in improving deep neural network training by minimizing both the training loss and the sharpness of the loss landscape, leading the model into flatter minima that are associated with better generalization properties. In another aspect, Model-Agnostic Meta-Learning (MAML) is a framework designed to improve the adaptability of models. MAML opt… ▽ More

    Submitted 11 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Under review

  11. arXiv:2406.04090  [pdf, other

    cs.LG cs.CV eess.IV eess.SP

    Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness Priors

    Authors: Tam Thuc Do, Parham Eftekhar, Seyed Alireza Hosseini, Gene Cheung, Philip Chou

    Abstract: We build interpretable and lightweight transformer-like neural networks by unrolling iterative optimization algorithms that minimize graph smoothness priors -- the quadratic graph Laplacian regularizer (GLR) and the $\ell_1$-norm graph total variation (GTV) -- subject to an interpolation constraint. The crucial insight is that a normalized signal-dependent graph learning module amounts to a varian… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  12. arXiv:2405.03206  [pdf, other

    cs.CL cs.AI

    Vietnamese AI Generated Text Detection

    Authors: Quang-Dan Tran, Van-Quan Nguyen, Quang-Huy Pham, K. B. Thang Nguyen, Trong-Hop Do

    Abstract: In recent years, Large Language Models (LLMs) have become integrated into our daily lives, serving as invaluable assistants in completing tasks. Widely embraced by users, the abuse of LLMs is inevitable, particularly in using them to generate text content for various purposes, leading to difficulties in distinguishing between text generated by LLMs and that written by humans. In this study, we pre… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  13. arXiv:2404.19252  [pdf, other

    cs.CL

    Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts

    Authors: Cuong Nhat Vo, Khanh Bao Huynh, Son T. Luu, Trong-Hop Do

    Abstract: The growth of social networks makes toxic content spread rapidly. Hate speech detection is a task to help decrease the number of harmful comments. With the diversity in the hate speech created by users, it is necessary to interpret the hate speech besides detecting it. Hence, we propose a methodology to construct a system for targeted hate speech detection from online streaming texts from social m… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  14. arXiv:2404.16556  [pdf, other

    cs.CV

    Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models

    Authors: Parul Gupta, Munawar Hayat, Abhinav Dhall, Thanh-Toan Do

    Abstract: Few-shot image synthesis entails generating diverse and realistic images of novel categories using only a few example images. While multiple recent efforts in this direction have achieved impressive results, the existing approaches are dependent only upon the few novel samples available at test time in order to generate new images, which restricts the diversity of the generated images. To overcome… ▽ More

    Submitted 28 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  15. arXiv:2404.08511  [pdf, other

    cs.AI cs.CL

    Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery

    Authors: Shiva Aryal, Tuyen Do, Bisesh Heyojoo, Sandeep Chataut, Bichar Dip Shrestha Gurung, Venkataramana Gadhamshetty, Etienne Gnimpieba

    Abstract: In the rapidly evolving field of artificial intelligence, the ability to harness and integrate knowledge across various domains stands as a paramount challenge and opportunity. This study introduces a novel approach to cross-domain knowledge discovery through the deployment of multi-AI agents, each specialized in distinct knowledge domains. These AI agents, designed to function as domain-specific… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  16. arXiv:2404.08396  [pdf, other

    cs.IT

    Joint Computation Offloading and Target Tracking in Integrated Sensing and Communication Enabled UAV Networks

    Authors: Trinh Van Chien, Mai Dinh Cong, Nguyen Cong Luong, Tri Nhu Do, Dong In Kim, Symeon Chatzinotas

    Abstract: In this paper, we investigate a joint computation offloading and target tracking in Integrated Sensing and Communication (ISAC)-enabled unmanned aerial vehicle (UAV) network. Therein, the UAV has a computing task that is partially offloaded to the ground UE for execution. Meanwhile, the UAV uses the offloading bit sequence to estimate the velocity of a ground target based on an autocorrelation fun… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 5 pages, 3 figures, 1 table. Accepted by IEEE Communications Letters

  17. arXiv:2404.03612  [pdf, other

    cs.HC

    Creator Hearts: Investigating the Impact Positive Signals from YouTube Creators in Shaping Comment Section Behavior

    Authors: Frederick Choi, Charlotte Lambert, Vinay Koshy, Sowmya Pratipati, Tue Do, Eshwar Chandrasekharan

    Abstract: Much of the research in online moderation focuses on punitive actions. However, emerging research has shown that positive reinforcement is effective at encouraging desirable behavior on online platforms. We extend this research by studying the "creator heart" feature on YouTube, quantifying their primary effects on comments that receive hearts and on videos where hearts have been given. We find th… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  18. arXiv:2404.02330  [pdf, other

    cs.CL cs.AI

    Comparative Study of Domain Driven Terms Extraction Using Large Language Models

    Authors: Sandeep Chataut, Tuyen Do, Bichar Dip Shrestha Gurung, Shiva Aryal, Anup Khanal, Carol Lushbough, Etienne Gnimpieba

    Abstract: Keywords play a crucial role in bridging the gap between human understanding and machine processing of textual data. They are essential to data enrichment because they form the basis for detailed annotations that provide a more insightful and in-depth view of the underlying data. Keyword/domain driven term extraction is a pivotal task in natural language processing, facilitating information retrie… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  19. arXiv:2403.08969  [pdf, other

    cs.HC cs.LG

    The Full-scale Assembly Simulation Testbed (FAST) Dataset

    Authors: Alec G. Moore, Tiffany D. Do, Nayan N. Chawla, Antonia Jimenez Iriarte, Ryan P. McMahan

    Abstract: In recent years, numerous researchers have begun investigating how virtual reality (VR) tracking and interaction data can be used for a variety of machine learning purposes, including user identification, predicting cybersickness, and estimating learning gains. One constraint for this research area is the dearth of open datasets. In this paper, we present a new open dataset captured with our VR-ba… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  20. arXiv:2403.05894  [pdf, other

    cs.CV

    Frequency Attention for Knowledge Distillation

    Authors: Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Knowledge distillation is an attractive approach for learning compact deep neural networks, which learns a lightweight student model by distilling knowledge from a complex teacher model. Attention-based knowledge distillation is a specific form of intermediate feature-based knowledge distillation that uses attention mechanisms to encourage the student to better mimic the teacher. However, most of… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Appear to WACV 2024

  21. arXiv:2402.11421  [pdf, other

    cs.RO eess.SY

    Analysis of Fatigue-Induced Compensatory Movements in Bicep Curls: Gaining Insights for the Deployment of Wearable Sensors

    Authors: Ming Xuan Chua, Yoshiro Okubo, Shuhua Peng, Thanh Nho Do, Chun Hui Wang, Liao Wu

    Abstract: A common challenge in Bicep Curls rehabilitation is muscle compensation, where patients adopt alternative movement patterns when the primary muscle group cannot act due to injury or fatigue, significantly decreasing the effectiveness of rehabilitation efforts. The problem is exacerbated by the growing trend toward transitioning from in-clinic to home-based rehabilitation, where constant monitoring… ▽ More

    Submitted 25 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: 11 pages, 7 figures, accepted by T-MRB

  22. arXiv:2402.03805  [pdf, other

    cs.SE

    Automated Description Generation for Software Patches

    Authors: Thanh Trong Vu, Tuan-Dung Bui, Thanh-Dat Do, Thu-Trang Nguyen, Hieu Dinh Vo, Son Nguyen

    Abstract: Software patches are pivotal in refining and evolving codebases, addressing bugs, vulnerabilities, and optimizations. Patch descriptions provide detailed accounts of changes, aiding comprehension and collaboration among developers. However, manual description creation poses challenges in terms of time consumption and variations in quality and detail. In this paper, we propose PATCHEXPLAINER, an ap… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Pre-print version of PATCHEXPLAINER

  23. Stepping into the Right Shoes: The Effects of User-Matched Avatar Ethnicity and Gender on Sense of Embodiment in Virtual Reality

    Authors: Tiffany D. Do, Camille Isabella Protko, Ryan P. McMahan

    Abstract: In many consumer virtual reality (VR) applications, users embody predefined characters that offer minimal customization options, frequently emphasizing storytelling over user choice. We explore whether matching a user's physical characteristics, specifically ethnicity and gender, with their virtual self-avatar affects their sense of embodiment in VR. We conducted a 2 x 2 within-subjects experiment… ▽ More

    Submitted 14 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: In IEEE Transactions on Visualization and Computer Graphics

    Journal ref: In IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 5, pp. 2434-2443, May 2024

  24. arXiv:2402.02319  [pdf

    cs.RO

    Smart Textile-Driven Soft Spine Exosuit for Lifting Tasks in Industrial Applications

    Authors: Kefan Zhu, Bibhu Sharma, Phuoc Thien Phan, James Davies, Mai Thanh Thai, Trung Thien Hoang, Chi Cong Nguyen, Adrienne Ji, Emanuele Nicotra, Nigel H. Lovell, Thanh Nho Do

    Abstract: Work related musculoskeletal disorders (WMSDs) are often caused by repetitive lifting, making them a significant concern in occupational health. Although wearable assist devices have become the norm for mitigating the risk of back pain, most spinal assist devices still possess a partially rigid structure that impacts the user comfort and flexibility. This paper addresses this issue by presenting a… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 6 pages, 7 figures

  25. arXiv:2401.18083  [pdf, other

    cs.CV cs.RO

    Improved Scene Landmark Detection for Camera Localization

    Authors: Tien Do, Sudipta N. Sinha

    Abstract: Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-spe… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: To be presented at 3DV 2024

  26. arXiv:2401.14203  [pdf, other

    eess.SP cs.IT

    Statistical Characterization of RIS-assisted UAV Communications in Terrestrial and Non-Terrestrial Networks Under Channel Aging

    Authors: Thanh Luan Nguyen, Georges Kaddoum, Tri Nhu Do, Zygmunt J. Haas

    Abstract: This paper studies the statistical characterization of ground-to-air (G2A) and reconfigurable intelligent surface (RIS)-assisted air-to-ground (A2G) communications with unmanned aerial vehicles (UAVs) in terrestrial and non-terrestrial networks under the impact of channel aging. We first model the G2A and A2G signal-to-noise ratios (SNRs) as non-central complex Gaussian quadratic random variable… ▽ More

    Submitted 30 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 6 pages, 3 figures and 7 subfigures, IEEE ICC'24 (Revision)

  27. arXiv:2401.01387  [pdf, other

    cs.CV

    DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition

    Authors: Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do

    Abstract: The task of Visual Relationship Recognition (VRR) aims to identify relationships between two interacting objects in an image and is particularly challenging due to the widely-spread and highly imbalanced distribution of <subject, relation, object> triplets. To overcome the resultant performance bias in existing VRR approaches, we introduce DiffAugment -- a method which first augments the tail clas… ▽ More

    Submitted 1 March, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  28. arXiv:2311.13539  [pdf, other

    eess.IV cs.LG eess.SP

    Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

    Authors: Tam Thuc Do, Philip A. Chou, Gene Cheung

    Abstract: We study 3D point cloud attribute compression via a volumetric approach: assuming point cloud geometry is known at both encoder and decoder, parameters $θ$ of a continuous attribute function $f: \mathbb{R}^3 \mapsto \mathbb{R}$ are quantized to $\hatθ$ and encoded, so that discrete samples $f_{\hatθ}(\mathbf{x}_i)$ can be recovered at known 3D points $\mathbf{x}_i \in \mathbb{R}^3$ at the decoder.… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  29. arXiv:2311.13172  [pdf, other

    cs.CV

    Learning to Complement with Multiple Humans

    Authors: Zheng Zhang, Cuong Nguyen, Kevin Wells, Thanh-Toan Do, Gustavo Carneiro

    Abstract: Real-world image classification tasks tend to be complex, where expert labellers are sometimes unsure about the classes present in the images, leading to the issue of learning with noisy labels (LNL). The ill-posedness of the LNL task requires the adoption of strong assumptions or the use of multiple noisy labels per training image, resulting in accurate models that work well in isolation but fail… ▽ More

    Submitted 1 May, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Under review

  30. An Application of Vector Autoregressive Model for Analyzing the Impact of Weather And Nearby Traffic Flow On The Traffic Volume

    Authors: Anh Thi-Hoang Nguyen, Dung Ha Nguyen, Trong-Hop Do

    Abstract: This paper aims to predict the traffic flow at one road segment based on nearby traffic volume and weather conditions. Our team also discover the impact of weather conditions and nearby traffic volume on the traffic flow at a target point. The analysis results will help solve the problem of traffic flow prediction and develop an optimal transport network with efficient traffic movement and minimal… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: International Conference on Computing and Communication Technologies (RIVF2022)

    Report number: D1-2022-48

  31. arXiv:2310.18986  [pdf, other

    cs.CV

    Controllable Group Choreography using Contrastive Diffusion

    Authors: Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

    Abstract: Music-driven group choreography poses a considerable challenge but holds significant potential for a wide range of industrial applications. The ability to generate synchronized and visually appealing group dance motions that are aligned with music opens up opportunities in many fields such as entertainment, advertising, and virtual performances. However, most of the recent works are not able to ge… ▽ More

    Submitted 3 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

  32. VALID: A perceptually validated Virtual Avatar Library for Inclusion and Diversity

    Authors: Tiffany D. Do, Steve Zelenty, Mar Gonzalez-Franco, Ryan P. McMahan

    Abstract: As consumer adoption of immersive technologies grows, virtual avatars will play a prominent role in the future of social computing. However, as people begin to interact more frequently through virtual avatars, it is important to ensure that the research community has validated tools to evaluate the effects and consequences of such technologies. We present the first iteration of a new, freely avail… ▽ More

    Submitted 30 October, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Journal ref: Frontiers in Virtual Reality 4 (2023)

  33. arXiv:2309.02902  [pdf, other

    cs.CL

    ViCGCN: Graph Convolutional Network with Contextualized Language Models for Social Media Mining in Vietnamese

    Authors: Chau-Thang Phan, Quoc-Nam Nguyen, Chi-Thanh Dang, Trong-Hop Do, Kiet Van Nguyen

    Abstract: Social media processing is a fundamental task in natural language processing with numerous applications. As Vietnamese social media and information science have grown rapidly, the necessity of information-based mining on Vietnamese social media has become crucial. However, state-of-the-art research faces several significant drawbacks, including imbalanced data and noisy data on social media platfo… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  34. arXiv:2308.15660  [pdf, other

    cs.CV

    Unveiling Camouflage: A Learnable Fourier-based Augmentation for Camouflaged Object Detection and Instance Segmentation

    Authors: Minh-Quan Le, Minh-Triet Tran, Trung-Nghia Le, Tam V. Nguyen, Thanh-Toan Do

    Abstract: Camouflaged object detection (COD) and camouflaged instance segmentation (CIS) aim to recognize and segment objects that are blended into their surroundings, respectively. While several deep neural network models have been proposed to tackle those tasks, augmentation methods for COD and CIS have not been thoroughly explored. Augmentation strategies can help improve the performance of models by inc… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  35. arXiv:2308.15005  [pdf, other

    cs.CV

    Few-Shot Object Detection via Synthetic Features with Optimal Transport

    Authors: Anh-Khoa Nguyen Vu, Thanh-Toan Do, Vinh-Tiep Nguyen, Tam Le, Minh-Triet Tran, Tam V. Nguyen

    Abstract: Few-shot object detection aims to simultaneously localize and classify the objects in an image with limited training samples. However, most existing few-shot object detection methods focus on extracting the features of a few samples of novel classes that lack diversity. Hence, they may not be sufficient to capture the data distribution. To address that limitation, in this paper, we propose a novel… ▽ More

    Submitted 29 August, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  36. arXiv:2308.11621  [pdf, other

    cs.NI cs.AI

    Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASH

    Authors: Nghia T. Nguyen, Long Luu, Phuong L. Vo, Thi Thanh Sang Nguyen, Cuong T. Do, Ngoc-thanh Nguyen

    Abstract: Dynamic adaptive streaming over HTTP (DASH) has been widely used in video streaming recently. In DASH, the client downloads video chunks in order from a server. The rate adaptation function at the video client enhances the user's quality-of-experience (QoE) by choosing a suitable quality level for each video chunk to download based on the network condition. Today networks such as content delivery… ▽ More

    Submitted 25 July, 2023; originally announced August 2023.

    Comments: 19 pages

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: C.2.4; I.2.11

  37. arXiv:2307.04223  [pdf

    cs.CV cs.AI

    Real-time Human Detection in Fire Scenarios using Infrared and Thermal Imaging Fusion

    Authors: Truong-Dong Do, Nghe-Nhan Truong, My-Ha Le

    Abstract: Fire is considered one of the most serious threats to human lives which results in a high probability of fatalities. Those severe consequences stem from the heavy smoke emitted from a fire that mostly restricts the visibility of escaping victims and rescuing squad. In such hazardous circumstances, the use of a vision-based human detection system is able to improve the ability to save more lives. T… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: 5 pages, 6 figures, 2 tables

  38. arXiv:2306.14418  [pdf, other

    cs.SE

    Context-Encoded Code Change Representation for Automated Commit Message Generation

    Authors: Thanh Trong Vu, Thanh-Dat Do, Hieu Dinh Vo

    Abstract: Changes in source code are an inevitable part of software development. They are the results of indispensable activities such as fixing bugs or improving functionality. Descriptions for code changes (commit messages) help people better understand the changes. However, due to a lack of motivation and time pressure, writing high-quality commit messages remains reluctantly considered. Several methods… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 16 pages

  39. arXiv:2306.10755  [pdf, other

    cs.CL

    Unsupervised Open-domain Keyphrase Generation

    Authors: Lam Thanh Do, Pritom Saha Akash, Kevin Chen-Chuan Chang

    Abstract: In this work, we study the problem of unsupervised open-domain keyphrase generation, where the objective is a keyphrase generation model that can be built without using human-labeled data and can perform consistently across domains. To solve this problem, we propose a seq2seq model that consists of two modules, namely \textit{phraseness} and \textit{informativeness} module, both of which can be bu… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023. arXiv admin note: text overlap with arXiv:1207.4169 by other authors

  40. arXiv:2306.09591  [pdf, other

    cs.CV cs.RO

    A Vision-based Autonomous Perching Approach for Nano Aerial Vehicles

    Authors: Truong-Dong Do, Sung Kyung Hong

    Abstract: Over the past decades, quadcopters have been investigated, due to their mobility and flexibility to operate in a wide range of environments. They have been used in various areas, including surveillance and monitoring. During a mission, drones do not have to remain active once they have reached a target location. To conserve energy and maintain a static position, it is possible to perch and stop th… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 6 pages, 6 figures, 2 tables. arXiv admin note: substantial text overlap with arXiv:2304.14838

  41. arXiv:2306.04178  [pdf, other

    cs.LG cs.CG

    Optimal Transport Model Distributional Robustness

    Authors: Van-Anh Nguyen, Trung Le, Anh Tuan Bui, Thanh-Toan Do, Dinh Phung

    Abstract: Distributional robustness is a promising framework for training deep learning models that are less vulnerable to adversarial examples and data distribution shifts. Previous works have mainly focused on exploiting distributional robustness in the data space. In this work, we explore an optimal transport-based distributional robustness framework in model spaces. Specifically, we examine a model dist… ▽ More

    Submitted 1 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPs 2023

    Journal ref: Advances in Neural Information Processing Systems, 2023

  42. arXiv:2305.19486  [pdf, other

    cs.CV

    Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation

    Authors: Arpit Garg, Cuong Nguyen, Rafael Felix, Thanh-Toan Do, Gustavo Carneiro

    Abstract: Deep learning faces a formidable challenge when handling noisy labels, as models tend to overfit samples affected by label noise. This challenge is further compounded by the presence of instance-dependent noise (IDN), a realistic form of label noise arising from ambiguous sample information. To address IDN, Label Noise Learning (LNL) incorporates a sample selection stage to differentiate clean and… ▽ More

    Submitted 4 July, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: ECCV 2024

  43. arXiv:2305.06042  [pdf, other

    cs.LG

    Blockwise Principal Component Analysis for monotone missing data imputation and dimensionality reduction

    Authors: Tu T. Do, Mai Anh Vu, Tuan L. Vo, Hoang Thien Ly, Thu Nguyen, Steven A. Hicks, Michael A. Riegler, Pål Halvorsen, Binh T. Nguyen

    Abstract: Monotone missing data is a common problem in data analysis. However, imputation combined with dimensionality reduction can be computationally expensive, especially with the increasing size of datasets. To address this issue, we propose a Blockwise principal component analysis Imputation (BPI) framework for dimensionality reduction and imputation of monotone missing data. The framework conducts Pri… ▽ More

    Submitted 10 January, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

  44. arXiv:2304.14838  [pdf, other

    eess.SY cs.CV cs.RO

    Vision-based Target Pose Estimation with Multiple Markers for the Perching of UAVs

    Authors: Truong-Dong Do, Nguyen Xuan-Mung, Sung-Kyung Hong

    Abstract: Autonomous Nano Aerial Vehicles have been increasingly popular in surveillance and monitoring operations due to their efficiency and maneuverability. Once a target location has been reached, drones do not have to remain active during the mission. It is possible for the vehicle to perch and stop its motors in such situations to conserve energy, as well as maintain a static position in unfavorable f… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 5 pages, 6 figures, 2 tables

  45. arXiv:2304.08396  [pdf, other

    cs.SE

    Code-centric Learning-based Just-In-Time Vulnerability Detection

    Authors: Son Nguyen, Thu-Trang Nguyen, Thanh Trong Vu, Thanh-Dat Do, Kien-Tuan Ngo, Hieu Dinh Vo

    Abstract: Attacks against computer systems exploiting software vulnerabilities can cause substantial damage to the cyber-infrastructure of our modern society and economy. To minimize the consequences, it is vital to detect and fix vulnerabilities as soon as possible. Just-in-time vulnerability detection (JIT-VD) discovers vulnerability-prone ("dangerous") commits to prevent them from being merged into sourc… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  46. Instance-level Few-shot Learning with Class Hierarchy Mining

    Authors: Anh-Khoa Nguyen Vu, Thanh-Toan Do, Nhat-Duy Nguyen, Vinh-Tiep Nguyen, Thanh Duc Ngo, Tam V. Nguyen

    Abstract: Few-shot learning is proposed to tackle the problem of scarce training data in novel classes. However, prior works in instance-level few-shot learning have paid less attention to effectively utilizing the relationship between categories. In this paper, we exploit the hierarchical information to leverage discriminative and relevant features of base classes to effectively classify novel objects. The… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: accepted by IEEE Transactions on Image Processing

  47. arXiv:2304.07444  [pdf, other

    cs.CV

    The Art of Camouflage: Few-shot Learning for Animal Detection and Segmentation

    Authors: Thanh-Danh Nguyen, Anh-Khoa Nguyen Vu, Nhat-Duy Nguyen, Vinh-Tiep Nguyen, Thanh Duc Ngo, Thanh-Toan Do, Minh-Triet Tran, Tam V. Nguyen

    Abstract: Camouflaged object detection and segmentation is a new and challenging research topic in computer vision. There is a serious issue of lacking data of camouflaged objects such as camouflaged animals in natural scenes. In this paper, we address the problem of few-shot learning for camouflaged object detection and segmentation. To this end, we first collect a new dataset, CAMO-FS, for the benchmark.… ▽ More

    Submitted 21 January, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Under-review Journal

  48. arXiv:2304.06053  [pdf, other

    cs.CV

    TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval

    Authors: Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, Trong-Le Do, Khanh-Duy Le, Mai-Khiem Tran, Nhat Hoang-Xuan, Thang-Long Nguyen-Ho, Vinh-Tiep Nguyen, Tuong-Nghiem Diep, Khanh-Duy Ho, Xuan-Hieu Nguyen, Thien-Phuc Tran, Tuan-Anh Yang, Kim-Phat Tran, Nhu-Vinh Hoang, Minh-Quang Nguyen, E-Ro Nguyen, Minh-Khoi Nguyen-Nhat, Tuan-An To, Trung-Truc Huynh-Le, Nham-Tan Nguyen, Hoang-Chau Luong , et al. (8 additional authors not shown)

    Abstract: 3D object retrieval is an important yet challenging task that has drawn more and more attention in recent years. While existing approaches have made strides in addressing this issue, they are often limited to restricted settings such as image and sketch queries, which are often unfriendly interactions for common users. In order to overcome these limitations, this paper presents a novel SHREC chall… ▽ More

    Submitted 9 August, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to Computers and Graphics (3DOR, Journal Track)

  49. arXiv:2304.05731  [pdf, other

    cs.CV

    SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval

    Authors: Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, Trong-Le Do, Khanh-Duy Le, Mai-Khiem Tran, Nhat Hoang-Xuan, Thang-Long Nguyen-Ho, Vinh-Tiep Nguyen, Nhat-Quynh Le-Pham, Huu-Phuc Pham, Trong-Vu Hoang, Quang-Binh Nguyen, Trong-Hieu Nguyen-Mau, Tuan-Luc Huynh, Thanh-Danh Le, Ngoc-Linh Nguyen-Ha, Tuong-Vy Truong-Thuy, Truong Hoai Phong, Tuong-Nghiem Diep, Khanh-Duy Ho, Xuan-Hieu Nguyen, Thien-Phuc Tran , et al. (9 additional authors not shown)

    Abstract: The retrieval of 3D objects has gained significant importance in recent years due to its broad range of applications in computer vision, computer graphics, virtual reality, and augmented reality. However, the retrieval of 3D objects presents significant challenges due to the intricate nature of 3D models, which can vary in shape, size, and texture, and have numerous polygons and vertices. To this… ▽ More

    Submitted 9 August, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to Computers & Graphics (3DOR 2023, Journal track)

  50. arXiv:2304.00335  [pdf, other

    eess.SP cs.CV

    Volumetric Attribute Compression for 3D Point Clouds using Feedforward Network with Geometric Attention

    Authors: Tam Thuc Do, Philip A. Chou, Gene Cheung

    Abstract: We study 3D point cloud attribute compression using a volumetric approach: given a target volumetric attribute function $f : \mathbb{R}^3 \rightarrow \mathbb{R}$, we quantize and encode parameter vector $θ$ that characterizes $f$ at the encoder, for reconstruction $f_{\hatθ}(\mathbf{x})$ at known 3D points $\mathbf{x}$'s at the decoder. Extending a previous work Region Adaptive Hierarchical Transf… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.