Skip to main content

Showing 1–50 of 249 results for author: Nguyen, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03788  [pdf, other

    cs.CV cs.CL

    Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

    Authors: Thong Nguyen, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering t… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  2. arXiv:2407.02721  [pdf, ps, other

    cs.LG cs.CV

    Model and Feature Diversity for Bayesian Neural Networks in Mutual Learning

    Authors: Cuong Pham, Cuong C. Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Bayesian Neural Networks (BNNs) offer probability distributions for model parameters, enabling uncertainty quantification in predictions. However, they often underperform compared to deterministic neural networks. Utilizing mutual learning can effectively enhance the performance of peer BNNs. In this paper, we propose a novel approach to improve BNNs performance through deep mutual learning. The p… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to NeurIPS 2023

  3. arXiv:2407.02662  [pdf, other

    cs.SI cs.CL cs.CY

    Supporters and Skeptics: LLM-based Analysis of Engagement with Mental Health (Mis)Information Content on Video-sharing Platforms

    Authors: Viet Cuong Nguyen, Mini Jain, Abhijat Chauhan, Heather Jaime Soled, Santiago Alvarez Lesmes, Zihang Li, Michael L. Birnbaum, Sunny X. Tang, Srijan Kumar, Munmun De Choudhury

    Abstract: Over one in five adults in the US lives with a mental illness. In the face of a shortage of mental health professionals and offline resources, online short-form video content has grown to serve as a crucial conduit for disseminating mental health help and resources. However, the ease of content creation and access also contributes to the spread of misinformation, posing risks to accurate diagnosis… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 12 pages, in submission to ICWSM

  4. arXiv:2407.02264  [pdf, other

    cs.CV cs.SD eess.AS

    SOAF: Scene Occlusion-aware Neural Acoustic Field

    Authors: Huiyu Gao, Jiahao Ma, David Ahmedt-Aristizabal, Chuong Nguyen, Miaomiao Liu

    Abstract: This paper tackles the problem of novel view audio-visual synthesis along an arbitrary trajectory in an indoor scene, given the audio-video recordings from other known trajectories of the scene. Existing methods often overlook the effect of room geometry, particularly wall occlusion to sound propagation, making them less accurate in multi-room environments. In this work, we propose a new approach… ▽ More

    Submitted 2 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2407.00535  [pdf, other

    cs.CE cs.CV

    AI-powered multimodal modeling of personalized hemodynamics in aortic stenosis

    Authors: Caglar Ozturk, Daniel H. Pak, Luca Rosalia, Debkalpa Goswami, Mary E. Robakowski, Raymond McKay, Christopher T. Nguyen, James S. Duncan, Ellen T. Roche

    Abstract: Aortic stenosis (AS) is the most common valvular heart disease in developed countries. High-fidelity preclinical models can improve AS management by enabling therapeutic innovation, early diagnosis, and tailored treatment planning. However, their use is currently limited by complex workflows necessitating lengthy expert-driven manual operations. Here, we propose an AI-powered computational framewo… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: CO and DHP contributed equally to this work. JSD and ETR are corresponding authors

  6. arXiv:2406.05615  [pdf, other

    cs.CL

    Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

    Authors: Thong Nguyen, Yi Bin, Junbin Xiao, Leigang Qu, Yicong Li, Jay Zhangjie Wu, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Humans use multiple senses to comprehend the environment. Vision and language are two of the most vital senses since they allow us to easily communicate our thoughts and perceive the world around us. There has been a lot of interest in creating video-language understanding systems with human-like senses since a video-language pair can mimic both our linguistic medium and visual environment with te… ▽ More

    Submitted 1 July, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Findings)

  7. arXiv:2406.03820  [pdf, other

    cs.NI cs.AI cs.CR cs.ET cs.LG

    A Survey on Intelligent Internet of Things: Applications, Security, Privacy, and Future Directions

    Authors: Ons Aouedi, Thai-Hoc Vu, Alessio Sacco, Dinh C. Nguyen, Kandaraj Piamrat, Guido Marchetto, Quoc-Viet Pham

    Abstract: The rapid advances in the Internet of Things (IoT) have promoted a revolution in communication technology and offered various customer services. Artificial intelligence (AI) techniques have been exploited to facilitate IoT operations and maximize their potential in modern application scenarios. In particular, the convergence of IoT and AI has led to a new networking paradigm called Intelligent IoT… ▽ More

    Submitted 21 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: This work has been accepted by IEEE Communications Surveys & Tutorials

  8. arXiv:2405.19723  [pdf, other

    cs.CV cs.AI

    Encoding and Controlling Global Semantics for Long-form Video Question Answering

    Authors: Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Seeking answers effectively for long videos is essential to build video question answering (videoQA) systems. Previous methods adaptively select frames and regions from long videos to save computations. However, this fails to reason over the whole sequence of video, leading to sub-optimal performance. To address this problem, we introduce a state space layer (SSL) into multi-modal Transformer to e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Work in progress

  9. arXiv:2405.14442  [pdf, other

    cs.ET nlin.CD

    Fully parallel implementation of digital memcomputing on FPGA

    Authors: Dyk Chung Nguyen, Yuriy V. Pershin

    Abstract: We present a fully parallel digital memcomputing solver implemented on a field-programmable gate array (FPGA) board. For this purpose, we have designed an FPGA code that solves the ordinary differential equations associated with digital memcomputing in parallel. A feature of the code is the use of only integer-type variables and integer constants to enhance optimization. Consequently, each integra… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2405.08542  [pdf, other

    cs.CE

    Industrial Metaverse: Enabling Technologies, Open Problems, and Future Trends

    Authors: Shiying Zhang, Jun Li, Long Shi, Ming Ding, Dinh C. Nguyen, Wen Chen, Zhu Han

    Abstract: As an emerging technology that enables seamless integration between the physical and virtual worlds, the Metaverse has great potential to be deployed in the industrial production field with the development of extended reality (XR) and next-generation communication networks. This deployment, called the Industrial Metaverse, is used for product design, production operations, industrial quality inspe… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 26 pages, 8 figures

  11. arXiv:2404.18960  [pdf

    q-bio.QM cs.LG

    Leak Proof CMap; a framework for training and evaluation of cell line agnostic L1000 similarity methods

    Authors: Steven Shave, Richard Kasprowicz, Abdullah M. Athar, Denise Vlachou, Neil O. Carragher, Cuong Q. Nguyen

    Abstract: The Connectivity Map (CMap) is a large publicly available database of cellular transcriptomic responses to chemical and genetic perturbations built using a standardized acquisition protocol known as the L1000 technique. Databases such as CMap provide an exciting opportunity to enrich drug discovery efforts, providing a 'known' phenotypic landscape to explore and enabling the development of state o… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  12. arXiv:2404.14908  [pdf, other

    cs.CV

    Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

    Authors: Hoang Chuong Nguyen, Tianyu Wang, Jose M. Alvarez, Miaomiao Liu

    Abstract: This paper focuses on self-supervised monocular depth estimation in dynamic scenes trained on monocular videos. Existing methods jointly estimate pixel-wise depth and motion, relying mainly on an image reconstruction loss. Dynamic regions1 remain a critical challenge for these methods due to the inherent ambiguity in depth and motion estimation, resulting in inaccurate depth estimation. This paper… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024

  13. arXiv:2404.14044  [pdf, other

    cs.CV

    HashPoint: Accelerated Point Searching and Sampling for Neural Rendering

    Authors: Jiahao Ma, Miaomiao Liu, David Ahmedt-Aristizaba, Chuong Nguyen

    Abstract: In this paper, we address the problem of efficient point searching and sampling for volume neural rendering. Within this realm, two typical approaches are employed: rasterization and ray tracing. The rasterization-based methods enable real-time rendering at the cost of increased memory and lower fidelity. In contrast, the ray-tracing-based methods yield superior quality but demand longer rendering… ▽ More

    Submitted 11 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: CVPR2024 Highlight

    Journal ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024

  14. arXiv:2404.11792  [pdf, other

    cs.AI

    Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning: A Comparative Study

    Authors: Zooey Nguyen, Anthony Annunziata, Vinh Luong, Sang Dinh, Quynh Le, Anh Hai Ha, Chanh Le, Hong An Phan, Shruti Raghavan, Christopher Nguyen

    Abstract: This paper investigates the impact of domain-specific model fine-tuning and of reasoning mechanisms on the performance of question-answering (Q&A) systems powered by large language models (LLMs) and Retrieval-Augmented Generation (RAG). Using the FinanceBench SEC financial filings dataset, we observe that, for RAG, combining a fine-tuned embedding model with a fine-tuned LLM achieves better accura… ▽ More

    Submitted 19 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Fixed typo of OODA's score on harder-question set in Table 2

  15. arXiv:2404.07626  [pdf, other

    cs.CV

    Homography Guided Temporal Fusion for Road Line and Marking Segmentation

    Authors: Shan Wang, Chuong Nguyen, Jiawei Liu, Kaihao Zhang, Wenhan Luo, Yanhao Zhang, Sundaram Muthu, Fahira Afzal Maken, Hongdong Li

    Abstract: Reliable segmentation of road lines and markings is critical to autonomous driving. Our work is motivated by the observations that road lines and markings are (1) frequently occluded in the presence of moving vehicles, shadow, and glare and (2) highly structured with low intra-class shape variance and overall high appearance consistency. To solve these issues, we propose a Homography Guided Fusion… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted by ICCV 2023

  16. arXiv:2403.19443  [pdf, other

    cs.CL

    Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model

    Authors: Qi Gou, Cam-Tu Nguyen

    Abstract: Large Language Models (LLMs) have become increasingly popular due to their ability to process and generate natural language. However, as they are trained on massive datasets of text, LLMs can inherit harmful biases and produce outputs that are not aligned with human values. This paper studies two main approaches to LLM alignment: Reinforcement Learning with Human Feedback (RLHF) and contrastive le… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  17. arXiv:2403.17486  [pdf, other

    cs.CL

    KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning

    Authors: Cong-Duy Nguyen, Thong Nguyen, Xiaobao Wu, Anh Tuan Luu

    Abstract: Previous work on multimodal sentence embedding has proposed multimodal contrastive learning and achieved promising results. However, by taking the rest of the batch as negative samples without reviewing when forming contrastive pairs, those studies encountered many suspicious and noisy negative examples, significantly affecting the methods' overall performance. In this work, we propose KDMCSE (Kno… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024

  18. arXiv:2403.07763  [pdf, other

    cs.NI cs.ET

    Emerging Technologies for 6G Non-Terrestrial-Networks: From Academia to Industrial Applications

    Authors: Cong T. Nguyen, Yuris Mulya Saputra, Nguyen Van Huynh, Tan N. Nguyen, Dinh Thai Hoang, Diep N Nguyen, Van-Quan Pham, Miroslav Voznak, Symeon Chatzinotas, Dinh-Hieu Tran

    Abstract: Terrestrial networks form the fundamental infrastructure of modern communication systems, serving more than 4 billion users globally. However, terrestrial networks are facing a wide range of challenges, from coverage and reliability to interference and congestion. As the demands of the 6G era are expected to be much higher, it is crucial to address these challenges to ensure a robust and efficient… ▽ More

    Submitted 3 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 35 pages

  19. arXiv:2402.18998  [pdf, other

    cs.CV

    COFT-AD: COntrastive Fine-Tuning for Few-Shot Anomaly Detection

    Authors: Jingyi Liao, Xun Xu, Manh Cuong Nguyen, Adam Goodge, Chuan Sheng Foo

    Abstract: Existing approaches towards anomaly detection~(AD) often rely on a substantial amount of anomaly-free data to train representation and density models. However, large anomaly-free datasets may not always be available before the inference stage; in which case an anomaly detection model must be trained with only a handful of normal samples, a.k.a. few-shot anomaly detection (FSAD). In this paper, we… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: IEEE Transactions on Image Processing

  20. arXiv:2402.17269  [pdf, other

    cs.LG

    Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition

    Authors: Cam-Van Thi Nguyen, Cao-Bach Nguyen, Quang-Thuy Ha, Duc-Trong Le

    Abstract: Emotion recognition in conversation (ERC) is a crucial task in natural language processing and affective computing. This paper proposes MultiDAG+CL, a novel approach for Multimodal Emotion Recognition in Conversation (ERC) that employs Directed Acyclic Graph (DAG) to integrate textual, acoustic, and visual features within a unified framework. The model is enhanced by Curriculum Learning (CL) to ad… ▽ More

    Submitted 8 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024

  21. arXiv:2402.15677  [pdf, other

    eess.SY cs.MA

    Consensus seeking in diffusive multidimensional networks with a repeated interaction pattern and time-delays

    Authors: Hoang Huy Vu, Quyen Ngoc Nguyen, Chuong Van Nguyen, Tuynh Van Pham, Minh Hoang Trinh

    Abstract: This paper studies a consensus problem in multidimensional networks having the same agent-to-agent interaction pattern under both intra- and cross-layer time delays. Several conditions for the agents to globally asymptotically achieve a consensus are derived, which involve the overall network's structure, the local interacting pattern, and the values of the time delays. The validity of these condi… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 6 pages, 7 figures, submitted to a journal

  22. arXiv:2402.13549  [pdf, ps, other

    cs.IT eess.SY

    Q-learning-based Joint Design of Adaptive Modulation and Precoding for Physical Layer Security in Visible Light Communications

    Authors: Duc M. T. Hoang, Thanh V. Pham, Anh T. Pham, Chuyen T Nguyen

    Abstract: There has been an increasing interest in physical layer security (PLS), which, compared with conventional cryptography, offers a unique approach to guaranteeing information confidentiality against eavesdroppers. In this paper, we study a joint design of adaptive $M$-ary pulse amplitude modulation (PAM) and precoding, which aims to optimize wiretap visible-light channels' secrecy capacity and bit e… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  23. arXiv:2402.12503  [pdf, other

    cs.LG

    PARCv2: Physics-aware Recurrent Convolutional Neural Networks for Spatiotemporal Dynamics Modeling

    Authors: Phong C. H. Nguyen, Xinlun Cheng, Shahab Azarfar, Pradeep Seshadri, Yen T. Nguyen, Munho Kim, Sanghun Choi, H. S. Udaykumar, Stephen Baek

    Abstract: Modeling unsteady, fast transient, and advection-dominated physics problems is a pressing challenge for physics-aware deep learning (PADL). The physics of complex systems is governed by large systems of partial differential equations (PDEs) and ancillary constitutive models with nonlinear structures, as well as evolving state fields exhibiting sharp gradients and rapidly deforming material interfa… ▽ More

    Submitted 24 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  24. arXiv:2402.07577  [pdf, other

    cs.CL

    Topic Modeling as Multi-Objective Contrastive Optimization

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Recent representation learning approaches enhance neural topic models by optimizing the weighted linear combination of the evidence lower bound (ELBO) of the log-likelihood and the contrastive learning objective that contrasts pairs of input documents. However, document-level contrastive learning might capture low-level mutual information, such as word ratio, which disturbs topic modeling. Moreove… ▽ More

    Submitted 9 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024 (poster)

  25. arXiv:2402.06682  [pdf, other

    cs.CR cs.AI cs.DC cs.LG

    Private Knowledge Sharing in Distributed Learning: A Survey

    Authors: Yasas Supeksala, Dinh C. Nguyen, Ming Ding, Thilina Ranbaduge, Calson Chua, Jun Zhang, Jun Li, H. Vincent Poor

    Abstract: The rise of Artificial Intelligence (AI) has revolutionized numerous industries and transformed the way society operates. Its widespread use has led to the distribution of AI and its underlying data across many intelligent systems. In this light, it is crucial to utilize information in learning processes that are either distributed or owned by different entities. As a result, modern data-driven se… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Manuscript submitted to ACM

  26. arXiv:2402.03832  [pdf, other

    cs.CL

    Rethinking Skill Extraction in the Job Market Domain using Large Language Models

    Authors: Khanh Cao Nguyen, Mike Zhang, Syrielle Montariol, Antoine Bosselut

    Abstract: Skill Extraction involves identifying skills and qualifications mentioned in documents such as job postings and resumes. The task is commonly tackled by training supervised models using a sequence labeling approach with BIO tags. However, the reliance on manually annotated data limits the generalizability of such approaches. Moreover, the common BIO setting limits the ability of the models to capt… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Published at NLP4HR 2024 (EACL Workshop)

  27. arXiv:2402.02319  [pdf

    cs.RO

    Smart Textile-Driven Soft Spine Exosuit for Lifting Tasks in Industrial Applications

    Authors: Kefan Zhu, Bibhu Sharma, Phuoc Thien Phan, James Davies, Mai Thanh Thai, Trung Thien Hoang, Chi Cong Nguyen, Adrienne Ji, Emanuele Nicotra, Nigel H. Lovell, Thanh Nho Do

    Abstract: Work related musculoskeletal disorders (WMSDs) are often caused by repetitive lifting, making them a significant concern in occupational health. Although wearable assist devices have become the norm for mitigating the risk of back pain, most spinal assist devices still possess a partially rigid structure that impacts the user comfort and flexibility. This paper addresses this issue by presenting a… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 6 pages, 7 figures

  28. arXiv:2402.02021  [pdf, other

    cs.LG cs.CV

    Transfer Learning in ECG Diagnosis: Is It Effective?

    Authors: Cuong V. Nguyen, Cuong D. Do

    Abstract: The adoption of deep learning in ECG diagnosis is often hindered by the scarcity of large, well-labeled datasets in real-world scenarios, leading to the use of transfer learning to leverage features learned from larger datasets. Yet the prevailing assumption that transfer learning consistently outperforms training from scratch has never been systematically validated. In this study, we conduct the… ▽ More

    Submitted 26 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  29. arXiv:2401.17897  [pdf, ps, other

    cs.CL

    Employing Label Models on ChatGPT Answers Improves Legal Text Entailment Performance

    Authors: Chau Nguyen, Le-Minh Nguyen

    Abstract: The objective of legal text entailment is to ascertain whether the assertions in a legal query logically follow from the information provided in one or multiple legal articles. ChatGPT, a large language model, is robust in many natural language processing tasks, including legal text entailment: when we set the temperature = 0 (the ChatGPT answers are deterministic) and prompt the model, it achieve… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 15 pages

  30. arXiv:2401.15625  [pdf, other

    cs.CR cs.AI

    Generative AI-enabled Blockchain Networks: Fundamentals, Applications, and Case Study

    Authors: Cong T. Nguyen, Yinqiu Liu, Hongyang Du, Dinh Thai Hoang, Dusit Niyato, Diep N. Nguyen, Shiwen Mao

    Abstract: Generative Artificial Intelligence (GAI) has recently emerged as a promising solution to address critical challenges of blockchain technology, including scalability, security, privacy, and interoperability. In this paper, we first introduce GAI techniques, outline their applications, and discuss existing solutions for integrating GAI into blockchains. Then, we discuss emerging solutions that demon… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  31. arXiv:2401.14420  [pdf, other

    cs.CR

    A Novel Blockchain Based Information Management Framework for Web 3.0

    Authors: Md Arif Hassan, Cong T. Nguyen, Chi-Hieu Nguyen, Dinh Thai Hoang, Diep N. Nguyen, Eryk Dutkiewicz

    Abstract: Web 3.0 is the third generation of the World Wide Web (WWW), concentrating on the critical concepts of decentralization, availability, and increasing client usability. Although Web 3.0 is undoubtedly an essential component of the future Internet, it currently faces critical challenges, including decentralized data collection and management. To overcome these challenges, blockchain has emerged as o… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  32. arXiv:2401.14113  [pdf, other

    cs.CL

    On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

    Authors: Xiaobao Wu, Fengjun Pan, Thong Nguyen, Yichao Feng, Chaoqun Liu, Cong-Duy Nguyen, Anh Tuan Luu

    Abstract: Hierarchical topic modeling aims to discover latent topics from a corpus and organize them into a hierarchy to understand documents with desirable semantic granularity. However, existing work struggles with producing topic hierarchies of low affinity, rationality, and diversity, which hampers document understanding. To overcome these challenges, we in this paper propose Transport Plan and Context-… ▽ More

    Submitted 31 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI2024 conference. Our code is available at https://github.com/bobxwu/TraCo

  33. arXiv:2401.10901  [pdf, other

    cs.CY

    Enabling Technologies for Web 3.0: A Comprehensive Survey

    Authors: Md Arif Hassan, Mohammad Behdad Jamshidi, Bui Duc Manh, Nam H. Chu, Chi-Hieu Nguyen, Nguyen Quang Hieu, Cong T. Nguyen, Dinh Thai Hoang, Diep N. Nguyen, Nguyen Van Huynh, Mohammad Abu Alsheikh, Eryk Dutkiewicz

    Abstract: Web 3.0 represents the next stage of Internet evolution, aiming to empower users with increased autonomy, efficiency, quality, security, and privacy. This evolution can potentially democratize content access by utilizing the latest developments in enabling technologies. In this paper, we conduct an in-depth survey of enabling technologies in the context of Web 3.0, such as blockchain, semantic web… ▽ More

    Submitted 29 December, 2023; originally announced January 2024.

  34. arXiv:2401.08723  [pdf, other

    cs.CR cs.CV cs.DC cs.LG

    HierSFL: Local Differential Privacy-aided Split Federated Learning in Mobile Edge Computing

    Authors: Minh K. Quan, Dinh C. Nguyen, Van-Dinh Nguyen, Mayuri Wijayasundara, Sujeeva Setunge, Pubudu N. Pathirana

    Abstract: Federated Learning is a promising approach for learning from user data while preserving data privacy. However, the high requirements of the model training process make it difficult for clients with limited memory or bandwidth to participate. To tackle this problem, Split Federated Learning is utilized, where clients upload their intermediate model training outcomes to a cloud server for collaborat… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 6 Pages, 5 figures, IEEE Virtual Conference on Communications 2023

  35. arXiv:2401.03917  [pdf, other

    cs.MS

    Toward a comprehensive simulation framework for hypergraphs: a Python-base approach

    Authors: Quoc Chuong Nguyen, Trung Kien Le

    Abstract: Hypergraphs, or generalization of graphs such that edges can contain more than two nodes, have become increasingly prominent in understanding complex network analysis. Unlike graphs, hypergraphs have relatively few supporting platforms, and such dearth presents a barrier to more widespread adaptation of hypergraph computational toolboxes that could enable further research in several areas. Here, w… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 13 pages, 3 figures

  36. arXiv:2401.03551  [pdf, other

    cs.CL cs.IR

    CAPTAIN at COLIEE 2023: Efficient Methods for Legal Information Retrieval and Entailment Tasks

    Authors: Chau Nguyen, Phuong Nguyen, Thanh Tran, Dat Nguyen, An Trieu, Tin Pham, Anh Dang, Le-Minh Nguyen

    Abstract: The Competition on Legal Information Extraction/Entailment (COLIEE) is held annually to encourage advancements in the automatic processing of legal texts. Processing legal documents is challenging due to the intricate structure and meaning of legal language. In this paper, we outline our strategies for tackling Task 2, Task 3, and Task 4 in the COLIEE 2023 competition. Our approach involved utiliz… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  37. arXiv:2401.00165  [pdf, other

    cs.CL

    Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization

    Authors: Shiqi Wang, Yeqin Zhang, Cam-Tu Nguyen

    Abstract: In open-domain Question Answering (QA), dense retrieval is crucial for finding relevant passages for answer generation. Typically, contrastive learning is used to train a retrieval model that maps passages and queries to the same semantic space. The objective is to make similar ones closer and dissimilar ones further apart. However, training such a system is challenging due to the false negative i… ▽ More

    Submitted 13 January, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted by AAAI24

  38. arXiv:2312.06950  [pdf, other

    cs.CV cs.CL

    READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Khoi Le, Zhiyuan Hu, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Fully fine-tuning pretrained large-scale transformer models has become a popular paradigm for video-language modeling tasks, such as temporal language grounding and video-language summarization. With a growing number of tasks and limited training data, such full fine-tuning approach leads to costly model storage and unstable training. To overcome these shortcomings, we introduce lightweight adapte… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  39. arXiv:2312.02549  [pdf, other

    cs.CV cs.CL

    DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Temporal Language Grounding seeks to localize video moments that semantically correspond to a natural language query. Recent advances employ the attention mechanism to learn the relations between video moments and the text query. However, naive attention might not be able to appropriately capture such relations, resulting in ineffective distributions where target video moments are difficult to sep… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted at EMNLP 2023 (Findings)

  40. arXiv:2312.02541  [pdf, other

    eess.IV cs.CV

    Explainable Severity ranking via pairwise n-hidden comparison: a case study of glaucoma

    Authors: Hong Nguyen, Cuong V. Nguyen, Shrikanth Narayanan, Benjamin Y. Xu, Michael Pazzani

    Abstract: Primary open-angle glaucoma (POAG) is a chronic and progressive optic nerve condition that results in an acquired loss of optic nerve fibers and potential blindness. The gradual onset of glaucoma results in patients progressively losing their vision without being consciously aware of the changes. To diagnose POAG and determine its severity, patients must undergo a comprehensive dilated eye examina… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 4 pages

  41. arXiv:2312.02227  [pdf, other

    cs.LG cs.CL

    Improving Multimodal Sentiment Analysis: Supervised Angular Margin-based Contrastive Learning for Enhanced Fusion Representation

    Authors: Cong-Duy Nguyen, Thong Nguyen, Duc Anh Vu, Luu Anh Tuan

    Abstract: The effectiveness of a model is heavily reliant on the quality of the fusion representation of multiple modalities in multimodal sentiment analysis. Moreover, each modality is extracted from raw input and integrated with the rest to construct a multimodal representation. Although previous methods have proposed multimodal representations and achieved promising results, most of them focus on forming… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  42. arXiv:2312.01592  [pdf, other

    cs.CL

    Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment

    Authors: Cong-Duy Nguyen, The-Anh Vu-Le, Thong Nguyen, Tho Quan, Luu Anh Tuan

    Abstract: Language models have been supervised with both language-only objective and visual grounding in existing studies of visual-grounded language learning. However, due to differences in the distribution and scale of visual-grounded datasets and language corpora, the language model tends to mix up the context of the tokens that occurred in the grounded data with those that do not. As a result, during re… ▽ More

    Submitted 9 January, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

  43. arXiv:2312.00656  [pdf, other

    cs.LG cs.AI stat.ML

    Simple Transferability Estimation for Regression Tasks

    Authors: Cuong N. Nguyen, Phong Tran, Lam Si Tung Ho, Vu Dinh, Anh T. Tran, Tal Hassner, Cuong V. Nguyen

    Abstract: We consider transferability estimation, the problem of estimating how well deep learning models transfer from a source to a target task. We focus on regression tasks, which received little previous attention, and propose two simple and computationally efficient approaches that estimate transferability based on the negative regularized mean squared error of a linear regression model. We prove novel… ▽ More

    Submitted 3 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Paper published at The 39th Conference on Uncertainty in Artificial Intelligence (UAI) 2023

  44. arXiv:2311.15836  [pdf, other

    cs.CV

    Syn3DWound: A Synthetic Dataset for 3D Wound Bed Analysis

    Authors: Léo Lebrat, Rodrigo Santa Cruz, Remi Chierchia, Yulia Arzhaeva, Mohammad Ali Armin, Joshua Goldsmith, Jeremy Oorloff, Prithvi Reddy, Chuong Nguyen, Lars Petersson, Michelle Barakat-Johnson, Georgina Luscombe, Clinton Fookes, Olivier Salvado, David Ahmedt-Aristizabal

    Abstract: Wound management poses a significant challenge, particularly for bedridden patients and the elderly. Accurate diagnostic and healing monitoring can significantly benefit from modern image analysis, providing accurate and precise measurements of wounds. Despite several existing techniques, the shortage of expansive and diverse training datasets remains a significant obstacle to constructing machine… ▽ More

    Submitted 3 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: In the IEEE International Symposium on Biomedical Imaging (ISBI) 2024

  45. arXiv:2311.13172  [pdf, other

    cs.CV

    Learning to Complement with Multiple Humans

    Authors: Zheng Zhang, Cuong Nguyen, Kevin Wells, Thanh-Toan Do, Gustavo Carneiro

    Abstract: Real-world image classification tasks tend to be complex, where expert labellers are sometimes unsure about the classes present in the images, leading to the issue of learning with noisy labels (LNL). The ill-posedness of the LNL task requires the adoption of strong assumptions or the use of multiple noisy labels per training image, resulting in accurate models that work well in isolation but fail… ▽ More

    Submitted 1 May, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Under review

  46. arXiv:2311.11378  [pdf, other

    cs.CV cs.AI

    Inspecting Explainability of Transformer Models with Additional Statistical Information

    Authors: Hoang C. Nguyen, Haeil Lee, Junmo Kim

    Abstract: Transformer becomes more popular in the vision domain in recent years so there is a need for finding an effective way to interpret the Transformer model by visualizing it. In recent work, Chefer et al. can visualize the Transformer on vision and multi-modal tasks effectively by combining attention layers to show the importance of each image patch. However, when applying to other variants of Transf… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  47. arXiv:2311.09542  [pdf, other

    cs.CL

    Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering

    Authors: Neha Srikanth, Rupak Sarkar, Heran Mane, Elizabeth M. Aparicio, Quynh C. Nguyen, Rachel Rudinger, Jordan Boyd-Graber

    Abstract: Questions posed by information-seeking users often contain implicit false or potentially harmful assumptions. In a high-risk domain such as maternal and infant health, a question-answering system must recognize these pragmatic constraints and go beyond simply answering user questions, examining them in context to respond helpfully. To achieve this, we study assumptions and implications, or pragmat… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024

  48. arXiv:2311.05192  [pdf, other

    cs.CV

    TransReg: Cross-transformer as auto-registration module for multi-view mammogram mass detection

    Authors: Hoang C. Nguyen, Chi Phan, Hieu H. Pham

    Abstract: Screening mammography is the most widely used method for early breast cancer detection, significantly reducing mortality rates. The integration of information from multi-view mammograms enhances radiologists' confidence and diminishes false-positive rates since they can examine on dual-view of the same breast to cross-reference the existence and location of the lesion. Inspired by this, we present… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  49. Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction

    Authors: Cam-Van Thi Nguyen, Anh-Tuan Mai, The-Son Le, Hai-Dang Kieu, Duc-Trong Le

    Abstract: Emotion recognition is a crucial task for human conversation understanding. It becomes more challenging with the notion of multimodal data, e.g., language, voice, and facial expressions. As a typical solution, the global- and the local context information are exploited to predict the emotional label for every single sentence, i.e., utterance, in the dialogue. Specifically, the global representatio… ▽ More

    Submitted 30 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

    Journal ref: The 2023 Conference on Empirical Methods in Natural Language Processing

  50. arXiv:2311.04224  [pdf, other

    eess.SP cs.CV cs.LG

    MELEP: A Novel Predictive Measure of Transferability in Multi-Label ECG Diagnosis

    Authors: Cuong V. Nguyen, Hieu Minh Duong, Cuong D. Do

    Abstract: In practical electrocardiography (ECG) interpretation, the scarcity of well-annotated data is a common challenge. Transfer learning techniques are valuable in such situations, yet the assessment of transferability has received limited attention. To tackle this issue, we introduce MELEP, which stands for Muti-label Expected Log of Empirical Predictions, a measure designed to estimate the effectiven… ▽ More

    Submitted 12 June, 2024; v1 submitted 27 October, 2023; originally announced November 2023.

    Comments: Accepted to the Journal of Healthcare Informatics Research