Zum Hauptinhalt springen

Showing 1–50 of 84 results for author: Luu, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.00023  [pdf, other

    cs.LG

    On the Perturbed States for Transformed Input-robust Reinforcement Learning

    Authors: Tung M. Luu, Haeyong Kang, Tri Ton, Thanh Nguyen, Chang D. Yoo

    Abstract: Reinforcement Learning (RL) agents demonstrating proficiency in a training environment exhibit vulnerability to adversarial perturbations in input observations during deployment. This underscores the importance of building a robust agent before its real-world deployment. To alleviate the challenging point, prior works focus on developing robust training-based procedures, encompassing efforts to fo… ▽ More

    Submitted 2 August, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

    Comments: 12 pages (Code: https://github.com/tunglm2203/tirl)

  2. arXiv:2407.10998  [pdf, other

    cs.CL cs.LG

    Discrete Diffusion Language Model for Long Text Summarization

    Authors: Do Huu Dat, Do Duc Anh, Anh Tuan Luu, Wray Buntine

    Abstract: While diffusion models excel at conditional generating high-quality images, prior works in discrete diffusion models were not evaluated on conditional long-text generation. In this work, we address the limitations of prior discrete diffusion models for conditional long-text generation, particularly in long sequence-to-sequence tasks such as abstractive summarization. Despite fast decoding speeds c… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  3. arXiv:2407.06826  [pdf, other

    cs.AI

    VRDSynth: Synthesizing Programs for Multilingual Visually Rich Document Information Extraction

    Authors: Thanh-Dat Nguyen, Tung Do-Viet, Hung Nguyen-Duy, Tuan-Hai Luu, Hung Le, Bach Le, Patanamon, Thongtanunam

    Abstract: Businesses need to query visually rich documents (VRDs) like receipts, medical records, and insurance forms to make decisions. Existing techniques for extracting entities from VRDs struggle with new layouts or require extensive pre-training data. We introduce VRDSynth, a program synthesis method to automatically extract entity relations from multilingual VRDs without pre-training data. To capture… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted in ISSTA'24

  4. arXiv:2407.01734  [pdf, other

    quant-ph cs.AI

    Universal Quantum Tomography With Deep Neural Networks

    Authors: Nhan T. Luu, Thang C. Truong

    Abstract: Quantum state tomography is a crucial technique for characterizing the state of a quantum system, which is essential for many applications in quantum technologies. In recent years, there has been growing interest in leveraging neural networks to enhance the efficiency and accuracy of quantum state tomography. Still, many of them did not include mixed quantum state, since pure states are arguably l… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures, 17 illustration, 1 table

  5. arXiv:2405.19723  [pdf, other

    cs.CV cs.AI

    Encoding and Controlling Global Semantics for Long-form Video Question Answering

    Authors: Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Seeking answers effectively for long videos is essential to build video question answering (videoQA) systems. Previous methods adaptively select frames and regions from long videos to save computations. However, this fails to reason over the whole sequence of video, leading to sub-optimal performance. To address this problem, we introduce a state space layer (SSL) into multi-modal Transformer to e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Work in progress

  6. arXiv:2405.17978  [pdf, other

    cs.CL cs.AI

    FASTopic: A Fast, Adaptive, Stable, and Transferable Topic Modeling Paradigm

    Authors: Xiaobao Wu, Thong Nguyen, Delvin Ce Zhang, William Yang Wang, Anh Tuan Luu

    Abstract: Topic models have been evolving rapidly over the years, from conventional to recent neural models. However, existing topic models generally struggle with either effectiveness, efficiency, or stability, highly impeding their practical applications. In this paper, we propose FASTopic, a fast, adaptive, stable, and transferable topic model. FASTopic follows a new paradigm: Dual Semantic-relation Reco… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.17957  [pdf, other

    cs.CL cs.AI

    Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion

    Authors: Xiaobao Wu, Xinshuai Dong, Liangming Pan, Thong Nguyen, Anh Tuan Luu

    Abstract: Dynamic topic models track the evolution of topics in sequential documents, which have derived various applications like trend analysis and opinion mining. However, existing models suffer from repetitive topic and unassociated topic issues, failing to reveal the evolution and hindering further applications. To address these issues, we break the tradition of simply chaining topics in existing work… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024 Findings

  8. Metadata Integration for Spam Reviews Detection on Vietnamese E-commerce Websites

    Authors: Co Van Dinh, Son T. Luu

    Abstract: The problem of detecting spam reviews (opinions) has received significant attention in recent years, especially with the rapid development of e-commerce. Spam reviews are often classified based on comment content, but in some cases, it is insufficient for models to accurately determine the review label. In this work, we introduce the ViSpamReviews v2 dataset, which includes metadata of reviews wit… ▽ More

    Submitted 1 August, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Published in the International Journal of Asian Language Processing (IJALP)

  9. arXiv:2405.11206  [pdf, other

    cs.LG cs.AI cs.RO

    Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses

    Authors: Thanh Nguyen, Tung M. Luu, Tri Ton, Chang D. Yoo

    Abstract: Offline reinforcement learning (RL) addresses the challenge of expensive and high-risk data exploration inherent in RL by pre-training policies on vast amounts of offline data, enabling direct deployment or fine-tuning in real-world environments. However, this training paradigm can compromise policy robustness, leading to degraded performance in practical conditions due to observation perturbation… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Journal ref: International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI) 2024

  10. arXiv:2404.19252  [pdf, other

    cs.CL

    Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts

    Authors: Cuong Nhat Vo, Khanh Bao Huynh, Son T. Luu, Trong-Hop Do

    Abstract: The growth of social networks makes toxic content spread rapidly. Hate speech detection is a task to help decrease the number of harmful comments. With the diversity in the hate speech created by users, it is necessary to interpret the hate speech besides detecting it. Hence, we propose a methodology to construct a system for targeted hate speech detection from online streaming texts from social m… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  11. arXiv:2403.17486  [pdf, other

    cs.CL

    KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning

    Authors: Cong-Duy Nguyen, Thong Nguyen, Xiaobao Wu, Anh Tuan Luu

    Abstract: Previous work on multimodal sentence embedding has proposed multimodal contrastive learning and achieved promising results. However, by taking the rest of the batch as negative samples without reviewing when forming contrastive pairs, those studies encountered many suspicious and noisy negative examples, significantly affecting the methods' overall performance. In this work, we propose KDMCSE (Kno… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024

  12. arXiv:2403.10258  [pdf, other

    cs.CL

    Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

    Authors: Chaoqun Liu, Wenxuan Zhang, Yiran Zhao, Anh Tuan Luu, Lidong Bing

    Abstract: Large language models (LLMs) have demonstrated multilingual capabilities; yet, they are mostly English-centric due to the imbalanced training corpora. Existing works leverage this phenomenon to improve their multilingual performances through translation, primarily on natural language processing (NLP) tasks. This work extends the evaluation from NLP tasks to real user queries and from English-centr… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 19 pages

  13. arXiv:2403.03435  [pdf, ps, other

    cs.CL

    VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition

    Authors: Vu Tran, Ha-Thanh Nguyen, Trung Vo, Son T. Luu, Hoang-Anh Dang, Ngoc-Cam Le, Thi-Thuy Le, Minh-Tien Nguyen, Truong-Son Nguyen, Le-Minh Nguyen

    Abstract: In this new era of rapid AI development, especially in language processing, the demand for AI in the legal domain is increasingly critical. In the context where research in other languages such as English, Japanese, and Chinese has been well-established, we introduce the first fundamental research for the Vietnamese language in the legal domain: legal textual entailment recognition through the Vie… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  14. arXiv:2403.02990  [pdf, other

    cs.CL cs.AI

    Data Augmentation using Large Language Models: Data Perspectives, Learning Paradigms and Challenges

    Authors: Bosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu, Anh Tuan Luu, Shafiq Joty

    Abstract: In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has emerged as a pivotal technique for enhancing model performance by diversifying training examples without the need for additional data collection. This survey explores the transformative impact of LLMs on DA, particularly addressing the unique challenges and opportunities they present in the context of natural… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  15. arXiv:2402.18909  [pdf, other

    cs.CL cs.AI

    Updating Language Models with Unstructured Facts: Towards Practical Knowledge Editing

    Authors: Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu

    Abstract: Knowledge editing aims to inject knowledge updates into language models to keep them correct and up-to-date. However, its current evaluation strategies are notably impractical: they solely update with well-curated structured facts (triplets with subjects, relations, and objects), whereas real-world knowledge updates commonly emerge in unstructured texts like news articles. In this paper, we propos… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  16. arXiv:2402.16030  [pdf, other

    cs.CL cs.AI

    Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration

    Authors: Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, Anh Tuan Luu

    Abstract: While Reinforcement Learning from Human Feedback (RLHF) significantly enhances the generation quality of Large Language Models (LLMs), recent studies have raised concerns regarding the complexity and instability associated with the Proximal Policy Optimization (PPO) algorithm, proposing a series of order-based calibration methods as viable alternatives. This paper delves further into current order… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 19 pages, Under review

  17. arXiv:2402.07844  [pdf, other

    cs.SE cs.CL

    Mercury: A Code Efficiency Benchmark for Code Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, Qian Liu, See-Kiong Ng

    Abstract: Amidst the recent strides in evaluating Large Language Models for Code (Code LLMs), existing benchmarks have mainly focused on the functional correctness of generated code, neglecting the importance of their computational efficiency. To fill the gap, we present Mercury, the first code efficiency benchmark for Code LLMs. It comprises 1,889 Python tasks, each accompanied by adequate solutions that s… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  18. arXiv:2402.07577  [pdf, other

    cs.CL

    Topic Modeling as Multi-Objective Contrastive Optimization

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Recent representation learning approaches enhance neural topic models by optimizing the weighted linear combination of the evidence lower bound (ELBO) of the log-likelihood and the contrastive learning objective that contrasts pairs of input documents. However, document-level contrastive learning might capture low-level mutual information, such as word ratio, which disturbs topic modeling. Moreove… ▽ More

    Submitted 9 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024 (poster)

  19. arXiv:2402.03271  [pdf, other

    cs.CL cs.AI cs.LG

    Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

    Authors: Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi

    Abstract: In the face of uncertainty, the ability to *seek information* is of fundamental importance. In many practical applications, such as medical diagnosis and troubleshooting, the information needed to solve the task is not initially given and has to be actively sought by asking follow-up questions (for example, a doctor asking a patient for more details about their symptoms). In this work, we introduc… ▽ More

    Submitted 30 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Update Results

  20. arXiv:2402.02655  [pdf, other

    cs.CL

    VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension

    Authors: Thinh Phuoc Ngo, Khoa Tran Anh Dang, Son T. Luu, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: This paper presents the development process of a Vietnamese spoken language corpus for machine reading comprehension (MRC) tasks and provides insights into the challenges and opportunities associated with using real-world data for machine reading comprehension tasks. The existing MRC corpora in Vietnamese mainly focus on formal written documents such as Wikipedia articles, online newspapers, or te… ▽ More

    Submitted 6 April, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: To appear as the main conference paper at EACL 2024

  21. A Survey on Neural Topic Models: Methods, Applications, and Challenges

    Authors: Xiaobao Wu, Thong Nguyen, Anh Tuan Luu

    Abstract: Topic models have been prevalent for decades to discover latent topics and infer topic proportions of documents in an unsupervised fashion. They have been widely used in various applications like text analysis and context recommendation. Recently, the rise of neural networks has facilitated the emergence of a new research field -- Neural Topic Models (NTMs). Different from conventional topic model… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted to Artificial Intelligence Review. See https://doi.org/10.1007/s10462-023-10661-7 and a paper list at https://github.com/BobXWu/Paper-Neural-Topic-Models

  22. arXiv:2401.14113  [pdf, other

    cs.CL

    On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

    Authors: Xiaobao Wu, Fengjun Pan, Thong Nguyen, Yichao Feng, Chaoqun Liu, Cong-Duy Nguyen, Anh Tuan Luu

    Abstract: Hierarchical topic modeling aims to discover latent topics from a corpus and organize them into a hierarchy to understand documents with desirable semantic granularity. However, existing work struggles with producing topic hierarchies of low affinity, rationality, and diversity, which hampers document understanding. To overcome these challenges, we in this paper propose Transport Plan and Context-… ▽ More

    Submitted 31 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI2024 conference. Our code is available at https://github.com/bobxwu/TraCo

  23. LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training

    Authors: Khoi M. Le, Trinh Pham, Tho Quan, Anh Tuan Luu

    Abstract: Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge fro… ▽ More

    Submitted 23 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: First two authors contribute equally. Accepted at AAAI 2024

  24. arXiv:2312.11109  [pdf, other

    cs.LG

    Graph Transformers for Large Graphs

    Authors: Vijay Prakash Dwivedi, Yozen Liu, Anh Tuan Luu, Xavier Bresson, Neil Shah, Tong Zhao

    Abstract: Transformers have recently emerged as powerful neural networks for graph learning, showcasing state-of-the-art performance on several graph property prediction tasks. However, these results have been limited to small-scale graphs, where the computational feasibility of the global attention mechanism is possible. The next goal is to scale up these architectures to handle very large graphs on the sc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  25. arXiv:2312.01661  [pdf, other

    cs.CL cs.AI

    ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

    Authors: Phuoc Pham Van Long, Duc Anh Vu, Nhat M. Hoang, Xuan Long Do, Anh Tuan Luu

    Abstract: Mathematical questioning is crucial for assessing students problem-solving skills. Since manually creating such questions requires substantial effort, automatic methods have been explored. Existing state-of-the-art models rely on fine-tuning strategies and struggle to generate questions that heavily involve multiple steps of logical and arithmetic reasoning. Meanwhile, large language models(LLMs)… ▽ More

    Submitted 27 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted at the 39th ACM/SIGAPP Symposium On Applied Computing (SAC 2024), Main Conference

  26. arXiv:2311.03970  [pdf, other

    cs.CV

    Bias and Diversity in Synthetic-based Face Recognition

    Authors: Marco Huber, Anh Thi Luu, Fadi Boutros, Arjan Kuijper, Naser Damer

    Abstract: Synthetic data is emerging as a substitute for authentic data to solve ethical and legal challenges in handling authentic face data. The current models can create real-looking face images of people who do not exist. However, it is a known and sensitive problem that face recognition systems are susceptible to bias, i.e. performance differences between different demographic and non-demographics attr… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted for presentation at WACV2024

  27. arXiv:2310.14248  [pdf, other

    cs.CL

    From Static to Dynamic: A Continual Learning Framework for Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, See-kiong Ng

    Abstract: The vast number of parameters in large language models (LLMs) endows them with remarkable capabilities, allowing them to excel in a variety of natural language processing tasks. However, this complexity also presents challenges, making LLMs difficult to train and inhibiting their ability to continuously assimilate new knowledge, which may lead to inaccuracies in their outputs. To mitigate these is… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  28. DimCL: Dimensional Contrastive Learning For Improving Self-Supervised Learning

    Authors: Thanh Nguyen, Trung Pham, Chaoning Zhang, Tung Luu, Thang Vu, Chang D. Yoo

    Abstract: Self-supervised learning (SSL) has gained remarkable success, for which contrastive learning (CL) plays a key role. However, the recent development of new non-CL frameworks has achieved comparable or better performance with high improvement potential, prompting researchers to enhance these frameworks further. Assimilating CL into non-CL frameworks has been thought to be beneficial, but empirical e… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Journal ref: IEEE Access 2023

  29. arXiv:2309.08949  [pdf, other

    cs.CL

    Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals

    Authors: Zhiyuan Hu, Yue Feng, Yang Deng, Zekun Li, See-Kiong Ng, Anh Tuan Luu, Bryan Hooi

    Abstract: Recently, the development of large language models (LLMs) has been significantly enhanced the question answering and dialogue generation, and makes them become increasingly popular in current practical scenarios. While unlike the general dialogue system which emphasizes the semantic performance, the task-oriented dialogue (ToD) systems aim to achieve the dialogue goal efficiently and successfully… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: 7 Pages

  30. arXiv:2309.06908  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Towards the TopMost: A Topic Modeling System Toolkit

    Authors: Xiaobao Wu, Fengjun Pan, Anh Tuan Luu

    Abstract: Topic models have a rich history with various applications and have recently been reinvigorated by neural topic modeling. However, these numerous topic models adopt totally distinct datasets, implementations, and evaluations. This impedes quick utilization and fair comparisons, and thereby hinders their research progress and applications. To tackle this challenge, we in this paper propose a Topic… ▽ More

    Submitted 14 June, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to ACL 2024 System Demonstrations Track

  31. arXiv:2309.04646  [pdf, ps, other

    cs.CL cs.AI

    Efficient Finetuning Large Language Models For Vietnamese Chatbot

    Authors: Vu-Thuan Doan, Quoc-Truong Truong, Duc-Vu Nguyen, Vinh-Tiep Nguyen, Thuy-Ngan Nguyen Luu

    Abstract: Large language models (LLMs), such as GPT-4, PaLM, and LLaMa, have been shown to achieve remarkable performance across a variety of natural language tasks. Recent advancements in instruction tuning bring LLMs with ability in following user's instructions and producing human-like responses. However, the high costs associated with training and implementing LLMs pose challenges to academic research.… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2304.08177, arXiv:2303.16199 by other authors

  32. arXiv:2309.01219  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

    Authors: Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi

    Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge… ▽ More

    Submitted 24 September, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: work in progress; 32 pages

  33. A Text-based Approach For Link Prediction on Wikipedia Articles

    Authors: Anh Hoang Tran, Tam Minh Nguyen, Son T. Luu

    Abstract: This paper present our work in the DSAA 2023 Challenge about Link Prediction for Wikipedia Articles. We use traditional machine learning models with POS tags (part-of-speech tags) features extracted from text to train the classification model for predicting whether two nodes has the link. Then, we use these tags to test on various machine learning models. We obtained the results by F1 score at 0.9… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted by DSAA 2023 Conference in the DSAA Student Competition Section

  34. arXiv:2307.14761  [pdf

    cs.SE

    Literature Survey on how to cluster and define Living Labs, Real World Laboratories and similar research infrastructures

    Authors: Troung Giang Luu, Tanja Zylowski, Sascha Alpers, Andreas Oberweis

    Abstract: In today's world, where societal challenges in the areas of digitalization, demographic change and sustainability are becoming increasingly complex, new innovation structures are needed to meet these challenges. Living Labs or also Real World Laboratories prove to be such. Through their applied methods such as co-creation, they integrate users into research, making it more user-centric. Which othe… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  35. Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System

    Authors: Zhiyuan Hu, Yue Feng, Anh Tuan Luu, Bryan Hooi, Aldo Lipani

    Abstract: Dialogue systems and large language models (LLMs) have gained considerable attention. However, the direct utilization of LLMs as task-oriented dialogue (TOD) models has been found to underperform compared to smaller task-specific models. Nonetheless, it is crucial to acknowledge the significant potential of LLMs and explore improved approaches for leveraging their impressive abilities. Motivated b… ▽ More

    Submitted 19 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted by CIKM 2023

  36. arXiv:2306.08456  [pdf, other

    cs.CL

    PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation

    Authors: Zhiyuan Hu, Chumin Liu, Yue Feng, Anh Tuan Luu, Bryan Hooi

    Abstract: Controllable text generation is a challenging and meaningful field in natural language generation (NLG). Especially, poetry generation is a typical one with well-defined and strict conditions for text generation which is an ideal playground for the assessment of current methodologies. While prior works succeeded in controlling either semantic or metrical aspects of poetry generation, simultaneousl… ▽ More

    Submitted 19 December, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted by AAAI2024

  37. arXiv:2306.04217  [pdf, other

    cs.CL

    Effective Neural Topic Modeling with Embedding Clustering Regularization

    Authors: Xiaobao Wu, Xinshuai Dong, Thong Nguyen, Anh Tuan Luu

    Abstract: Topic models have been prevalent for decades with various applications. However, existing topic models commonly suffer from the notorious topic collapsing: discovered topics semantically collapse towards each other, leading to highly repetitive topics, insufficient topic discovery, and damaged model interpretability. In this paper, we propose a new neural topic model, Embedding Clustering Regulari… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023 conference

  38. arXiv:2305.15872  [pdf, other

    cs.CL cs.AI

    Jointprop: Joint Semi-supervised Learning for Entity and Relation Extraction with Heterogeneous Graph-based Propagation

    Authors: Yandan Zheng, Anran Hao, Anh Tuan Luu

    Abstract: Semi-supervised learning has been an important approach to address challenges in extracting entities and relations from limited data. However, current semi-supervised works handle the two tasks (i.e., Named Entity Recognition and Relation Extraction) separately and ignore the cross-correlation of entity and relation instances as well as the existence of similar instances across unlabeled data. To… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  39. arXiv:2305.12744  [pdf, other

    cs.CL cs.AI

    Fact-Checking Complex Claims with Program-Guided Reasoning

    Authors: Liangming Pan, Xiaobao Wu, Xinyuan Lu, Anh Tuan Luu, William Yang Wang, Min-Yen Kan, Preslav Nakov

    Abstract: Fact-checking real-world claims often requires collecting multiple pieces of evidence and applying complex multi-step reasoning. In this paper, we present Program-Guided Fact-Checking (ProgramFC), a novel fact-checking model that decomposes complex claims into simpler sub-tasks that can be solved using a shared library of specialized functions. We first leverage the in-context learning ability of… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (main conference, long paper)

  40. arXiv:2305.12678  [pdf, other

    cs.CL

    Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Anh Tuan Luu, Cong-Duy Nguyen, Zhen Hai, Lidong Bing

    Abstract: Multimodal Review Helpfulness Prediction (MRHP) aims to rank product reviews based on predicted helpfulness scores and has been widely applied in e-commerce via presenting customers with useful reviews. Previous studies commonly employ fully-connected neural networks (FCNNs) as the final score predictor and pairwise loss as the training objective. However, FCNNs have been shown to perform ineffici… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Published in ACL 2023 (Findings)

  41. arXiv:2305.11442  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-Shot Text Classification via Self-Supervised Tuning

    Authors: Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing

    Abstract: Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data… ▽ More

    Submitted 25 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to the Findings of ACL 2023

  42. arXiv:2304.13409  [pdf, other

    cs.CV

    Efficient Explainable Face Verification based on Similarity Score Argument Backpropagation

    Authors: Marco Huber, Anh Thi Luu, Philipp Terhörst, Naser Damer

    Abstract: Explainable Face Recognition is gaining growing attention as the use of the technology is gaining ground in security-critical applications. Understanding why two faces images are matched or not matched by a given face recognition system is important to operators, users, anddevelopers to increase trust, accountability, develop better systems, and highlight unfair behavior. In this work, we propose… ▽ More

    Submitted 7 November, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at WACV 2024

  43. arXiv:2304.03544  [pdf, other

    cs.CL

    InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling

    Authors: Xiaobao Wu, Xinshuai Dong, Thong Nguyen, Chaoqun Liu, Liangming Pan, Anh Tuan Luu

    Abstract: Cross-lingual topic models have been prevalent for cross-lingual text analysis by revealing aligned latent topics. However, most existing methods suffer from producing repetitive topics that hinder further analysis and performance decline caused by low-coverage dictionaries. In this paper, we propose the Cross-lingual Topic Modeling with Mutual Information (InfoCTM). Instead of the direct alignmen… ▽ More

    Submitted 27 March, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted to AAAI2023 conference. Code is available at https://github.com/BobXWu/InfoCTM

  44. arXiv:2303.18162  [pdf, other

    cs.CL

    A Multiple Choices Reading Comprehension Corpus for Vietnamese Language Education

    Authors: Son T. Luu, Khoi Trong Hoang, Tuong Quang Pham, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Machine reading comprehension has been an interesting and challenging task in recent years, with the purpose of extracting useful information from texts. To attain the computer ability to understand the reading text and answer relevant information, we introduce ViMMRC 2.0 - an extension of the previous ViMMRC for the task of multiple-choice reading comprehension in Vietnamese Textbooks which conta… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

  45. Integrating Image Features with Convolutional Sequence-to-sequence Network for Multilingual Visual Question Answering

    Authors: Triet Minh Thai, Son T. Luu

    Abstract: Visual Question Answering (VQA) is a task that requires computers to give correct answers for the input questions based on the images. This task can be solved by humans with ease but is a challenge for computers. The VLSP2022-EVJVQA shared task carries the Visual Question Answering task in the multilingual domain on a newly released dataset: UIT-EVJVQA, in which the questions and answers are writt… ▽ More

    Submitted 3 September, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: VLSP2022-EVJVQA

  46. arXiv:2211.12878  [pdf, other

    cs.CL

    Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning

    Authors: Xiaobao Wu, Anh Tuan Luu, Xinshuai Dong

    Abstract: To overcome the data sparsity issue in short text topic modeling, existing methods commonly rely on data augmentation or the data characteristic of short texts to introduce more word co-occurrence information. However, most of them do not make full use of the augmented data or the data characteristic: they insufficiently learn the relations among samples in data, leading to dissimilar topic distri… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted to EMNLP2022 main conference

  47. arXiv:2211.10065  [pdf, other

    cs.LG

    How to train your draGAN: A task oriented solution to imbalanced classification

    Authors: Leon O. Guertler, Andri Ashfahani, Anh Tuan Luu

    Abstract: The long-standing challenge of building effective classification models for small and imbalanced datasets has seen little improvement since the creation of the Synthetic Minority Over-sampling Technique (SMOTE) over 20 years ago. Though GAN based models seem promising, there has been a lack of purpose built architectures for solving the aforementioned problem, as most previous studies focus on app… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 94 Datasets; under review (Elsevier Neural Networks)

  48. Improving Sentiment Analysis By Emotion Lexicon Approach on Vietnamese Texts

    Authors: An Long Doan, Son T. Luu

    Abstract: The sentiment analysis task has various applications in practice. In the sentiment analysis task, words and phrases that represent positive and negative emotions are important. Finding out the words that represent the emotion from the text can improve the performance of the classification models for the sentiment analysis task. In this paper, we propose a methodology that combines the emotion lexi… ▽ More

    Submitted 3 December, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Published at the International Conference on Asian Language Processing (IALP 2022)

  49. arXiv:2209.08263  [pdf, other

    cs.CV

    Scalable SoftGroup for 3D Instance Segmentation on Point Clouds

    Authors: Thang Vu, Kookhoi Kim, Tung M. Luu, Thanh Nguyen, Junyeong Kim, Chang D. Yoo

    Abstract: This paper considers a network referred to as SoftGroup for accurate and scalable 3D instance segmentation. Existing state-of-the-art methods produce hard semantic predictions followed by grouping instance segmentation results. Unfortunately, errors stemming from hard decisions propagate into the grouping, resulting in poor overlap between predicted instances and ground truth and substantial false… ▽ More

    Submitted 23 December, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

    Comments: Accepted by TPAMI. Extension of arXiv:2203.01509

  50. arXiv:2209.06668  [pdf, other

    cs.CL

    UIT-ViCoV19QA: A Dataset for COVID-19 Community-based Question Answering on Vietnamese Language

    Authors: Triet Minh Thai, Ngan Ha-Thao Chu, Anh Tuan Vo, Son T. Luu

    Abstract: For the last two years, from 2020 to 2021, COVID-19 has broken disease prevention measures in many countries, including Vietnam, and negatively impacted various aspects of human life and the social community. Besides, the misleading information in the community and fake news about the pandemic are also serious situations. Therefore, we present the first Vietnamese community-based question answerin… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted as poster paper at The 36th annual Meeting of Pacific Asia Conference on Language, Information and Computation (PACLIC 36). The dataset and code are available at https://github.com/minhtriet2397/UIT-ViCoV19QA