Zum Hauptinhalt springen

Showing 51–100 of 3,810 results for author: Nguyen, T

.
  1. arXiv:2407.17790  [pdf, other

    cs.LG cs.AR

    Exploring the Limitations of Kolmogorov-Arnold Networks in Classification: Insights to Software Training and Hardware Implementation

    Authors: Van Duy Tran, Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima

    Abstract: Kolmogorov-Arnold Networks (KANs), a novel type of neural network, have recently gained popularity and attention due to the ability to substitute multi-layer perceptions (MLPs) in artificial intelligence (AI) with higher accuracy and interoperability. However, KAN assessment is still limited and cannot provide an in-depth analysis of a specific domain. Furthermore, no study has been conducted on t… ▽ More

    Submitted 25 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, 2 tables

  2. arXiv:2407.17053  [pdf, other

    cs.SE cs.CR cs.LG

    Automated Code-centric Software Vulnerability Assessment: How Far Are We? An Empirical Study in C/C++

    Authors: Anh The Nguyen, Triet Huynh Minh Le, M. Ali Babar

    Abstract: Background: The C and C++ languages hold significant importance in Software Engineering research because of their widespread use in practice. Numerous studies have utilized Machine Learning (ML) and Deep Learning (DL) techniques to detect software vulnerabilities (SVs) in the source code written in these languages. However, the application of these techniques in function-level SV assessment has be… ▽ More

    Submitted 3 August, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: Accepted as a full paper in the technical track at The International Symposium on Empirical Software Engineering and Measurement (ESEM) 2024

  3. arXiv:2407.16946  [pdf, other

    cs.SE

    Automatic Categorization of GitHub Actions with Transformers and Few-shot Learning

    Authors: Phuong T. Nguyen, Juri Di Rocco, Claudio Di Sipio, Mudita Shakya, Davide Di Ruscio, Massimiliano Di Penta

    Abstract: In the GitHub ecosystem, workflows are used as an effective means to automate development tasks and to set up a Continuous Integration and Delivery (CI/CD pipeline). GitHub Actions (GHA) have been conceived to provide developers with a practical tool to create and maintain workflows, avoiding reinventing the wheel and cluttering the workflow with shell commands. Properly leveraging the power of Gi… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: The paper has been peer-reviewed and accepted for publication in the Proceedings of the 18th International Symposium on Empirical Software Engineering and Measurement (ESEM 2024)

  4. arXiv:2407.15812  [pdf, ps, other

    math.AP

    On the stability of blowup solutions to the complex Ginzburg-Landau equation in R^d

    Authors: Jiajie Chen, Thomas Y. Hou, Van Tien Nguyen, Yixuan Wang

    Abstract: Building upon the idea in \cite{HNWarXiv24}, we establish stability of the type-I blowup with log correction for the complex Ginzburg-Landau equation. In the amplitude-phase representation, a generalized dynamic rescaling formulation is introduced, with modulation parameters capturing the spatial translation and rotation symmetries of the equation and novel additional modulation parameters perturb… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 38 pages

    MSC Class: 35Q56

  5. arXiv:2407.15468  [pdf, ps, other

    math.ST

    Efficient influence functions for Sobol' indices under two designs of experiments

    Authors: Thierry Klein, Agnès Lagnoux, Paul Rochet, Thi Mong Ngoc Nguyen

    Abstract: In this note, we are interested in the asymptotic efficiency of Sobol' indices esti-mators. After recalling the basis of asymptotic efficiency, we compute the efficientinfluence functions for Sobol' indices in two different contexts: the Pick-Freeze andthe given-data settings.

    Submitted 22 July, 2024; originally announced July 2024.

  6. arXiv:2407.13904  [pdf, other

    stat.ME

    In defense of MAR over latent ignorability (or latent MAR) for outcome missingness in studying principal causal effects: a causal graph view

    Authors: Trang Quynh Nguyen

    Abstract: This paper concerns outcome missingness in principal stratification analysis. We revisit a common assumption known as latent ignorability or latent missing-at-random (LMAR), often considered a relaxation of missing-at-random (MAR). LMAR posits that the outcome is independent of its missingness if one conditions on principal stratum (which is partially unobservable) in addition to observed variable… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  7. arXiv:2407.13842  [pdf, other

    cs.RO cs.CV

    Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

    Authors: Toan Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen

    Abstract: 6-DoF grasp detection has been a fundamental and challenging problem in robotic vision. While previous works have focused on ensuring grasp stability, they often do not consider human intention conveyed through natural language, hindering effective collaboration between robots and users in complex 3D environments. In this paper, we present a new approach for language-driven 6-DoF grasp detection i… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  8. arXiv:2407.12094  [pdf, other

    cs.CL

    Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models

    Authors: Minh Nguyen, Franck Dernoncourt, Seunghyun Yoon, Hanieh Deilamsalehy, Hao Tan, Ryan Rossi, Quan Hung Tran, Trung Bui, Thien Huu Nguyen

    Abstract: We introduce an approach to identifying speaker names in dialogue transcripts, a crucial task for enhancing content accessibility and searchability in digital media archives. Despite the advancements in speech recognition, the task of text-based speaker identification (SpeakerID) has received limited attention, lacking large-scale, diverse datasets for effective model training. Addressing these ga… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: accepted to INTERSPEECH 2024

  9. arXiv:2407.12064  [pdf, other

    eess.IV cs.CL cs.CV cs.LG cs.MM

    LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

    Authors: Khai Le-Duc, Ryan Zhang, Ngoc Son Nguyen, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy

    Abstract: Vision-language models have been extensively explored across a wide range of tasks, achieving satisfactory performance; however, their application in medical imaging remains underexplored. In this work, we propose a unified framework - LiteGPT - for the medical imaging. We leverage multiple pre-trained visual encoders to enrich information and enhance the performance of vision-language models. To… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Preprint, 19 pages

  10. arXiv:2407.12034  [pdf, other

    cs.CL cs.AI cs.LG

    Understanding Transformers via N-gram Statistics

    Authors: Timothy Nguyen

    Abstract: Transformer based large-language models (LLMs) display extreme proficiency with language yet a precise understanding of how they work remains elusive. One way of demystifying transformer predictions would be to describe how they depend on their context in terms of simple template functions. This paper takes a first step in this direction by considering families of functions (i.e. rules) formed out… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  11. arXiv:2407.11771  [pdf, other

    cs.CV cs.AI cs.LG

    XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach

    Authors: Truong Thanh Hung Nguyen, Phuc Truong Loc Nguyen, Hung Cao

    Abstract: Recent advancements in deep learning have significantly improved visual quality inspection and predictive maintenance within industrial settings. However, deploying these technologies on low-resource edge devices poses substantial challenges due to their high computational demands and the inherent complexity of Explainable AI (XAI) methods. This paper addresses these challenges by introducing a no… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 28 pages, preprint submitted to Information Fusion journal

  12. arXiv:2407.11525  [pdf, ps, other

    math.NT

    On a Theorem of Nathanson on Diophantine Approximation

    Authors: Jaroslav Hančl, Tho Phuoc Nguyen

    Abstract: In 1974, M. B. Nathanson proved that every irrational number $α$ represented by a simple continued fraction with infinitely many elements greater than or equal to $k$ is approximable by an infinite number of rational numbers $p/q$ satisfying $|α-p/q|<1/(\sqrt{k^2+4}q^2)$. In this paper we refine this result.

    Submitted 16 July, 2024; originally announced July 2024.

    MSC Class: 11J82; 11A55

  13. arXiv:2407.11194  [pdf, other

    astro-ph.IM astro-ph.EP astro-ph.GA astro-ph.SR cs.AI cs.CL

    AstroMLab 1: Who Wins Astronomy Jeopardy!?

    Authors: Yuan-Sen Ting, Tuan Dung Nguyen, Tirthankar Ghosal, Rui Pan, Hardik Arora, Zechang Sun, Tijmen de Haan, Nesar Ramachandra, Azton Wells, Sandeep Madireddy, Alberto Accomazzi

    Abstract: We present a comprehensive evaluation of proprietary and open-weights large language models using the first astronomy-specific benchmarking dataset. This dataset comprises 4,425 multiple-choice questions curated from the Annual Review of Astronomy and Astrophysics, covering a broad range of astrophysical topics. Our analysis examines model performance across various astronomical subfields and asse… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 45 pages, 12 figures, 7 tables. Submitted to ApJ. Comments welcome. AstroMLab homepage: https://astromlab.org/

  14. arXiv:2407.11166  [pdf, ps, other

    math.NT

    On a Theorem of Legendre on Diophantine Approximation

    Authors: Jaroslav Hančl, Tho Phuoc Nguyen

    Abstract: Legendre's theorem states that every irreducible fraction $\frac{p}{q}$ which satisfies the inequality $\left |α-\frac{p}{q} \right | < \frac{1}{2q^2}$ is convergent to $α$. Later Barbolosi and Jager improved this theorem. In this paper we refine these results.

    Submitted 15 July, 2024; originally announced July 2024.

    MSC Class: 11J82; 11A55

  15. arXiv:2407.11078  [pdf, other

    cs.LG cs.AI cs.CV

    Overcoming Catastrophic Forgetting in Federated Class-Incremental Learning via Federated Global Twin Generator

    Authors: Thinh Nguyen, Khoa D Doan, Binh T. Nguyen, Danh Le-Phuoc, Kok-Seng Wong

    Abstract: Federated Class-Incremental Learning (FCIL) increasingly becomes important in the decentralized setting, where it enables multiple participants to collaboratively train a global model to perform well on a sequence of tasks without sharing their private data. In FCIL, conventional Federated Learning algorithms such as FedAVG often suffer from catastrophic forgetting, resulting in significant perfor… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    MSC Class: 68T07 (Primary); 68T45 (Secondary)

  16. KAT: Dependency-aware Automated API Testing with Large Language Models

    Authors: Tri Le, Thien Tran, Duy Cao, Vy Le, Tien Nguyen, Vu Nguyen

    Abstract: API testing has increasing demands for software companies. Prior API testing tools were aware of certain types of dependencies that needed to be concise between operations and parameters. However, their approaches, which are mostly done manually or using heuristic-based algorithms, have limitations due to the complexity of these dependencies. In this paper, we present KAT (Katalon API Testing), a… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: ICST 2024

  17. arXiv:2407.09740  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.supr-con

    Ferroelectric AlBN Films by Molecular Beam Epitaxy

    Authors: Chandrashekhar Savant, Ved Gund, Kazuki Nomoto, Takuya Maeda, Shubham Jadhav, Joongwon Lee, Madhav Ramesh, Eungkyun Kim, Thai-Son Nguyen, Yu-Hsin Chen, Joseph Casamento, Farhan Rana, Amit Lal, Huili, Xing, Debdeep Jena

    Abstract: We report the properties of molecular beam epitaxy deposited AlBN thin films on a recently developed epitaxial nitride metal electrode Nb2N. While a control AlN thin film exhibits standard capacitive behavior, distinct ferroelectric switching is observed in the AlBN films with increasing Boron mole fraction. The measured remnant polarization Pr of 15 uC/cm2 and coercive field Ec of 1.45 MV/cm in t… ▽ More

    Submitted 17 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: DOI: 10.1063/5.0181217

  18. arXiv:2407.09281  [pdf, other

    cs.AI

    Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning

    Authors: Thuy Ngoc Nguyen, Kasturi Jamale, Cleotilde Gonzalez

    Abstract: Large Language Models (LLMs) have demonstrated their capabilities across various tasks, from language translation to complex reasoning. Understanding and predicting human behavior and biases are crucial for artificial intelligence (AI) assisted systems to provide useful assistance, yet it remains an open question whether these models can achieve this. This paper addresses this gap by leveraging th… ▽ More

    Submitted 5 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  19. arXiv:2407.09035  [pdf, other

    eess.IV cs.CV

    GPC: Generative and General Pathology Image Classifier

    Authors: Anh Tien Nguyen, Jin Tae Kwak

    Abstract: Deep learning has been increasingly incorporated into various computational pathology applications to improve its efficiency, accuracy, and robustness. Although successful, most previous approaches for image classification have crucial drawbacks. There exist numerous tasks in pathology, but one needs to build a model per task, i.e., a task-specific model, thereby increasing the number of models, t… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: MICCAI-MedAGI 2023 (Best Paper Honorable Mention)

  20. arXiv:2407.09030  [pdf, other

    eess.IV cs.CV

    CAMP: Continuous and Adaptive Learning Model in Pathology

    Authors: Anh Tien Nguyen, Keunho Byeon, Kyungeun Kim, Boram Song, Seoung Wan Chae, Jin Tae Kwak

    Abstract: There exist numerous diagnostic tasks in pathology. Conventional computational pathology formulates and tackles them as independent and individual image classification problems, thereby resulting in computational inefficiency and high costs. To address the challenges, we propose a generic, unified, and universal framework, called a continuous and adaptive learning model in pathology (CAMP), for pa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Under review

  21. arXiv:2407.08872  [pdf, other

    cs.CV

    Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets

    Authors: Linh Van Ma, Tran Thien Dat Nguyen, Changbeom Shim, Du Yong Kim, Namkoo Ha, Moongu Jeon

    Abstract: This paper proposes an online visual multi-object tracking (MOT) algorithm that resolves object appearance-reappearance and occlusion. Our solution is based on the labeled random finite set (LRFS) filtering approach, which in principle, addresses disappearance, appearance, reappearance, and occlusion via a single Bayesian recursion. However, in practice, existing numerical approximations cause rea… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  22. arXiv:2407.08470  [pdf, other

    cs.CV cs.AI

    Brain Tumor Segmentation in MRI Images with 3D U-Net and Contextual Transformer

    Authors: Thien-Qua T. Nguyen, Hieu-Nghia Nguyen, Thanh-Hieu Bui, Thien B. Nguyen-Tat, Vuong M. Ngo

    Abstract: This research presents an enhanced approach for precise segmentation of brain tumor masses in magnetic resonance imaging (MRI) using an advanced 3D-UNet model combined with a Context Transformer (CoT). By architectural expansion CoT, the proposed model extends its architecture to a 3D format, integrates it smoothly with the base model to utilize the complex contextual information found in MRI scan… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 6 pages, 7 figures

  23. arXiv:2407.07917  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Non-Cooperative Backdoor Attacks in Federated Learning: A New Threat Landscape

    Authors: Tuan Nguyen, Dung Thuy Nguyen, Khoa D Doan, Kok-Seng Wong

    Abstract: Despite the promise of Federated Learning (FL) for privacy-preserving model training on distributed data, it remains susceptible to backdoor attacks. These attacks manipulate models by embedding triggers (specific input patterns) in the training data, forcing misclassification as predefined classes during deployment. Traditional single-trigger attacks and recent work on cooperative multiple-trigge… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  24. arXiv:2407.07472  [pdf, other

    cs.SE cs.AI

    Rectifier: Code Translation with Corrector via LLMs

    Authors: Xin Yin, Chao Ni, Tien N. Nguyen, Shaohua Wang, Xiaohu Yang

    Abstract: Software migration is garnering increasing attention with the evolution of software and society. Early studies mainly relied on handcrafted translation rules to translate between two languages, the translation process is error-prone and time-consuming. In recent years, researchers have begun to explore the use of pre-trained large language models (LLMs) in code translation. However, code translati… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2308.03109, arXiv:2302.03908 by other authors

  25. arXiv:2407.07421  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Federated PCA on Grassmann Manifold for IoT Anomaly Detection

    Authors: Tung-Anh Nguyen, Long Tan Le, Tuan Dung Nguyen, Wei Bao, Suranga Seneviratne, Choong Seon Hong, Nguyen H. Tran

    Abstract: With the proliferation of the Internet of Things (IoT) and the rising interconnectedness of devices, network security faces significant challenges, especially from anomalous activities. While traditional machine learning-based intrusion detection systems (ML-IDS) effectively employ supervised learning methods, they possess limitations such as the requirement for labeled data and challenges with hi… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted for publication at IEEE/ACM Transactions on Networking

    Journal ref: IEEE/ACM Transactions on Networking On page(s): 1-16 Print ISSN: 1063-6692 Online ISSN: 1558-2566 Digital Object Identifier: 10.1109/TNET.2024.3423780

  26. arXiv:2407.07369  [pdf, ps, other

    math.ST math.AP math.PR

    Viscosity estimation for 2D pipe flows I. Construction, consistency, asymptotic normality

    Authors: Thi Hien Nguyen, Armen Shirikyan

    Abstract: We consider the motion of incompressible viscous fluid in a rectangle, imposing the periodicity condition in one direction and the no-slip boundary condition in the other. Assuming that the flow is subject to an external random force, white in time and regular in space, we construct an estimator for the viscosity using only observations of the enstrophy. The goal of the paper is to prove that the… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    MSC Class: 35Q30; 37L55; 62M05; 76D06

  27. arXiv:2407.07360  [pdf, other

    cs.CV cs.LG

    Towards a text-based quantitative and explainable histopathology image analysis

    Authors: Anh Tien Nguyen, Trinh Thi Le Vuong, Jin Tae Kwak

    Abstract: Recently, vision-language pre-trained models have emerged in computational pathology. Previous works generally focused on the alignment of image-text pairs via the contrastive pre-training paradigm. Such pre-trained models have been applied to pathology image classification in zero-shot learning or transfer learning fashion. Herein, we hypothesize that the pre-trained vision-language models can be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024 - Early acceptance (Top 11%)

  28. arXiv:2407.06826  [pdf, other

    cs.AI

    VRDSynth: Synthesizing Programs for Multilingual Visually Rich Document Information Extraction

    Authors: Thanh-Dat Nguyen, Tung Do-Viet, Hung Nguyen-Duy, Tuan-Hai Luu, Hung Le, Bach Le, Patanamon, Thongtanunam

    Abstract: Businesses need to query visually rich documents (VRDs) like receipts, medical records, and insurance forms to make decisions. Existing techniques for extracting entities from VRDs struggle with new layouts or require extensive pre-training data. We introduce VRDSynth, a program synthesis method to automatically extract entity relations from multilingual VRDs without pre-training data. To capture… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted in ISSTA'24

  29. arXiv:2407.06581  [pdf, other

    cs.AI cs.CV

    Vision language models are blind

    Authors: Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen

    Abstract: While large language models with vision capabilities (VLMs), e.g., GPT-4o and Gemini 1.5 Pro, are powering various image-text applications and scoring high on many vision-understanding benchmarks, we find that they are surprisingly still struggling with low-level vision tasks that are easy to humans. Specifically, on BlindTest, our suite of 7 very simple tasks such as identifying (a) whether two c… ▽ More

    Submitted 25 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  30. arXiv:2407.06142  [pdf, ps, other

    cs.NI eess.SY math.OC

    Delay-Aware Robust Edge Network Hardening Under Decision-Dependent Uncertainty

    Authors: Jiaming Cheng, Duong Thuy Anh Nguyen, Ni Trieu, Duong Tung Nguyen

    Abstract: Edge computing promises to offer low-latency and ubiquitous computation to numerous devices at the network edge. For delay-sensitive applications, link delays can have a direct impact on service quality. These delays can fluctuate drastically over time due to various factors such as network congestion, changing traffic conditions, cyberattacks, component failures, and natural disasters. Thus, it i… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 14 pages, 18 figures

  31. arXiv:2407.06045  [pdf, other

    cs.CV

    OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning

    Authors: Wenjun Miao, Guansong Pang, Trong-Tung Nguyen, Ruohang Fang, Jin Zheng, Xiao Bai

    Abstract: Class incremental learning (CIL) aims to learn a model that can not only incrementally accommodate new classes, but also maintain the learned knowledge of old classes. Out-of-distribution (OOD) detection in CIL is to retain this incremental learning ability, while being able to reject unknown samples that are drawn from different distributions of the learned classes. This capability is crucial to… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  32. arXiv:2407.05469  [pdf, other

    cs.CV

    Smart Camera Parking System With Auto Parking Spot Detection

    Authors: Tuan T. Nguyen, Mina Sartipi

    Abstract: Given the rising urban population and the consequential rise in traffic congestion, the implementation of smart parking systems has emerged as a critical matter of concern. Smart parking solutions use cameras, sensors, and algorithms like computer vision to find available parking spaces. This method improves parking place recognition, reduces traffic and pollution, and optimizes travel time. In re… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  33. arXiv:2407.05452  [pdf, other

    cs.CV

    Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images

    Authors: Tuan T. Nguyen, Phan Le, Yasir Hassan, Mina Sartipi

    Abstract: In this paper, we present the submission to the 5th Annual Smoky Mountains Computational Sciences Data Challenge, Challenge 3. This is the solution for semantic segmentation problem in both real-world and synthetic images from a vehicle s forward-facing camera. We concentrate in building a robust model which performs well across various domains of different outdoor situations such as sunny, snowy,… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 13 pages

  34. arXiv:2407.05205  [pdf, other

    cs.CY cs.AI cs.LG

    The AI Companion in Education: Analyzing the Pedagogical Potential of ChatGPT in Computer Science and Engineering

    Authors: Zhangying He, Thomas Nguyen, Tahereh Miari, Mehrdad Aliasgari, Setareh Rafatirad, Hossein Sayadi

    Abstract: Artificial Intelligence (AI), with ChatGPT as a prominent example, has recently taken center stage in various domains including higher education, particularly in Computer Science and Engineering (CSE). The AI revolution brings both convenience and controversy, offering substantial benefits while lacking formal guidance on their application. The primary objective of this work is to comprehensively… ▽ More

    Submitted 23 April, 2024; originally announced July 2024.

    Comments: conference, 13 pages

  35. arXiv:2407.04992  [pdf, other

    cs.LG cs.AI stat.ME

    Scalable Variational Causal Discovery Unconstrained by Acyclicity

    Authors: Nu Hoang, Bao Duong, Thin Nguyen

    Abstract: Bayesian causal discovery offers the power to quantify epistemic uncertainties among a broad range of structurally diverse causal theories potentially explaining the data, represented in forms of directed acyclic graphs (DAGs). However, existing methods struggle with efficient DAG sampling due to the complex acyclicity constraint. In this study, we propose a scalable Bayesian approach to effective… ▽ More

    Submitted 28 August, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted at ECAI 2024

  36. arXiv:2407.04980  [pdf, other

    cs.LG cs.AI stat.ME

    Enabling Causal Discovery in Post-Nonlinear Models with Normalizing Flows

    Authors: Nu Hoang, Bao Duong, Thin Nguyen

    Abstract: Post-nonlinear (PNL) causal models stand out as a versatile and adaptable framework for modeling intricate causal relationships. However, accurately capturing the invertibility constraint required in PNL models remains challenging in existing studies. To address this problem, we introduce CAF-PoNo (Causal discovery via Normalizing Flows for Post-Nonlinear models), harnessing the power of the norma… ▽ More

    Submitted 28 August, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: Acepted at ECAI 2024

  37. arXiv:2407.04489  [pdf, other

    cs.CV

    Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model

    Authors: Duy M. H. Nguyen, An T. Le, Trung Q. Nguyen, Nghiem T. Diep, Tai Nguyen, Duy Duong-Tran, Jan Peters, Li Shen, Mathias Niepert, Daniel Sonntag

    Abstract: Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we c… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Version 1

  38. arXiv:2407.04408  [pdf, ps, other

    eess.SP

    Hybrid Receiver Design for Massive MIMO-OFDM with Low-Resolution ADCs and Oversampling

    Authors: Mengyuan Ma, Nhan Thanh Nguyen, Italo Atzeni, Markku Juntti

    Abstract: Low-resolution analog-to-digital converters (ADCs) and hybrid beamforming have emerged as efficient solutions to reduce power consumption with satisfactory spectral efficiency (SE) in massive multiple-input multiple-output (MIMO) systems. In this paper, we investigate the performance of a hybrid receiver in uplink massive MIMO orthogonal frequency-division multiplexing (OFDM) systems with low-reso… ▽ More

    Submitted 9 August, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures, submitted to GlobeCom 2024

  39. arXiv:2407.03796  [pdf, ps, other

    eess.SP

    Joint Beamforming Design and Bit Allocation in Massive MIMO with Resolution-Adaptive ADCs

    Authors: Mengyuan Ma, Nhan Thanh Nguyen, Italo Atzeni, Markku Juntti

    Abstract: Low-resolution analog-to-digital converters (ADCs) have emerged as a promising technology for reducing power consumption and complexity in massive multiple-input multiple-output (MIMO) systems while maintaining satisfactory spectral and energy efficiencies (SE/EE). In this work, we first identify the essential properties of optimal quantization and leverage them to derive a closed-form approximati… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 13 pages, 14 figures

  40. arXiv:2407.03788  [pdf, other

    cs.CV cs.CL

    Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

    Authors: Thong Nguyen, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering t… ▽ More

    Submitted 19 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  41. arXiv:2407.03665  [pdf, other

    cs.IR cs.AI cs.LG cs.SI stat.ML

    Heterogeneous Hypergraph Embedding for Recommendation Systems

    Authors: Darnbi Sakong, Viet Hung Vu, Thanh Trung Huynh, Phi Le Nguyen, Hongzhi Yin, Quoc Viet Hung Nguyen, Thanh Tam Nguyen

    Abstract: Recent advancements in recommender systems have focused on integrating knowledge graphs (KGs) to leverage their auxiliary information. The core idea of KG-enhanced recommenders is to incorporate rich semantic information for more accurate recommendations. However, two main challenges persist: i) Neglecting complex higher-order interactions in the KG-based user-item network, potentially leading to… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  42. arXiv:2407.03611  [pdf, other

    cs.SE cs.AI

    An Empirical Study on Capability of Large Language Models in Understanding Code Semantics

    Authors: Thu-Trang Nguyen, Thanh Trong Vu, Hieu Dinh Vo, Son Nguyen

    Abstract: Large Language Models for Code (code LLMs) have demonstrated remarkable performance across various software engineering (SE) tasks, increasing the application of code LLMs in software development. Despite the success of code LLMs, there remain significant concerns about the actual capabilities and reliability of these models, "whether these models really learn the semantics of code from the traini… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  43. arXiv:2407.03144  [pdf, other

    cs.CV

    Venomancer: Towards Imperceptible and Target-on-Demand Backdoor Attacks in Federated Learning

    Authors: Son Nguyen, Thinh Nguyen, Khoa D Doan, Kok-Seng Wong

    Abstract: Federated Learning (FL) is a distributed machine learning approach that maintains data privacy by training on decentralized data sources. Similar to centralized machine learning, FL is also susceptible to backdoor attacks, where an attacker can compromise some clients by injecting a backdoor trigger into local models of those clients, leading to the global model's behavior being manipulated as des… ▽ More

    Submitted 11 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

  44. arXiv:2407.03110  [pdf, other

    cs.SD cs.AI eess.AS

    A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)

    Authors: Lam Pham, Phat Lam, Tin Nguyen, Hieu Tang, Alexander Schindler

    Abstract: In this paper, we present a toolchain for a comprehensive audio/video analysis by leveraging deep learning based multimodal approach. To this end, different specific tasks of Speech to Text (S2T), Acoustic Scene Classification (ASC), Acoustic Event Detection (AED), Visual Object Detection (VOD), Image Captioning (IC), and Video Captioning (VC) are conducted and integrated into the toolchain. By co… ▽ More

    Submitted 2 May, 2024; originally announced July 2024.

  45. arXiv:2407.02966  [pdf, other

    physics.comp-ph

    Efficient Forward-Mode Algorithmic Derivatives of Geant4

    Authors: Max Aehle, Xuan Tung Nguyen, Mihály Novák, Tommaso Dorigo, Nicolas R. Gauger, Jan Kieseler, Markus Klute, Vassil Vassilev

    Abstract: We have applied an operator-overloading forward-mode algorithmic differentiation tool to the Monte-Carlo particle simulation toolkit Geant4. Our differentiated version of Geant4 allows computing mean pathwise derivatives of user-defined outputs of Geant4 applications with respect to user-defined inputs. This constitutes a major step towards enabling gradient-based optimization techniques in high-e… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  46. arXiv:2407.02828  [pdf

    cs.ET quant-ph

    Quantum Serverless Paradigm and Application Development using the QFaaS Framework

    Authors: Hoa T. Nguyen, Bui Binh An Pham, Muhammad Usman, Rajkumar Buyya

    Abstract: Quantum computing has the potential to solve complex problems beyond the capabilities of classical computers. However, its practical use is currently limited due to early-stage quantum software engineering and the constraints of Noisy Intermediate-Scale Quantum (NISQ) devices. To address this issue, this chapter introduces the concept of serverless quantum computing with examples using QFaaS, a pr… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Guidelines for deploying and using the QFaaS Framework (for the original paper, see https://doi.org/10.1016/j.future.2024.01.018)

  47. arXiv:2407.02748  [pdf, other

    cs.DC cs.ET

    DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum Cloud Computing

    Authors: Hoa T. Nguyen, Muhammad Usman, Rajkumar Buyya

    Abstract: The quantum cloud computing paradigm presents unique challenges in task placement due to the dynamic and heterogeneous nature of quantum computation resources. Traditional heuristic approaches fall short in adapting to the rapidly evolving landscape of quantum computing. This paper proposes DRLQ, a novel Deep Reinforcement Learning (DRL)-based technique for task placement in quantum cloud computin… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted paper at IEEE CLOUD 2024 conference

  48. arXiv:2407.02190  [pdf, other

    cs.RO

    I2EKF-LO: A Dual-Iteration Extended Kalman Filter Based LiDAR Odometry

    Authors: Wenlu Yu, Jie Xu, Chengwei Zhao, Lijun Zhao, Thien-Minh Nguyen, Shenghai Yuan, Mingming Bai, Lihua Xie

    Abstract: LiDAR odometry is a pivotal technology in the fields of autonomous driving and autonomous mobile robotics. However, most of the current works focus on nonlinear optimization methods, and still existing many challenges in using the traditional Iterative Extended Kalman Filter (IEKF) framework to tackle the problem: IEKF only iterates over the observation equation, relying on a rough estimate of the… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted by IROS 2024

  49. arXiv:2407.01987  [pdf, other

    cs.CV

    AHMsys: An Automated HVAC Modeling System for BIM Project

    Authors: Long Hoang Dang, Duy-Hung Nguyen, Thai Quang Le, Thinh Truong Nguyen, Clark Mei, Vu Hoang

    Abstract: This paper presents a novel system, named AHMsys, designed to automate the process of generating 3D Heating, Ventilation, and Air Conditioning (HVAC) models from 2D Computer-Aided Design (CAD) drawings, a key component of Building Information Modeling (BIM). By automatically preprocessing and extracting essential HVAC object information then creating detailed 3D models, our proposed AHMsys signifi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  50. arXiv:2407.01963  [pdf, other

    eess.AS

    Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders

    Authors: Phat Lam, Lam Pham, Tin Nguyen, Hieu Tang, Thinh Pham, Loi Khanh Nguyen, Alexander Schindler

    Abstract: Existing speaker diarization systems heavily rely on large amounts of manually annotated data, which is labor-intensive and challenging to collect in real-world scenarios. Additionally, the language-specific constraint in speaker diarization systems significantly hinders their applicability and scalability in multilingual settings. In this paper, we therefore propose a cluster-based speaker diariz… ▽ More

    Submitted 7 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 8 pages, 7 figures