Zum Hauptinhalt springen

Showing 101–150 of 3,810 results for author: Nguyen, T

.
  1. arXiv:2407.01777  [pdf, other

    cs.SD cs.AI eess.AS

    Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models

    Authors: Lam Pham, Phat Lam, Truong Nguyen, Huyen Nguyen, Alexander Schindler

    Abstract: In this paper, we propose a deep learning based system for the task of deepfake audio detection. In particular, the draw input audio is first transformed into various spectrograms using three transformation methods of Short-time Fourier Transform (STFT), Constant-Q Transform (CQT), Wavelet Transform (WT) combined with different auditory-based filters of Mel, Gammatone, linear filters (LF), and dis… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.01110  [pdf

    cs.CR cs.AI cs.CY cs.LG

    SecGenAI: Enhancing Security of Cloud-based Generative AI Applications within Australian Critical Technologies of National Interest

    Authors: Christoforus Yoga Haryanto, Minh Hieu Vu, Trung Duc Nguyen, Emily Lomempow, Yulia Nurliana, Sona Taheri

    Abstract: The rapid advancement of Generative AI (GenAI) technologies offers transformative opportunities within Australia's critical technologies of national interest while introducing unique security challenges. This paper presents SecGenAI, a comprehensive security framework for cloud-based GenAI applications, with a focus on Retrieval-Augmented Generation (RAG) systems. SecGenAI addresses functional, in… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures, 9 tables, submitted to the 2024 11th International Conference on Soft Computing & Machine Intelligence (ISCMI 2024)

  3. arXiv:2407.00895  [pdf, other

    cond-mat.mes-hall

    Large-Amplitude, Easy-Plane Spin-Orbit Torque Oscillators Driven by Out-of-Plane Spin Current: A Micromagnetic Study

    Authors: Daniel Kubler, David A. Smith, Tommy Nguyen, Fernando Ramos-Diaz, Satoru Emori, Vivek P. Amin

    Abstract: Spin torque oscillators are spintronic devices that generate a periodic output signal from a non-periodic input, making them promising candidates for applications like microwave communications and neuromorphic computing. However, traditional spin torque oscillators suffer from a limited precessional cone angle and thermal stability, as well as a need for an applied bias magnetic field. We use micr… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2407.00710  [pdf, other

    cs.LG stat.ML

    Weighted Missing Linear Discriminant Analysis: An Explainable Approach for Classification with Missing Data

    Authors: Tuan L. Vo, Uyen Dang, Thu Nguyen

    Abstract: As Artificial Intelligence (AI) models are gradually being adopted in real-life applications, the explainability of the model used is critical, especially in high-stakes areas such as medicine, finance, etc. Among the commonly used models, Linear Discriminant Analysis (LDA) is a widely used classification tool that is also explainable thanks to its ability to model class distributions and maximize… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  5. arXiv:2407.00609  [pdf, other

    cs.CV cs.LG

    ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding

    Authors: Quang P. M. Pham, Khoi T. N. Nguyen, Lan C. Ngo, Truong Do, Truong Son Hy

    Abstract: Scene graphs have been proven to be useful for various scene understanding tasks due to their compact and explicit nature. However, existing approaches often neglect the importance of maintaining the symmetry-preserving property when generating scene graphs from 3D point clouds. This oversight can diminish the accuracy and robustness of the resulting scene graphs, especially when handling noisy, m… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  6. arXiv:2407.00535  [pdf, other

    cs.CE cs.CV

    AI-powered multimodal modeling of personalized hemodynamics in aortic stenosis

    Authors: Caglar Ozturk, Daniel H. Pak, Luca Rosalia, Debkalpa Goswami, Mary E. Robakowski, Raymond McKay, Christopher T. Nguyen, James S. Duncan, Ellen T. Roche

    Abstract: Aortic stenosis (AS) is the most common valvular heart disease in developed countries. High-fidelity preclinical models can improve AS management by enabling therapeutic innovation, early diagnosis, and tailored treatment planning. However, their use is currently limited by complex workflows necessitating lengthy expert-driven manual operations. Here, we propose an AI-powered computational framewo… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: CO and DHP contributed equally to this work. JSD and ETR are corresponding authors

  7. arXiv:2407.00411  [pdf, other

    cs.LG cs.AI

    Explainability of Machine Learning Models under Missing Data

    Authors: Tuan L. Vo, Thu Nguyen, Hugo L. Hammer, Michael A. Riegler, Pal Halvorsen

    Abstract: Missing data is a prevalent issue that can significantly impair model performance and interpretability. This paper briefly summarizes the development of the field of missing data with respect to Explainable Artificial Intelligence and experimentally investigates the effects of various imputation methods on the calculation of Shapley values, a popular technique for interpreting complex machine lear… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  8. arXiv:2406.20077  [pdf, other

    cs.CV

    HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model

    Authors: Hieu T. Nguyen, Yiwen Chen, Vikram Voleti, Varun Jampani, Huaizu Jiang

    Abstract: We introduce HouseCrafter, a novel approach that can lift a floorplan into a complete large 3D indoor scene (e.g., a house). Our key insight is to adapt a 2D diffusion model, which is trained on web-scale images, to generate consistent multi-view color (RGB) and depth (D) images across different locations of the scene. Specifically, the RGB-D images are generated autoregressively in a batch-wise m… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  9. arXiv:2406.19753  [pdf, other

    cs.LG

    Backdoor Attack in Prompt-Based Continual Learning

    Authors: Trang Nguyen, Anh Tran, Nhat Ho

    Abstract: Prompt-based approaches offer a cutting-edge solution to data privacy issues in continual learning, particularly in scenarios involving multiple data suppliers where long-term storage of private user data is prohibited. Despite delivering state-of-the-art performance, its impressive remembering capability can become a double-edged sword, raising security concerns as it might inadvertently retain p… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  10. arXiv:2406.19445  [pdf, other

    hep-ph astro-ph.HE

    X-Ray Constraints on Dark Photon Tridents

    Authors: Tim Linden, Thong T. Q. Nguyen, Tim M. P. Tait

    Abstract: Dark photons that are sufficiently light and/or weakly-interacting represent a compelling vision of dark matter. Dark photon decay into three photons, which we call the dark photon trident, can be the dominant channel when the dark photon mass falls below the electron pair threshold and can produce a significant flux of x-rays. We use 16 years of data from INTEGRAL/SPI to constrain sub-MeV dark ph… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 4+3 pages, 4 figures. Comments are welcome!

  11. arXiv:2406.18851  [pdf, other

    cs.LG cs.AI physics.chem-ph q-bio.BM q-bio.QM

    LICO: Large Language Models for In-Context Molecular Optimization

    Authors: Tung Nguyen, Aditya Grover

    Abstract: Optimizing black-box functions is a fundamental problem in science and engineering. To solve this problem, many approaches learn a surrogate function that estimates the underlying objective from limited historical evaluations. Large Language Models (LLMs), with their strong pattern-matching capabilities via pretraining on vast amounts of data, stand out as a potential candidate for surrogate model… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  12. arXiv:2406.17381  [pdf, other

    cs.LG cs.CV

    Forget but Recall: Incremental Latent Rectification in Continual Learning

    Authors: Nghia D. Nguyen, Hieu Trung Nguyen, Ang Li, Hoang Pham, Viet Anh Nguyen, Khoa D. Doan

    Abstract: Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper in… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  13. arXiv:2406.17376  [pdf, other

    cs.SD cs.AI eess.AS

    Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection

    Authors: Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng

    Abstract: Recent synthetic speech detectors leveraging the Transformer model have superior performance compared to the convolutional neural network counterparts. This improvement could be due to the powerful modeling ability of the multi-head self-attention (MHSA) in the Transformer model, which learns the temporal relationship of each input token. However, artifacts of synthetic speech can be located in sp… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024

  14. arXiv:2406.16777  [pdf, other

    cs.CL cs.AI

    Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024

    Authors: Sai Koneru, Thai-Binh Nguyen, Ngoc-Quan Pham, Danni Liu, Zhaolin Li, Alexander Waibel, Jan Niehues

    Abstract: Large Language Models (LLMs) are currently under exploration for various tasks, including Automatic Speech Recognition (ASR), Machine Translation (MT), and even End-to-End Speech Translation (ST). In this paper, we present KIT's offline submission in the constrained + LLM track by incorporating recently proposed techniques that can be added to any cascaded speech translation. Specifically, we inte… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  15. arXiv:2406.16685  [pdf, other

    cs.CE

    A locking-free isogeometric thin shell formulation based on higher order accurate local strain projection via approximate dual splines

    Authors: Thi-Hoa Nguyen, René R. Hiemstra, Dominik Schillinger

    Abstract: We present a novel isogeometric discretization approach for the Kirchhoff-Love shell formulation based on the Hellinger-Reissner variational principle. For mitigating membrane locking, we discretize the independent strains with spline basis functions that are one degree lower than those used for the displacements. To enable computationally efficient condensation of the independent strains, we firs… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  16. arXiv:2406.16656  [pdf, ps, other

    cs.IT cs.DM math.CO

    Permutation Codes Correcting Multiple Deletions

    Authors: Shuche Wang, The Nguyen, Yeow Meng Chee, Van Khu Vu

    Abstract: Permutation codes in the Ulam metric, which can correct multiple deletions, have been investigated extensively recently owing to their applications. In this work, we are interested in the maximum size of the permutation codes in the Ulam metric and aim to design permutation codes that can correct multiple deletions with efficient decoding algorithms. We first present an improvement on the Gilbert-… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 9 pages

  17. arXiv:2406.15749  [pdf, ps, other

    hep-ph

    Decay of CP-even Higgs $H\rightarrow h γγ$ in Two Higgs Doublet Model: (I) one-loop analytic results, ward identity checks

    Authors: Khiem Hong Phan, Dzung Tri Tran, Thanh Huy Nguyen

    Abstract: We present the first analytical expressions for one-loop induced contributions for the decay channels of CP-even Higgs $H\rightarrow h γγ$ with $h$ being standard model-like Higgs boson within the framework of Two Higgs Doublet Model in this paper. One-loop form factors for the decay processes are written in terms of the scalar Passarino-Veltman functions following the general notations of the pac… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 39 pages, 8 Figures, 9 Tables

    Report number: DTU_2024-03

  18. arXiv:2406.15633  [pdf, other

    cs.SE

    Good things come in three: Generating SO Post Titles with Pre-Trained Models, Self Improvement and Post Ranking

    Authors: Duc Anh Le, Anh M. T. Bui, Phuong T. Nguyen, Davide Di Ruscio

    Abstract: Stack Overflow is a prominent Q and A forum, supporting developers in seeking suitable resources on programming-related matters. Having high-quality question titles is an effective means to attract developers' attention. Unfortunately, this is often underestimated, leaving room for improvement. Research has been conducted, predominantly leveraging pre-trained models to generate titles from code sn… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: The paper has been per-reviewed and accepted for publication to the International Symposium on Empirical Software Engineering and Measurement (ESEM 2024)

  19. arXiv:2406.15119  [pdf, other

    cs.SD cs.AI eess.AS

    Speech Emotion Recognition under Resource Constraints with Data Distillation

    Authors: Yi Chang, Zhao Ren, Zhonghao Zhao, Thanh Tam Nguyen, Kun Qian, Tanja Schultz, Björn W. Schuller

    Abstract: Speech emotion recognition (SER) plays a crucial role in human-computer interaction. The emergence of edge devices in the Internet of Things (IoT) presents challenges in constructing intricate deep learning models due to constraints in memory and computational resources. Moreover, emotional speech data often contains private information, raising concerns about privacy leakage during the deployment… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  20. arXiv:2406.14835  [pdf, other

    cs.CL cs.LG

    ToVo: Toxicity Taxonomy via Voting

    Authors: Tinh Son Luong, Thanh-Thien Le, Thang Viet Doan, Linh Ngo Van, Thien Huu Nguyen, Diep Thi-Ngoc Nguyen

    Abstract: Existing toxic detection models face significant limitations, such as lack of transparency, customization, and reproducibility. These challenges stem from the closed-source nature of their training data and the paucity of explanations for their evaluation mechanism. To address these issues, we propose a dataset creation mechanism that integrates voting and chain-of-thought processes, producing a h… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  21. arXiv:2406.14784  [pdf, other

    cs.LG stat.OT

    Active Learning for Fair and Stable Online Allocations

    Authors: Riddhiman Bhattacharya, Thanh Nguyen, Will Wei Sun, Mohit Tawarmalani

    Abstract: We explore an active learning approach for dynamic fair resource allocation problems. Unlike previous work that assumes full feedback from all agents on their allocations, we consider feedback from a select subset of agents at each epoch of the online resource allocation process. Despite this restriction, our proposed algorithms provide regret bounds that are sub-linear in number of time-periods f… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  22. arXiv:2406.14572  [pdf, other

    q-bio.QM cs.AI cs.IR

    Bioptic -- A Target-Agnostic Potency-Based Small Molecules Search Engine

    Authors: Vlad Vinogradov, Ivan Izmailov, Simon Steshin, Kong T. Nguyen

    Abstract: Recent successes in virtual screening have been made possible by large models and extensive chemical libraries. However, combining these elements is challenging: the larger the model, the more expensive it is to run, making ultra-large libraries unfeasible. To address this, we developed a target-agnostic, efficacy-based molecule search model, which allows us to find structurally dissimilar molecul… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  23. arXiv:2406.13781  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    A Primal-Dual Framework for Transformers and Neural Networks

    Authors: Tan M. Nguyen, Tam Nguyen, Nhat Ho, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher

    Abstract: Self-attention is key to the remarkable success of transformers in sequence modeling tasks including many applications in natural language processing and computer vision. Like neural network layers, these attention mechanisms are often developed by heuristics and experience. To provide a principled framework for constructing attention layers in transformers, we show that the self-attention corresp… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to ICLR 2023, 26 pages, 4 figures, 14 tables

  24. arXiv:2406.13770  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Elliptical Attention

    Authors: Stefan K. Nielsen, Laziz U. Abdullaev, Rachel Teo, Tan M. Nguyen

    Abstract: Pairwise dot-product self-attention is key to the success of transformers that achieve state-of-the-art performance across a variety of applications in language and vision. This dot-product self-attention computes attention weights among the input tokens using Euclidean distance, which makes the model prone to representation collapse and vulnerable to contaminated samples. In this paper, we propos… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 38 pages, 7 figures, 12 tables

  25. arXiv:2406.13762  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis

    Authors: Rachel S. Y. Teo, Tan M. Nguyen

    Abstract: The remarkable success of transformers in sequence modeling tasks, spanning various applications in natural language processing and computer vision, is attributed to the critical role of self-attention. Similar to the development of most deep learning models, the construction of these attention mechanisms rely on heuristics and experience. In our work, we derive self-attention from kernel principa… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 33 pages, 5 figures, 12 tables

  26. arXiv:2406.13725  [pdf, other

    cs.LG cs.AI stat.ML

    Tree-Sliced Wasserstein Distance on a System of Lines

    Authors: Viet-Hoang Tran, Trang Pham, Tho Tran, Tam Le, Tan M. Nguyen

    Abstract: Sliced Wasserstein (SW) distance in Optimal Transport (OT) is widely used in various applications thanks to its statistical effectiveness and computational efficiency. On the other hand, Tree Wassenstein (TW) and Tree-sliced Wassenstein (TSW) are instances of OT for probability measures where its ground cost is a tree metric. TSW also has a low computational complexity, i.e. linear to the number o… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 33 pages, 6 figures, 2 tables, 4 algorithms

  27. arXiv:2406.13587  [pdf, other

    astro-ph.IM

    The Precursor Small Aperture Telescope (PreSAT) CMB polarimeter

    Authors: Matthew A. Petroff, Zeeshan Ahmed, James J. Bock, Marion Dierickx, Sofia Fatigoni, David C. Goldfinger, Paul K. Grimes, Shawn W. Henderson, Kirit S. Karkare, John M. Kovac, Hien T. Nguyen, Scott N. Paine, Anna R. Polish, Clement Pryke, Thibault Romand, Benjamin L. Schmitt, Abigail G. Vieregg

    Abstract: The search for the polarized imprint of primordial gravitational waves in the cosmic microwave background (CMB) as direct evidence of cosmic inflation requires exquisite sensitivity and control over systematics. The next-generation CMB-S4 project intends to improve upon current-generation experiments by deploying a significantly greater number of highly-sensitive detectors, combined with refined i… ▽ More

    Submitted 10 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures, submitted to Proc. SPIE

  28. Machine Learning Applications of Quantum Computing: A Review

    Authors: Thien Nguyen, Tuomo Sipola, Jari Hautamäki

    Abstract: At the intersection of quantum computing and machine learning, this review paper explores the transformative impact these technologies are having on the capabilities of data processing and analysis, far surpassing the bounds of traditional computational methods. Drawing upon an in-depth analysis of 32 seminal papers, this review delves into the interplay between quantum computing and machine learn… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Proceedings of the 23rd European Conference on Cyber Warfare and Security (ECCWS 2024)

  29. arXiv:2406.13096  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Electric field enhances the electronic and diffusion properties of penta-graphene nanoribbons for application in lithium-ion batteries: a first-principles study

    Authors: Thi Nhan Tran, Nguyen Vo Anh Duy, Nguyen Hoang Hieu, Truc Anh Nguyen, Nguyen To Van, Viet Bac Thi Phung, Peter Schall, Minh Triet Dang

    Abstract: Enhancing the electronic and diffusion properties of lithium-ion batteries is crucial for improving the performance of the fast-growing energy storage devices. Recently, fast-charging capability of commercial-like lithium-ion anodes with the least modification of the current manufactoring technology is of great interest. Here we use first principles methods with density functional theory and the c… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: 21 pages, 5 figures

  30. arXiv:2406.12507  [pdf, other

    cs.LG

    Improving the Evaluation and Actionability of Explanation Methods for Multivariate Time Series Classification

    Authors: Davide Italo Serramazza, Thach Le Nguyen, Georgiana Ifrim

    Abstract: Explanation for Multivariate Time Series Classification (MTSC) is an important topic that is under explored. There are very few quantitative evaluation methodologies and even fewer examples of actionable explanation, where the explanation methods are shown to objectively improve specific computational tasks on time series data. In this paper we focus on analyzing InterpretTime, a recent evaluation… ▽ More

    Submitted 12 August, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  31. arXiv:2406.11982  [pdf, other

    cond-mat.mes-hall physics.app-ph

    Nonlinear photocurrent in quantum materials for broadband photodetection

    Authors: Yulin Shen, Louis Primeau, Jiangxu Li, Tuan-Dung Nguyen, David Mandrus, Yuxuan Cosmi Lin, Yang Zhang

    Abstract: Unlocking the vast potential of optical sensing technology has long been hindered by the challenges of achieving fast, sensitive, and broadband photodetection at ambient temperatures. In this review, we summarize recent progress in the study of nonlinear photocurrent in topological quantum materials, and its application in broadband photodetection without the use of p-n junction based semiconducto… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Review article, 24 pages + 19 figures

  32. arXiv:2406.11794  [pdf, other

    cs.LG cs.CL

    DataComp-LM: In search of the next generation of training sets for language models

    Authors: Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Gadre, Hritik Bansal, Etash Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner , et al. (34 additional authors not shown)

    Abstract: We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset experiments with the goal of improving language models. As part of DCLM, we provide a standardized corpus of 240T tokens extracted from Common Crawl, effective pretraining recipes based on the OpenLM framework, and a broad suite of 53 downstream evaluations. Participants in the DCLM benchmark can experiment with dat… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Project page: https://www.datacomp.ai/dclm/

  33. arXiv:2406.11233  [pdf, other

    cs.LG cs.AI cs.CL

    Probing the Decision Boundaries of In-context Learning in Large Language Models

    Authors: Siyan Zhao, Tung Nguyen, Aditya Grover

    Abstract: In-context learning is a key paradigm in large language models (LLMs) that enables them to generalize to new tasks and domains by simply prompting these models with a few exemplars without explicit parameter updates. Many attempts have been made to understand in-context learning in LLMs as a function of model scale, pretraining data, and other factors. In this work, we propose a new mechanism to p… ▽ More

    Submitted 24 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 18 pages, code at https://github.com/siyan-zhao/ICL_decision_boundary

  34. arXiv:2406.11040  [pdf, other

    hep-ph

    Decays $Z\to e_ae_b$ in a 3-3-1 model with neutral leptons

    Authors: T. T. Hong, L. T. Hue, L. T. T. Phuong, N. H. T. Nha, T. Phong Nguyen

    Abstract: We investigate the 3-3-1 model with neutral leptons (called the 331$NL$ for short) and by that, we will point out that this model can simultaneously explain the lepton flavor violating (LFV) decays of the $Z$ boson $Z \to e_a e_b$, Standard model-like Higgs boson decay $h\to e_be_a$, and the charged leptons $e_b\to e_a γ$ consistent with the recent experimental data. In addition, the numerical res… ▽ More

    Submitted 19 August, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 24 pages, 5 figures

  35. arXiv:2406.09732  [pdf, ps, other

    math.PR cs.GT econ.TH

    Finding pure Nash equilibria in large random games

    Authors: Andrea Collevecchio, Tuan-Minh Nguyen, Ziwen Zhong

    Abstract: Best Response Dynamics (BRD) is a class of strategy updating rules to find Pure Nash Equilibria (PNE) in a game. At each step, a player is randomly picked, and the player switches to a "best response" strategy based on the strategies chosen by others, so that the new strategy profile maximises their payoff. If no such strategy exists, a different player will be chosen randomly. When no player want… ▽ More

    Submitted 16 August, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 20 pages, 4 figures, 1 table

    MSC Class: 91A10; 91A06; 60K35; 60K37

  36. arXiv:2406.09400  [pdf, other

    cs.CV cs.LG

    Yo'LLaVA: Your Personalized Language and Vision Assistant

    Authors: Thao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee

    Abstract: Large Multimodal Models (LMMs) have shown remarkable capabilities across a variety of tasks (e.g., image captioning, visual question answering). While broad, their knowledge remains generic (e.g., recognizing a dog), and they are unable to handle personalized subjects (e.g., recognizing a user's pet dog). Human reasoning, in contrast, typically operates within the context of specific subjects in o… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Project page: https://thaoshibe.github.io/YoLLaVA

  37. A Tangible Multi-Display Toolkit to Support the Collaborative Design Exploration of AV-Pedestrian Interfaces

    Authors: Marius Hoggenmuller, Martin Tomitsch, Callum Parker, Trung Thanh Nguyen, Dawei Zhou, Stewart Worrall, Eduardo Nebot

    Abstract: The advent of cyber-physical systems, such as robots and autonomous vehicles (AVs), brings new opportunities and challenges for the domain of interaction design. Though there is consensus about the value of human-centred development, there is a lack of documented tailored methods and tools for involving multiple stakeholders in design exploration processes. In this paper we present a novel approac… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  38. arXiv:2406.07460  [pdf, ps, other

    math.PR math.AP

    Existence and asymptotic autonomous robustness of random attractors for three-dimensional stochastic globally modified Navier-Stokes equations on unbounded domains

    Authors: Bui Kim My, Ho Thi Hang, Kush Kinra, Manil T. Mohan, Pham Tri Nguyen

    Abstract: In this article, we discuss the existence and asymptotically autonomous robustness (AAR) (almost surely) of random attractors for 3D stochastic globally modified Navier-Stokes equations (SGMNSE) on Poincaré domains (which may be bounded or unbounded). Our aim is to investigate the existence and AAR of random attractors for 3D SGMNSE when the time-dependent forcing converges to a time-independent f… ▽ More

    Submitted 9 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2208.06808

    MSC Class: 37L55; 76D05; 35B41; 37B55; 35B40

  39. arXiv:2406.06863  [pdf, other

    cs.CR cs.AI cs.HC

    Ollabench: Evaluating LLMs' Reasoning for Human-centric Interdependent Cybersecurity

    Authors: Tam n. Nguyen

    Abstract: Large Language Models (LLMs) have the potential to enhance Agent-Based Modeling by better representing complex interdependent cybersecurity systems, improving cybersecurity threat modeling and risk management. However, evaluating LLMs in this context is crucial for legal compliance and effective application development. Existing LLM evaluation frameworks often overlook the human factor and cogniti… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 7 figures, 2 tables The final conference/journal version may have significantly more content updates

    ACM Class: I.2.0; J.4

  40. arXiv:2406.06551  [pdf

    physics.app-ph cond-mat.mes-hall

    A Simple View on Large-Signal Resonant-Tunneling-Diode Dynamics

    Authors: Petr Ourednik, Dinh Tuan Nguyen, Michael Feiginov

    Abstract: We present a model for an accurate description of the large-signal resonant-tunneling-diode (RTD) dynamics, which allows for a simple and intuitive analysis in terms of dynamical trajectories in a phase space. We show that the RTD admittance can be accurately described by a simple RLRC equivalent circuit, which has a universal configuration, but with different circuit parameters in the large- and… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

    Comments: 2 pages, 2 figures, accepted version

    Journal ref: 2023 48th International Conference on Infrared, Millimeter, and Terahertz Waves (IRMMW-THz), Montreal, QC, Canada, 2023, pp. 1-2

  41. arXiv:2406.06409  [pdf, ps, other

    math.OC

    On the structure of the value function of optimal exit time problems

    Authors: Piermarco Cannarsa, Marco Mazzola, Khai T. Nguyen

    Abstract: In this paper, we study an optimal exit time problem with general running and terminal costs and a target $\mathcal{S}\subset\mathbb{R}^d$ having an inner ball property for a nonlinear control system that satisfies mild controllability assumptions. In particular, Petrov's condition at the boundary of $\mathcal{S}$ is not required and the value function $V$ may fail to be locally Lipschitz. In such… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 50 pages

    MSC Class: 49N60; 49N05; 49J52; 49E30

  42. arXiv:2406.06239  [pdf, other

    cs.CV

    I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data

    Authors: Hoang H. Le, Duy M. H. Nguyen, Omair Shahzad Bhatti, Laszlo Kopacsi, Thinh P. Ngo, Binh T. Nguyen, Michael Barz, Daniel Sonntag

    Abstract: Comprehending how humans process visual information in dynamic settings is crucial for psychology and designing user-centered interactions. While mobile eye-tracking systems combining egocentric video and gaze signals can offer valuable insights, manual analysis of these recordings is time-intensive. In this work, we present a novel human-centered learning algorithm designed for automated object r… ▽ More

    Submitted 7 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Updated version

  43. arXiv:2406.05615  [pdf, other

    cs.CL

    Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

    Authors: Thong Nguyen, Yi Bin, Junbin Xiao, Leigang Qu, Yicong Li, Jay Zhangjie Wu, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Humans use multiple senses to comprehend the environment. Vision and language are two of the most vital senses since they allow us to easily communicate our thoughts and perceive the world around us. There has been a lot of interest in creating video-language understanding systems with human-like senses since a video-language pair can mimic both our linguistic medium and visual environment with te… ▽ More

    Submitted 1 July, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Findings)

  44. arXiv:2406.05349  [pdf, other

    cs.CV

    Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid

    Authors: Thanh-Huy Nguyen, Thi Kim Ngan Ngo, Mai Anh Vu, Ting-Yuan Tu

    Abstract: The ability of three-dimensional (3D) spheroid modeling to study the invasive behavior of breast cancer cells has drawn increased attention. The deep learning-based image processing framework is very effective at speeding up the cell morphological analysis process. Out-of-focus photos taken while capturing 3D cells under several z-slices, however, could negatively impact the deep learning model. I… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  45. arXiv:2406.04994  [pdf, other

    stat.ME

    Unguided structure learning of DAGs for count data

    Authors: Thi Kim Hue Nguyen, Monica Chiogna, Davide Risso

    Abstract: Mainly motivated by the problem of modelling directional dependence relationships for multivariate count data in high-dimensional settings, we present a new algorithm, called learnDAG, for learning the structure of directed acyclic graphs (DAGs). In particular, the proposed algorithm tackled the problem of learning DAGs from observational data in two main steps: (i) estimation of candidate parent… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  46. arXiv:2406.03699  [pdf, other

    cs.CL

    M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering

    Authors: Anand Subramanian, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler

    Abstract: There is vivid research on adapting Large Language Models (LLMs) to perform a variety of tasks in high-stakes domains such as healthcare. Despite their popularity, there is a lack of understanding of the extent and contributing factors that allow LLMs to recall relevant knowledge and combine it with presented information in the clinical and biomedical domain: a fundamental pre-requisite for succes… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Findings)

  47. arXiv:2406.02555  [pdf, ps, other

    eess.AS cs.CL

    PhoWhisper: Automatic Speech Recognition for Vietnamese

    Authors: Thanh-Thien Le, Linh The Nguyen, Dat Quoc Nguyen

    Abstract: We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition. PhoWhisper's robustness is achieved through fine-tuning the Whisper model on an 844-hour dataset that encompasses diverse Vietnamese accents. Our experimental study demonstrates state-of-the-art performances of PhoWhisper on benchmark Vietnamese ASR datasets. We have open-sourced PhoWhisper at: https://github.com… ▽ More

    Submitted 27 March, 2024; originally announced June 2024.

    Comments: Accepted to ICLR 2024 Tiny Papers Track

  48. arXiv:2406.02440  [pdf, ps, other

    math.CO math.AC math.AG

    Simplicial complexes and matroids with vanishing $T^2$

    Authors: Alexandru Constantinescu, Patricia Klein, Thai Thanh Nguyen, Anurag Singh, Lorenzo Venturello

    Abstract: We investigate quotients by radical monomial ideals for which $T^2$, the second cotangent cohomology module, vanishes. The dimension of the graded components of $T^2$, and thus their vanishing, depends only on the combinatorics of the corresponding simplicial complex. We give both a complete characterization and a full list of one dimensional complexes with $T^2=0$. We characterize the graded comp… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 13 pages

  49. arXiv:2406.02317  [pdf, other

    cs.LG cs.AI stat.ML

    Generative Conditional Distributions by Neural (Entropic) Optimal Transport

    Authors: Bao Nguyen, Binh Nguyen, Hieu Trung Nguyen, Viet Anh Nguyen

    Abstract: Learning conditional distributions is challenging because the desired outcome is not a single distribution but multiple distributions that correspond to multiple instances of the covariates. We introduce a novel neural entropic optimal transport method designed to effectively learn generative models of conditional distributions, particularly in scenarios characterized by limited sample sizes. Our… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures

  50. arXiv:2406.01029  [pdf, other

    cs.CV

    CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos

    Authors: Trong-Thuan Nguyen, Pha Nguyen, Xin Li, Jackson Cothren, Alper Yilmaz, Khoa Luu

    Abstract: Video scene graph generation (VidSGG) has emerged as a transformative approach to capturing and interpreting the intricate relationships among objects and their temporal dynamics in video sequences. In this paper, we introduce the new AeroEye dataset that focuses on multi-object relationship modeling in aerial videos. Our AeroEye dataset features various drone scenes and includes a visually compre… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.