Zum Hauptinhalt springen

Showing 1–50 of 231 results for author: Nguyen, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.10665  [pdf, other

    eess.IV cs.LG

    End-to-end learned Lossy Dynamic Point Cloud Attribute Compression

    Authors: Dat Thanh Nguyen, Daniel Zieger, Marc Stamminger, Andre Kaup

    Abstract: Recent advancements in point cloud compression have primarily emphasized geometry compression while comparatively fewer efforts have been dedicated to attribute compression. This study introduces an end-to-end learned dynamic lossy attribute coding approach, utilizing an efficient high-dimensional convolution to capture extensive inter-point dependencies. This enables the efficient projection of a… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 6 pages, accepted for presentation at 2024 IEEE International Conference on Image Processing (ICIP) 2024

  2. arXiv:2407.20249  [pdf, other

    cs.LG eess.SP

    Revisiting the Disequilibrium Issues in Tackling Heart Disease Classification Tasks

    Authors: Thao Hoang, Linh Nguyen, Khoi Do, Duong Nguyen, Viet Dung Nguyen

    Abstract: In the field of heart disease classification, two primary obstacles arise. Firstly, existing Electrocardiogram (ECG) datasets consistently demonstrate imbalances and biases across various modalities. Secondly, these time-series data consist of diverse lead signals, causing Convolutional Neural Networks (CNNs) to become overfitting to the one with higher power, hence diminishing the performance of… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  3. arXiv:2407.20247  [pdf, other

    eess.SP cs.AI cs.LG

    How Homogenizing the Channel-wise Magnitude Can Enhance EEG Classification Model?

    Authors: Huyen Ngo, Khoi Do, Duong Nguyen, Viet Dung Nguyen, Lan Dang

    Abstract: A significant challenge in the electroencephalogram EEG lies in the fact that current data representations involve multiple electrode signals, resulting in data redundancy and dominant lead information. However extensive research conducted on EEG classification focuses on designing model architectures without tackling the underlying issues. Otherwise, there has been a notable gap in addressing dat… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  4. arXiv:2407.16803  [pdf, other

    cs.CV cs.AI cs.HC cs.LG eess.SP

    Fusion and Cross-Modal Transfer for Zero-Shot Human Action Recognition

    Authors: Abhi Kamboj, Anh Duy Nguyen, Minh Do

    Abstract: Despite living in a multi-sensory world, most AI models are limited to textual and visual interpretations of human motion and behavior. Inertial measurement units (IMUs) provide a salient signal to understand human motion; however, they are challenging to use due to their uninterpretability and scarcity of their data. We investigate a method to transfer knowledge between visual and inertial modali… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  5. arXiv:2407.09534  [pdf, other

    eess.IV

    DFS-based fast crack detection

    Authors: Vsevolod Chernyshev, Vitalii Makogin, Duc Nguyen, Evgeny Spodarev

    Abstract: In this paper, we propose an fast method for crack detection in 3D computed tomography (CT) images. Our approach combines the Maximal Hessian Entry filter and a Deep-First Search algorithm-based technique to strike a balance between computational complexity and accuracy. Experimental results demonstrate the effectiveness of our approach in detecting the crack structure with predefined misclassific… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  6. arXiv:2407.06142  [pdf, ps, other

    cs.NI eess.SY math.OC

    Delay-Aware Robust Edge Network Hardening Under Decision-Dependent Uncertainty

    Authors: Jiaming Cheng, Duong Thuy Anh Nguyen, Ni Trieu, Duong Tung Nguyen

    Abstract: Edge computing promises to offer low-latency and ubiquitous computation to numerous devices at the network edge. For delay-sensitive applications, link delays can have a direct impact on service quality. These delays can fluctuate drastically over time due to various factors such as network congestion, changing traffic conditions, cyberattacks, component failures, and natural disasters. Thus, it i… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 14 pages, 18 figures

  7. arXiv:2406.11220  [pdf, other

    eess.SP

    No Analog Combiner TTD-based Hybrid Precoding for Multi-User Sub-THz Communications

    Authors: Dang Qua Nguyen, Alexei Ashikhmin, Hong Yang, Taejoon Kim

    Abstract: We address the design and optimization of real-world-suitable hybrid precoders for multi-user wideband sub-terahertz (sub-THz) communications. We note that the conventional fully connected true-time delay (TTD)-based architecture is impractical because there is no room for the required large number of analog signal combiners in the circuit board. Additionally, analog signal combiners incur signifi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.02879  [pdf, ps, other

    math.PR cs.CV eess.IV math.NA stat.CO

    Second-order differential operators, stochastic differential equations and Brownian motions on embedded manifolds

    Authors: Du Nguyen, Stefan Sommer

    Abstract: We specify the conditions when a manifold M embedded in an inner product space E is an invariant manifold of a stochastic differential equation (SDE) on E, linking it with the notion of second-order differential operators on M. When M is given a Riemannian metric, we derive a simple formula for the Laplace-Beltrami operator in terms of the gradient and Hessian on E and construct the Riemannian Bro… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    MSC Class: 65C30; 65L20; 65C20; 60J65; 58J65

  9. arXiv:2406.02555  [pdf, ps, other

    eess.AS cs.CL

    PhoWhisper: Automatic Speech Recognition for Vietnamese

    Authors: Thanh-Thien Le, Linh The Nguyen, Dat Quoc Nguyen

    Abstract: We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition. PhoWhisper's robustness is achieved through fine-tuning the Whisper model on an 844-hour dataset that encompasses diverse Vietnamese accents. Our experimental study demonstrates state-of-the-art performances of PhoWhisper on benchmark Vietnamese ASR datasets. We have open-sourced PhoWhisper at: https://github.com… ▽ More

    Submitted 27 March, 2024; originally announced June 2024.

    Comments: Accepted to ICLR 2024 Tiny Papers Track

  10. arXiv:2405.16664  [pdf

    eess.SP physics.med-ph

    Deep learning improved autofocus for motion artifact reduction and its application in quantitative susceptibility mapping

    Authors: Chao Li, Jinwei Zhang, Hang Zhang, Jiahao Li, Pascal Spincemaille, Thanh D. Nguyen, Yi Wang

    Abstract: Purpose: To develop a pipeline for motion artifact correction in mGRE and quantitative susceptibility mapping (QSM). Methods: Deep learning is integrated with autofocus to improve motion artifact suppression, which is applied QSM of patients with Parkinson's disease (PD). The estimation of affine motion parameters in the autofocus method depends on signal-to-noise ratio and lacks accuracy when dat… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  11. arXiv:2405.02994  [pdf, other

    eess.SY

    Extended State Observer for Mismatch Disturbances Using Taylor Approximation of the Integral

    Authors: Cuong Duc Nguyen

    Abstract: The development of disturbance estimators using extended state observers (ESOs) typically assumes that the system is observable. This paper introduces an improved method for systems that are initially unobservable, leveraging Taylor expansion to approximate the integral of disturbance dynamics. A new extended system is formulated based on this approximation, enabling the design of an observer that… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  12. arXiv:2405.00712  [pdf, other

    eess.SP cs.LG

    SoK: Behind the Accuracy of Complex Human Activity Recognition Using Deep Learning

    Authors: Duc-Anh Nguyen, Nhien-An Le-Khac

    Abstract: Human Activity Recognition (HAR) is a well-studied field with research dating back to the 1980s. Over time, HAR technologies have evolved significantly from manual feature extraction, rule-based algorithms, and simple machine learning models to powerful deep learning models, from one sensor type to a diverse array of sensing modalities. The scope has also expanded from recognising a limited set of… ▽ More

    Submitted 3 May, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  13. arXiv:2403.17392  [pdf, other

    cs.RO eess.SY nlin.AO

    Natural-artificial hybrid swarm: Cyborg-insect group navigation in unknown obstructed soft terrain

    Authors: Yang Bai, Phuoc Thanh Tran Ngoc, Huu Duoc Nguyen, Duc Long Le, Quang Huy Ha, Kazuki Kai, Yu Xiang See To, Yaosheng Deng, Jie Song, Naoki Wakamiya, Hirotaka Sato, Masaki Ogura

    Abstract: Navigating multi-robot systems in complex terrains has always been a challenging task. This is due to the inherent limitations of traditional robots in collision avoidance, adaptation to unknown environments, and sustained energy efficiency. In order to overcome these limitations, this research proposes a solution by integrating living insects with miniature electronic controllers to enable roboti… ▽ More

    Submitted 27 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  14. arXiv:2403.16595  [pdf, other

    cs.HC cs.RO eess.SY

    The Adaptive Workplace: Orchestrating Architectural Services around the Wellbeing of Individual Occupants

    Authors: Andrew Vande Moere, Sara Arko, Alena Safrova Drasilova, Tomáš Ondráček, Ilaria Pigliautile, Benedetta Pioppi, Anna Laura Pisello, Jakub Prochazka, Paula Acuna Roncancio, Davide Schaumann, Marcel Schweiker, Binh Vinh Duc Nguyen

    Abstract: As the academic consortia members of the EU Horizon project SONATA ("Situation-aware OrchestratioN of AdapTive Architecture"), we respond to the workshop call for "Office Wellbeing by Design: Don't Stand for Anything Less" by proposing the "Adaptive Workplace" concept. In essence, our vision aims to adapt a workplace to the ever-changing needs of individual occupants, instead of that occupants are… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  15. arXiv:2403.11104  [pdf, other

    eess.SY

    Deep Neural Network NMPC for Computationally Tractable Optimal Power Management of Hybrid Electric Vehicle

    Authors: Suyong Park, Duc Giap Nguyen, Jinrak Park, Dohee Kim, Jeong Soo Eo, Kyoungseok Han

    Abstract: This study presents a method for deep neural network nonlinear model predictive control (DNN-MPC) to reduce computational complexity, and we show its practical utility through its application in optimizing the energy management of hybrid electric vehicles (HEVs). For optimal power management of HEVs, we first design the online NMPC to collect the data set, and the deep neural network is trained to… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 6 pages, 10 figures, 3 tables, 2024 ACC conference (accepted)

  16. arXiv:2403.08371  [pdf, other

    eess.SP

    User-Centric Beam Selection and Precoding Design for Coordinated Multiple-Satellite Systems

    Authors: Vu Nguyen Ha, Duy H. N. Nguyen, Juan C. -M. Duncan, Jorge L. Gonzalez-Rios, Juan A. Vasquez, Geoffrey Eappen, Luis M. Garces-Socarras, Rakesh Palisetty, Symeon Chatzinotas, Bjorn Ottersten

    Abstract: This paper introduces a joint optimization framework for user-centric beam selection and linear precoding (LP) design in a coordinated multiple-satellite (CoMSat) system, employing a Digital-Fourier-Transform-based (DFT) beamforming (BF) technique. Regarding serving users at their target SINRs and minimizing the total transmit power, the scheme aims to efficiently determine satellites for users to… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  17. arXiv:2402.03648  [pdf, other

    eess.SP cs.LG

    Multilinear Kernel Regression and Imputation via Manifold Learning

    Authors: Duc Thien Nguyen, Konstantinos Slavakis

    Abstract: This paper introduces a novel nonparametric framework for data imputation, coined multilinear kernel regression and imputation via the manifold assumption (MultiL-KRIM). Motivated by manifold learning, MultiL-KRIM models data features as a point cloud located in or close to a user-unknown smooth manifold embedded in a reproducing kernel Hilbert space. Unlike typical manifold-learning routes, which… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  18. arXiv:2402.00238  [pdf, other

    cs.LG eess.IV q-bio.QM

    CNN-FL for Biotechnology Industry Empowered by Internet-of-BioNano Things and Digital Twins

    Authors: Mohammad, Jamshidi, Dinh Thai Hoang, Diep N. Nguyen

    Abstract: Digital twins (DTs) are revolutionizing the biotechnology industry by enabling sophisticated digital representations of biological assets, microorganisms, drug development processes, and digital health applications. However, digital twinning at micro and nano scales, particularly in modeling complex entities like bacteria, presents significant challenges in terms of requiring advanced Internet of… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  19. arXiv:2401.12488  [pdf

    eess.IV cs.CV

    An Automated Real-Time Approach for Image Processing and Segmentation of Fluoroscopic Images and Videos Using a Single Deep Learning Network

    Authors: Viet Dung Nguyen, Michael T. LaCour, Richard D. Komistek

    Abstract: Image segmentation in total knee arthroplasty is crucial for precise preoperative planning and accurate implant positioning, leading to improved surgical outcomes and patient satisfaction. The biggest challenges of image segmentation in total knee arthroplasty include accurately delineating complex anatomical structures, dealing with image artifacts and noise, and developing robust algorithms that… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  20. arXiv:2401.10032  [pdf, other

    eess.AS cs.AI eess.SP

    FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder

    Authors: Tan Dat Nguyen, Ji-Hoon Kim, Youngjoon Jang, Jaehun Kim, Joon Son Chung

    Abstract: The goal of this paper is to generate realistic audio with a lightweight and fast diffusion-based vocoder, named FreGrad. Our framework consists of the following three key components: (1) We employ discrete wavelet transform that decomposes a complicated waveform into sub-band wavelets, which helps FreGrad to operate on a simple and concise feature space, (2) We design a frequency-aware dilated co… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  21. The Smooth Trajectory Estimator for LMB Filters

    Authors: Hoa Van Nguyen, Tran Thien Dat Nguyen, Changbeom Shim, Marzhar Anuar

    Abstract: This paper proposes a smooth-trajectory estimator for the labelled multi-Bernoulli (LMB) filter by exploiting the special structure of the generalised labelled multi-Bernoulli (GLMB) filter. We devise a simple and intuitive approach to store the best association map when approximating the GLMB random finite set (RFS) to the LMB RFS. In particular, we construct a smooth-trajectory estimator (i.e.,… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures. Presented at The 12th IEEE International Conference on Control, Automation and Information Sciences (ICCAIS 2023), Nov 2023, Hanoi, Vietnam

  22. arXiv:2312.16835  [pdf, other

    eess.IV cs.CV

    RimSet: Quantitatively Identifying and Characterizing Chronic Active Multiple Sclerosis Lesion on Quantitative Susceptibility Maps

    Authors: Hang Zhang, Thanh D. Nguyen, Jinwei Zhang, Renjiu Hu, Susan A. Gauthier, Yi Wang

    Abstract: Background: Rim+ lesions in multiple sclerosis (MS), detectable via Quantitative Susceptibility Mapping (QSM), correlate with increased disability. Existing literature lacks quantitative analysis of these lesions. We introduce RimSet for quantitative identification and characterization of rim+ lesions on QSM. Methods: RimSet combines RimSeg, an unsupervised segmentation method using level-set meth… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 13 pages, 7 figures, 4 tables

  23. arXiv:2312.10543  [pdf, other

    q-bio.NC eess.SP

    Study of cognitive component of auditory attention to natural speech events

    Authors: Nhan D. T. Nguyen, Kaare Mikkelsen, Preben Kidmose

    Abstract: Event-related potentials (ERP) have been used to address a wide range of research questions in neuroscience and cognitive psychology including selective auditory attention. The recent progress in auditory attention decoding (AAD) methods is based on algorithms that find a relation between the audio envelope and the neurophysiological response. The most popular approach is based on the reconstructi… ▽ More

    Submitted 19 December, 2023; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: 15 pages, 11 figures

  24. arXiv:2312.07011  [pdf, ps, other

    cs.IT eess.SP

    Securing MIMO Wiretap Channel with Learning-Based Friendly Jamming under Imperfect CSI

    Authors: Bui Minh Tuan, Diep N. Nguyen, Nguyen Linh Trung, Van-Dinh Nguyen, Nguyen Van Huynh, Dinh Thai Hoang, Marwan Krunz, Eryk Dutkiewicz

    Abstract: Wireless communications are particularly vulnerable to eavesdropping attacks due to their broadcast nature. To effectively deal with eavesdroppers, existing security techniques usually require accurate channel state information (CSI), e.g., for friendly jamming (FJ), and/or additional computing resources at transceivers, e.g., cryptography-based solutions, which unfortunately may not be feasible i… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 12 pages, 15 figures

  25. arXiv:2312.01777  [pdf, ps, other

    eess.SP cs.IT

    Doubly 1-Bit Quantized Massive MIMO

    Authors: Italo Atzeni, Antti Tölli, Duy H. N. Nguyen, A. Lee Swindlehurst

    Abstract: Enabling communications in the (sub-)THz band will call for massive multiple-input multiple-output (MIMO) arrays at either the transmit- or receive-side, or at both. To scale down the complexity and power consumption when operating across massive frequency and antenna dimensions, a sacrifice in the resolution of the digital-to-analog/analog-to-digital converters (DACs/ADCs) will be inevitable. In… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Presented at the IEEE Asilomar Conference on Signals, Systems, and Computers 2023

  26. arXiv:2311.15041  [pdf, other

    cs.LG cs.AI eess.SP

    MPCNN: A Novel Matrix Profile Approach for CNN-based Sleep Apnea Classification

    Authors: Hieu X. Nguyen, Duong V. Nguyen, Hieu H. Pham, Cuong D. Do

    Abstract: Sleep apnea (SA) is a significant respiratory condition that poses a major global health challenge. Previous studies have investigated several machine and deep learning models for electrocardiogram (ECG)-based SA diagnoses. Despite these advancements, conventional feature extractions derived from ECG signals, such as R-peaks and RR intervals, may fail to capture crucial information encompassed wit… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  27. arXiv:2311.11096  [pdf, other

    eess.IV cs.CV

    On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation

    Authors: Duy Minh Ho Nguyen, Tan Ngoc Pham, Nghiem Tuong Diep, Nghi Quoc Phan, Quang Pham, Vinh Tong, Binh T. Nguyen, Ngan Hoang Le, Nhat Ho, Pengtao Xie, Daniel Sonntag, Mathias Niepert

    Abstract: Constructing a robust model that can effectively generalize to test samples under distribution shifts remains a significant challenge in the field of medical imaging. The foundational models for vision and language, pre-trained on extensive sets of natural image and text data, have emerged as a promising approach. It showcases impressive learning abilities across different tasks with the need for… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Advances in Neural Information Processing Systems (NeurIPS) 2023, Workshop on robustness of zero/few-shot learning in foundation models

  28. arXiv:2311.01715  [pdf, other

    cs.SD eess.AS eess.SP

    Acousto-optic reconstruction of exterior sound field based on concentric circle sampling with circular harmonic expansion

    Authors: Phuc Duc Nguyen, Kenji Ishikawa, Noboru Harada, Takehiro Moriya

    Abstract: Acousto-optic sensing provides an alternative approach to traditional microphone arrays by shedding light on the interaction of light with an acoustic field. Sound field reconstruction is a fascinating and advanced technique used in acousto-optics sensing. Current challenges in sound-field reconstruction methods pertain to scenarios in which the sound source is located within the reconstruction ar… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  29. arXiv:2310.14506  [pdf, other

    eess.SP cs.DB

    Label Space Partition Selection for Multi-Object Tracking Using Two-Layer Partitioning

    Authors: Ji Youn Lee, Changbeom Shim, Hoa Van Nguyen, Tran Thien Dat Nguyen, Hyunjin Choi, Youngho Kim

    Abstract: Estimating the trajectories of multi-objects poses a significant challenge due to data association ambiguity, which leads to a substantial increase in computational requirements. To address such problems, a divide-and-conquer manner has been employed with parallel computation. In this strategy, distinguished objects that have unique labels are grouped based on their statistical dependencies, the i… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 6 pages, 4 figures

  30. arXiv:2310.00418  [pdf, other

    eess.IV cs.CV

    MVC: A Multi-Task Vision Transformer Network for COVID-19 Diagnosis from Chest X-ray Images

    Authors: Huyen Tran, Duc Thanh Nguyen, John Yearwood

    Abstract: Medical image analysis using computer-based algorithms has attracted considerable attention from the research community and achieved tremendous progress in the last decade. With recent advances in computing resources and availability of large-scale medical image datasets, many deep learning models have been developed for disease diagnosis from medical images. However, existing techniques focus on… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  31. arXiv:2309.16699  [pdf

    cs.RO eess.SY

    Circular-Line Trajectory Tracking Controller for Mobile Robot using Multi-Pixy2 Sensors

    Authors: Xuan Quang Ngo, Tri Duc Tran, Huy Hung Nguyen, Van Dong Nguyen, Van Tu Duong, Tan Tien Nguyen

    Abstract: This study suggests a novel tracking method that employs three Pixy2 sensors to identify the desired line trajectories instead of traditional perceiving means. Firstly, the kinematic model of the mobile robot is derived from the information gathered by three Pixy2 sensors. Secondly, the sliding mode controller is implemented to regulate the tracking error. Finally, simulation results are analyzed… ▽ More

    Submitted 12 August, 2023; originally announced September 2023.

    Comments: 6 pages, 12 figures, the 2023 International Symposium on Electrical and Electronics Engineering, Ho Chi Minh, Viet Nam, 2023

  32. arXiv:2309.15053  [pdf

    eess.IV

    Thalamic nuclei segmentation from T$_1$-weighted MRI: unifying and benchmarking state-of-the-art methods with young and old cohorts

    Authors: Brendan Williams, Dan Nguyen, Julie Vidal, Alzheimer's Disease Neuroimaging Initiative, Manojkumar Saranathan

    Abstract: The thalamus and its constituent nuclei are critical for a broad range of cognitive and sensorimotor processes, and implicated in many neurological and neurodegenerative conditions. However, the functional involvement and specificity of thalamic nuclei in human neuroimaging is underappreciated and not well studied due, in part, to technical challenges of accurately identifying and segmenting nucle… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 10 figures, 4 tables, 3 supplemental figures, 2 supplemental tables

  33. Double RIS-Assisted MIMO Systems Over Spatially Correlated Rician Fading Channels and Finite Scatterers

    Authors: Ha An Le, Trinh Van Chien, Van Duc Nguyen, Wan Choi

    Abstract: This paper investigates double RIS-assisted MIMO communication systems over Rician fading channels with finite scatterers, spatial correlation, and the existence of a double-scattering link between the transceiver. First, the statistical information is driven in closed form for the aggregated channels, unveiling various influences of the system and environment on the average channel power gains. N… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 15 pages, 9 figures, accepted by IEEE Transactions on Communications

  34. arXiv:2309.03317  [pdf, other

    cs.IT eess.SP

    Sub-Array Selection in Full-Duplex Massive MIMO for Enhanced Self-Interference Suppression

    Authors: Mobeen Mahmood, Asil Koc, Duc Tuong Nguyen, Robert Morawski, Tho Le-Ngoc

    Abstract: This study considers a novel full-duplex (FD) massive multiple-input multiple-output (mMIMO) system using hybrid beamforming (HBF) architecture, which allows for simultaneous uplink (UL) and downlink (DL) transmission over the same frequency band. Particularly, our objective is to mitigate the strong self-interference (SI) solely on the design of UL and DL RF beamforming stages jointly with sub-ar… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted for publication in IEEE Globecom 2023

  35. arXiv:2308.11557  [pdf, other

    eess.IV cs.CV

    Open Set Synthetic Image Source Attribution

    Authors: Shengbang Fang, Tai D. Nguyen, Matthew C. Stamm

    Abstract: AI-generated images have become increasingly realistic and have garnered significant public attention. While synthetic images are intriguing due to their realism, they also pose an important misinformation threat. To address this new threat, researchers have developed multiple algorithms to detect synthetic images and identify their source generators. However, most existing source attribution tech… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  36. arXiv:2308.09603  [pdf, ps, other

    eess.SY

    A Convergence Predictor Model for Consensus-based Decentralised Energy Markets

    Authors: Parikshit Pareek, L. P. Mohasha Isuru Sampath, Hung D. Nguyen, Eddy Y. S. Foo

    Abstract: This letter introduces a convergence prediction model (CPM) for decentralized market clearing mechanisms. The CPM serves as a tool to detect potential cyber-attacks that affect the convergence of the consensus mechanism during ongoing market clearing operations. In this study, we propose a successively elongating Bayesian logistic regression approach to model the probability of convergence of real… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  37. arXiv:2307.10017  [pdf, ps, other

    math.DG cs.RO eess.SY math-ph

    Geometry in global coordinates in mechanics and optimal transport

    Authors: Du Nguyen

    Abstract: For a manifold embedded in an inner product space, we express geometric quantities such as {\it Hamilton vector fields, affine and Levi-Civita connections, curvature} in global coordinates. Instead of coordinate indices, the global formulas for most quantities are expressed as {\it operator-valued} expressions, using an {\it affine projection} to the tangent bundle. For a submersion image of an em… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    MSC Class: 53C05; 53C42; 70H05; 70H45; 70H33; 53D05; 53Z30; 53Z50

  38. arXiv:2307.01062  [pdf, other

    cs.RO eess.SY

    A Data-Driven Approach to Geometric Modeling of Systems with Low-Bandwidth Actuator Dynamics

    Authors: Siming Deng, Junning Liu, Bibekananda Datta, Aishwarya Pantula, David H. Gracias, Thao D. Nguyen, Brian A. Bittner, Noah J. Cowan

    Abstract: It is challenging to perform system identification on soft robots due to their underactuated, high-dimensional dynamics. In this work, we present a data-driven modeling framework, based on geometric mechanics (also known as gauge theory) that can be applied to systems with low-bandwidth control of the system's internal configuration. This method constructs a series of connected models comprising a… ▽ More

    Submitted 3 October, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 9 pages, 6 figures

  39. arXiv:2306.12925  [pdf, other

    cs.CL cs.AI cs.SD eess.AS stat.ML

    AudioPaLM: A Large Language Model That Can Speak and Listen

    Authors: Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats , et al. (5 additional authors not shown)

    Abstract: We introduce AudioPaLM, a large language model for speech understanding and generation. AudioPaLM fuses text-based and speech-based language models, PaLM-2 [Anil et al., 2023] and AudioLM [Borsos et al., 2022], into a unified multimodal architecture that can process and generate text and speech with applications including speech recognition and speech-to-speech translation. AudioPaLM inherits the… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Technical report

  40. arXiv:2306.01159  [pdf, other

    quant-ph eess.SY

    Quantum-based Distributed Algorithms for Edge Node Placement and Workload Allocation

    Authors: Duong The Do, Ni Trieu, Duong Tung Nguyen

    Abstract: Edge computing is a promising technology that offers a superior user experience and enables various innovative Internet of Things applications. In this paper, we present a mixed-integer linear programming (MILP) model for optimal edge server placement and workload allocation, which is known to be NP-hard. To this end, we explore the possibility of addressing this computationally challenging proble… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  41. arXiv:2305.19709  [pdf, other

    cs.CL cs.SD eess.AS

    XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech

    Authors: Linh The Nguyen, Thinh Pham, Dat Quoc Nguyen

    Abstract: We present XPhoneBERT, the first multilingual model pre-trained to learn phoneme representations for the downstream text-to-speech (TTS) task. Our XPhoneBERT has the same model architecture as BERT-base, trained using the RoBERTa pre-training approach on 330M phoneme-level sentences from nearly 100 languages and locales. Experimental results show that employing XPhoneBERT as an input phoneme encod… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: In Proceedings of INTERSPEECH 2023 (to appear)

  42. arXiv:2305.08754  [pdf, other

    cs.IT eess.SP

    On the Stability of Approximate Message Passing with Independent Measurement Ensembles

    Authors: Dang Qua Nguyen, Taejoon Kim

    Abstract: Approximate message passing (AMP) is a scalable, iterative approach to signal recovery. For structured random measurement ensembles, including independent and identically distributed (i.i.d.) Gaussian and rotationally-invariant matrices, the performance of AMP can be characterized by a scalar recursion called state evolution (SE). The pseudo-Lipschitz (polynomial) smoothness is conventionally assu… ▽ More

    Submitted 25 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  43. arXiv:2305.05165  [pdf

    eess.SY math.OC

    Assessing the optimal contributions of renewables and carbon capture and storage toward carbon neutrality by 2050

    Authors: Dinh Hoa Nguyen, Andrew Chapman, Takeshi Tsuji

    Abstract: Building on the carbon reduction targets agreed in the Paris Agreements, many nations have renewed their efforts toward achieving carbon neutrality by the year 2050. In line with this ambitious goal, nations are seeking to understand the appropriate combination of technologies which will enable the required reductions in such a way that they are appealing to investors. Around the globe, solar and… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  44. arXiv:2305.03844  [pdf

    eess.IV cs.CV

    Physics-based network fine-tuning for robust quantitative susceptibility mapping from high-pass filtered phase

    Authors: Jinwei Zhang, Alexey Dimov, Chao Li, Hang Zhang, Thanh D. Nguyen, Pascal Spincemaille, Yi Wang

    Abstract: Purpose: To improve the generalization ability of convolutional neural network (CNN) based prediction of quantitative susceptibility mapping (QSM) from high-pass filtered phase (HPFP) image. Methods: The proposed network addresses two common generalization issues that arise when using a pre-trained network to predict QSM from HPFP: a) data with unseen voxel sizes, and b) data with unknown high-pas… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  45. Bearing-Based Network Localization Under Randomized Gossip Protocol

    Authors: Nhat-Minh Le-Phan, Minh Hoang Trinh, Phuoc Doan Nguyen

    Abstract: In this paper, we consider a randomized gossip algorithm for the bearing-based network localization problem. Let each sensor node be able to obtain the bearing vectors and communicate its position estimates with several neighboring agents. Each update involves two agents, and the update sequence follows a stochastic process. Under the assumption that the network is infinitesimally bearing rigid an… ▽ More

    Submitted 17 January, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: preprint, 6 pages, 2 figures. Published in the Proceeding of the 12th International Conference on Control, Automation and Information Sciences (ICCAIS). arXiv admin note: text overlap with arXiv:2303.14733

  46. The Bjøntegaard Bible -- Why your Way of Comparing Video Codecs May Be Wrong

    Authors: Christian Herglotz, Hannah Och, Anna Meyer, Geetha Ramasubbu, Lena Eichermüller, Matthias Kränzler, Fabian Brand, Kristian Fischer, Dat Thanh Nguyen, Andy Regensky, André Kaup

    Abstract: In this paper, we provide an in-depth assessment on the Bjøntegaard Delta. We construct a large data set of video compression performance comparisons using a diverse set of metrics including PSNR, VMAF, bitrate, and processing energies. These metrics are evaluated for visual data types such as classic perspective video, 360$^\circ$ video, point clouds, and screen content. As compression technology… ▽ More

    Submitted 22 December, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 21 pages, 14 figures

  47. arXiv:2304.11476  [pdf

    eess.IV

    Maximum Spherical Mean Value (mSMV) Filtering for Whole Brain Quantitative Susceptibility Mapping

    Authors: Alexandra G. Roberts, Dominick J. Romano, Mert Şişman, Alexey V. Dimov, Pascal Spincemaille, Thanh D. Nguyen, Ilhami Kovanlikaya, Susan A. Gauthier, Yi Wang

    Abstract: To develop a tissue field filtering algorithm, called maximum Spherical Mean Value (mSMV), for reducing shadow artifacts in quantitative susceptibility mapping (QSM) of the brain without requiring brain tissue erosion.Residual background field is a major source of shadow artifacts in QSM. The mSMV algorithm filters large field values near the border, where the maximum value of the harmonic backgro… ▽ More

    Submitted 27 November, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

    Comments: 12 pages, 5 figures

  48. mcLARO: Multi-Contrast Learned Acquisition and Reconstruction Optimization for simultaneous quantitative multi-parametric mapping

    Authors: Jinwei Zhang, Thanh D. Nguyen, Eddy Solomon, Chao Li, Qihao Zhang, Jiahao Li, Hang Zhang, Pascal Spincemaille, Yi Wang

    Abstract: Purpose: To develop a method for rapid sub-millimeter T1, T2, T2* and QSM mapping in a single scan using multi-contrast Learned Acquisition and Reconstruction Optimization (mcLARO). Methods: A pulse sequence was developed by interleaving inversion recovery and T2 magnetization preparations and single-echo and multi-echo gradient echo acquisitions, which sensitized k-space data to T1, T2, T2* and… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Journal ref: Magn Reson Med. 2024; 91: 344-356

  49. arXiv:2304.03041  [pdf, other

    eess.SP cs.LG eess.IV

    Multi-Linear Kernel Regression and Imputation in Data Manifolds

    Authors: Duc Thien Nguyen, Konstantinos Slavakis

    Abstract: This paper introduces an efficient multi-linear nonparametric (kernel-based) approximation framework for data regression and imputation, and its application to dynamic magnetic-resonance imaging (dMRI). Data features are assumed to reside in or close to a smooth manifold embedded in a reproducing kernel Hilbert space. Landmark points are identified to describe concisely the point cloud of features… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  50. Randomized Matrix Weighted Consensus

    Authors: Nhat-Minh Le-Phan, Minh Hoang Trinh, Phuoc Doan Nguyen

    Abstract: In this paper, randomized gossip-type matrix-weighted consensus algorithms are proposed for both leaderless and leader-follower topologies. First, we introduce the notion of expected matrix-weighted network, which captures the multi-dimensional interactions between any two agents in a probabilistic sense. Under some mild assumptions on the distribution of the expected matrix weights and the upper… ▽ More

    Submitted 6 February, 2024; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: 32 pages, 6 figures, preprint