Skip to main content

Showing 1–50 of 82 results for author: Pham, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06697  [pdf, other

    cs.LG

    Certified Continual Learning for Neural Network Regression

    Authors: Long H. Pham, Jun Sun

    Abstract: On the one hand, there has been considerable progress on neural network verification in recent years, which makes certifying neural networks a possibility. On the other hand, neural networks in practice are often re-trained over time to cope with new data distribution or for solving different tasks (a.k.a. continual learning). Once re-trained, the verified correctness of the neural network is like… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.03110  [pdf, other

    cs.SD cs.AI eess.AS

    A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)

    Authors: Lam Pham, Phat Lam, Tin Nguyen, Hieu Tang, Alexander Schindler

    Abstract: In this paper, we present a toolchain for a comprehensive audio/video analysis by leveraging deep learning based multimodal approach. To this end, different specific tasks of Speech to Text (S2T), Acoustic Scene Classification (ASC), Acoustic Event Detection (AED), Visual Object Detection (VOD), Image Captioning (IC), and Video Captioning (VC) are conducted and integrated into the toolchain. By co… ▽ More

    Submitted 2 May, 2024; originally announced July 2024.

  3. arXiv:2407.01777  [pdf, other

    cs.SD cs.AI eess.AS

    Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models

    Authors: Lam Pham, Phat Lam, Truong Nguyen, Huyen Nguyen, Alexander Schindler

    Abstract: In this paper, we propose a deep learning based system for the task of deepfake audio detection. In particular, the draw input audio is first transformed into various spectrograms using three transformation methods of Short-time Fourier Transform (STFT), Constant-Q Transform (CQT), Wavelet Transform (WT) combined with different auditory-based filters of Mel, Gammatone, linear filters (LF), and dis… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2405.14781  [pdf, other

    cs.CR cs.AI

    Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning

    Authors: Nay Myat Min, Long H. Pham, Jun Sun

    Abstract: The application of deep neural network models in various security-critical applications has raised significant security concerns, particularly the risk of backdoor attacks. Neural backdoors pose a serious security threat as they allow attackers to maliciously alter model behavior. While many defenses have been explored, existing approaches are often bounded by model-specific constraints, or necess… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2405.09330  [pdf, other

    cs.SE

    BARO: Robust Root Cause Analysis for Microservices via Multivariate Bayesian Online Change Point Detection

    Authors: Luan Pham, Huong Ha, Hongyu Zhang

    Abstract: Detecting failures and identifying their root causes promptly and accurately is crucial for ensuring the availability of microservice systems. A typical failure troubleshooting pipeline for microservices consists of two phases: anomaly detection and root cause analysis. While various existing works on root cause analysis require accurate anomaly detection, there is no guarantee of accurate estimat… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted to FSE'24

  6. arXiv:2405.06870  [pdf, other

    cs.IT

    Noise-Tolerant Codebooks for Semi-Quantitative Group Testing: Application to Spatial Genomics

    Authors: Kok Hao Chen, Duc Tu Dao, Han Mao Kiah, Van Long Phuoc Pham, Eitan Yaakobi

    Abstract: Motivated by applications in spatial genomics, we revisit group testing (Dorfman~1943) and propose the class of $λ$-{\sf ADD}-codes, studying such codes with certain distance $d$ and codelength $n$. When $d$ is constant, we provide explicit code constructions with rates close to $1/2$. When $d$ is proportional to $n$, we provide a GV-type lower bound whose rates are efficiently computable. Upper b… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: To appear in ISIT 2024 Proceedings

  7. arXiv:2403.00379  [pdf, other

    eess.AS cs.SD

    The Impact of Frequency Bands on Acoustic Anomaly Detection of Machines using Deep Learning Based Model

    Authors: Tin Nguyen, Lam Pham, Phat Lam, Dat Ngo, Hieu Tang, Alexander Schindler

    Abstract: In this paper, we propose a deep learning based model for Acoustic Anomaly Detection of Machines, the task for detecting abnormal machines by analysing the machine sound. By conducting extensive experiments, we indicate that multiple techniques of pseudo audios, audio segment, data augmentation, Mahalanobis distance, and narrow frequency bands, which mainly focus on feature engineering, are effect… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  8. arXiv:2401.17571  [pdf, other

    eess.IV cs.CV

    Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?

    Authors: Zhangxing Bian, Ahmed Alshareef, Shuwen Wei, Junyu Chen, Yuli Wang, Jonghye Woo, Dzung L. Pham, Jiachen Zhuo, Aaron Carass, Jerry L. Prince

    Abstract: Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to SPIE Medical Imaging 2024 (oral)

  9. arXiv:2401.15854  [pdf, other

    cs.CL

    LSTM-based Deep Neural Network With A Focus on Sentence Representation for Sequential Sentence Classification in Medical Scientific Abstracts

    Authors: Phat Lam, Lam Pham, Tin Nguyen, Hieu Tang, Michael Seidl, Medina Andresel, Alexander Schindler

    Abstract: The Sequential Sentence Classification task within the domain of medical abstracts, termed as SSC, involves the categorization of sentences into pre-defined headings based on their roles in conveying critical information in the abstract. In the SSC task, sentences are sequentially related to each other. For this reason, the role of sentence embeddings is crucial for capturing both the semantic inf… ▽ More

    Submitted 31 May, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Submitted to FedCSIS 2024

  10. arXiv:2401.11487  [pdf, other

    cs.CL cs.CY

    Towards Better Inclusivity: A Diverse Tweet Corpus of English Varieties

    Authors: Nhi Pham, Lachlan Pham, Adam L. Meyers

    Abstract: The prevalence of social media presents a growing opportunity to collect and analyse examples of English varieties. Whilst usage of these varieties was - and, in many cases, still is - used only in spoken contexts or hard-to-access private messages, social media sites like Twitter provide a platform for users to communicate informally in a scrapeable format. Notably, Indian English (Hinglish), Sin… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 10 pages (including limitations, references and appendices), 2 figures

  11. arXiv:2312.16717  [pdf, other

    cs.CV cs.LG eess.IV

    Landslide Detection and Segmentation Using Remote Sensing Images and Deep Neural Network

    Authors: Cam Le, Lam Pham, Jasmin Lampert, Matthias Schlögl, Alexander Schindler

    Abstract: Knowledge about historic landslide event occurrence is important for supporting disaster risk reduction strategies. Building upon findings from 2022 Landslide4Sense Competition, we propose a deep neural network based system for landslide detection and segmentation from multisource remote sensing image input. We use a U-Net trained with Cross Entropy loss as baseline model. We then improve the U-Ne… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  12. arXiv:2312.12414  [pdf, ps, other

    cs.DB cs.AI cs.LG

    Translating Natural Language Queries to SQL Using the T5 Model

    Authors: Albert Wong, Lien Pham, Young Lee, Shek Chan, Razel Sadaya, Youry Khmelevsky, Mathias Clement, Florence Wing Yau Cheng, Joe Mahony, Michael Ferri

    Abstract: This paper presents the development process of a natural language to SQL model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, have a 73\% and 84\% exact match accuracy respectively. These models, in conjunction with other work completed in the research project, were implemented for several companies and used s… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  13. arXiv:2312.01460  [pdf, other

    eess.IV cs.CV

    Towards an accurate and generalizable multiple sclerosis lesion segmentation model using self-ensembled lesion fusion

    Authors: Jinwei Zhang, Lianrui Zuo, Blake E. Dewey, Samuel W. Remedios, Dzung L. Pham, Aaron Carass, Jerry L. Prince

    Abstract: Automatic multiple sclerosis (MS) lesion segmentation using multi-contrast magnetic resonance (MR) images provides improved efficiency and reproducibility compared to manual delineation. Current state-of-the-art automatic MS lesion segmentation methods utilize modified U-Net-like architectures. However, in the literature, dedicated architecture modifications were always required to maximize their… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  14. arXiv:2311.09231  [pdf, other

    cs.CY cs.AI

    Key Factors Affecting European Reactions to AI in European Full and Flawed Democracies

    Authors: Long Pham, Barry O'Sullivan, Tai Tan Mai

    Abstract: This study examines the key factors that affect European reactions to artificial intelligence (AI) in the context of both full and flawed democracies in Europe. Analysing a dataset of 4,006 respondents, categorised into full democracies and flawed democracies based on the Democracy Index developed by the Economist Intelligence Unit (EIU), this research identifies crucial factors that shape Europea… ▽ More

    Submitted 4 October, 2023; originally announced November 2023.

    Comments: IJCAI DemorcAI 2023 - The 2nd International Workshop on Democracy and AI in conjunction with IJCAI 2023, Macau

  15. arXiv:2310.11477  [pdf, other

    cs.LG cs.AI

    Robust-MBFD: A Robust Deep Learning System for Motor Bearing Faults Detection Using Multiple Deep Learning Training Strategies and A Novel Double Loss Function

    Authors: Khoa Tran, Lam Pham, Hai-Canh Vu

    Abstract: This paper presents a comprehensive analysis of motor bearing fault detection (MBFD), which involves the task of identifying faults in a motor bearing based on its vibration. To this end, we first propose and evaluate various machine learning based systems for the MBFD task. Furthermore, we propose three deep learning based systems for the MBFD task, each of which explores one of the following tra… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  16. arXiv:2309.06157  [pdf, other

    cs.LG cs.AI

    Robust-MBDL: A Robust Multi-branch Deep Learning Based Model for Remaining Useful Life Prediction and Operational Condition Identification of Rotating Machines

    Authors: Khoa Tran, Hai-Canh Vu, Lam Pham, Nassim Boudaoud

    Abstract: In this paper, a Robust Multi-branch Deep learning-based system for remaining useful life (RUL) prediction and condition operations (CO) identification of rotating machines is proposed. In particular, the proposed system comprises main components: (1) an LSTM-Autoencoder to denoise the vibration data; (2) a feature extraction to generate time-domain, frequency-domain, and time-frequency based feat… ▽ More

    Submitted 14 December, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

  17. arXiv:2309.01261  [pdf, other

    cs.PL cs.DC cs.LO

    Worst-Case Input Generation for Concurrent Programs under Non-Monotone Resource Metrics

    Authors: Long Pham, Jan Hoffmann

    Abstract: Worst-case input generation aims to automatically generate inputs that exhibit the worst-case performance of programs. It has several applications, and can, for example, detect vulnerabilities to denial-of-service attacks. However, it is non-trivial to generate worst-case inputs for concurrent programs, particularly for resources like memory where the peak cost depends on how processes are schedul… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  18. arXiv:2308.09979  [pdf, ps, other

    cs.CY cs.AI

    Artificial Intelligence across Europe: A Study on Awareness, Attitude and Trust

    Authors: Teresa Scantamburlo, Atia Cortés, Francesca Foffano, Cristian Barrué, Veronica Distefano, Long Pham, Alessandro Fabris

    Abstract: This paper presents the results of an extensive study investigating the opinions on Artificial Intelligence (AI) of a sample of 4,006 European citizens from eight distinct countries (France, Germany, Italy, Netherlands, Poland, Romania, Spain, and Sweden). The aim of the study is to gain a better understanding of people's views and perceptions within the European context, which is already marked b… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

  19. arXiv:2307.02289  [pdf, other

    cs.CR cs.SE

    Fuzzing with Quantitative and Adaptive Hot-Bytes Identification

    Authors: Tai D. Nguyen, Long H. Pham, Jun Sun

    Abstract: Fuzzing has emerged as a powerful technique for finding security bugs in complicated real-world applications. American fuzzy lop (AFL), a leading fuzzing tool, has demonstrated its powerful bug finding ability through a vast number of reported CVEs. However, its random mutation strategy is unable to generate test inputs that satisfy complicated branching conditions (e.g., magic-byte comparisons, c… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  20. arXiv:2306.14929  [pdf, other

    cs.SD eess.AS

    A Deep Learning Architecture with Spatio-Temporal Focusing for Detecting Respiratory Anomalies

    Authors: Dat Ngo, Lam Pham, Huy Phan, Minh Tran, Delaram Jarchi

    Abstract: This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Our system initially performs audio feature extraction using Continuous Wavelet transformation. This transformation converts the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed deep learning architecture… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2303.04104

  21. arXiv:2305.09463  [pdf, other

    cs.SD cs.AI eess.AS

    Low-complexity deep learning frameworks for acoustic scene classification using teacher-student scheme and multiple spectrograms

    Authors: Lam Pham, Dat Ngo, Cam Le, Anahid Jalali, Alexander Schindler

    Abstract: In this technical report, a low-complexity deep learning system for acoustic scene classification (ASC) is presented. The proposed system comprises two main phases: (Phase I) Training a teacher network; and (Phase II) training a student network using distilled knowledge from the teacher. In the first phase, the teacher, which presents a large footprint model, is trained. After training the teacher… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2206.06057

  22. arXiv:2305.07274  [pdf, other

    cs.IT

    Deletion Correcting Codes for Efficient DNA Synthesis

    Authors: Johan Chrisnata, Han Mao Kiah, Van Long Phuoc Pham

    Abstract: The synthesis of DNA strands remains the most costly part of the DNA storage system. Thus, to make DNA storage system more practical, the time and materials used in the synthesis process have to be optimized. We consider the most common type of synthesis process where multiple DNA strands are synthesized in parallel from a common alternating supersequence, one nucleotide at a time. The synthesis t… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: A shorter version of this paper will be presented in in ISIT 2023

  23. arXiv:2305.01476  [pdf, other

    cs.SD cs.MM eess.AS

    Deep Learning Based Multimodal with Two-phase Training Strategy for Daily Life Video Classification

    Authors: Lam Pham, Trang Le, Cam Le, Dat Ngo, Weissenfeld Axel, Alexander Schindler

    Abstract: In this paper, we present a deep learning based multimodal system for classifying daily life videos. To train the system, we propose a two-phase training strategy. In the first training phase (Phase I), we extract the audio and visual (image) data from the original video. We then train the audio data and the visual data with independent deep learning based models. After the training processes, we… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  24. arXiv:2303.04104  [pdf, other

    cs.SD cs.LG eess.AS q-bio.QM

    An Inception-Residual-Based Architecture with Multi-Objective Loss for Detecting Respiratory Anomalies

    Authors: Dat Ngo, Lam Pham, Huy Phan, Minh Tran, Delaram Jarchi, Sefki Kolozali

    Abstract: This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Initially, our system begins with audio feature extraction using Gammatone and Continuous Wavelet transformation. This step aims to transform the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed system in… ▽ More

    Submitted 19 June, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  25. arXiv:2302.13028  [pdf, other

    cs.CV cs.AI cs.LG

    A Light-weight Deep Learning Model for Remote Sensing Image Classification

    Authors: Lam Pham, Cam Le, Dat Ngo, Anh Nguyen, Jasmin Lampert, Alexander Schindler, Ian McLoughlin

    Abstract: In this paper, we present a high-performance and light-weight deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the aerial scene of a remote sensing image. To this end, we first valuate various benchmark convolutional neural network (CNN) architectures: MobileNet V1/V2, ResNet 50/151V2, InceptionV3/InceptionResNetV2, EfficientNet B0/B7, DenseNet 121/201, C… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  26. arXiv:2211.02820  [pdf, other

    cs.CV cs.LG eess.IV

    A Robust and Low Complexity Deep Learning Model for Remote Sensing Image Classification

    Authors: Cam Le, Lam Pham, Nghia NVN, Truong Nguyen, Le Hong Trang

    Abstract: In this paper, we present a robust and low complexity deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the scene of a remote sensing image. In particular, we firstly evaluate different low complexity and benchmark deep neural networks: MobileNetV1, MobileNetV2, NASNetMobile, and EfficientNetB0, which present the number of trainable parameters lower than 5… ▽ More

    Submitted 12 December, 2022; v1 submitted 5 November, 2022; originally announced November 2022.

    Comments: 8 pages

  27. arXiv:2210.08610  [pdf, other

    cs.SD cs.AI eess.AS

    Robust, General, and Low Complexity Acoustic Scene Classification Systems and An Effective Visualization for Presenting a Sound Scene Context

    Authors: Lam Pham, Dusan Salovic, Anahid Jalali, Alexander Schindler, Khoa Tran, Canh Vu, Phu X. Nguyen

    Abstract: In this paper, we present a comprehensive analysis of Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording from its acoustic signature. In particular, we firstly propose an inception-based and low footprint ASC model, referred to as the ASC baseline. The proposed ASC baseline is then compared with benchmark and high-complexity network architectures of Mobile… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  28. arXiv:2209.09327  [pdf, ps, other

    cs.PL cs.SE

    S2TD: a Separation Logic Verifier that Supports Reasoning of the Absence and Presence of Bugs

    Authors: Quang Loc Le, Jun Sun, Long H. Pham, Shengchao Qin

    Abstract: Heap-manipulating programs are known to be challenging to reason about. We present a novel verifier for heap-manipulating programs called S2TD, which encodes programs systematically in the form of Constrained Horn Clauses (CHC) using a novel extension of separation logic (SL) with recursive predicates and dangling predicates. S2TD actively explores cyclic proofs to address the path explosion probl… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 24 pages

    MSC Class: 68N15

  29. arXiv:2209.02611  [pdf, other

    eess.IV cs.CV

    Deep filter bank regression for super-resolution of anisotropic MR brain images

    Authors: Samuel W. Remedios, Shuo Han, Yuan Xue, Aaron Carass, Trac D. Tran, Dzung L. Pham, Jerry L. Prince

    Abstract: In 2D multi-slice magnetic resonance (MR) acquisition, the through-plane signals are typically of lower resolution than the in-plane signals. While contemporary super-resolution (SR) methods aim to recover the underlying high-resolution volume, the estimated high-frequency information is implicit via end-to-end data-driven training rather than being explicitly stated and sought. To address this, w… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  30. arXiv:2206.13392  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Remote Sensing Image Classification using Transfer Learning and Attention Based Deep Neural Network

    Authors: Lam Pham, Khoa Tran, Dat Ngo, Jasmin Lampert, Alexander Schindler

    Abstract: The task of remote sensing image scene classification (RSISC), which aims at classifying remote sensing images into groups of semantic categories based on their contents, has taken the important role in a wide range of applications such as urban planning, natural hazards detection, environment monitoring,vegetation mapping, or geospatial object detection. During the past years, research community… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  31. arXiv:2206.06057  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Low-complexity deep learning frameworks for acoustic scene classification

    Authors: Lam Pham, Dat Ngo, Anahid Jalali, Alexander Schindler

    Abstract: In this report, we presents low-complexity deep learning frameworks for acoustic scene classification (ASC). The proposed frameworks can be separated into four main steps: Front-end spectrogram extraction, online data augmentation, back-end classification, and late fusion of predicted probabilities. In particular, we initially transform audio recordings into Mel, Gammatone, and CQT spectrograms. N… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  32. arXiv:2205.06992  [pdf, other

    cs.CR cs.LG

    Verifying Neural Networks Against Backdoor Attacks

    Authors: Long H. Pham, Jun Sun

    Abstract: Neural networks have achieved state-of-the-art performance in solving many problems, including many applications in safety/security-critical systems. Researchers also discovered multiple security issues associated with neural networks. One of them is backdoor attacks, i.e., a neural network may be embedded with a backdoor such that a target output is almost always generated in the presence of a tr… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  33. arXiv:2204.09274  [pdf, other

    cs.SE cs.AI cs.LG

    Causality-based Neural Network Repair

    Authors: Bing Sun, Jun Sun, Hong Long Pham, Jie Shi

    Abstract: Neural networks have had discernible achievements in a wide range of applications. The wide-spread adoption also raises the concern of their dependability and reliability. Similar to traditional decision-making programs, neural networks can have defects that need to be repaired. The defects may cause unsafe behaviors, raise security concerns or unjust societal impacts. In this work, we address the… ▽ More

    Submitted 7 July, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  34. arXiv:2203.12314  [pdf, other

    cs.SD cs.LG eess.AS

    Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices

    Authors: Lam Pham, Khoa Dinh, Dat Ngo, Hieu Tang, Alexander Schindler

    Abstract: In this paper, we present a robust and low complexity system for Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording. We first construct an ASC baseline system in which a novel inception-residual-based network architecture is proposed to deal with the mismatched recording device issue. To further improve the performance but still satisfy the low complexity… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: This paper was submitted to INTERSPEECH 2022

  35. arXiv:2202.05626  [pdf, other

    cs.SD eess.AS

    Audio-Based Deep Learning Frameworks for Detecting COVID-19

    Authors: Dat Ngo, Lam Pham, Truong Hoang, Sefki Kolozali, Delaram Jarchi

    Abstract: This paper evaluates a wide range of audio-based deep learning frameworks applied to the breathing, cough, and speech sounds for detecting COVID-19. In general, the audio recording inputs are transformed into low-level spectrogram features, then they are fed into pre-trained deep learning models to extract high-level embedding features. Next, the dimension of these high-level embedding features ar… ▽ More

    Submitted 2 March, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

  36. arXiv:2201.04742  [pdf, other

    cs.RO

    nuReality: A VR environment for research of pedestrian and autonomous vehicle interactions

    Authors: Paul Schmitt, Nicholas Britten, JiHyun Jeong, Amelia Coffey, Kevin Clark, Shweta Sunil Kothawade, Elena Corina Grigore, Adam Khaw, Christopher Konopka, Linh Pham, Kim Ryan, Christopher Schmitt, Aryaman Pandya, Emilio Frazzoli

    Abstract: We present nuReality, a virtual reality 'VR' environment designed to test the efficacy of vehicular behaviors to communicate intent during interactions between autonomous vehicles 'AVs' and pedestrians at urban intersections. In this project we focus on expressive behaviors as a means for pedestrians to readily recognize the underlying intent of the AV's movements. VR is an ideal tool to use to te… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  37. arXiv:2201.03054  [pdf, ps, other

    cs.SD eess.AS

    An Ensemble of Deep Learning Frameworks Applied For Predicting Respiratory Anomalies

    Authors: Lam Pham, Dat Ngo, Truong Hoang, Alexander Schindler, Ian McLoughlin

    Abstract: In this paper, we evaluate various deep learning frameworks for detecting respiratory anomalies from input audio recordings. To this end, we firstly transform audio respiratory cycles collected from patients into spectrograms where both temporal and spectral features are presented, referred to as the front-end feature extraction. We then feed the spectrograms into back-end deep learning networks f… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

  38. arXiv:2112.09172  [pdf, ps, other

    cs.CV cs.LG eess.IV

    An Audio-Visual Dataset and Deep Learning Frameworks for Crowded Scene Classification

    Authors: Lam Pham, Dat Ngo, Phu X. Nguyen, Truong Hoang, Alexander Schindler

    Abstract: This paper presents a task of audio-visual scene classification (SC) where input videos are classified into one of five real-life crowded scenes: 'Riot', 'Noise-Street', 'Firework-Event', 'Music-Event', and 'Sport-Atmosphere'. To this end, we firstly collect an audio-visual dataset (videos) of these five crowded contexts from Youtube (in-the-wild scenes). Then, a wide range of deep learning framew… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

  39. arXiv:2111.08817  [pdf, other

    cs.AI

    Compressive Features in Offline Reinforcement Learning for Recommender Systems

    Authors: Hung Nguyen, Minh Nguyen, Long Pham, Jennifer Adorno Nieves

    Abstract: In this paper, we develop a recommender system for a game that suggests potential items to players based on their interactive behaviors to maximize revenue for the game provider. Our approach is built on a reinforcement learning-based technique and is trained on an offline data set that is publicly available on an IEEE Big Data Cup challenge. The limitation of the offline data set and the curse of… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  40. arXiv:2111.04255  [pdf, ps, other

    cs.IT math.CO

    Sequence Reconstruction Problem for Deletion Channels: A Complete Asymptotic Solution

    Authors: Van Long Phuoc Pham, Keshav Goyal, Han Mao Kiah

    Abstract: Transmit a codeword $x$, that belongs to an $(\ell-1)$-deletion-correcting code of length $n$, over a $t$-deletion channel for some $1\le \ell\le t<n$. Levenshtein, in 2001, proposed the problem of determining $N(n,\ell,t)+1$, the minimum number of distinct channel outputs required to uniquely reconstruct $x$. Prior to this work, $N(n,\ell,t)$ is known only when $\ell\in\{1,2\}$. Here, we provide… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

    MSC Class: 94B99

  41. arXiv:2110.06323  [pdf, other

    cs.SD eess.AS

    An Annihilating Filter-Based DOA Estimation for Uniform Linear Array

    Authors: Son Phan, Lam Pham

    Abstract: In this paper, we propose a new method to design an annihilating filter (AF) for direction-of-arrival (DOA) estimation of multiple snapshots within an uniform linear array. To evaluate the proposed method, we firstly design a DOA estimation using multiple signal classification (MUSIC) algorithm, referred to as the MUSIC baseline. We then compare the proposed method with the MUSIC baseline in two e… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  42. A Cough-based deep learning framework for detecting COVID-19

    Authors: Truong Hoang, Lam Pham, Dat Ngo, Hoang D. Nguyen

    Abstract: This paper presents a deep learning framework for detecting COVID-19 positive subjects from their cough sounds. In particular, the proposed approach comprises two main steps. In the first step, we generate a feature representing the cough sound by combining an embedding extracted from a pre-trained model and handcrafted features extracted from draw audio recording, referred to as the front-end fea… ▽ More

    Submitted 30 September, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: COVID-19, EMBC-2022, DiCOVA, top 2nd, benchmark on Spec > 0.95%

    MSC Class: 92-05; 68Txx ACM Class: J.3; I.5.4; I.5.2; H.5.5; C.3; K.5

    Journal ref: EMBC 44 (2022) 3422-3425

  43. arXiv:2107.09268  [pdf, ps, other

    cs.SD eess.AS

    Robust Deep Learning Frameworks for Acoustic Scene and Respiratory Sound Classification

    Authors: Lam Pham

    Abstract: This thesis focuses on dealing with the task of acoustic scene classification (ASC), and then applied the techniques developed for ASC to a real-life application of detecting respiratory disease. To deal with ASC challenges, this thesis addresses three main factors that directly affect the performance of an ASC system. Firstly, this thesis explores input features by making use of multiple spectrog… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  44. arXiv:2106.06840  [pdf, ps, other

    cs.SD eess.AS

    Deep Learning Frameworks Applied For Audio-Visual Scene Classification

    Authors: Lam Pham, Alexander Schindler, Mina Schütz, Jasmin Lampert, Sven Schlarb, Ross King

    Abstract: In this paper, we present deep learning frameworks for audio-visual scene classification (SC) and indicate how individual visual and audio features as well as their combination affect SC performance. Our extensive experiments, which are conducted on DCASE (IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events) Task 1B development dataset, achieve the best classification… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: 6 pages

  45. arXiv:2106.06838  [pdf, ps, other

    cs.SD eess.AS

    A Low-Compexity Deep Learning Framework For Acoustic Scene Classification

    Authors: Lam Pham, Hieu Tang, Anahid Jalali, Alexander Schindler, Ross King

    Abstract: In this paper, we presents a low-complexity deep learning frameworks for acoustic scene classification (ASC). The proposed framework can be separated into three main steps: Front-end spectrogram extraction, back-end classification, and late fusion of predicted probabilities. First, we use Mel filter, Gammatone filter and Constant Q Transfrom (CQT) to transform raw audio signal into spectrograms, w… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

  46. arXiv:2104.02523  [pdf, other

    cs.LG

    An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

    Authors: Anh Nguyen, Khoa Pham, Dat Ngo, Thanh Ngo, Lam Pham

    Abstract: This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network. These activation functions comprise of Rectified Linear Units (ReLU), Exponential Linear Unit (ELU), Scaled Exponential Linear Unit (SELU), Gaussian Error Linear Unit (GELU), and the Inverse Square Root Linear Unit (ISRLU). To evaluate, experiments over two dee… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: 6 pages, 5 figures

  47. arXiv:2104.01161  [pdf, ps, other

    cs.SD eess.AS

    An Audio-Based Deep Learning Framework For BBC Television Programme Classification

    Authors: Lam Pham, Chris Baume, Qiuqiang Kong, Tassadaq Hussain, Wenwu Wang, Mark Plumbley

    Abstract: This paper proposes a deep learning framework for classification of BBC television programmes using audio. The audio is firstly transformed into spectrograms, which are fed into a pre-trained convolutional Neural Network (CNN), obtaining predicted probabilities of sound events occurring in the audio recording. Statistics for the predicted probabilities and detected sound events are then calculated… ▽ More

    Submitted 11 February, 2022; v1 submitted 2 April, 2021; originally announced April 2021.

  48. Contrast Adaptive Tissue Classification by Alternating Segmentation and Synthesis

    Authors: Dzung L. Pham, Yi-Yu Chou, Blake E. Dewey, Daniel S. Reich, John A. Butman, Snehashis Roy

    Abstract: Deep learning approaches to the segmentation of magnetic resonance images have shown significant promise in automating the quantitative analysis of brain images. However, a continuing challenge has been its sensitivity to the variability of acquisition protocols. Attempting to segment images that have different contrast properties from those within the training data generally leads to significantl… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 10 pages. MICCAI SASHIMI Workshop 2021

  49. arXiv:2103.02420  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Multi-view Audio and Music Classification

    Authors: Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Lam Pham, Philipp Koch, Ian McLoughlin, Alfred Mertins

    Abstract: We propose in this work a multi-view learning approach for audio and music classification. Considering four typical low-level representations (i.e. different views) commonly used for audio and music recognition tasks, the proposed multi-view network consists of four subnetworks, each handling one input types. The learned embedding in the subnetworks are then concatenated to form the multi-view emb… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted to ICASSP 2021

  50. arXiv:2101.01917  [pdf, other

    cs.CR cs.SE

    sGUARD: Towards Fixing Vulnerable Smart Contracts Automatically

    Authors: Tai D. Nguyen, Long H. Pham, Jun Sun

    Abstract: Smart contracts are distributed, self-enforcing programs executing on top of blockchain networks. They have the potential to revolutionize many industries such as financial institutes and supply chains. However, smart contracts are subject to code-based vulnerabilities, which casts a shadow on its applications. As smart contracts are unpatchable (due to the immutability of blockchain), it is essen… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: Published in IEEE S&P 2021