Zum Hauptinhalt springen

Showing 1–50 of 105 results for author: Dang, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14415  [pdf, other

    cs.CV cs.LG

    LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation

    Authors: Trung Dinh Quoc Dang, Huy Hoang Nguyen, Aleksei Tiulpin

    Abstract: Mamba, a State Space Model (SSM), has recently shown competitive performance to Convolutional Neural Networks (CNNs) and Transformers in Natural Language Processing and general sequence modeling. Various attempts have been made to adapt Mamba to Computer Vision tasks, including medical image segmentation (MIS). Vision Mamba (VM)-based networks are particularly attractive due to their ability to ac… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 20 pages

  2. Exploring Large-Scale Language Models to Evaluate EEG-Based Multimodal Data for Mental Health

    Authors: Yongquan Hu, Shuning Zhang, Ting Dang, Hong Jia, Flora D. Salim, Wen Hu, Aaron J. Quigley

    Abstract: Integrating physiological signals such as electroencephalogram (EEG), with other data such as interview audio, may offer valuable multimodal insights into psychological states or neurological disorders. Recent advancements with Large Language Models (LLMs) position them as prospective ``health agents'' for mental health assessment. However, current research predominantly focus on single data modal… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 6 pages; UbiComp Companion '24, Companion of the 2024 ACM International Joint Conference on Pervasive and Ubiquitous Computing, October 5--9, 2024}{Melbourne, VIC, Australia

  3. arXiv:2407.21344  [pdf, other

    cs.AI

    Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction

    Authors: Jingyao Wu, Ting Dang, Vidhyasaharan Sethu, Eliathamby Ambikairajah

    Abstract: There has been a significant focus on modelling emotion ambiguity in recent years, with advancements made in representing emotions as distributions to capture ambiguity. However, there has been comparatively less effort devoted to the consideration of temporal dependencies in emotion distributions which encodes ambiguity in perceived emotions that evolve smoothly over time. Recognizing the benefit… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted at INTERSPEECh 2024

  4. arXiv:2407.08261  [pdf, other

    cs.RO

    The OPNV Data Collection: A Dataset for Infrastructure-Supported Perception Research with Focus on Public Transportation

    Authors: Marcel Vosshans, Alexander Baumann, Matthias Drueppel, Omar Ait-Aider, Ralf Woerner, Youcef Mezouar, Thao Dang, Markus Enzweiler

    Abstract: This paper we present our vision and ongoing work for a novel dataset designed to advance research into the interoperability of intelligent vehicles and infrastructure, specifically aimed at enhancing cooperative perception and interaction in the realm of public transportation. Unlike conventional datasets centered on ego-vehicle data, this approach encompasses both a stationary sensor tower and a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  5. arXiv:2406.02897  [pdf, other

    cs.SD eess.AS

    LiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autoregressive Modeling of Audio Discrete Codes

    Authors: Trung Dang, David Aponte, Dung Tran, Kazuhito Koishida

    Abstract: Prior works have demonstrated zero-shot text-to-speech by using a generative language model on audio tokens obtained via a neural audio codec. It is still challenging, however, to adapt them to low-latency scenarios. In this paper, we present LiveSpeech - a fully autoregressive language model-based approach for zero-shot text-to-speech, enabling low-latency streaming of the output audio. To allow… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2405.16815  [pdf, other

    cs.CV

    Image-level Regression for Uncertainty-aware Retinal Image Segmentation

    Authors: Trung Dang, Huy Hoang Nguyen, Aleksei Tiulpin

    Abstract: Accurate retinal vessel (RV) segmentation is a crucial step in the quantitative assessment of retinal vasculature, which is needed for the early detection of retinal diseases and other conditions. Numerous studies have been conducted to tackle the problem of segmenting vessels automatically using a pixel-wise classification approach. The common practice of creating ground truth labels is to catego… ▽ More

    Submitted 25 July, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: 13 pages

  7. arXiv:2405.16813  [pdf, other

    cs.CV

    SiNGR: Brain Tumor Segmentation via Signed Normalized Geodesic Transform Regression

    Authors: Trung Dang, Huy Hoang Nguyen, Aleksei Tiulpin

    Abstract: One of the primary challenges in brain tumor segmentation arises from the uncertainty of voxels close to tumor boundaries. However, the conventional process of generating ground truth segmentation masks fails to treat such uncertainties properly. Those "hard labels" with 0s and 1s conceptually influenced the majority of prior studies on brain image segmentation. As a result, tumor segmentation is… ▽ More

    Submitted 22 August, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted as a conference paper at MICCAI 2024

  8. arXiv:2405.10134  [pdf, other

    cs.RO cs.AI

    Towards Consistent and Explainable Motion Prediction using Heterogeneous Graph Attention

    Authors: Tobias Demmler, Andreas Tamke, Thao Dang, Karsten Haug, Lars Mikelsons

    Abstract: In autonomous driving, accurately interpreting the movements of other road users and leveraging this knowledge to forecast future trajectories is crucial. This is typically achieved through the integration of map data and tracked trajectories of various agents. Numerous methodologies combine this information into a singular embedding for each agent, which is then utilized to predict future behavio… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  9. arXiv:2405.08654  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    Can we Defend Against the Unknown? An Empirical Study About Threshold Selection for Neural Network Monitoring

    Authors: Khoi Tran Dang, Kevin Delmas, Jérémie Guiochet, Joris Guérin

    Abstract: With the increasing use of neural networks in critical systems, runtime monitoring becomes essential to reject unsafe predictions during inference. Various techniques have emerged to establish rejection scores that maximize the separability between the distributions of safe and unsafe predictions. The efficacy of these approaches is mostly evaluated using threshold-agnostic metrics, such as the ar… ▽ More

    Submitted 21 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: 13 pages, 5 figures, 6 tables. To appear in the proceedings of the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  10. arXiv:2404.14679  [pdf, ps, other

    cs.GT

    A Multi-Dimensional Online Contention Resolution Scheme for Revenue Maximization

    Authors: Shuchi Chawla, Dimitris Christou, Trung Dang, Zhiyi Huang, Gregory Kehne, Rojin Rezvan

    Abstract: We study multi-buyer multi-item sequential item pricing mechanisms for revenue maximization with the goal of approximating a natural fractional relaxation -- the ex ante optimal revenue. We assume that buyers' values are subadditive but make no assumptions on the value distributions. While the optimal revenue, and therefore also the ex ante benchmark, is inapproximable by any simple mechanism in t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 39 pages

  11. arXiv:2404.00399  [pdf, other

    cs.CL cs.AI cs.LG

    Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

    Authors: Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak , et al. (20 additional authors not shown)

    Abstract: Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and StarCoder aim to democratize access to pretrained models for collaborative community development. However, such existing models face challenges: limited multilingual capabilities, continual pretraining causing catastrophic forgetting, where… ▽ More

    Submitted 23 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Preprint

  12. arXiv:2403.16593  [pdf, other

    cs.RO eess.SY

    Counter-example guided Imitation Learning of Feedback Controllers from Temporal Logic Specifications

    Authors: Thao Dang, Alexandre Donzé, Inzemamul Haque, Nikolaos Kekatos, Indranil Saha

    Abstract: We present a novel method for imitation learning for control requirements expressed using Signal Temporal Logic (STL). More concretely we focus on the problem of training a neural network to imitate a complex controller. The learning process is guided by efficient data aggregation based on counter-examples and a coverage measure. Moreover, we introduce a method to evaluate the performance of the l… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  13. arXiv:2403.09579  [pdf, other

    cs.SD cs.LG eess.AS

    uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures

    Authors: Afrina Tabassum, Dung Tran, Trung Dang, Ismini Lourentzou, Kazuhito Koishida

    Abstract: Masked Autoencoders (MAEs) learn rich low-level representations from unlabeled data but require substantial labeled data to effectively adapt to downstream tasks. Conversely, Instance Discrimination (ID) emphasizes high-level semantics, offering a potential solution to alleviate annotation requirements in MAEs. Although combining these two approaches can address downstream tasks with limited label… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 5 pages, 6 figures, 4 tables. To appear in ICASSP'2024

  14. arXiv:2403.08843  [pdf, other

    cs.AI

    Fuzzy Fault Trees Formalized

    Authors: Thi Kim Nhung Dang, Milan Lopuhaä-Zwakenberg, Mariëlle Stoelinga

    Abstract: Fault tree analysis is a vital method of assessing safety risks. It helps to identify potential causes of accidents, assess their likelihood and severity, and suggest preventive measures. Quantitative analysis of fault trees is often done via the dependability metrics that compute the system's failure behaviour over time. However, the lack of precise data is a major obstacle to quantitative analys… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 14 pages

  15. arXiv:2402.12179  [pdf, other

    cs.CV cs.AI cs.CY

    Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations

    Authors: Dinh An Ngo, Thanh Dat Nguyen, Thi Le Chi Dang, Huy Hoan Le, Ton Bao Ho, Vo Thanh Khang Nguyen, Truong Thanh Hung Nguyen

    Abstract: Cheating in online exams has become a prevalent issue over the past decade, especially during the COVID-19 pandemic. To address this issue of academic dishonesty, our "Exam Monitoring System: Detecting Abnormal Behavior in Online Examinations" is designed to assist proctors in identifying unusual student behavior. Our system demonstrates high accuracy and speed in detecting cheating in real-time s… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  16. arXiv:2402.11872  [pdf, other

    cs.RO

    Real-time 3D Semantic Scene Perception for Egocentric Robots with Binocular Vision

    Authors: K. Nguyen, T. Dang, M. Huber

    Abstract: Perceiving a three-dimensional (3D) scene with multiple objects while moving indoors is essential for vision-based mobile cobots, especially for enhancing their manipulation tasks. In this work, we present an end-to-end pipeline with instance segmentation, feature matching, and point-set registration for egocentric robots with binocular vision, and demonstrate the robot's grasping capability throu… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  17. arXiv:2402.02655  [pdf, other

    cs.CL

    VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension

    Authors: Thinh Phuoc Ngo, Khoa Tran Anh Dang, Son T. Luu, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: This paper presents the development process of a Vietnamese spoken language corpus for machine reading comprehension (MRC) tasks and provides insights into the challenges and opportunities associated with using real-world data for machine reading comprehension tasks. The existing MRC corpora in Vietnamese mainly focus on formal written documents such as Wikipedia articles, online newspapers, or te… ▽ More

    Submitted 6 April, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: To appear as the main conference paper at EACL 2024

  18. arXiv:2401.12346  [pdf, other

    cs.CR

    Fuzzy quantitative attack tree analysis

    Authors: Thi Kim Nhung Dang, Milan Lopuhaä-Zwakenberg, Mariëlle Stoelinga

    Abstract: Attack trees are important for security, as they help to identify weaknesses and vulnerabilities in a system. Quantitative attack tree analysis supports a number security metrics, which formulate important KPIs such as the shortest, most likely and cheapest attacks. A key bottleneck in quantitative analysis is that the values are usually not known exactly, due to insufficient data and/or lack of… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 23 pages, 6 figures, FASE2024

  19. arXiv:2311.12784  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Optimality in Mean Estimation: Beyond Worst-Case, Beyond Sub-Gaussian, and Beyond $1+α$ Moments

    Authors: Trung Dang, Jasper C. H. Lee, Maoyuan Song, Paul Valiant

    Abstract: There is growing interest in improving our algorithmic understanding of fundamental statistical problems such as mean estimation, driven by the goal of understanding the limits of what we can extract from valuable data. The state of the art results for mean estimation in $\mathbb{R}$ are 1) the optimal sub-Gaussian mean estimator by [LV22], with the tight sub-Gaussian constant for all distribution… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 27 pages, to appear in NeurIPS 2023. Abstract shortened to fit arXiv limit

  20. arXiv:2309.11983  [pdf, other

    cs.LG

    Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling

    Authors: Zheng Nan, Ting Dang, Vidhyasaharan Sethu, Beena Ahmed

    Abstract: Connectionist temporal classification (CTC) is commonly adopted for sequence modeling tasks like speech recognition, where it is necessary to preserve order between the input and target sequences. However, CTC is only applied to deterministic sequence models, where the latent space is discontinuous and sparse, which in turn makes them less capable of handling data variability when compared to vari… ▽ More

    Submitted 14 December, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures, conference

  21. arXiv:2309.10740  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

    Authors: Yatong Bai, Trung Dang, Dung Tran, Kazuhito Koishida, Somayeh Sojoudi

    Abstract: Diffusion models are instrumental in text-to-audio (TTA) generation. Unfortunately, they suffer from slow inference due to an excessive number of queries to the underlying denoising network per generation. To address this bottleneck, we introduce ConsistencyTTA, a framework requiring only a single non-autoregressive network query, thereby accelerating TTA by hundreds of times. We achieve so by pro… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  22. arXiv:2307.05953  [pdf, ps, other

    cs.GT

    Reward Selection with Noisy Observations

    Authors: Kamyar Azizzadenesheli, Trung Dang, Aranyak Mehta, Alexandros Psomas, Qian Zhang

    Abstract: We study a fundamental problem in optimization under uncertainty. There are $n$ boxes; each box $i$ contains a hidden reward $x_i$. Rewards are drawn i.i.d. from an unknown distribution $\mathcal{D}$. For each box $i$, we see $y_i$, an unbiased estimate of its reward, which is drawn from a Normal distribution with known standard deviation $σ_i$ (and an unknown mean $x_i$). Our task is to select a… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  23. arXiv:2306.04853  [pdf, other

    cs.RO

    ExtPerFC: An Efficient 2D and 3D Perception Hardware-Software Framework for Mobile Cobot

    Authors: Tuan Dang, Khang Nguyen, Manfred Huber

    Abstract: As the reliability of the robot's perception correlates with the number of integrated sensing modalities to tackle uncertainty, a practical solution to manage these sensors from different computers, operate them simultaneously, and maintain their real-time performance on the existing robotic system with minimal effort is needed. In this work, we present an end-to-end software-hardware framework, n… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  24. arXiv:2301.08642  [pdf, other

    cs.NI eess.SP

    Optimal multiple FSO transceiver configuration for using on High-altitude platforms

    Authors: Dieu Linh Truong, The Ngoc Dang

    Abstract: Free-space optical (FSO) communication requires light of sight (LoS) between the transmitter and the receiver. For long-distance communication, many research projects have been conducted towards using a network composed of high-altitude platforms (HAPs) flying at an elevation of 20 km to carry intermediate FSO transceivers that forward data between ground stations. The clear environment at high el… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: Submitted to an IEEE journal

  25. arXiv:2301.05176  [pdf, other

    cs.DC

    Workload Failure Prediction for Data Centers

    Authors: Jie Li, Rui Wang, Ghazanfar Ali, Tommy Dang, Alan Sill, Yong Chen

    Abstract: Failed workloads that consumed significant computational resources in time and space affect the efficiency of data centers significantly and thus limit the amount of scientific work that can be achieved. While the computational power has increased significantly over the years, detection and prediction of workload failures have lagged far behind and will become increasingly critical as the system s… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  26. arXiv:2210.16485  [pdf, other

    cs.CV cs.MS

    IM: An R-Package for Computation of Image Moments and Moment Invariants

    Authors: Allison Irvine, Tan Dang, M. Murat Dundar, Bartek Rajwa

    Abstract: Moment invariants are well-established and effective shape descriptors for image classification. In this report, we introduce a package for R-language, named IM, that implements the calculation of moments for images and allows the reconstruction of images from moments within an object-oriented framework. Several types of moments may be computed using the IM library, including discrete and continuo… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Apr 2014, technical report

    ACM Class: I.4.7

  27. arXiv:2210.05610  [pdf, other

    cs.CL cs.AI

    MTet: Multi-domain Translation for English and Vietnamese

    Authors: Chinh Ngo, Trieu H. Trinh, Long Phan, Hieu Tran, Tai Dang, Hieu Nguyen, Minh Nguyen, Minh-Thang Luong

    Abstract: We introduce MTet, the largest publicly available parallel corpus for English-Vietnamese translation. MTet consists of 4.2M high-quality training sentence pairs and a multi-domain test set refined by the Vietnamese research community. Combining with previous works on English-Vietnamese translation, we grow the existing parallel dataset to 6.2M sentence pairs. We also release the first pretrained m… ▽ More

    Submitted 19 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

  28. arXiv:2210.05598  [pdf, other

    cs.CL cs.AI

    Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation

    Authors: Long Phan, Tai Dang, Hieu Tran, Trieu H. Trinh, Vy Phan, Lam D. Chau, Minh-Thang Luong

    Abstract: Biomedical data and benchmarks are highly valuable yet very limited in low-resource languages other than English such as Vietnamese. In this paper, we make use of a state-of-the-art translation model in English-Vietnamese to translate and produce both pretrained as well as supervised data in the biomedical domains. Thanks to such large-scale translation, we introduce ViPubmedT5, a pretrained Encod… ▽ More

    Submitted 29 January, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  29. arXiv:2210.03527  [pdf, other

    cs.HC cs.AI

    Do We Need Explainable AI in Companies? Investigation of Challenges, Expectations, and Chances from Employees' Perspective

    Authors: Katharina Weitz, Chi Tai Dang, Elisabeth André

    Abstract: Companies' adoption of artificial intelligence (AI) is increasingly becoming an essential element of business success. However, using AI poses new requirements for companies and their employees, including transparency and comprehensibility of AI systems. The field of Explainable AI (XAI) aims to address these issues. Yet, the current research primarily consists of laboratory studies, and there is… ▽ More

    Submitted 2 June, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: Project report

    ACM Class: K.4.3; J.4; I.2

  30. arXiv:2209.14264  [pdf, other

    cs.LG

    A Multi-scale Graph Signature for Persistence Diagrams based on Return Probabilities of Random Walks

    Authors: Chau Pham, Trung Dang, Peter Chin

    Abstract: Persistence diagrams (PDs), often characterized as sets of death and birth of homology class, have been known for providing a topological representation of a graph structure, which is often useful in machine learning tasks. Prior works rely on a single graph signature to construct PDs. In this paper, we explore the use of a family of multi-scale graph signatures to enhance the robustness of topolo… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  31. arXiv:2209.12140  [pdf, other

    cs.HC cs.GR q-bio.BM

    Modie Viewer: Protein Beasts and How to View Them

    Authors: Huyen N. Nguyen, Caleb Trujillo, Tommy Dang

    Abstract: Understanding chemical modifications on proteins opens up further possibilities for research on rare diseases. This work proposes visualization approaches using two-dimensional (2D) and three-dimensional (3D) visual representations to analyze and gain insights into protein modifications. In this work, we present the application of Modie Viewer as an attempt to address the Bio+MedVis Challenge at I… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: 5 pages, 5 figures, Bio+MedVis Challenge @ IEEE VIS 2022

    ACM Class: H.5.2; J.3; D.2.2

  32. arXiv:2209.11856  [pdf, other

    cs.HC cs.GR

    WordStream Maker: A Lightweight End-to-end Visualization Platform for Qualitative Time-series Data

    Authors: Huyen N. Nguyen, Tommy Dang, Kathleen A. Bowe

    Abstract: Whether it is in the form of transcribed conversations, blog posts, or tweets, qualitative data provides a reader with rich insight into both the overarching trends as well as the diversity of human ideas expressed through text. Handling and analyzing large amounts of qualitative data, however, is difficult, often requiring multiple time-intensive perusals in order to identify patterns. This diffi… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: 4 pages, 4 figures. In NLVIZ: Exploring Research Opportunities for Natural Language, Text, and Data Visualization, IEEE VIS 2022

    ACM Class: H.5.2; D.2.2; I.3.6; I.3.8; H.5.3

  33. arXiv:2208.11688  [pdf, other

    cs.HC

    VisFCAC: An Interactive Family Clinical Attribute Comparison

    Authors: Jake Gonzalez, Ngan V. T. Nguyen, Tommy Dang

    Abstract: This paper presents VisFCAC, a visual analysis system that displays family structures along with clinical attribute of family members to effectively uncover patterns related to suicide deaths for submission to the BioVis 2020 Data Challenge. VisFCAC facilitates pattern tracing to offer insight on potential clinical attributes that might connect suicide deaths while also attempting to offer insight… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  34. arXiv:2208.07536  [pdf, other

    cs.NI eess.SP

    Dependency Tasks Offloading and Communication Resource Allocation in Collaborative UAVs Networks: A Meta-Heuristic Approach

    Authors: Loc X. Nguyen, Yan Kyaw Tun, Tri Nguyen Dang, Yu Min Park, Zhu Han, Choong Seon Hong

    Abstract: In recent years, unmanned aerial vehicles (UAVs) assisted mobile edge computing systems have been exploited by researchers as a promising solution for providing computation services to mobile users outside of terrestrial infrastructure coverage. However, it remains challenging for the standalone MEC-enabled UAVs in order to meet the computation requirement of numerous mobile users due to the limit… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 14 pages, 9 figures

  35. arXiv:2207.04914  [pdf, other

    cs.RO

    Team CERBERUS Wins the DARPA Subterranean Challenge: Technical Overview and Lessons Learned

    Authors: Marco Tranzatto, Mihir Dharmadhikari, Lukas Bernreiter, Marco Camurri, Shehryar Khattak, Frank Mascarich, Patrick Pfreundschuh, David Wisth, Samuel Zimmermann, Mihir Kulkarni, Victor Reijgwart, Benoit Casseau, Timon Homberger, Paolo De Petris, Lionel Ott, Wayne Tubby, Gabriel Waibel, Huan Nguyen, Cesar Cadena, Russell Buchanan, Lorenz Wellhausen, Nikhil Khedekar, Olov Andersson, Lintong Zhang, Takahiro Miki , et al. (11 additional authors not shown)

    Abstract: This article presents the CERBERUS robotic system-of-systems, which won the DARPA Subterranean Challenge Final Event in 2021. The Subterranean Challenge was organized by DARPA with the vision to facilitate the novel technologies necessary to reliably explore diverse underground environments despite the grueling challenges they present for robotic autonomy. Due to their geometric complexity, degrad… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  36. arXiv:2207.04186  [pdf, other

    cs.CV

    A Study on Self-Supervised Object Detection Pretraining

    Authors: Trung Dang, Simon Kornblith, Huy Thong Nguyen, Peter Chin, Maryam Khademi

    Abstract: In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and projecting boxes to each augmented view and maximizing the similarity between corresponding box features. We study existing design choices in the literature, such as bo… ▽ More

    Submitted 10 August, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

  37. arXiv:2206.09004  [pdf, other

    cs.FL cs.AI cs.LG

    Towards Efficient Active Learning of PDFA

    Authors: Franz Mayr, Sergio Yovine, Federico Pan, Nicolas Basset, Thao Dang

    Abstract: We propose a new active learning algorithm for PDFA based on three main aspects: a congruence over states which takes into account next-symbol probability distributions, a quantization that copes with differences in distributions, and an efficient tree-based data structure. Experiments showed significant performance gains with respect to reference implementations.

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 11 pages, 7 figures, workshop paper

  38. arXiv:2206.02628  [pdf, other

    cs.IR cs.AI cs.CL

    HYCEDIS: HYbrid Confidence Engine for Deep Document Intelligence System

    Authors: Bao-Sinh Nguyen, Quang-Bach Tran, Tuan-Anh Nguyen Dang, Duc Nguyen, Hung Le

    Abstract: Measuring the confidence of AI models is critical for safely deploying AI in real-world industrial systems. One important application of confidence measurement is information extraction from scanned documents. However, there exists no solution to provide reliable confidence score for current state-of-the-art deep-learning-based information extractors. In this paper, we propose a complete and novel… ▽ More

    Submitted 10 October, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Document Intelligence @ KDD 2021 Workshop

  39. SymForce: Symbolic Computation and Code Generation for Robotics

    Authors: Hayk Martiros, Aaron Miller, Nathan Bucki, Bradley Solliday, Ryan Kennedy, Jack Zhu, Tung Dang, Dominic Pattison, Harrison Zheng, Teo Tomic, Peter Henry, Gareth Cross, Josiah VanderMey, Alvin Sun, Samuel Wang, Kristen Holtz

    Abstract: We present SymForce, a library for fast symbolic computation, code generation, and nonlinear optimization for robotics applications like computer vision, motion planning, and controls. SymForce combines the development speed and flexibility of symbolic math with the performance of autogenerated, highly optimized code in C++ or any target runtime language. SymForce provides geometry and camera type… ▽ More

    Submitted 6 May, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: 10 pages, 5 figures. RSS 2022

  40. Identifying Scenarios in Field Data to Enable Validation of Highly Automated Driving Systems

    Authors: Christian Reichenbächer, Maximilian Rasch, Zafer Kayatas, Florian Wirthmüller, Jochen Hipp, Thao Dang, Oliver Bringmann

    Abstract: Scenario-based approaches for the validation of highly automated driving functions are based on the search for safety-critical characteristics of driving scenarios using software-in-the-loop simulations. This search requires information about the shape and probability of scenarios in real-world traffic. The scope of this work is to develop a method that identifies redefined logical driving scenari… ▽ More

    Submitted 4 May, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 9 pages, 5 figures

    Journal ref: In Proceedings of the 8th International Conference on Vehicle Technology and Intelligent Transport Systems - VEHITS, 134-142, 2022

  41. Survivable Free Space Optical Mesh Network using High-Altitude Platforms

    Authors: Dieu Linh Truong, Xuan Vuong Dang, The Ngoc Dang

    Abstract: Free space optical (FSO) communication refers to the information transmission technology based on the propagation of optical signals in space. FSO communication requires that the transmitter and receiver directly see each other. High-altitude platforms (HAPs) have been proposed for carrying FSO transceivers in the stratosphere. A multihop HAP network with FSO links can relay traffic between ground… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    ACM Class: C.2.1

  42. Transformer-based Approaches for Legal Text Processing

    Authors: Ha-Thanh Nguyen, Minh-Phuong Nguyen, Thi-Hai-Yen Vuong, Minh-Quan Bui, Minh-Chau Nguyen, Tran-Binh Dang, Vu Tran, Le-Minh Nguyen, Ken Satoh

    Abstract: In this paper, we introduce our approaches using Transformer-based models for different problems of the COLIEE 2021 automatic legal text processing competition. Automated processing of legal documents is a challenging task because of the characteristics of legal documents as well as the limitation of the amount of data. With our detailed experiments, we found that Transformer-based pretrained lang… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2106.13405

  43. arXiv:2201.07067  [pdf, other

    cs.RO

    CERBERUS: Autonomous Legged and Aerial Robotic Exploration in the Tunnel and Urban Circuits of the DARPA Subterranean Challenge

    Authors: Marco Tranzatto, Frank Mascarich, Lukas Bernreiter, Carolina Godinho, Marco Camurri, Shehryar Khattak, Tung Dang, Victor Reijgwart, Johannes Loeje, David Wisth, Samuel Zimmermann, Huan Nguyen, Marius Fehr, Lukas Solanka, Russell Buchanan, Marko Bjelonic, Nikhil Khedekar, Mathieu Valceschini, Fabian Jenelten, Mihir Dharmadhikari, Timon Homberger, Paolo De Petris, Lorenz Wellhausen, Mihir Kulkarni, Takahiro Miki , et al. (16 additional authors not shown)

    Abstract: Autonomous exploration of subterranean environments constitutes a major frontier for robotic systems as underground settings present key challenges that can render robot autonomy hard to achieve. This has motivated the DARPA Subterranean Challenge, where teams of robots search for objects of interest in various underground environments. In response, the CERBERUS system-of-systems is presented as a… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: 50 pages, 25 figures. Accepted at Field Robotics, 2021

  44. arXiv:2201.01232  [pdf

    cs.SD cs.LG eess.AS

    Exploring Longitudinal Cough, Breath, and Voice Data for COVID-19 Progression Prediction via Sequential Deep Learning: Model Development and Validation

    Authors: Ting Dang, Jing Han, Tong Xia, Dimitris Spathis, Erika Bondareva, Chloë Siegele-Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Andres Floto, Pietro Cicuta, Cecilia Mascolo

    Abstract: Recent work has shown the potential of using audio data (eg, cough, breathing, and voice) in the screening for COVID-19. However, these approaches only focus on one-off detection and detect the infection given the current audio sample, but do not monitor disease progression in COVID-19. Limited exploration has been put forward to continuously monitor COVID-19 progression, especially recovery, thro… ▽ More

    Submitted 22 June, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: Updated title. Revised format according to journal requirements

  45. arXiv:2112.04424  [pdf, other

    cs.SD cs.LG eess.AS

    Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features

    Authors: Trung Dang, Dung Tran, Peter Chin, Kazuhito Koishida

    Abstract: Unsupervised Zero-Shot Voice Conversion (VC) aims to modify the speaker characteristic of an utterance to match an unseen target speaker without relying on parallel training data. Recently, self-supervised learning of speech representation has been shown to produce useful linguistic units without using transcripts, which can be directly passed to a VC model. In this paper, we showed that high-qual… ▽ More

    Submitted 10 February, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  46. arXiv:2111.05062  [pdf, other

    cs.LG

    Look back, look around: a systematic analysis of effective predictors for new outlinks in focused Web crawling

    Authors: Thi Kim Nhung Dang, Doina Bucur, Berk Atil, Guillaume Pitel, Frank Ruis, Hamidreza Kadkhodaei, Nelly Litvak

    Abstract: Small and medium enterprises rely on detailed Web analytics to be informed about their market and competition. Focused crawlers meet this demand by crawling and indexing specific parts of the Web. Critically, a focused crawler must quickly find new pages that have not yet been indexed. Since a new page can be discovered only by following a new outlink, predicting new outlinks is very relevant in p… ▽ More

    Submitted 15 November, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: 23 pages, 15 figures, 4 tables, uses arxiv.sty, added new title, heuristic features and their results added, figures 7, 14, and 15 updated, accepted version

  47. arXiv:2111.00556  [pdf, other

    cs.LG cs.CL cs.CR

    Revealing and Protecting Labels in Distributed Training

    Authors: Trung Dang, Om Thakkar, Swaroop Ramaswamy, Rajiv Mathews, Peter Chin, Françoise Beaufays

    Abstract: Distributed learning paradigms such as federated learning often involve transmission of model updates, or gradients, over a network, thereby avoiding transmission of private data. However, it is possible for sensitive information about the training data to be revealed from such gradients. Prior works have demonstrated that labels can be revealed analytically from the last layer of certain models (… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  48. arXiv:2110.09978  [pdf, other

    cs.AI

    What is Learned in Knowledge Graph Embeddings?

    Authors: Michael R. Douglas, Michael Simkin, Omri Ben-Eliezer, Tianqi Wu, Peter Chin, Trung V. Dang, Andrew Wood

    Abstract: A knowledge graph (KG) is a data structure which represents entities and relations as the vertices and edges of a directed graph with edge types. KGs are an important primitive in modern machine learning and artificial intelligence. Embedding-based models, such as the seminal TransE [Bordes et al., 2013] and the recent PairRE [Chao et al., 2020] are among the most popular and successful approaches… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 16 pages

    ACM Class: I.2.4

  49. arXiv:2109.05349  [pdf, other

    cs.CL

    HYDRA -- Hyper Dependency Representation Attentions

    Authors: Ha-Thanh Nguyen, Vu Tran, Tran-Binh Dang, Minh-Quan Bui, Minh-Phuong Nguyen, Le-Minh Nguyen

    Abstract: Attention is all we need as long as we have enough data. Even so, it is sometimes not easy to determine how much data is enough while the models are becoming larger and larger. In this paper, we propose HYDRA heads, lightweight pretrained linguistic self-attention heads to inject knowledge into transformer models without pretraining them again. Our approach is a balanced paradigm between leaving t… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

  50. hEARt: Motion-resilient Heart Rate Monitoring with In-ear Microphones

    Authors: Kayla-Jade Butkow, Ting Dang, Andrea Ferlini, Dong Ma, Cecilia Mascolo

    Abstract: With the soaring adoption of in-ear wearables, the research community has started investigating suitable in-ear heart rate (HR) detection systems. HR is a key physiological marker of cardiovascular health and physical fitness. Continuous and reliable HR monitoring with wearable devices has therefore gained increasing attention in recent years. Existing HR detection systems in wearables mainly rely… ▽ More

    Submitted 10 January, 2023; v1 submitted 20 August, 2021; originally announced August 2021.

    Journal ref: 2023 IEEE International Conference on Pervasive Computing and Communications (PerCom)