Skip to main content

Showing 1–50 of 286 results for author: Tran, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12094  [pdf, other

    cs.CL

    Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models

    Authors: Minh Nguyen, Franck Dernoncourt, Seunghyun Yoon, Hanieh Deilamsalehy, Hao Tan, Ryan Rossi, Quan Hung Tran, Trung Bui, Thien Huu Nguyen

    Abstract: We introduce an approach to identifying speaker names in dialogue transcripts, a crucial task for enhancing content accessibility and searchability in digital media archives. Despite the advancements in speech recognition, the task of text-based speaker identification (SpeakerID) has received limited attention, lacking large-scale, diverse datasets for effective model training. Addressing these ga… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: accepted to INTERSPEECH 2024

  2. arXiv:2407.07421  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Federated PCA on Grassmann Manifold for IoT Anomaly Detection

    Authors: Tung-Anh Nguyen, Long Tan Le, Tuan Dung Nguyen, Wei Bao, Suranga Seneviratne, Choong Seon Hong, Nguyen H. Tran

    Abstract: With the proliferation of the Internet of Things (IoT) and the rising interconnectedness of devices, network security faces significant challenges, especially from anomalous activities. While traditional machine learning-based intrusion detection systems (ML-IDS) effectively employ supervised learning methods, they possess limitations such as the requirement for labeled data and challenges with hi… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted for publication at IEEE/ACM Transactions on Networking

    Journal ref: IEEE/ACM Transactions on Networking On page(s): 1-16 Print ISSN: 1063-6692 Online ISSN: 1558-2566 Digital Object Identifier: 10.1109/TNET.2024.3423780

  3. arXiv:2407.02419  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum Curriculum Learning

    Authors: Quoc Hoan Tran, Yasuhiro Endo, Hirotaka Oshima

    Abstract: Quantum machine learning (QML) requires significant quantum resources to achieve quantum advantage. Research should prioritize both the efficient design of quantum architectures and the development of learning strategies to optimize resource usage. We propose a framework called quantum curriculum learning (Q-CurL) for quantum data, where the curriculum introduces simpler tasks or data to the learn… ▽ More

    Submitted 11 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: main 6 pages, supplementary materials 8 pages (update the supplementary materials)

  4. arXiv:2407.01825  [pdf, other

    cs.LG math.OC

    Empirical Tests of Optimization Assumptions in Deep Learning

    Authors: Hoang Tran, Qinzi Zhang, Ashok Cutkosky

    Abstract: There is a significant gap between our theoretical understanding of optimization algorithms used in deep learning and their practical performance. Theoretical development usually focuses on proving convergence guarantees under a variety of different assumptions, which are themselves often chosen based on a rough combination of intuitive match to practice and analytical convenience. The theory/prac… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  5. arXiv:2406.19579  [pdf, ps, other

    math.OC cs.CR cs.LG

    Private Zeroth-Order Nonsmooth Nonconvex Optimization

    Authors: Qinzi Zhang, Hoang Tran, Ashok Cutkosky

    Abstract: We introduce a new zeroth-order algorithm for private stochastic optimization on nonconvex and nonsmooth objectives. Given a dataset of size $M$, our algorithm ensures $(α,αρ^2/2)$-Rényi differential privacy and finds a $(δ,ε)$-stationary point so long as $M=\tildeΩ\left(\frac{d}{δε^3} + \frac{d^{3/2}}{ρδε^2}\right)$. This matches the optimal complexity of its non-private zeroth-order analog. Nota… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  6. arXiv:2406.18316  [pdf, other

    quant-ph cs.LG

    Trade-off between Gradient Measurement Efficiency and Expressivity in Deep Quantum Neural Networks

    Authors: Koki Chinzei, Shinichiro Yamano, Quoc Hoan Tran, Yasuhiro Endo, Hirotaka Oshima

    Abstract: Quantum neural networks (QNNs) require an efficient training algorithm to achieve practical quantum advantages. A promising approach is the use of gradient-based optimization algorithms, where gradients are estimated through quantum measurements. However, it is generally difficult to efficiently measure gradients in QNNs because the quantum state collapses upon measurement. In this work, we prove… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 32 pages, 11 figures

  7. arXiv:2406.17335  [pdf, other

    cs.IR cs.LG

    A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems

    Authors: Hung Vinh Tran, Tong Chen, Quoc Viet Hung Nguyen, Zi Huang, Lizhen Cui, Hongzhi Yin

    Abstract: Since the creation of the Web, recommender systems (RSs) have been an indispensable mechanism in information filtering. State-of-the-art RSs primarily depend on categorical features, which ecoded by embedding vectors, resulting in excessively large embedding tables. To prevent over-parameterized embedding tables from harming scalability, both academia and industry have seen increasing efforts in c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  8. arXiv:2406.09205  [pdf, other

    cs.CL cs.AI

    ReadCtrl: Personalizing text generation with readability-controlled instruction learning

    Authors: Hieu Tran, Zonghai Yao, Lingxi Li, Hong Yu

    Abstract: Content generation conditioning on users's readability is an important application for personalization. In an era of large language models (LLMs), readability-controlled text generation based on LLMs has become increasingly important. This paper introduces a novel methodology called "Readability-Controlled Instruction Learning (ReadCtrl)," which aims to instruction-tune LLMs to tailor users' reada… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 9 pages

  9. arXiv:2406.09128  [pdf, other

    cs.CL

    CoastTerm: a Corpus for Multidisciplinary Term Extraction in Coastal Scientific Literature

    Authors: Julien Delaunay, Hanh Thi Hong Tran, Carlos-Emiliano González-Gallardo, Georgeta Bordea, Mathilde Ducos, Nicolas Sidere, Antoine Doucet, Senja Pollak, Olivier De Viron

    Abstract: The growing impact of climate change on coastal areas, particularly active but fragile regions, necessitates collaboration among diverse stakeholders and disciplines to formulate effective environmental protection policies. We introduce a novel specialized corpus comprising 2,491 sentences from 410 scientific abstracts concerning coastal areas, for the Automatic Term Extraction (ATE) and Classific… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2405.16748  [pdf

    cs.CV cs.LG

    Hypergraph Laplacian Eigenmaps and Face Recognition Problems

    Authors: Loc Hoang Tran

    Abstract: Face recognition is a very important topic in data science and biometric security research areas. It has multiple applications in military, finance, and retail, to name a few. In this paper, the novel hypergraph Laplacian Eigenmaps will be proposed and combine with the k nearest-neighbor method and/or with the kernel ridge regression method to solve the face recognition problem. Experimental resul… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  11. arXiv:2405.16148  [pdf, other

    cs.LG

    Accelerating Transformers with Spectrum-Preserving Token Merging

    Authors: Hoai-Chau Tran, Duy M. H. Nguyen, Duy M. Nguyen, Trung-Tin Nguyen, Ngan Le, Pengtao Xie, Daniel Sonntag, James Y. Zou, Binh T. Nguyen, Mathias Niepert

    Abstract: Increasing the throughput of the Transformer architecture, a foundational component used in numerous state-of-the-art models for vision and language tasks (e.g., GPT, LLaVa), is an important problem in machine learning. One recent and effective strategy is to merge token representations within Transformer models, aiming to reduce computational and memory requirements while maintaining accuracy. Pr… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Version 1

  12. arXiv:2405.15230  [pdf, other

    cs.AI cs.LG

    $i$REPO: $i$mplicit Reward Pairwise Difference based Empirical Preference Optimization

    Authors: Long Tan Le, Han Shu, Tung-Anh Nguyen, Choong Seon Hong, Nguyen H. Tran

    Abstract: While astonishingly capable, large Language Models (LLM) can sometimes produce outputs that deviate from human expectations. Such deviations necessitate an alignment phase to prevent disseminating untruthful, toxic, or biased information. Traditional alignment methods based on reinforcement learning often struggle with the identified instability, whereas preference optimization methods are limited… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Under Review

  13. arXiv:2405.11456  [pdf, other

    cs.CR

    Biometrics-Based Authenticated Key Exchange with Multi-Factor Fuzzy Extractor

    Authors: Hong Yen Tran, Jiankun Hu, Wen Hu

    Abstract: Existing fuzzy extractors and similar methods provide an effective way for extracting a secret key from a user's biometric data, but are susceptible to impersonation attack: once a valid biometric sample is captured, the scheme is no longer secure. We propose a novel multi-factor fuzzy extractor that integrates both a user's secret (e.g., a password) and a user's biometrics in the generation and r… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 17 pages

  14. arXiv:2404.09951  [pdf, other

    cs.CV

    Unifying Global and Local Scene Entities Modelling for Precise Action Spotting

    Authors: Kim Hoang Tran, Phuc Vuong Do, Ngoc Quoc Ly, Ngan Le

    Abstract: Sports videos pose complex challenges, including cluttered backgrounds, camera angle changes, small action-representing objects, and imbalanced action class distribution. Existing methods for detecting actions in sports videos heavily rely on global features, utilizing a backbone network as a black box that encompasses the entire spatial frame. However, these approaches tend to overlook the nuance… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted to IJCNN 2024

  15. arXiv:2404.09275  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning

    Authors: Quang Minh Dinh, Minh Khoi Ho, Anh Quan Dang, Hung Phong Tran

    Abstract: Traffic video description and analysis have received much attention recently due to the growing demand for efficient and reliable urban surveillance systems. Most existing methods only focus on locating traffic event segments, which severely lack descriptive details related to the behaviour and context of all the subjects of interest in the events. In this paper, we present TrafficVLM, a novel mul… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 7134-7143

  16. arXiv:2404.05393  [pdf, other

    cs.CV cs.AI

    PAT: Pixel-wise Adaptive Training for Long-tailed Segmentation

    Authors: Khoi Do, Duong Nguyen, Nguyen H. Tran, Viet Dung Nguyen

    Abstract: Beyond class frequency, we recognize the impact of class-wise relationships among various class-specific predictions and the imbalance in label masks on long-tailed segmentation learning. To address these challenges, we propose an innovative Pixel-wise Adaptive Training (PAT) technique tailored for long-tailed segmentation. PAT has two key features: 1) class-wise gradient magnitude homogenization,… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  17. arXiv:2403.03180  [pdf, other

    math.OC cs.LG

    Shuffling Momentum Gradient Algorithm for Convex Optimization

    Authors: Trang H. Tran, Quoc Tran-Dinh, Lam M. Nguyen

    Abstract: The Stochastic Gradient Descent method (SGD) and its stochastic variants have become methods of choice for solving finite-sum optimization problems arising from machine learning and data science thanks to their ability to handle large-scale applications and big datasets. In the last decades, researchers have made substantial effort to study the theoretical performance of SGD and its shuffling vari… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Vietnam Journal of Mathematics (VJOM), Special issue dedicated to Dr. Tamás Terlaky on the occasion of his 70th birthday, 2024

  18. arXiv:2402.17311  [pdf, other

    cs.CL

    SKT5SciSumm -- A Hybrid Generative Approach for Multi-Document Scientific Summarization

    Authors: Huy Quoc To, Hung-Nghiep Tran, Andr'e Greiner-Petter, Felix Beierle, Akiko Aizawa

    Abstract: Summarization for scientific text has shown significant benefits both for the research community and human society. Given the fact that the nature of scientific text is distinctive and the input of the multi-document summarization task is substantially long, the task requires sufficient embedding generation and text truncation without losing important information. To tackle these issues, in this p… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  19. arXiv:2402.14874  [pdf, other

    cs.CL cs.AI cs.LG

    Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation

    Authors: Phuc Phan, Hieu Tran, Long Phan

    Abstract: We propose a straightforward approach called Distillation Contrastive Decoding (DCD) to enhance the reasoning capabilities of Large Language Models (LLMs) during inference. In contrast to previous approaches that relied on smaller amateur models or analysis of hidden state differences, DCD employs Contrastive Chain-of-thought Prompting and advanced distillation techniques, including Dropout and Qu… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Under Review

  20. arXiv:2402.13822  [pdf, other

    cs.CV

    MSTAR: Multi-Scale Backbone Architecture Search for Timeseries Classification

    Authors: Tue M. Cao, Nhat H. Tran, Hieu H. Pham, Hung T. Nguyen, Le P. Nguyen

    Abstract: Most of the previous approaches to Time Series Classification (TSC) highlight the significance of receptive fields and frequencies while overlooking the time resolution. Hence, unavoidably suffered from scalability issues as they integrated an extensive range of receptive fields into classification models. Other methods, while having a better adaptation for large datasets, require manual design an… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  21. arXiv:2402.11739  [pdf, other

    eess.SY cs.LG

    A Transition System Abstraction Framework for Neural Network Dynamical System Models

    Authors: Yejiang Yang, Zihao Mo, Hoang-Dung Tran, Weiming Xiang

    Abstract: This paper proposes a transition system abstraction framework for neural network dynamical system models to enhance the model interpretability, with applications to complex dynamical systems such as human behavior learning and verification. To begin with, the localized working zone will be segmented into multiple localized partitions under the data-driven Maximum Entropy (ME) partitioning method.… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: ACC 2024

  22. How good are my search strings? Reflections on using an existing review as a quasi-gold standard

    Authors: Huynh Khanh Vi Tran, Jürgen Börstler, Nauman Bin Ali, Michael Unterkalmsteiner

    Abstract: Background: Systematic literature studies (SLS) have become a core research methodology in Evidence-based Software Engineering (EBSE). Search completeness, ie, finding all relevant papers on the topic of interest, has been recognized as one of the most commonly discussed validity issues of SLSs. Aim: This study aims at raising awareness on the issues related to search string construction and on se… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: e Informatica Softw. Eng. J. 16(1) (2022)

  23. arXiv:2402.10765  [pdf, other

    cs.LG cs.AI

    Policy Learning for Off-Dynamics RL with Deficient Support

    Authors: Linh Le Pham Van, Hung The Tran, Sunil Gupta

    Abstract: Reinforcement Learning (RL) can effectively learn complex policies. However, learning these policies often demands extensive trial-and-error interactions with the environment. In many real-world scenarios, this approach is not practical due to the high costs of data collection and safety concerns. As a result, a common strategy is to transfer a policy trained in a low-cost, rapid source simulator… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted by AAMAS 2024 as a full paper

  24. Assessing test artifact quality -- A tertiary study

    Authors: Huynh Khanh Vi Tran, Michael Unterkalmsteiner, Jürgen Börstler, Nauman bin Ali

    Abstract: Context: Modern software development increasingly relies on software testing for an ever more frequent delivery of high quality software. This puts high demands on the quality of the central artifacts in software testing, test suites and test cases. Objective: We aim to develop a comprehensive model for capturing the dimensions of test case/suite quality, which are relevant for a variety of perspe… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Journal ref: Information and Software Technology 139 (2021): 106620

  25. arXiv:2402.03243  [pdf, other

    cs.LG

    PINN-BO: A Black-box Optimization Algorithm using Physics-Informed Neural Networks

    Authors: Dat Phan-Trong, Hung The Tran, Alistair Shilton, Sunil Gupta

    Abstract: Black-box optimization is a powerful approach for discovering global optima in noisy and expensive black-box functions, a problem widely encountered in real-world scenarios. Recently, there has been a growing interest in leveraging domain knowledge to enhance the efficacy of machine learning methods. Partial Differential Equations (PDEs) often provide an effective means for elucidating the fundame… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  26. arXiv:2402.02345  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Stereographic Spherical Sliced Wasserstein Distances

    Authors: Huy Tran, Yikun Bai, Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Rocio Diaz Martin, Soheil Kolouri

    Abstract: Comparing spherical probability distributions is of great interest in various fields, including geology, medical domains, computer vision, and deep representation learning. The utility of optimal transport-based distances, such as the Wasserstein distance, for comparing probability measures has spurred active research in developing computationally efficient variations of these distances for spheri… ▽ More

    Submitted 9 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024 (Spotlight). Project page: https://abi-kothapalli.github.io/s3w/

  27. arXiv:2402.02006  [pdf, other

    cs.LG

    PresAIse, A Prescriptive AI Solution for Enterprises

    Authors: Wei Sun, Scott McFaddin, Linh Ha Tran, Shivaram Subramanian, Kristjan Greenewald, Yeshi Tenzin, Zack Xue, Youssef Drissi, Markus Ettl

    Abstract: Prescriptive AI represents a transformative shift in decision-making, offering causal insights and actionable recommendations. Despite its huge potential, enterprise adoption often faces several challenges. The first challenge is caused by the limitations of observational data for accurate causal inference which is typically a prerequisite for good decision-making. The second pertains to the inter… ▽ More

    Submitted 12 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 14 pages

  28. arXiv:2401.15952  [pdf, other

    cs.LG cs.AI cs.CV

    A Class-aware Optimal Transport Approach with Higher-Order Moment Matching for Unsupervised Domain Adaptation

    Authors: Tuan Nguyen, Van Nguyen, Trung Le, He Zhao, Quan Hung Tran, Dinh Phung

    Abstract: Unsupervised domain adaptation (UDA) aims to transfer knowledge from a labeled source domain to an unlabeled target domain. In this paper, we introduce a novel approach called class-aware optimal transport (OT), which measures the OT distance between a distribution over the source class-conditional distributions and a mixture of source and target data distribution. Our class-aware OT leverages a c… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 18 pages

  29. arXiv:2401.05367  [pdf, other

    eess.SP cs.LG

    Context-Aware Stress Monitoring using Wearable and Mobile Technologies in Everyday Settings

    Authors: Seyed Amir Hossein Aqajari, Sina Labbaf, Phuc Hoang Tran, Brenda Nguyen, Milad Asgari Mehrabadi, Marco Levorato, Nikil Dutt, Amir M. Rahmani

    Abstract: Daily monitoring of stress is a critical component of maintaining optimal physical and mental health. Physiological signals and contextual information have recently emerged as promising indicators for detecting instances of heightened stress. Nonetheless, developing a real-time monitoring system that utilizes both physiological and contextual data to anticipate stress levels in everyday settings w… ▽ More

    Submitted 14 December, 2023; originally announced January 2024.

  30. arXiv:2312.15561  [pdf, other

    cs.CL cs.AI

    README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP

    Authors: Zonghai Yao, Nandyala Siddharth Kantu, Guanghao Wei, Hieu Tran, Zhangqi Duan, Sunjae Kwon, Zhichao Yang, README annotation team, Hong Yu

    Abstract: The advancement in healthcare has shifted focus toward patient-centric approaches, particularly in self-care and patient education, facilitated by access to Electronic Health Records (EHR). However, medical jargon in EHRs poses significant challenges in patient comprehension. To address this, we introduce a new task of automatically generating lay definitions, aiming to simplify complex medical te… ▽ More

    Submitted 16 June, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  31. arXiv:2312.09445  [pdf, other

    eess.SP cs.CV cs.LG

    IncepSE: Leveraging InceptionTime's performance with Squeeze and Excitation mechanism in ECG analysis

    Authors: Tue Minh Cao, Nhat Hong Tran, Le Phi Nguyen, Hieu Huy Pham, Hung Thanh Nguyen

    Abstract: Our study focuses on the potential for modifications of Inception-like architecture within the electrocardiogram (ECG) domain. To this end, we introduce IncepSE, a novel network characterized by strategic architectural incorporation that leverages the strengths of both InceptionTime and channel attention mechanisms. Furthermore, we propose a training setup that employs stabilization techniques tha… ▽ More

    Submitted 16 November, 2023; originally announced December 2023.

  32. arXiv:2311.17449  [pdf, other

    cs.CV

    Weakly-semi-supervised object detection in remotely sensed imagery

    Authors: Ji Hun Wang, Jeremy Irvin, Beri Kohen Behar, Ha Tran, Raghav Samavedam, Quentin Hsu, Andrew Y. Ng

    Abstract: Deep learning for detecting objects in remotely sensed imagery can enable new technologies for important applications including mitigating climate change. However, these models often require large datasets labeled with bounding box annotations which are expensive to curate, prohibiting the development of models for new tasks and geographies. To address this challenge, we develop weakly-semi-superv… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2023

  33. arXiv:2311.12290  [pdf, other

    cs.LG

    A Supervised Contrastive Learning Pretrain-Finetune Approach for Time Series

    Authors: Trang H. Tran, Lam M. Nguyen, Kyongmin Yeo, Nam Nguyen, Roman Vaculin

    Abstract: Foundation models have recently gained attention within the field of machine learning thanks to its efficiency in broad data processing. While researchers had attempted to extend this success to time series models, the main challenge is effectively extracting representations and transferring knowledge from pretraining datasets to the target finetuning dataset. To tackle this issue, we introduce a… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  34. arXiv:2311.12282  [pdf, other

    math.NA cs.LG

    Orthogonally weighted $\ell_{2,1}$ regularization for rank-aware joint sparse recovery: algorithm and analysis

    Authors: Armenak Petrosyan, Konstantin Pieper, Hoang Tran

    Abstract: We propose and analyze an efficient algorithm for solving the joint sparse recovery problem using a new regularization-based method, named orthogonally weighted $\ell_{2,1}$ ($\mathit{ow}\ell_{2,1}$), which is specifically designed to take into account the rank of the solution matrix. This method has applications in feature extraction, matrix column selection, and dictionary learning, and it is di… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  35. SniffyArt: The Dataset of Smelling Persons

    Authors: Mathias Zinnen, Azhar Hussian, Hang Tran, Prathmesh Madhu, Andreas Maier, Vincent Christlein

    Abstract: Smell gestures play a crucial role in the investigation of past smells in the visual arts yet their automated recognition poses significant challenges. This paper introduces the SniffyArt dataset, consisting of 1941 individuals represented in 441 historical artworks. Each person is annotated with a tightly fitting bounding box, 17 pose keypoints, and a gesture label. By integrating these annotatio… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 8 figures

    Journal ref: Proceedings of the 5th Workshop on analySis, Understanding and proMotion of heritAge Contents. 2023. S. 49-58

  36. arXiv:2311.04292  [pdf, other

    cs.CL

    Aspect-based Meeting Transcript Summarization: A Two-Stage Approach with Weak Supervision on Sentence Classification

    Authors: Zhongfen Deng, Seunghyun Yoon, Trung Bui, Franck Dernoncourt, Quan Hung Tran, Shuaiqi Liu, Wenting Zhao, Tao Zhang, Yibo Wang, Philip S. Yu

    Abstract: Aspect-based meeting transcript summarization aims to produce multiple summaries, each focusing on one aspect of content in a meeting transcript. It is challenging as sentences related to different aspects can mingle together, and those relevant to a specific aspect can be scattered throughout the long transcript of a meeting. The traditional summarization methods produce one summary mixing inform… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted by 2023 IEEE International Conference on Big Data

  37. arXiv:2310.19975  [pdf, other

    cs.CL cs.AI

    BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing

    Authors: Hieu Tran, Zhichao Yang, Zonghai Yao, Hong Yu

    Abstract: To enhance the performance of large language models (LLMs) in biomedical natural language processing (BioNLP) by introducing a domain-specific instruction dataset and examining its impact when combined with multi-task learning principles. We created the BioInstruct, comprising 25,005 instructions to instruction-tune LLMs(LLaMA 1 & 2, 7B & 13B version). The instructions were created by prompting th… ▽ More

    Submitted 6 June, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: This article has been accepted for publication in Journal of the American Medical Informatics Association Published by Oxford University Press. https://academic.oup.com/jamia/advance-article-abstract/doi/10.1093/jamia/ocae122/7687618

  38. arXiv:2310.00418  [pdf, other

    eess.IV cs.CV

    MVC: A Multi-Task Vision Transformer Network for COVID-19 Diagnosis from Chest X-ray Images

    Authors: Huyen Tran, Duc Thanh Nguyen, John Yearwood

    Abstract: Medical image analysis using computer-based algorithms has attracted considerable attention from the research community and achieved tremendous progress in the last decade. With recent advances in computing resources and availability of large-scale medical image datasets, many deep learning models have been developed for disease diagnosis from medical images. However, existing techniques focus on… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  39. arXiv:2310.00258  [pdf, other

    cs.CV

    NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation

    Authors: Minh-Tuan Tran, Trung Le, Xuan-May Le, Mehrtash Harandi, Quan Hung Tran, Dinh Phung

    Abstract: Data-Free Knowledge Distillation (DFKD) has made significant recent strides by transferring knowledge from a teacher neural network to a student neural network without accessing the original data. Nonetheless, existing approaches encounter a significant challenge when attempting to generate samples from random noise inputs, which inherently lack meaningful information. Consequently, these models s… ▽ More

    Submitted 21 March, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: Accepted at CVPR 2024

  40. arXiv:2309.16960  [pdf, other

    cs.AI

    On Generating Explanations for Reinforcement Learning Policies: An Empirical Study

    Authors: Mikihisa Yuasa, Huy T. Tran, Ramavarapu S. Sreenivas

    Abstract: Understanding a \textit{reinforcement learning} policy, which guides state-to-action mappings to maximize rewards, necessitates an accompanying explanation for human comprehension. In this paper, we introduce a set of \textit{linear temporal logic} (LTL) formulae designed to provide explanations for policies, and an algorithm for searching through those formulae for the one that best explains a gi… ▽ More

    Submitted 5 March, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  41. Test-Case Quality -- Understanding Practitioners' Perspectives

    Authors: Huynh Khanh Vi Tran, Nauman Bin Ali, Jürgen Börstler, Michael Unterkalmsteiner

    Abstract: Background: Test-case quality has always been one of the major concerns in software testing. To improve test-case quality, it is important to better understand how practitioners perceive the quality of test-cases. Objective: Motivated by that need, we investigated how practitioners define test-case quality and which aspects of test-cases are important for quality assessment. Method: We conducted s… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: PROFES 2019: 37-52

  42. arXiv:2309.16396  [pdf, other

    cs.CL

    A Comprehensive Survey of Document-level Relation Extraction (2016-2023)

    Authors: Julien Delaunay, Hanh Thi Hong Tran, Carlos-Emiliano González-Gallardo, Georgeta Bordea, Nicolas Sidere, Antoine Doucet

    Abstract: Document-level relation extraction (DocRE) is an active area of research in natural language processing (NLP) concerned with identifying and extracting relationships between entities beyond sentence boundaries. Compared to the more traditional sentence-level relation extraction, DocRE provides a broader context for analysis and is more challenging because it involves identifying relationships that… ▽ More

    Submitted 12 October, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    ACM Class: A.1

  43. arXiv:2309.15787  [pdf, other

    cs.CV cs.LG

    Partial Transport for Point-Cloud Registration

    Authors: Yikun Bai, Huy Tran, Steven B. Damelin, Soheil Kolouri

    Abstract: Point cloud registration plays a crucial role in various fields, including robotics, computer graphics, and medical imaging. This process involves determining spatial relationships between different sets of points, typically within a 3D space. In real-world scenarios, complexities arise from non-rigid movements and partial visibility, such as occlusions or sensor noise, making non-rigid registrati… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  44. arXiv:2309.15659  [pdf, other

    cs.LG cs.DC

    Federated Deep Equilibrium Learning: A Compact Shared Representation for Edge Communication Efficiency

    Authors: Long Tan Le, Tuan Dung Nguyen, Tung-Anh Nguyen, Choong Seon Hong, Nguyen H. Tran

    Abstract: Federated Learning (FL) is a prominent distributed learning paradigm facilitating collaboration among nodes within an edge network to co-train a global model without centralizing data. By shifting computation to the network edge, FL offers robust and responsive edge-AI solutions and enhance privacy-preservation. However, deploying deep FL models within edge environments is often hindered by commun… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  45. arXiv:2309.10150  [pdf, other

    cs.RO cs.AI cs.LG

    Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

    Authors: Yevgen Chebotar, Quan Vuong, Alex Irpan, Karol Hausman, Fei Xia, Yao Lu, Aviral Kumar, Tianhe Yu, Alexander Herzog, Karl Pertsch, Keerthana Gopalakrishnan, Julian Ibarz, Ofir Nachum, Sumedh Sontakke, Grecia Salazar, Huong T Tran, Jodilyn Peralta, Clayton Tan, Deeksha Manjunath, Jaspiar Singht, Brianna Zitkovich, Tomas Jackson, Kanishka Rao, Chelsea Finn, Sergey Levine

    Abstract: In this work, we present a scalable reinforcement learning method for training multi-task policies from large offline datasets that can leverage both human demonstrations and autonomously collected data. Our method uses a Transformer to provide a scalable representation for Q-functions trained via offline temporal difference backups. We therefore refer to the method as Q-Transformer. By discretizi… ▽ More

    Submitted 17 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: See website at https://qtransformer.github.io

  46. arXiv:2309.00585  [pdf, other

    cs.LG cond-mat.soft

    PolyGET: Accelerating Polymer Simulations by Accurate and Generalizable Forcefield with Equivariant Transformer

    Authors: Rui Feng, Huan Tran, Aubrey Toland, Binghong Chen, Qi Zhu, Rampi Ramprasad, Chao Zhang

    Abstract: Polymer simulation with both accuracy and efficiency is a challenging task. Machine learning (ML) forcefields have been developed to achieve both the accuracy of ab initio methods and the efficiency of empirical force fields. However, existing ML force fields are usually limited to single-molecule settings, and their simulations are not robust enough. In this paper, we present PolyGET, a new frame… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  47. A Text-based Approach For Link Prediction on Wikipedia Articles

    Authors: Anh Hoang Tran, Tam Minh Nguyen, Son T. Luu

    Abstract: This paper present our work in the DSAA 2023 Challenge about Link Prediction for Wikipedia Articles. We use traditional machine learning models with POS tags (part-of-speech tags) features extracted from text to train the classification model for predicting whether two nodes has the link. Then, we use these tags to test on various machine learning models. We obtained the results by F1 score at 0.9… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted by DSAA 2023 Conference in the DSAA Student Competition Section

  48. arXiv:2308.14759  [pdf, other

    physics.chem-ph cs.AI cs.LG q-bio.BM

    May the Force be with You: Unified Force-Centric Pre-Training for 3D Molecular Conformations

    Authors: Rui Feng, Qi Zhu, Huan Tran, Binghong Chen, Aubrey Toland, Rampi Ramprasad, Chao Zhang

    Abstract: Recent works have shown the promise of learning pre-trained models for 3D molecular representation. However, existing pre-training models focus predominantly on equilibrium data and largely overlook off-equilibrium conformations. It is challenging to extend these methods to off-equilibrium data because their training objective relies on assumptions of conformations being the local energy minima. W… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  49. arXiv:2308.02344  [pdf, ps, other

    math.ST cs.LG stat.CO stat.ME stat.ML

    Learning Networks from Gaussian Graphical Models and Gaussian Free Fields

    Authors: Subhro Ghosh, Soumendu Sundar Mukherjee, Hoang-Son Tran, Ujan Gangopadhyay

    Abstract: We investigate the problem of estimating the structure of a weighted network from repeated measurements of a Gaussian Graphical Model (GGM) on the network. In this vein, we consider GGMs whose covariance structures align with the geometry of the weighted network on which they are based. Such GGMs have been of longstanding interest in statistical physics, and are referred to as the Gaussian Free Fi… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  50. arXiv:2307.15818  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

    Authors: Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Xi Chen, Krzysztof Choromanski, Tianli Ding, Danny Driess, Avinava Dubey, Chelsea Finn, Pete Florence, Chuyuan Fu, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Kehang Han, Karol Hausman, Alexander Herzog, Jasmine Hsu, Brian Ichter, Alex Irpan, Nikhil Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal , et al. (29 additional authors not shown)

    Abstract: We study how vision-language models trained on Internet-scale data can be incorporated directly into end-to-end robotic control to boost generalization and enable emergent semantic reasoning. Our goal is to enable a single end-to-end trained model to both learn to map robot observations to actions and enjoy the benefits of large-scale pretraining on language and vision-language data from the web.… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Website: https://robotics-transformer.github.io/