Zum Hauptinhalt springen

Showing 1–30 of 30 results for author: Hy, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05822  [pdf, other

    cs.LG cs.CV

    Sampling Foundational Transformer: A Theoretical Perspective

    Authors: Viet Anh Nguyen, Minh Lenhat, Khoa Nguyen, Duong Duc Hieu, Dao Huu Hung, Truong Son Hy

    Abstract: The versatility of self-attention mechanism earned transformers great success in almost all data modalities, with limitations on the quadratic complexity and difficulty of training. To apply transformers across different data modalities, practitioners have to make specific clever data-modality-dependent constructions. In this paper, we propose Sampling Foundational Transformer (SFT) that can work… ▽ More

    Submitted 17 August, 2024; v1 submitted 11 August, 2024; originally announced August 2024.

  2. arXiv:2408.05391  [pdf, other

    cs.LG

    SAMSA: Efficient Transformer for Many Data Modalities

    Authors: Minh Lenhat, Viet Anh Nguyen, Khoa Nguyen, Duong Duc Hieu, Dao Huu Hung, Truong Son Hy

    Abstract: The versatility of self-attention mechanism earned transformers great success in almost all data modalities, with limitations on the quadratic complexity and difficulty of training. Efficient transformers, on the other hand, often rely on clever data-modality-dependent construction to get over the quadratic complexity of transformers. This greatly hinders their applications on different data modal… ▽ More

    Submitted 18 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  3. arXiv:2408.04174  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.SD eess.AS

    wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

    Authors: Khai Le-Duc, Quy-Anh Dang, Tan-Hanh Pham, Truong-Son Hy

    Abstract: Knowledge graphs (KGs) enhance the performance of large language models (LLMs) and search engines by providing structured, interconnected data that improves reasoning and context-awareness. However, KGs only focus on text data, thereby neglecting other modalities such as speech. In this work, we introduce wav2graph, the first framework for supervised learning knowledge graph from speech data. Our… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Preprint, 32 pages

  4. arXiv:2407.21054  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Sentiment Reasoning for Healthcare

    Authors: Khai Le-Duc, Khai-Nguyen Nguyen, Bach Phan Tat, Duy Le, Jerry Ngo, Long Vo-Dang, Anh Totti Nguyen, Truong-Son Hy

    Abstract: Transparency in AI decision-making is crucial in healthcare due to the severe consequences of errors, and this is important for building trust among AI and users in sentiment analysis task. Incorporating reasoning capabilities helps Large Language Models (LLMs) understand human emotions within broader contexts, handle nuanced and ambiguous language, and infer underlying sentiments that may not be… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: Preprint, 18 pages

  5. arXiv:2407.12064  [pdf, other

    eess.IV cs.CL cs.CV cs.LG cs.MM

    LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

    Authors: Khai Le-Duc, Ryan Zhang, Ngoc Son Nguyen, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy

    Abstract: Vision-language models have been extensively explored across a wide range of tasks, achieving satisfactory performance; however, their application in medical imaging remains underexplored. In this work, we propose a unified framework - LiteGPT - for the medical imaging. We leverage multiple pre-trained visual encoders to enrich information and enhance the performance of vision-language models. To… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Preprint, 19 pages

  6. arXiv:2407.00609  [pdf, other

    cs.CV cs.LG

    ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding

    Authors: Quang P. M. Pham, Khoi T. N. Nguyen, Lan C. Ngo, Truong Do, Truong Son Hy

    Abstract: Scene graphs have been proven to be useful for various scene understanding tasks due to their compact and explicit nature. However, existing approaches often neglect the importance of maintaining the symmetry-preserving property when generating scene graphs from 3D point clouds. This oversight can diminish the accuracy and robustness of the resulting scene graphs, especially when handling noisy, m… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  7. arXiv:2406.15888  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Real-time Speech Summarization for Medical Conversations

    Authors: Khai Le-Duc, Khai-Nguyen Nguyen, Long Vo-Dang, Truong-Son Hy

    Abstract: In doctor-patient conversations, identifying medically relevant information is crucial, posing the need for conversation summarization. In this work, we propose the first deployable real-time speech summarization system for real-world applications in industry, which generates a local summary after every N speech utterances within a conversation and a global summary after the end of a conversation.… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  8. arXiv:2406.13337  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Medical Spoken Named Entity Recognition

    Authors: Khai Le-Duc, David Thulke, Hung-Phong Tran, Long Vo-Dang, Khai-Nguyen Nguyen, Truong-Son Hy, Ralf Schlüter

    Abstract: Spoken Named Entity Recognition (NER) aims to extracting named entities from speech and categorizing them into types like person, location, organization, etc. In this work, we present VietMed-NER - the first spoken NER dataset in the medical domain. To our best knowledge, our real-world dataset is the largest spoken NER dataset in the world in terms of the number of entity types, featuring 18 dist… ▽ More

    Submitted 20 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Preprint, 41 pages

  9. arXiv:2406.00469  [pdf, other

    cs.LG cs.NE

    Learning to Solve Multiresolution Matrix Factorization by Manifold Optimization and Evolutionary Metaheuristics

    Authors: Truong Son Hy, Thieu Khang, Risi Kondor

    Abstract: Multiresolution Matrix Factorization (MMF) is unusual amongst fast matrix factorization algorithms in that it does not make a low rank assumption. This makes MMF especially well suited to modeling certain types of graphs with complex multiscale or hierarchical strucutre. While MMF promises to yields a useful wavelet basis, finding the factorization itself is hard, and existing greedy methods tend… ▽ More

    Submitted 17 August, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2111.01940

  10. arXiv:2404.00477  [pdf, other

    cs.LG cs.AR

    DE-HNN: An effective neural model for Circuit Netlist representation

    Authors: Zhishang Luo, Truong Son Hy, Puoya Tabaghi, Donghyeon Koh, Michael Defferrard, Elahe Rezaei, Ryan Carey, Rhett Davis, Rajeev Jain, Yusu Wang

    Abstract: The run-time for optimization tools used in chip design has grown with the complexity of designs to the point where it can take several days to go through one design cycle which has become a bottleneck. Designers want fast tools that can quickly give feedback on a design. Using the input and output data of the tools from past designs, one can attempt to build a machine learning model that predicts… ▽ More

    Submitted 16 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  11. arXiv:2402.04821  [pdf, other

    cs.LG

    E(3)-Equivariant Mesh Neural Networks

    Authors: Thuan Trang, Nhat Khang Ngo, Daniel Levy, Thieu N. Vo, Siamak Ravanbakhsh, Truong Son Hy

    Abstract: Triangular meshes are widely used to represent three-dimensional objects. As a result, many recent works have address the need for geometric deep learning on 3D mesh. However, we observe that the complexities in many of these architectures does not translate to practical performance, and simple deep models for geometric graphs are competitive in practice. Motivated by this observation, we minimall… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  12. arXiv:2310.15543  [pdf, other

    cs.LG

    Symmetry-preserving graph attention network to solve routing problems at multiple resolutions

    Authors: Cong Dao Tran, Thong Bach, Truong Son Hy

    Abstract: Travelling Salesperson Problems (TSPs) and Vehicle Routing Problems (VRPs) have achieved reasonable improvement in accuracy and computation time with the adaptation of Machine Learning (ML) methods. However, none of the previous works completely respects the symmetries arising from TSPs and VRPs including rotation, translation, permutation, and scaling. In this work, we introduce the first-ever co… ▽ More

    Submitted 19 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  13. arXiv:2310.15516  [pdf, other

    cs.LG

    Graph Attention-based Deep Reinforcement Learning for solving the Chinese Postman Problem with Load-dependent costs

    Authors: Truong Son Hy, Cong Dao Tran

    Abstract: Recently, Deep reinforcement learning (DRL) models have shown promising results in solving routing problems. However, most DRL solvers are commonly proposed to solve node routing problems, such as the Traveling Salesman Problem (TSP). Meanwhile, there has been limited research on applying neural methods to arc routing problems, such as the Chinese Postman Problem (CPP), since they often feature ir… ▽ More

    Submitted 2 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  14. arXiv:2310.14549  [pdf, other

    cs.LG

    Multimodal Graph Learning for Modeling Emerging Pandemics with Big Data

    Authors: Khanh-Tung Tran, Truong Son Hy, Lili Jiang, Xuan-Son Vu

    Abstract: Accurate forecasting and analysis of emerging pandemics play a crucial role in effective public health management and decision-making. Traditional approaches primarily rely on epidemiological data, overlooking other valuable sources of information that could act as sensors or indicators of pandemic patterns. In this paper, we propose a novel framework called MGL4MEP that integrates temporal graph… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  15. arXiv:2309.16685  [pdf, other

    q-bio.BM cs.LG

    Target-aware Variational Auto-encoders for Ligand Generation with Multimodal Protein Representation Learning

    Authors: Nhat Khang Ngo, Truong Son Hy

    Abstract: Without knowledge of specific pockets, generating ligands based on the global structure of a protein target plays a crucial role in drug discovery as it helps reduce the search space for potential drug-like candidates in the pipeline. However, contemporary methods require optimizing tailored networks for each protein, which is arduous and costly. To address this issue, we introduce TargetVAE, a ta… ▽ More

    Submitted 2 August, 2023; originally announced September 2023.

  16. arXiv:2306.17090  [pdf, other

    cs.LG

    Sparsity exploitation via discovering graphical models in multi-variate time-series forecasting

    Authors: Ngoc-Dung Do, Truong Son Hy, Duy Khuong Nguyen

    Abstract: Graph neural networks (GNNs) have been widely applied in multi-variate time-series forecasting (MTSF) tasks because of their capability in capturing the correlations among different time-series. These graph-based learning approaches improve the forecasting performance by discovering and understanding the underlying graph structures, which represent the data correlation. When the explicit prior gra… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  17. arXiv:2306.12545  [pdf, other

    physics.flu-dyn cs.LG

    Neural Multigrid Memory For Computational Fluid Dynamics

    Authors: Duc Minh Nguyen, Minh Chau Vu, Tuan Anh Nguyen, Tri Huynh, Nguyen Tri Nguyen, Truong Son Hy

    Abstract: Turbulent flow simulation plays a crucial role in various applications, including aircraft and ship design, industrial process optimization, and weather prediction. In this paper, we propose an advanced data-driven method for simulating turbulent flow, representing a significant improvement over existing approaches. Our methodology combines the strengths of Video Prediction Transformer (VPTR) (Ye… ▽ More

    Submitted 24 June, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:1911.08655 by other authors

  18. arXiv:2305.07731  [pdf, other

    cs.LG physics.soc-ph

    Predicting COVID-19 pandemic by spatio-temporal graph neural networks: A New Zealand's study

    Authors: Viet Bach Nguyen, Truong Son Hy, Long Tran-Thanh, Nhung Nghiem

    Abstract: Modeling and simulations of pandemic dynamics play an essential role in understanding and addressing the spreading of highly infectious diseases such as COVID-19. In this work, we propose a novel deep learning architecture named Attention-based Multiresolution Graph Neural Networks (ATMGNN) that learns to combine the spatial graph information, i.e. geographical data, with the temporal information,… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  19. arXiv:2302.08680  [pdf, other

    cs.LG

    Modeling Polypharmacy and Predicting Drug-Drug Interactions using Deep Generative Models on Multimodal Graphs

    Authors: Nhat Khang Ngo, Truong Son Hy, Risi Kondor

    Abstract: Latent representations of drugs and their targets produced by contemporary graph autoencoder models have proved useful in predicting many types of node-pair interactions on large networks, including drug-drug, drug-target, and target-target interactions. However, most existing approaches model either the node's latent spaces in which node distributions are rigid or do not effectively capture the i… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2209.09941

  20. Multiresolution Graph Transformers and Wavelet Positional Encoding for Learning Hierarchical Structures

    Authors: Nhat Khang Ngo, Truong Son Hy, Risi Kondor

    Abstract: Contemporary graph learning algorithms are not well-defined for large molecules since they do not consider the hierarchical interactions among the atoms, which are essential to determine the molecular properties of macromolecules. In this work, we propose Multiresolution Graph Transformers (MGT), the first graph transformer architecture that can learn to represent large molecules at multiple scale… ▽ More

    Submitted 21 July, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  21. arXiv:2302.08643  [pdf, other

    cs.LG

    Fast Temporal Wavelet Graph Neural Networks

    Authors: Duc Thien Nguyen, Manh Duc Tuan Nguyen, Truong Son Hy, Risi Kondor

    Abstract: Spatio-temporal signals forecasting plays an important role in numerous domains, especially in neuroscience and transportation. The task is challenging due to the highly intricate spatial structure, as well as the non-linear temporal dynamics of the network. To facilitate reliable and timely forecast for the human brain and traffic networks, we propose the Fast Temporal Wavelet Graph Neural Networ… ▽ More

    Submitted 28 October, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2111.01940

  22. arXiv:2301.11956  [pdf, other

    cs.LG

    On the Connection Between MPNN and Graph Transformer

    Authors: Chen Cai, Truong Son Hy, Rose Yu, Yusu Wang

    Abstract: Graph Transformer (GT) recently has emerged as a new paradigm of graph learning algorithms, outperforming the previously popular Message Passing Neural Network (MPNN) on multiple benchmarks. Previous work (Kim et al., 2022) shows that with proper position embedding, GT can approximate MPNN arbitrarily well, implying that GT is at least as powerful as MPNN. In this paper, we study the inverse conne… ▽ More

    Submitted 20 June, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  23. arXiv:2301.10439  [pdf, other

    cs.CL cs.LG

    ViDeBERTa: A powerful pre-trained language model for Vietnamese

    Authors: Cong Dao Tran, Nhut Huy Pham, Anh Nguyen, Truong Son Hy, Tu Vu

    Abstract: This paper presents ViDeBERTa, a new pre-trained monolingual language model for Vietnamese, with three versions - ViDeBERTa_xsmall, ViDeBERTa_base, and ViDeBERTa_large, which are pre-trained on a large-scale corpus of high-quality and diverse Vietnamese texts using DeBERTa architecture. Although many successful pre-trained language models based on Transformer have been widely proposed for the Engl… ▽ More

    Submitted 10 February, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  24. arXiv:2209.09941  [pdf, other

    q-bio.BM cs.LG

    Predicting Drug-Drug Interactions using Deep Generative Models on Graphs

    Authors: Nhat Khang Ngo, Truong Son Hy, Risi Kondor

    Abstract: Latent representations of drugs and their targets produced by contemporary graph autoencoder-based models have proved useful in predicting many types of node-pair interactions on large networks, including drug-drug, drug-target, and target-target interactions. However, most existing approaches model the node's latent spaces in which node distributions are rigid and disjoint; these limitations hind… ▽ More

    Submitted 30 October, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

  25. arXiv:2205.15479  [pdf, other

    cs.SE cs.AI cs.PL

    HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations

    Authors: Minh Huynh Nguyen, Nghi D. Q. Bui, Truong Son Hy, Long Tran-Thanh, Tien N. Nguyen

    Abstract: We propose a novel method for code summarization utilizing Heterogeneous Code Representations (HCRs) and our specially designed HierarchyNet. HCRs effectively capture essential code features at lexical, syntactic, and semantic levels by abstracting coarse-grained code elements and incorporating fine-grained program elements in a hierarchical structure. Our HierarchyNet method processes each layer… ▽ More

    Submitted 9 May, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

  26. arXiv:2205.14831  [pdf, other

    cs.LG cs.AI cs.SI physics.soc-ph

    Temporal Multiresolution Graph Neural Networks For Epidemic Prediction

    Authors: Truong Son Hy, Viet Bach Nguyen, Long Tran-Thanh, Risi Kondor

    Abstract: In this paper, we introduce Temporal Multiresolution Graph Neural Networks (TMGNN), the first architecture that both learns to construct the multiscale and multiresolution graph structures and incorporates the time-series signals to capture the temporal changes of the dynamic graphs. We have applied our proposed model to the task of predicting future spreading of epidemic and pandemic based on the… ▽ More

    Submitted 28 June, 2022; v1 submitted 29 May, 2022; originally announced May 2022.

  27. arXiv:2111.01940  [pdf, other

    cs.LG

    Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs

    Authors: Truong Son Hy, Risi Kondor

    Abstract: Multiresolution Matrix Factorization (MMF) is unusual amongst fast matrix factorization algorithms in that it does not make a low rank assumption. This makes MMF especially well suited to modeling certain types of graphs with complex multiscale or hierarchical strucutre. While MMF promises to yields a useful wavelet basis, finding the factorization itself is hard, and existing greedy methods tend… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  28. arXiv:2106.00967  [pdf, other

    cs.LG cs.SI physics.chem-ph

    Multiresolution Equivariant Graph Variational Autoencoder

    Authors: Truong Son Hy, Risi Kondor

    Abstract: In this paper, we propose Multiresolution Equivariant Graph Variational Autoencoders (MGVAE), the first hierarchical generative model to learn and generate graphs in a multiresolution and equivariant manner. At each resolution level, MGVAE employs higher order message passing to encode the graph while learning to partition it into mutually exclusive clusters and coarsening into a lower resolution… ▽ More

    Submitted 28 June, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  29. arXiv:2004.03990  [pdf, other

    cs.LG stat.ML

    The general theory of permutation equivarant neural networks and higher order graph variational encoders

    Authors: Erik Henning Thiede, Truong Son Hy, Risi Kondor

    Abstract: Previous work on symmetric group equivariant neural networks generally only considered the case where the group acts by permuting the elements of a single vector. In this paper we derive formulae for general permutation equivariant layers, including the case where the layer acts on matrices by permuting their rows and columns simultaneously. This case arises naturally in graph learning and relatio… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  30. arXiv:1906.04015  [pdf, ps, other

    physics.comp-ph cs.LG stat.ML

    Cormorant: Covariant Molecular Neural Networks

    Authors: Brandon Anderson, Truong-Son Hy, Risi Kondor

    Abstract: We propose Cormorant, a rotationally covariant neural network architecture for learning the behavior and properties of complex many-body physical systems. We apply these networks to molecular systems with two goals: learning atomic potential energy surfaces for use in Molecular Dynamics simulations, and learning ground state properties of molecules calculated by Density Functional Theory. Some of… ▽ More

    Submitted 25 November, 2019; v1 submitted 6 June, 2019; originally announced June 2019.