Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Duong, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.13448  [pdf, other

    cs.LG stat.ME stat.ML

    ALIAS: DAG Learning with Efficient Unconstrained Policies

    Authors: Bao Duong, Hung Le, Thin Nguyen

    Abstract: Recently, reinforcement learning (RL) has proved a promising alternative for conventional local heuristics in score-based approaches to learning directed acyclic causal graphs (DAGs) from observational data. However, the intricate acyclicity constraint still challenges the efficient exploration of the vast space of DAGs in existing methods. In this study, we introduce ALIAS (reinforced dAg Learnin… ▽ More

    Submitted 26 August, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

  2. arXiv:2407.04992  [pdf, other

    cs.LG cs.AI stat.ME

    Scalable Variational Causal Discovery Unconstrained by Acyclicity

    Authors: Nu Hoang, Bao Duong, Thin Nguyen

    Abstract: Bayesian causal discovery offers the power to quantify epistemic uncertainties among a broad range of structurally diverse causal theories potentially explaining the data, represented in forms of directed acyclic graphs (DAGs). However, existing methods struggle with efficient DAG sampling due to the complex acyclicity constraint. In this study, we propose a scalable Bayesian approach to effective… ▽ More

    Submitted 28 August, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted at ECAI 2024

  3. arXiv:2407.04980  [pdf, other

    cs.LG cs.AI stat.ME

    Enabling Causal Discovery in Post-Nonlinear Models with Normalizing Flows

    Authors: Nu Hoang, Bao Duong, Thin Nguyen

    Abstract: Post-nonlinear (PNL) causal models stand out as a versatile and adaptable framework for modeling intricate causal relationships. However, accurately capturing the invertibility constraint required in PNL models remains challenging in existing studies. To address this problem, we introduce CAF-PoNo (Causal discovery via Normalizing Flows for Post-Nonlinear models), harnessing the power of the norma… ▽ More

    Submitted 28 August, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: Acepted at ECAI 2024

  4. arXiv:2404.06824  [pdf, other

    cs.LG

    Error Mitigation for TDoA UWB Indoor Localization using Unsupervised Machine Learning

    Authors: Phuong Bich Duong, Ben Van Herbruggen, Arne Broering, Adnan Shahid, Eli De Poorter

    Abstract: Indoor positioning systems based on Ultra-wideband (UWB) technology are gaining recognition for their ability to provide cm-level localization accuracy. However, these systems often encounter challenges caused by dense multi-path fading, leading to positioning errors. To address this issue, in this letter, we propose a novel methodology for unsupervised anchor node selection using deep embedded cl… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 5 pages, 3 figures, 3 tables, 14 references

    ACM Class: I.2.1

  5. arXiv:2312.10102  [pdf, other

    stat.ML cs.LG

    Robust Estimation of Causal Heteroscedastic Noise Models

    Authors: Quang-Duy Tran, Bao Duong, Phuoc Nguyen, Thin Nguyen

    Abstract: Distinguishing the cause and effect from bivariate observational data is the foundational problem that finds applications in many scientific disciplines. One solution to this problem is assuming that cause and effect are generated from a structural causal model, enabling identification of the causal direction after estimating the model in each direction. The heteroscedastic noise model is a type o… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted at the 2024 SIAM International Conference on Data Mining (SDM24)

  6. arXiv:2310.18598  [pdf, other

    cs.LG cs.CV

    Domain Generalisation via Risk Distribution Matching

    Authors: Toan Nguyen, Kien Do, Bao Duong, Thin Nguyen

    Abstract: We propose a novel approach for domain generalisation (DG) leveraging risk distributions to characterise domains, thereby achieving domain invariance. In our findings, risk distributions effectively highlight differences between training domains and reveal their inherent complexities. In testing, we may observe similar, or potentially intensifying in magnitude, divergences between risk distributio… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  7. arXiv:2309.01392  [pdf, ps, other

    cs.LG stat.ML

    Differentiable Bayesian Structure Learning with Acyclicity Assurance

    Authors: Quang-Duy Tran, Phuoc Nguyen, Bao Duong, Thin Nguyen

    Abstract: Score-based approaches in the structure learning task are thriving because of their scalability. Continuous relaxation has been the key reason for this advancement. Despite achieving promising outcomes, most of these methods are still struggling to ensure that the graphs generated from the latent space are acyclic by minimizing a defined score. There has also been another trend of permutation-base… ▽ More

    Submitted 6 September, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted as a regular paper (9.37%) at the 23rd IEEE International Conference on Data Mining (ICDM 2023)

  8. arXiv:2307.07973  [pdf, other

    cs.LG stat.ME

    Heteroscedastic Causal Structure Learning

    Authors: Bao Duong, Thin Nguyen

    Abstract: Heretofore, learning the directed acyclic graphs (DAGs) that encode the cause-effect relationships embedded in observational data is a computationally challenging problem. A recent trend of studies has shown that it is possible to recover the DAGs with polynomial time complexity under the equal variances assumption. However, this prohibits the heteroscedasticity of the noise, which allows for more… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted at the 26th European Conference on Artificial Intelligence (ECAI 2023)

  9. Causal Inference via Style Transfer for Out-of-distribution Generalisation

    Authors: Toan Nguyen, Kien Do, Duc Thanh Nguyen, Bao Duong, Thin Nguyen

    Abstract: Out-of-distribution (OOD) generalisation aims to build a model that can generalise well on an unseen target domain using knowledge from multiple source domains. To this end, the model should seek the causal dependence between inputs and labels, which may be determined by the semantics of inputs and remain invariant across domains. However, statistical or non-causal methods often cannot capture thi… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 23), August 6-10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 19 pages

    Journal ref: In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 23), August 6-10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 19 pages

  10. arXiv:2211.10856  [pdf, other

    cs.LG cs.IT

    Diffeomorphic Information Neural Estimation

    Authors: Bao Duong, Thin Nguyen

    Abstract: Mutual Information (MI) and Conditional Mutual Information (CMI) are multi-purpose tools from information theory that are able to naturally measure the statistical dependencies between random variables, thus they are usually of central interest in several statistical and machine learning tasks, such as conditional independence testing and representation learning. However, estimating CMI, or even M… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: Accepted at the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023)

  11. arXiv:2210.15247  [pdf, other

    cs.LG cs.MM

    A few-shot learning approach with domain adaptation for personalized real-life stress detection in close relationships

    Authors: Kexin Feng, Jacqueline B. Duong, Kayla E. Carta, Sierra Walters, Gayla Margolin, Adela C. Timmons, Theodora Chaspari

    Abstract: We design a metric learning approach that aims to address computational challenges that yield from modeling human outcomes from ambulatory real-life data. The proposed metric learning is based on a Siamese neural network (SNN) that learns the relative difference between pairs of samples from a target user and non-target users, thus being able to address the scarcity of labelled data from the targe… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  12. arXiv:2209.01547  [pdf, other

    cs.LG stat.ML

    Conditional Independence Testing via Latent Representation Learning

    Authors: Bao Duong, Thin Nguyen

    Abstract: Detecting conditional independencies plays a key role in several statistical and machine learning tasks, especially in causal discovery algorithms. In this study, we introduce LCIT (Latent representation based Conditional Independence Test)-a novel non-parametric method for conditional independence testing based on representation learning. Our main contribution involves proposing a generative fram… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: Accepted as a regular paper at the 22nd IEEE International Conference on Data Mining (ICDM 2022)

  13. arXiv:2207.12086  [pdf, other

    cs.LG cs.AI

    Efficient Classification with Counterfactual Reasoning and Active Learning

    Authors: Azhar Mohammed, Dang Nguyen, Bao Duong, Thin Nguyen

    Abstract: Data augmentation is one of the most successful techniques to improve the classification accuracy of machine learning models in computer vision. However, applying data augmentation to tabular data is a challenging problem since it is hard to generate synthetic samples with labels. In this paper, we propose an efficient classifier with a novel data augmentation technique for tabular data. Our metho… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  14. arXiv:1810.04334  [pdf, other

    cs.DC

    GraphMP: I/O-Efficient Big Graph Analytics on a Single Commodity Machine

    Authors: Peng Sun, Yonggang Wen, Ta Nguyen Binh Duong, Xiaokui Xiao

    Abstract: Recent studies showed that single-machine graph processing systems can be as highly competitive as cluster-based approaches on large-scale problems. While several out-of-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on… ▽ More

    Submitted 18 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1707.02557

  15. arXiv:1707.02557  [pdf, other

    cs.DC

    GraphMP: An Efficient Semi-External-Memory Big Graph Processing System on a Single Machine

    Authors: Peng Sun, Yonggang Wen, Ta Nguyen Binh Duong, Xiaokui Xiao

    Abstract: Recent studies showed that single-machine graph processing systems can be as highly competitive as cluster-based approaches on large-scale problems. While several out-of-core graph processing systems and computation models have been proposed, the high disk I/O overhead could significantly reduce performance in many practical cases. In this paper, we propose GraphMP to tackle big graph analytics on… ▽ More

    Submitted 9 July, 2017; originally announced July 2017.

  16. arXiv:1705.05595  [pdf, other

    cs.DC

    GraphH: High Performance Big Graph Analytics in Small Clusters

    Authors: Peng Sun, Yonggang Wen, Ta Nguyen Binh Duong, Xiaokui Xiao

    Abstract: It is common for real-world applications to analyze big graphs using distributed graph processing systems. Popular in-memory systems require an enormous amount of resources to handle big graphs. While several out-of-core approaches have been proposed for processing big graphs on disk, the high disk I/O overhead could significantly reduce performance. In this paper, we propose GraphH to enable high… ▽ More

    Submitted 7 August, 2017; v1 submitted 16 May, 2017; originally announced May 2017.

  17. Towards Distributed Machine Learning in Shared Clusters: A Dynamically-Partitioned Approach

    Authors: Peng Sun, Yonggang Wen, Ta Nguyen Binh Duong, Shengen Yan

    Abstract: Many cluster management systems (CMSs) have been proposed to share a single cluster with multiple distributed computing systems. However, none of the existing approaches can handle distributed machine learning (ML) workloads given the following criteria: high resource utilization, fair resource allocation and low sharing overhead. To solve this problem, we propose a new CMS named Dorm, incorporati… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

  18. Proposal of algorithms for navigation and obstacles avoidance of autonomous mobile robot

    Authors: T. T. Hoang, D. T. Hiep, P. M. Duong, N. T. T. Van, B. G. Duong, T. Q. Vinh

    Abstract: This paper presents algorithms to navigate and avoid obstacles for an in-door autonomous mobile robot. A laser range finder is used to obtain 3D images of the environment. A new algorithm, namely 3D-to-2D image pressure and barriers detection (IPaBD), is proposed to create a 2D global map from the 3D images. This map is basic to design the trajectory. A tracking controller is developed to control… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: In 2013 8th IEEE Conference on Industrial Electronics and Applications (ICIEA)

  19. MetaFlow: a Scalable Metadata Lookup Service for Distributed File Systems in Data Centers

    Authors: Peng Sun, Yonggang Wen, Ta Nguyen Binh Duong, Haiyong Xie

    Abstract: In large-scale distributed file systems, efficient meta- data operations are critical since most file operations have to interact with metadata servers first. In existing distributed hash table (DHT) based metadata management systems, the lookup service could be a performance bottleneck due to its significant CPU overhead. Our investigations showed that the lookup service could reduce system throu… ▽ More

    Submitted 10 November, 2016; v1 submitted 4 November, 2016; originally announced November 2016.

    Comments: in IEEE Transactions on Big Data 2016