Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Dang, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.02623  [pdf, other

    cs.CV

    YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition

    Authors: Duc Manh Nguyen Dang, Viet Hang Duong, Jia Ching Wang, Nhan Bui Duc

    Abstract: In this paper, we propose a new framework called YOWOv3, which is an improved version of YOWOv2, designed specifically for the task of Human Action Detection and Recognition. This framework is designed to facilitate extensive experimentation with different configurations and supports easy customization of various components within the model, reducing efforts required for understanding and modifyin… ▽ More

    Submitted 8 August, 2024; v1 submitted 5 August, 2024; originally announced August 2024.

  2. arXiv:2401.07326  [pdf, other

    eess.IV cs.CV

    Beyond Traditional Approaches: Multi-Task Network for Breast Ultrasound Diagnosis

    Authors: Dat T. Chung, Minh-Anh Dang, Mai-Anh Vu, Minh T. Nguyen, Thanh-Huy Nguyen, Vinh Q. Dinh

    Abstract: Breast Ultrasound plays a vital role in cancer diagnosis as a non-invasive approach with cost-effective. In recent years, with the development of deep learning, many CNN-based approaches have been widely researched in both tumor localization and cancer classification tasks. Even though previous single models achieved great performance in both tasks, these methods have some limitations in inference… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 7 pages, 3 figures

  3. arXiv:2311.12908  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Diffusion Model Alignment Using Direct Preference Optimization

    Authors: Bram Wallace, Meihua Dang, Rafael Rafailov, Linqi Zhou, Aaron Lou, Senthil Purushwalkam, Stefano Ermon, Caiming Xiong, Shafiq Joty, Nikhil Naik

    Abstract: Large language models (LLMs) are fine-tuned using human comparison data with Reinforcement Learning from Human Feedback (RLHF) methods to make them better aligned with users' preferences. In contrast to LLMs, human preference learning has not been widely explored in text-to-image diffusion models; the best existing approach is to fine-tune a pretrained model using carefully curated high quality im… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  4. arXiv:2305.00567  [pdf, other

    cs.LG cs.AI

    Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL

    Authors: Baiting Zhu, Meihua Dang, Aditya Grover

    Abstract: The goal of multi-objective reinforcement learning (MORL) is to learn policies that simultaneously optimize multiple competing objectives. In practice, an agent's preferences over the objectives may not be known apriori, and hence, we require policies that can generalize to arbitrary preferences at test time. In this work, we propose a new data-driven setup for offline MORL, where we wish to learn… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: Published in ICLR 2023

  5. arXiv:2304.07438  [pdf, other

    cs.CL cs.AI

    Tractable Control for Autoregressive Language Generation

    Authors: Honghua Zhang, Meihua Dang, Nanyun Peng, Guy Van den Broeck

    Abstract: Despite the success of autoregressive large language models in text generation, it remains a major challenge to generate text that satisfies complex constraints: sampling from the conditional distribution ${\Pr}(\text{text} | α)$ is intractable for even the simplest lexical constraints $α$. To overcome this challenge, we propose to use tractable probabilistic models (TPMs) to impose lexical constr… ▽ More

    Submitted 15 November, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

  6. arXiv:2211.12551  [pdf, other

    cs.LG cs.AI

    Sparse Probabilistic Circuits via Pruning and Growing

    Authors: Meihua Dang, Anji Liu, Guy Van den Broeck

    Abstract: Probabilistic circuits (PCs) are a tractable representation of probability distributions allowing for exact and efficient computation of likelihoods and marginals. There has been significant recent progress on improving the scale and expressiveness of PCs. However, PC training performance plateaus as model size increases. We discover that most capacity in existing large PC structures is wasted: fu… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  7. arXiv:2109.09026  [pdf, other

    cs.SD cs.HC cs.LG eess.AS

    Hybrid Data Augmentation and Deep Attention-based Dilated Convolutional-Recurrent Neural Networks for Speech Emotion Recognition

    Authors: Nhat Truong Pham, Duc Ngoc Minh Dang, Sy Dzung Nguyen

    Abstract: Speech emotion recognition (SER) has been one of the significant tasks in Human-Computer Interaction (HCI) applications. However, it is hard to choose the optimal features and deal with imbalance labeled data. In this article, we investigate hybrid data augmentation (HDA) methods to generate and balance data based on traditional and generative adversarial networks (GAN) methods. To evaluate the ef… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

    Comments: 12 pages, 16 figures, 6 tables

  8. arXiv:2109.01999  [pdf, other

    eess.IV cs.CV cs.MM

    Image Compression with Recurrent Neural Network and Generalized Divisive Normalization

    Authors: Khawar Islam, L. Minh Dang, Sujin Lee, Hyeonjoon Moon

    Abstract: Image compression is a method to remove spatial redundancy between adjacent pixels and reconstruct a high-quality image. In the past few years, deep learning has gained huge attention from the research community and produced promising image reconstruction results. Therefore, recent methods focused on developing deeper and more complex networks, which significantly increased network complexity. In… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

    Comments: Accpeted at IEEE CVPR Workshop

    Report number: 10.1109/CVPRW53098.2021.00209

  9. arXiv:2107.12920  [pdf, other

    cs.CL

    Emotion Stimulus Detection in German News Headlines

    Authors: Bao Minh Doan Dang, Laura Oberländer, Roman Klinger

    Abstract: Emotion stimulus extraction is a fine-grained subtask of emotion analysis that focuses on identifying the description of the cause behind an emotion expression from a text passage (e.g., in the sentence "I am happy that I passed my exam" the phrase "passed my exam" corresponds to the stimulus.). Previous work mainly focused on Mandarin and English, with no resources or models for German. We fill t… ▽ More

    Submitted 16 May, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: KONVENS 2021, published at https://aclanthology.org/2021.konvens-1.7/ Please cite by using https://aclanthology.org/2021.konvens-1.7.bib

  10. arXiv:2009.09031  [pdf, other

    cs.LG cs.AI stat.ML

    Group Fairness by Probabilistic Modeling with Latent Fair Decisions

    Authors: YooJung Choi, Meihua Dang, Guy Van den Broeck

    Abstract: Machine learning systems are increasingly being used to make impactful decisions such as loan applications and criminal justice risk assessments, and as such, ensuring fairness of these systems is critical. This is often challenging as the labels in the data are biased. This paper studies learning fair probability distributions from biased data by explicitly modeling a latent variable that represe… ▽ More

    Submitted 16 December, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

  11. arXiv:2007.10867  [pdf, other

    cs.CV

    GarNet++: Improving Fast and Accurate Static3D Cloth Draping by Curvature Loss

    Authors: Erhan Gundogdu, Victor Constantin, Shaifali Parashar, Amrollah Seifoddini, Minh Dang, Mathieu Salzmann, Pascal Fua

    Abstract: In this paper, we tackle the problem of static 3D cloth draping on virtual human bodies. We introduce a two-stream deep network model that produces a visually plausible draping of a template cloth on virtual 3D bodies by extracting features from both the body and garment shapes. Our network learns to mimic a Physics-Based Simulation (PBS) method while requiring two orders of magnitude less computa… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: Accepted to be published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), July 2020. arXiv admin note: text overlap with arXiv:1811.10983

  12. arXiv:2007.09331  [pdf, other

    cs.LG cs.AI

    Strudel: Learning Structured-Decomposable Probabilistic Circuits

    Authors: Meihua Dang, Antonio Vergari, Guy Van den Broeck

    Abstract: Probabilistic circuits (PCs) represent a probability distribution as a computational graph. Enforcing structural properties on these graphs guarantees that several inference scenarios become tractable. Among these properties, structured decomposability is a particularly appealing one: it enables the efficient and exact computations of the probability of complex logical formulas, and can be used to… ▽ More

    Submitted 2 September, 2020; v1 submitted 18 July, 2020; originally announced July 2020.

    Comments: 12 pages, 3 figures, to be published on PGM2020 (The 10th International Conference on Probabilistic Graphical Models)

    ACM Class: I.2.6

  13. arXiv:1811.10983  [pdf, other

    cs.CV

    GarNet: A Two-Stream Network for Fast and Accurate 3D Cloth Draping

    Authors: Erhan Gundogdu, Victor Constantin, Amrollah Seifoddini, Minh Dang, Mathieu Salzmann, Pascal Fua

    Abstract: While Physics-Based Simulation (PBS) can accurately drape a 3D garment on a 3D body, it remains too costly for real-time applications, such as virtual try-on. By contrast, inference in a deep network, requiring a single forward pass, is much faster. Taking advantage of this, we propose a novel architecture to fit a 3D garment template to a 3D body. Specifically, we build upon the recent progress i… ▽ More

    Submitted 21 August, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: Accepted to ICCV 2019

  14. arXiv:1808.01166  [pdf, ps, other

    cs.DC

    ViPIOS - VIenna Parallel Input Output System: Language, Compiler and Advanced Data Structure Support for Parallel I/O Operations

    Authors: Erich Schikuta, Helmut Wanek, Heinz Stockinger, Kurt Stockinger, Thomas Fürle, Oliver Jorns, Christoph Löffelhardt, Peter Brezany, Minh Dang, Thomas Mück

    Abstract: For an increasing number of data intensive scientific applications, parallel I/O concepts are a major performance issue. Tackling this issue, we develop an input/output system designed for highly efficient, scalable and conveniently usable parallel I/O on distributed memory systems. The main focus of this research is the parallel I/O runtime system support provided for software-generated programs… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: 210 pages