Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Dang, L H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01987  [pdf, other

    cs.CV

    AHMsys: An Automated HVAC Modeling System for BIM Project

    Authors: Long Hoang Dang, Duy-Hung Nguyen, Thai Quang Le, Thinh Truong Nguyen, Clark Mei, Vu Hoang

    Abstract: This paper presents a novel system, named AHMsys, designed to automate the process of generating 3D Heating, Ventilation, and Air Conditioning (HVAC) models from 2D Computer-Aided Design (CAD) drawings, a key component of Building Information Modeling (BIM). By automatically preprocessing and extracting essential HVAC object information then creating detailed 3D models, our proposed AHMsys signifi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.01983  [pdf, other

    cs.CV

    SADL: An Effective In-Context Learning Method for Compositional Visual QA

    Authors: Long Hoang Dang, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran

    Abstract: Large vision-language models (LVLMs) offer a novel capability for performing in-context learning (ICL) in Visual QA. When prompted with a few demonstrations of image-question-answer triplets, LVLMs have demonstrated the ability to discern underlying patterns and transfer this latent knowledge to answer new questions about unseen images without the need for expensive supervised fine-tuning. However… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2106.13432  [pdf, other

    cs.CV

    Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering

    Authors: Long Hoang Dang, Thao Minh Le, Vuong Le, Truyen Tran

    Abstract: Video Question Answering (Video QA) is a powerful testbed to develop new AI capabilities. This task necessitates learning to reason about objects, relations, and events across visual and linguistic domains in space-time. High-level reasoning demands lifting from associative visual pattern recognition to symbol-like manipulation over objects, their behavior and interactions. Toward reaching this go… ▽ More

    Submitted 25 August, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Accepted by IJCAI 2021. Please cite the conference version

  4. arXiv:2104.05166  [pdf, other

    cs.CV

    Object-Centric Representation Learning for Video Question Answering

    Authors: Long Hoang Dang, Thao Minh Le, Vuong Le, Truyen Tran

    Abstract: Video question answering (Video QA) presents a powerful testbed for human-like intelligent behaviors. The task demands new capabilities to integrate video processing, language understanding, binding abstract linguistic concepts to concrete visual artifacts, and deliberative reasoning over spacetime. Neural networks offer a promising approach to reach this potential through learning from examples r… ▽ More

    Submitted 8 July, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted by IJCNN 2021