Skip to main content

Showing 1–19 of 19 results for author: Zuo, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02751  [pdf, other

    cs.CL cs.AI

    Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset

    Authors: Rui Liu, Haolin Zuo, Zheng Lian, Xiaofen Xing, Björn W. Schuller, Haizhou Li

    Abstract: Emotion and Intent Joint Understanding in Multimodal Conversation (MC-EIU) aims to decode the semantic information manifested in a multimodal conversational history, while inferring the emotions and intents simultaneously for the current utterance. MC-EIU is enabling technology for many human-computer interfaces. However, there is a lack of available datasets in terms of annotation, modality, lang… ▽ More

    Submitted 4 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 26 pages, 8 figures, 12 tables, NeurIPS 2024 Dataset and Benchmark Track

  2. arXiv:2405.16464  [pdf, other

    cs.RO cs.CV

    Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge

    Authors: Tianchen Deng, Yi Zhou, Wenhua Wu, Mingrui Li, Jingwei Huang, Shuhong Liu, Yanzeng Song, Hao Zuo, Yanbo Wang, Yutao Yue, Hesheng Wang, Weidong Chen

    Abstract: This technical report presents the 1st winning model for UG2+, a task in CVPR 2024 UAV Tracking and Pose-Estimation Challenge. This challenge faces difficulties in drone detection, UAV-type classification and 2D/3D trajectory estimation in extreme weather conditions with multi-modal sensor information, including stereo vision, various Lidars, Radars, and audio arrays. Leveraging this information… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024 workshop. The 1st winning model in CVPR 2024 UG2+ challenge. The code and configuration of our method are available at https://github.com/dtc111111/Multi-Modal-UAV

  3. arXiv:2405.12684  [pdf, other

    stat.ML cs.LG

    Model Free Prediction with Uncertainty Assessment

    Authors: Yuling Jiao, Lican Kang, Jin Liu, Heng Peng, Heng Zuo

    Abstract: Deep nonparametric regression, characterized by the utilization of deep neural networks to learn target functions, has emerged as a focus of research attention in recent years. Despite considerable progress in understanding convergence rates, the absence of asymptotic properties hinders rigorous statistical inference. To address this gap, we propose a novel framework that transforms the deep estim… ▽ More

    Submitted 16 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  4. arXiv:2404.13309  [pdf, ps, other

    stat.ML cs.LG

    Latent Schr{ö}dinger Bridge Diffusion Model for Generative Learning

    Authors: Yuling Jiao, Lican Kang, Huazhen Lin, Jin Liu, Heng Zuo

    Abstract: This paper aims to conduct a comprehensive theoretical analysis of current diffusion models. We introduce a novel generative learning methodology utilizing the Schr{ö}dinger bridge diffusion model in latent space as the framework for theoretical exploration in this domain. Our approach commences with the pre-training of an encoder-decoder architecture using data originating from a distribution tha… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  5. arXiv:2403.11186  [pdf, other

    cs.CV

    NetTrack: Tracking Highly Dynamic Objects with a Net

    Authors: Guangze Zheng, Shijie Lin, Haobo Zuo, Changhong Fu, Jia Pan

    Abstract: The complex dynamicity of open-world objects presents non-negligible challenges for multi-object tracking (MOT), often manifested as severe deformations, fast motion, and occlusions. Most methods that solely depend on coarse-grained object cues, such as boxes and the overall appearance of the object, are susceptible to degradation due to distorted internal relationships of dynamic objects. To addr… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  6. arXiv:2403.05090  [pdf, other

    cs.RO

    OCEAN: An Openspace Collision-free Trajectory Planner for Autonomous Parking Based on ADMM

    Authors: Dongxu Wang, Yanbin Lu, Weilong Liu, Hao Zuo, Jiade Xin, Xiang Long, Yuncheng Jiang

    Abstract: In this paper, we propose an Openspace Collision-freE trAjectory plaNner (OCEAN) for autonomous parking. OCEAN is an optimization-based trajectory planner accelerated by Alternating Direction Method of Multiplier (ADMM) with enhanced computational efficiency and robustness, and is suitable for all scenes with few dynamic obstacles. Starting from a hierarchical optimization-based collision avoidanc… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 8 pages,5 figures

  7. arXiv:2401.08099  [pdf, other

    cs.CV cs.AI cs.GR

    Inpainting Normal Maps for Lightstage data

    Authors: Hancheng Zuo, Bernard Tiddeman

    Abstract: This study introduces a novel method for inpainting normal maps using a generative adversarial network (GAN). Normal maps, often derived from a lightstage, are crucial in performance capture but can have obscured areas due to movement (e.g., by arms, hair, or props). Inpainting fills these missing areas with plausible data. Our approach extends previous general image inpainting techniques, employi… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 8 pages, 4 figures, CGVC Conference, The Eurographics Association

    ACM Class: I.2.6; I.4.5

    Journal ref: Computer Graphics and Visual Computing (CGVC), 2023, pp. 45-52

  8. arXiv:2312.02203  [pdf, other

    q-bio.NC cs.LG

    Learning High-Order Relationships of Brain Regions

    Authors: Weikang Qiu, Huangrui Chu, Selena Wang, Haolan Zuo, Xiaoxiao Li, Yize Zhao, Rex Ying

    Abstract: Discovering reliable and informative relationships among brain regions from functional magnetic resonance imaging (fMRI) signals is essential in phenotypic predictions. Most of the current methods fail to accurately characterize those interactions because they only focus on pairwise connections and overlook the high-order relationships of brain regions. We propose that these high-order relationshi… ▽ More

    Submitted 8 June, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at ICML 2024, Camera Ready Version

  9. arXiv:2311.16114  [pdf

    cs.CV cs.AI cs.LG

    Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios

    Authors: Qi Fan, Haolin Zuo, Rui Liu, Zheng Lian, Guanglai Gao

    Abstract: Multimodal emotion recognition (MER) in practical scenarios is significantly challenged by the presence of missing or incomplete data across different modalities. To overcome these challenges, researchers have aimed to simulate incomplete conditions during the training phase to enhance the system's overall robustness. Traditional methods have often involved discarding data or substituting data seg… ▽ More

    Submitted 7 May, 2024; v1 submitted 21 September, 2023; originally announced November 2023.

  10. arXiv:2307.01024  [pdf, other

    cs.CV

    SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation

    Authors: Changhong Fu, Liangliang Yao, Haobo Zuo, Guangze Zheng, Jia Pan

    Abstract: Domain adaptation (DA) has demonstrated significant promise for real-time nighttime unmanned aerial vehicle (UAV) tracking. However, the state-of-the-art (SOTA) DA still lacks the potential object with accurate pixel-level location and boundary to generate the high-quality target domain training sample. This key issue constrains the transfer learning of the real-time daytime SOTA trackers for chal… ▽ More

    Submitted 24 March, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

  11. arXiv:2303.04525  [pdf, other

    cs.CV cs.RO

    Continuity-Aware Latent Interframe Information Mining for Reliable UAV Tracking

    Authors: Changhong Fu, Mutian Cai, Sihang Li, Kunhan Lu, Haobo Zuo, Chongjun Liu

    Abstract: Unmanned aerial vehicle (UAV) tracking is crucial for autonomous navigation and has broad applications in robotic automation fields. However, reliable UAV tracking remains a challenging task due to various difficulties like frequent occlusion and aspect ratio change. Additionally, most of the existing work mainly focuses on explicit information to improve tracking performance, ignoring potential i… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  12. arXiv:2211.03075  [pdf

    cond-mat.supr-con cs.LG

    Prediction of superconducting properties of materials based on machine learning models

    Authors: Jie Hu, Yongquan Jiang, Yang Yan, Houchen Zuo

    Abstract: The application of superconducting materials is becoming more and more widespread. Traditionally, the discovery of new superconducting materials relies on the experience of experts and a large number of "trial and error" experiments, which not only increases the cost of experiments but also prolongs the period of discovering new superconducting materials. In recent years, machine learning has been… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  13. arXiv:2210.15364  [pdf, other

    cs.SD cs.AI eess.AS

    Explicit Intensity Control for Accented Text-to-speech

    Authors: Rui Liu, Haolin Zuo, De Hu, Guanglai Gao, Haizhou Li

    Abstract: Accented text-to-speech (TTS) synthesis seeks to generate speech with an accent (L2) as a variant of the standard version (L1). How to control the intensity of accent in the process of TTS is a very interesting research direction, and has attracted more and more attention. Recent work design a speaker-adversarial loss to disentangle the speaker and accent information, and then adjust the loss weig… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures. Submitted to ICASSP 2023. arXiv admin note: text overlap with arXiv:2209.10804

  14. arXiv:2210.15359  [pdf, other

    cs.CV cs.AI

    Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities

    Authors: Haolin Zuo, Rui Liu, Jinming Zhao, Guanglai Gao, Haizhou Li

    Abstract: Multimodal emotion recognition leverages complementary information across modalities to gain performance. However, we cannot guarantee that the data of all modalities are always present in practice. In the studies to predict the missing data across modalities, the inherent difference between heterogeneous modalities, namely the modality gap, presents a challenge. To address this, we propose to use… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures, 1 table. Submitted to ICASSP 2023. We release the code at: https://github.com/ZhuoYulang/IF-MMIN

  15. WikiLink: an encyclopedia-based semantic network for design innovation

    Authors: Haoyu Zuo, Qianzhi Jing, Tianqi Song, Huiting Liu, Lingyun Sun, Peter Childs, Liuqing Chen

    Abstract: Data-driven design and innovation is a process to reuse and provide valuable and useful information. However, existing semantic networks for design innovation is built on data source restricted to technological and scientific information. Besides, existing studies build the edges of a semantic network only on either statistical or semantic relationships, which is less likely to make full use of th… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 20 pages, 11 figures

    Journal ref: Journal of Intelligence 10, no. 4 (2022): 103

  16. arXiv:2206.08011  [pdf

    cond-mat.mtrl-sci cs.LG

    Hardness prediction of age-hardening aluminum alloy based on ensemble learning

    Authors: Houchen Zuo, Yongquan Jiang, Yan Yang, Baoying Liu, Jie Hu

    Abstract: With the rapid development of artificial intelligence, the combination of material database and machine learning has driven the progress of material informatics. Because aluminum alloy is widely used in many fields, so it is significant to predict the properties of aluminum alloy. In this thesis, the data of Al-Cu-Mg-X (X: Zn, Zr, etc.) alloy are used to input the composition, aging conditions (ti… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  17. arXiv:2109.09394  [pdf

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Prediction of properties of metal alloy materials based on machine learning

    Authors: Houchen Zuo, Yongquan Jiang, Yan Yang, Jie Hu

    Abstract: Density functional theory and its optimization algorithm are the main methods to calculate the properties in the field of materials. Although the calculation results are accurate, it costs a lot of time and money. In order to alleviate this problem, we intend to use machine learning to predict material properties. In this paper, we conduct experiments on atomic volume, atomic energy and atomic for… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  18. arXiv:2108.11899  [pdf

    cs.IR

    Patent-KG: Patent Knowledge Graph Use for Engineering Design

    Authors: Haoyu Zuo, Yuan Yin, Peter Childs

    Abstract: To facilitate knowledge reuse in engineering design, several dataset approaches have been proposed and applied by designers. This paper builds a patent-based knowledge graph, patent-KG, to represent the knowledge facts in patents for engineering design. The arising patent-KG approach proposes a new unsupervised mechanism to extract knowledge facts in a patent, by searching the attention graph in l… ▽ More

    Submitted 16 September, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: delete the weapon examples

  19. arXiv:1402.0060  [pdf, ps, other

    cs.IT math.CO

    On Classification of Toric Surface Codes of Low Dimension

    Authors: Xue Luo, Stephen S. -T. Yau, Mingyi Zhang, Huaiqing Zuo

    Abstract: This work is a natural continuation of our previous work \cite{yz}. In this paper, we give a complete classification of toric surface codes of dimension less than or equal to 6, except a special pair, $C_{P_6^{(4)}}$ and $C_{P_6^{(5)}}$ over $\mathbb{F}_8$. Also, we give an example, $C_{P_6^{(5)}}$ and $C_{P_6^{(6)}}$ over $\mathbb{F}_7$, to illustrate that two monomially equivalent toric codes ca… ▽ More

    Submitted 13 September, 2014; v1 submitted 1 February, 2014; originally announced February 2014.

    Comments: 18 pages, 4 figures, 8 tables

    Journal ref: Finite Fields Appl., Vol. 33, pp. 90-102, 2015