Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Huo, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.02032  [pdf, other

    cs.CV cs.AI

    Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models

    Authors: Fushuo Huo, Wenchao Xu, Zhong Zhang, Haozhao Wang, Zhicheng Chen, Peilin Zhao

    Abstract: While Large Vision-Language Models (LVLMs) have rapidly advanced in recent years, the prevalent issue known as the `hallucination' problem has emerged as a significant bottleneck, hindering their real-world deployments. Existing methods mitigate this issue mainly from two perspectives: One approach leverages extra knowledge like robust instruction tuning LVLMs with curated datasets or employing au… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  2. arXiv:2401.00403  [pdf, other

    cs.LG cs.CV cs.MM

    Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection

    Authors: Yunfeng Fan, Wenchao Xu, Haozhao Wang, Fushuo Huo, Jinyu Chen, Song Guo

    Abstract: Selecting proper clients to participate in each federated learning (FL) round is critical to effectively harness a broad range of distributed data. Existing client selection methods simply consider the mining of distributed uni-modal data, yet, their effectiveness may diminish in multi-modal FL (MFL) as the modality imbalance problem not only impedes the collaborative local training but also leads… ▽ More

    Submitted 28 July, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: Accepted by ECCV24, 23 pages

  3. arXiv:2305.01239  [pdf, other

    cs.CV cs.AI

    DRPT: Disentangled and Recurrent Prompt Tuning for Compositional Zero-Shot Learning

    Authors: Xiaocheng Lu, Ziming Liu, Song Guo, Jingcai Guo, Fushuo Huo, Sikai Bai, Tao Han

    Abstract: Compositional Zero-shot Learning (CZSL) aims to recognize novel concepts composed of known knowledge without training samples. Standard CZSL either identifies visual primitives or enhances unseen composed entities, and as a result, entanglement between state and object primitives cannot be fully utilized. Admittedly, vision-language models (VLMs) could naturally cope with CZSL through tuning promp… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  4. arXiv:2303.10891  [pdf, other

    cs.CV

    Non-Exemplar Online Class-incremental Continual Learning via Dual-prototype Self-augment and Refinement

    Authors: Fushuo Huo, Wenchao Xu, Jingcai Guo, Haozhao Wang, Yunfeng Fan, Song Guo

    Abstract: This paper investigates a new, practical, but challenging problem named Non-exemplar Online Class-incremental continual Learning (NO-CL), which aims to preserve the discernibility of base classes without buffering data examples and efficiently learn novel classes continuously in a single-pass (i.e., online) data stream. The challenges of this task are mainly two-fold: (1) Both base and novel class… ▽ More

    Submitted 15 December, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  5. arXiv:2211.12417  [pdf, other

    cs.CV

    ProCC: Progressive Cross-primitive Compatibility for Open-World Compositional Zero-Shot Learning

    Authors: Fushuo Huo, Wenchao Xu, Song Guo, Jingcai Guo, Haozhao Wang, Ziming Liu, Xiaocheng Lu

    Abstract: Open-World Compositional Zero-shot Learning (OW-CZSL) aims to recognize novel compositions of state and object primitives in images with no priors on the compositional space, which induces a tremendously large output space containing all possible state-object compositions. Existing works either learn the joint compositional state-object embedding or predict simple primitives with separate classifi… ▽ More

    Submitted 15 December, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

  6. arXiv:2209.01760  [pdf, other

    eess.IV cs.CV

    REQA: Coarse-to-fine Assessment of Image Quality to Alleviate the Range Effect

    Authors: Bingheng Li, Fushuo Huo

    Abstract: Blind image quality assessment (BIQA) of user generated content (UGC) suffers from the range effect which indicates that on the overall quality range, mean opinion score (MOS) and predicted MOS (pMOS) are well correlated; focusing on a particular range, the correlation is lower. The reason for the range effect is that the predicted deviations both in a wide range and in a narrow range destroy the… ▽ More

    Submitted 26 June, 2023; v1 submitted 5 September, 2022; originally announced September 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  7. arXiv:2203.03483  [pdf, other

    cs.CV

    Towards Unbiased Multi-label Zero-Shot Learning with Pyramid and Semantic Attention

    Authors: Ziming Liu, Song Guo, Jingcai Guo, Yuanyuan Xu, Fushuo Huo

    Abstract: Multi-label zero-shot learning extends conventional single-label zero-shot learning to a more realistic scenario that aims at recognizing multiple unseen labels of classes for each input sample. Existing works usually exploit attention mechanism to generate the correlation among different labels. However, most of them are usually biased on several major classes while neglect most of the minor clas… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  8. arXiv:1106.4728  [pdf, other

    cs.IT

    Large Zero Autocorrelation Zone of Golay Sequences and $4^q$-QAM Golay Complementary Sequences

    Authors: Guang Gong, Fei Huo, Yang Yang

    Abstract: Sequences with good correlation properties have been widely adopted in modern communications, radar and sonar applications. In this paper, we present our new findings on some constructions of single $H$-ary Golay sequence and $4^q$-QAM Golay complementary sequence with a large zero autocorrelation zone, where $H\ge 2$ is an arbitrary even integer and $q\ge 2$ is an arbitrary integer. Those new res… ▽ More

    Submitted 22 June, 2011; originally announced June 2011.