Skip to main content

Showing 1–19 of 19 results for author: Wang, F L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13445  [pdf, other

    cs.CV cs.AI

    Lost in UNet: Improving Infrared Small Target Detection by Underappreciated Local Features

    Authors: Wuzhou Quan, Wei Zhao, Weiming Wang, Haoran Xie, Fu Lee Wang, Mingqiang Wei

    Abstract: Many targets are often very small in infrared images due to the long-distance imaging meachnism. UNet and its variants, as popular detection backbone networks, downsample the local features early and cause the irreversible loss of these local features, leading to both the missed and false detection of small targets in infrared images. We propose HintU, a novel network to recover the local features… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.05000  [pdf, other

    cs.CV

    AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation

    Authors: Lianyu Pang, Jian Yin, Baoquan Zhao, Feize Wu, Fu Lee Wang, Qing Li, Xudong Mao

    Abstract: Recent advances in text-to-image models have enabled high-quality personalized image synthesis of user-provided concepts with flexible textual control. In this work, we analyze the limitations of two primary techniques in text-to-image personalization: Textual Inversion and DreamBooth. When integrating the learned concept into new prompts, Textual Inversion tends to overfit the concept, while Drea… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2403.04443  [pdf, other

    cs.CV

    FriendNet: Detection-Friendly Dehazing Network

    Authors: Yihua Fan, Yongzhen Wang, Mingqiang Wei, Fu Lee Wang, Haoran Xie

    Abstract: Adverse weather conditions often impair the quality of captured images, inevitably inducing cutting-edge object detection models for advanced driver assistance systems (ADAS) and autonomous driving. In this paper, we raise an intriguing question: can the combination of image restoration and object detection enhance detection performance in adverse weather conditions? To answer it, we propose an ef… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 13 pages, 8 figures, 6 tables

  4. arXiv:2312.12148  [pdf, other

    cs.CL

    Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment

    Authors: Lingling Xu, Haoran Xie, Si-Zhao Joe Qin, Xiaohui Tao, Fu Lee Wang

    Abstract: With the continuous growth in the number of parameters of transformer-based pretrained language models (PLMs), particularly the emergence of large language models (LLMs) with billions of parameters, many natural language processing (NLP) tasks have demonstrated remarkable success. However, the enormous size and computational demands of these models pose significant challenges for adapting them to… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 20 pages, 4 figures

  5. arXiv:2312.04891  [pdf, other

    cs.CV

    Cross-BERT for Point Cloud Pretraining

    Authors: Xin Li, Peng Li, Zeyong Wei, Zhe Zhu, Mingqiang Wei, Junhui Hou, Liangliang Nan, Jing Qin, Haoran Xie, Fu Lee Wang

    Abstract: Introducing BERT into cross-modal settings raises difficulties in its optimization for handling multiple modalities. Both the BERT architecture and training objective need to be adapted to incorporate and model information from different modalities. In this paper, we address these challenges by exploring the implicit semantic and geometric correlations between 2D and 3D data of the same objects/sc… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  6. arXiv:2306.06843  [pdf, other

    cs.CL

    Recurrent Attention Networks for Long-text Modeling

    Authors: Xianming Li, Zongxi Li, Xiaotian Luo, Haoran Xie, Xing Lee, Yingbin Zhao, Fu Lee Wang, Qing Li

    Abstract: Self-attention-based models have achieved remarkable progress in short-text mining. However, the quadratic computational complexities restrict their application in long text processing. Prior works have adopted the chunking strategy to divide long documents into chunks and stack a self-attention backbone with the recurrent structure to extract semantic representation. Such an approach disables par… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  7. arXiv:2303.14075  [pdf, other

    cs.CV

    Search By Image: Deeply Exploring Beneficial Features for Beauty Product Retrieval

    Authors: Mingqiang Wei, Qian Sun, Haoran Xie, Dong Liang, Fu Lee Wang

    Abstract: Searching by image is popular yet still challenging due to the extensive interference arose from i) data variations (e.g., background, pose, visual angle, brightness) of real-world captured images and ii) similar images in the query dataset. This paper studies a practically meaningful problem of beauty product retrieval (BPR) by neural networks. We broadly extract different types of image features… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  8. arXiv:2211.01664  [pdf, other

    cs.CV

    PointSee: Image Enhances Point Cloud

    Authors: Lipeng Gu, Xuefeng Yan, Peng Cui, Lina Gong, Haoran Xie, Fu Lee Wang, Jin Qin, Mingqiang Wei

    Abstract: There is a trend to fuse multi-modal information for 3D object detection (3OD). However, the challenging problems of low lightweightness, poor flexibility of plug-and-play, and inaccurate alignment of features are still not well-solved, when designing multi-modal fusion newtorks. We propose PointSee, a lightweight, flexible and effective multi-modal fusion solution to facilitate various 3OD networ… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  9. arXiv:2210.15913  [pdf, other

    cs.CV

    GeoGCN: Geometric Dual-domain Graph Convolution Network for Point Cloud Denoising

    Authors: Zhaowei Chen, Peng Li, Zeyong Wei, Honghua Chen, Haoran Xie, Mingqiang Wei, Fu Lee Wang

    Abstract: We propose GeoGCN, a novel geometric dual-domain graph convolution network for point cloud denoising (PCD). Beyond the traditional wisdom of PCD, to fully exploit the geometric information of point clouds, we define two kinds of surface normals, one is called Real Normal (RN), and the other is Virtual Normal (VN). RN preserves the local details of noisy point clouds while VN avoids the global shap… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  10. arXiv:2209.01746  [pdf, other

    cs.CV cs.GR

    SPCNet: Stepwise Point Cloud Completion Network

    Authors: Fei Hu, Honghua Chen, Xuequan Lu, Zhe Zhu, Jun Wang, Weiming Wang, Fu Lee Wang, Mingqiang Wei

    Abstract: How will you repair a physical object with large missings? You may first recover its global yet coarse shape and stepwise increase its local details. We are motivated to imitate the above physical repair procedure to address the point cloud completion task. We propose a novel stepwise point cloud completion network (SPCNet) for various 3D models with large missings. SPCNet has a hierarchical botto… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

  11. arXiv:2209.01373  [pdf, other

    cs.CV

    TogetherNet: Bridging Image Restoration and Object Detection Together via Dynamic Enhancement Learning

    Authors: Yongzhen Wang, Xuefeng Yan, Kaiwen Zhang, Lina Gong, Haoran Xie, Fu Lee Wang, Mingqiang Wei

    Abstract: Adverse weather conditions such as haze, rain, and snow often impair the quality of captured images, causing detection networks trained on normal images to generalize poorly in these scenarios. In this paper, we raise an intriguing question - if the combination of image restoration and object detection, can boost the performance of cutting-edge detectors in adverse weather conditions. To answer it… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 12 pages, 9 figures

  12. arXiv:2209.00977  [pdf, other

    cs.CV

    Contrastive Semantic-Guided Image Smoothing Network

    Authors: Jie Wang, Yongzhen Wang, Yidan Feng, Lina Gong, Xuefeng Yan, Haoran Xie, Fu Lee Wang, Mingqiang Wei

    Abstract: Image smoothing is a fundamental low-level vision task that aims to preserve salient structures of an image while removing insignificant details. Deep learning has been explored in image smoothing to deal with the complex entanglement of semantic structures and trivial details. However, current methods neglect two important facts in smoothing: 1) naive pixel-level regression supervised by the limi… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  13. arXiv:2208.14796  [pdf, other

    cs.CV

    3DLG-Detector: 3D Object Detection via Simultaneous Local-Global Feature Learning

    Authors: Baian Chen, Liangliang Nan, Haoran Xie, Dening Lu, Fu Lee Wang, Mingqiang Wei

    Abstract: Capturing both local and global features of irregular point clouds is essential to 3D object detection (3OD). However, mainstream 3D detectors, e.g., VoteNet and its variants, either abandon considerable local features during pooling operations or ignore many global features in the whole scene context. This paper explores new modules to simultaneously learn local-global features of scene point clo… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

  14. arXiv:2208.13414  [pdf, other

    cs.CV

    PV-RCNN++: Semantical Point-Voxel Feature Interaction for 3D Object Detection

    Authors: Peng Wu, Lipeng Gu, Xuefeng Yan, Haoran Xie, Fu Lee Wang, Gary Cheng, Mingqiang Wei

    Abstract: Large imbalance often exists between the foreground points (i.e., objects) and the background points in outdoor LiDAR point clouds. It hinders cutting-edge detectors from focusing on informative areas to produce accurate 3D object detection results. This paper proposes a novel object detection network by semantical point-voxel feature interaction, dubbed PV-RCNN++. Unlike most of existing methods,… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: 18 pages, 9 figures

  15. GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling

    Authors: Chen Chen, Yisen Wang, Honghua Chen, Xuefeng Yan, Dayong Ren, Yanwen Guo, Haoran Xie, Fu Lee Wang, Mingqiang Wei

    Abstract: Semantic segmentation of point clouds, aiming to assign each point a semantic category, is critical to 3D scene understanding.Despite of significant advances in recent years, most of existing methods still suffer from either the object-level misclassification or the boundary-level ambiguity. In this paper, we present a robust semantic segmentation network by deeply exploring the geometry of point… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  16. arXiv:2205.01871  [pdf, other

    cs.CV

    UCL-Dehaze: Towards Real-world Image Dehazing via Unsupervised Contrastive Learning

    Authors: Yongzhen Wang, Xuefeng Yan, Fu Lee Wang, Haoran Xie, Wenhan Yang, Mingqiang Wei, Jing Qin

    Abstract: While the wisdom of training an image dehazing model on synthetic hazy data can alleviate the difficulty of collecting real-world hazy/clean image pairs, it brings the well-known domain shift problem. From a different yet new perspective, this paper explores contrastive learning with an adversarial training effort to leverage unpaired real-world hazy and clean images, thus bridging the gap between… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 14 pages, 9 figures, 9 tables

  17. Semi-MoreGAN: A New Semi-supervised Generative Adversarial Network for Mixture of Rain Removal

    Authors: Yiyang Shen, Yongzhen Wang, Mingqiang Wei, Honghua Chen, Haoran Xie, Gary Cheng, Fu Lee Wang

    Abstract: Rain is one of the most common weather which can completely degrade the image quality and interfere with the performance of many computer vision tasks, especially under heavy rain conditions. We observe that: (i) rain is a mixture of rain streaks and rainy haze; (ii) the scene depth determines the intensity of rain streaks and the transformation into the rainy haze; (iii) most existing deraining m… ▽ More

    Submitted 1 September, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: 18 pages

  18. arXiv:2203.00258  [pdf, other

    cs.CV

    When A Conventional Filter Meets Deep Learning: Basis Composition Learning on Image Filters

    Authors: Fu Lee Wang, Yidan Feng, Haoran Xie, Gary Cheng, Mingqiang Wei

    Abstract: Image filters are fast, lightweight and effective, which make these conventional wisdoms preferable as basic tools in vision tasks. In practical scenarios, users have to tweak parameters multiple times to obtain satisfied results. This inconvenience heavily discounts the efficiency and user experience. We propose basis composition learning on single image filters to automatically determine their o… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: 14 pages, 10 figures

  19. arXiv:2111.14094  [pdf, other

    cs.CL cs.AI

    Topic Driven Adaptive Network for Cross-Domain Sentiment Classification

    Authors: Yicheng Zhu, Yiqiao Qiu, Qingyuan Wu, Fu Lee Wang, Yanghui Rao

    Abstract: Cross-domain sentiment classification has been a hot spot these years, which aims to learn a reliable classifier using labeled data from a source domain and evaluate it on a target domain. In this vein, most approaches utilized domain adaptation that maps data from different domains into a common feature space. To further improve the model performance, several methods targeted to mine domain-speci… ▽ More

    Submitted 6 September, 2022; v1 submitted 28 November, 2021; originally announced November 2021.