Zum Hauptinhalt springen

Showing 1–32 of 32 results for author: Qiao, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12505  [pdf, other

    math.OC cs.LG

    Stochastic Compositional Minimax Optimization with Provable Convergence Guarantees

    Authors: Yuyang Deng, Fuli Qiao, Mehrdad Mahdavi

    Abstract: Stochastic compositional minimax problems are prevalent in machine learning, yet there are only limited established on the convergence of this class of problems. In this paper, we propose a formal definition of the stochastic compositional minimax problem, which involves optimizing a minimax loss with a compositional structure either in primal , dual, or both primal and dual variables. We introduc… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  2. arXiv:2408.01653  [pdf, other

    cs.CV

    MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas

    Authors: Feng Qiao, Zhexiao Xiong, Xinge Zhu, Yuexin Ma, Qiumeng He, Nathan Jacobs

    Abstract: We introduce Multi-Cylindrical Panoramic Depth Estimation (MCPDepth), a two-stage framework for omnidirectional depth estimation via stereo matching between multiple cylindrical panoramas. MCPDepth uses cylindrical panoramas for initial stereo matching and then fuses the resulting depth maps across views. A circular attention module is employed to overcome the distortion along the vertical axis. M… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  3. arXiv:2407.12240  [pdf, other

    cs.LG cs.CV

    Adaptive Cascading Network for Continual Test-Time Adaptation

    Authors: Kien X. Nguyen, Fengchun Qiao, Xi Peng

    Abstract: We study the problem of continual test-time adaption where the goal is to adapt a source pre-trained model to a sequence of unlabelled target domains at test time. Existing methods on test-time training suffer from several limitations: (1) Mismatch between the feature extractor and classifier; (2) Interference between the main and self-supervised tasks; (3) Lack of the ability to quickly adapt to… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    ACM Class: I.5.1; I.5.2

  4. arXiv:2406.07400  [pdf, other

    cs.LG cs.LO

    Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control

    Authors: William Murphy, Nikolaus Holzer, Nathan Koenig, Leyi Cui, Raven Rothkopf, Feitong Qiao, Mark Santolucito

    Abstract: Temporal logics are powerful tools that are widely used for the synthesis and verification of reactive systems. The recent progress on Large Language Models (LLMs) has the potential to make the process of writing such specifications more accessible. However, writing specifications in temporal logics remains challenging for all but the most expert users. A key question in using LLMs for temporal lo… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2402.15632  [pdf, other

    cs.SE

    Statically Inferring Usage Bounds for Infrastructure as Code

    Authors: Feitong Qiao, Aryana Mohammadi, Jürgen Cito, Mark Santolucito

    Abstract: Infrastructure as Code (IaC) has enabled cloud customers to have more agility in creating and modifying complex deployments of cloud-provisioned resources. By writing a configuration in IaC languages such as CloudFormation, users can declaratively specify their infrastructure and CloudFormation will handle the creation of the resources. However, understanding the complexity of IaC deployments has… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  6. arXiv:2310.08820  [pdf, other

    cs.CV

    Learning to Adapt SAM for Segmenting Cross-domain Point Clouds

    Authors: Xidong Peng, Runnan Chen, Feng Qiao, Lingdong Kong, Youquan Liu, Tai Wang, Xinge Zhu, Yuexin Ma

    Abstract: Unsupervised domain adaptation (UDA) in 3D segmentation tasks presents a formidable challenge, primarily stemming from the sparse and unordered nature of point cloud data. Especially for LiDAR point clouds, the domain discrepancy becomes obvious across varying capture scenes, fluctuating weather conditions, and the diverse array of LiDAR devices in use. While previous UDA methodologies have often… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  7. arXiv:2309.01842  [pdf, other

    cs.CV

    StereoFlowGAN: Co-training for Stereo and Flow with Unsupervised Domain Adaptation

    Authors: Zhexiao Xiong, Feng Qiao, Yu Zhang, Nathan Jacobs

    Abstract: We introduce a novel training strategy for stereo matching and optical flow estimation that utilizes image-to-image translation between synthetic and real image domains. Our approach enables the training of models that excel in real image scenarios while relying solely on ground-truth information from synthetic images. To facilitate task-agnostic domain adaptation and the training of task-specific… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted by BMVC 2023

  8. arXiv:2307.13943  [pdf, other

    cs.LG

    Topology-aware Robust Optimization for Out-of-distribution Generalization

    Authors: Fengchun Qiao, Xi Peng

    Abstract: Out-of-distribution (OOD) generalization is a challenging machine learning problem yet highly desirable in many high-stake applications. Existing methods suffer from overly pessimistic modeling with low generalization confidence. As generalizing to arbitrary test distributions is impossible, we hypothesize that further structure on the topology of distributions is crucial in developing strong OOD… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: In ICLR 2023 (17 pages including appendix). The source code and pre-trained models are publicly available at: https://github.com/joffery/TRO

    Journal ref: International Conference on Learning Representations 2023

  9. arXiv:2304.05821  [pdf, other

    cs.CV

    DUFormer: Solving Power Line Detection Task in Aerial Images using Semantic Segmentation

    Authors: Deyu An, Qiang Zhang, Jianshu Chao, Ting Li, Feng Qiao, Yong Deng, Zhenpeng Bian

    Abstract: Unmanned aerial vehicles (UAVs) are frequently used for inspecting power lines and capturing high-resolution aerial images. However, detecting power lines in aerial images is difficult,as the foreground data(i.e, power lines) is small and the background information is abundant.To tackle this problem, we introduce DUFormer, a semantic segmentation algorithm explicitly designed to detect power lines… ▽ More

    Submitted 31 August, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  10. arXiv:2303.16390  [pdf, other

    cs.LG

    Are Data-driven Explanations Robust against Out-of-distribution Data?

    Authors: Tang Li, Fengchun Qiao, Mengmeng Ma, Xi Peng

    Abstract: As black-box models increasingly power high-stakes applications, a variety of data-driven explanation methods have been introduced. Meanwhile, machine learning models are constantly challenged by distributional shifts. A question naturally arises: Are data-driven explanations robust against out-of-distribution data? Our empirical results show that even though predict correctly, the model might sti… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023

  11. arXiv:2211.16988  [pdf, other

    cs.CV

    QuadFormer: Quadruple Transformer for Unsupervised Domain Adaptation in Power Line Segmentation of Aerial Images

    Authors: Pratyaksh Prabhav Rao, Feng Qiao, Weide Zhang, Yiliang Xu, Yong Deng, Guangbin Wu, Qiang Zhang

    Abstract: Accurate segmentation of power lines in aerial images is essential to ensure the flight safety of aerial vehicles. Acquiring high-quality ground truth annotations for training a deep learning model is a laborious process. Therefore, developing algorithms that can leverage knowledge from labelled synthetic data to unlabelled real images is highly demanded. This process is studied in Unsupervised do… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  12. arXiv:2204.01026  [pdf, other

    cs.CV

    STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes

    Authors: Peishan Cong, Xinge Zhu, Feng Qiao, Yiming Ren, Xidong Peng, Yuenan Hou, Lan Xu, Ruigang Yang, Dinesh Manocha, Yuexin Ma

    Abstract: Accurately detecting and tracking pedestrians in 3D space is challenging due to large variations in rotations, poses and scales. The situation becomes even worse for dense crowds with severe occlusions. However, existing benchmarks either only provide 2D annotations, or have limited 3D annotations with low-density pedestrian distribution, making it difficult to build a reliable pedestrian percepti… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

    Comments: accepted at CVPR2022

  13. arXiv:2203.13046  [pdf, other

    cs.CV

    Facial Action Unit Recognition With Multi-models Ensembling

    Authors: Wenqiang Jiang, Yannan Wu, Fengsheng Qiao, Liyu Meng, Yuanyuan Deng, Chuanhe Liu

    Abstract: The Affective Behavior Analysis in-the-wild (ABAW) 2022 Competition gives Affective Computing a large promotion. In this paper, we present our method of AU challenge in this Competition. We use improved IResnet100 as backbone. Then we train AU dataset in Aff-Wild2 on three pertained models pretrained by our private au and expression dataset, and Glint360K respectively. Finally, we ensemble the res… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  14. arXiv:2108.02888  [pdf, other

    cs.CV

    Out-of-Domain Generalization from a Single Source: An Uncertainty Quantification Approach

    Authors: Xi Peng, Fengchun Qiao, Long Zhao

    Abstract: We are concerned with a worst-case scenario in model generalization, in the sense that a model aims to perform well on many unseen domains while there is only one single domain available for training. We propose Meta-Learning based Adversarial Domain Augmentation to solve this Out-of-Domain generalization problem. The key idea is to leverage adversarial training to create "fictitious" yet "challen… ▽ More

    Submitted 16 June, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: 13 pages, 11 figures, accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence. arXiv admin note: substantial text overlap with arXiv:2003.13216

  15. arXiv:2103.12579  [pdf, other

    cs.CV

    MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

    Authors: Shuang Li, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Feng Qiao, Xinjing Cheng

    Abstract: Real-world training data usually exhibits long-tailed distribution, where several majority classes have a significantly larger number of samples than the remaining minority classes. This imbalance degrades the performance of typical supervised learning algorithms designed for balanced training sets. In this paper, we address this issue by augmenting minority classes with a recently proposed implic… ▽ More

    Submitted 7 April, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021

  16. arXiv:2103.07531  [pdf, other

    cs.CV

    Uncertainty-guided Model Generalization to Unseen Domains

    Authors: Fengchun Qiao, Xi Peng

    Abstract: We study a worst-case scenario in generalization: Out-of-domain generalization from a single source. The goal is to learn a robust model from a single source and expect it to generalize over many unknown distributions. This challenging problem has been seldom investigated while existing solutions suffer from various limitations. In this paper, we propose a new solution. The key idea is to augment… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: In CVPR 2021 (13 pages including supplementary material)

  17. arXiv:2012.00234  [pdf, other

    cs.CV cs.RO

    RaP-Net: A Region-wise and Point-wise Weighting Network to Extract Robust Features for Indoor Localization

    Authors: Dongjiang Li, Jinyu Miao, Xuesong Shi, Yuxin Tian, Qiwei Long, Tianyu Cai, Ping Guo, Hongfei Yu, Wei Yang, Haosong Yue, Qi Wei, Fei Qiao

    Abstract: Feature extraction plays an important role in visual localization. Unreliable features on dynamic objects or repetitive regions will interfere with feature matching and challenge indoor localization greatly. To address the problem, we propose a novel network, RaP-Net, to simultaneously predict region-wise invariability and point-wise reliability, and then extract features by considering both of th… ▽ More

    Submitted 22 August, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: IROS 2021

  18. arXiv:2008.05416  [pdf, other

    cs.CV cs.RO

    DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features

    Authors: Dongjiang Li, Xuesong Shi, Qiwei Long, Shenghui Liu, Wei Yang, Fangshi Wang, Qi Wei, Fei Qiao

    Abstract: A robust and efficient Simultaneous Localization and Mapping (SLAM) system is essential for robot autonomy. For visual SLAM algorithms, though the theoretical framework has been well established for most aspects, feature extraction and association is still empirically designed in most cases, and can be vulnerable in complex environments. This paper shows that feature extraction with deep convoluti… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 8 pages, 5 figures, to be published in IROS 2020

  19. arXiv:2003.13216  [pdf, other

    cs.CV

    Learning to Learn Single Domain Generalization

    Authors: Fengchun Qiao, Long Zhao, Xi Peng

    Abstract: We are concerned with a worst-case scenario in model generalization, in the sense that a model aims to perform well on many unseen domains while there is only one single domain available for training. We propose a new method named adversarial domain augmentation to solve this Out-of-Distribution (OOD) generalization problem. The key idea is to leverage adversarial training to create "fictitious" y… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: In CVPR 2020 (13 pages including supplementary material). The source code and pre-trained models are publicly available at: https://github.com/joffery/M-ADA

  20. arXiv:1911.06487  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    OpenLORIS-Object: A Robotic Vision Dataset and Benchmark for Lifelong Deep Learning

    Authors: Qi She, Fan Feng, Xinyue Hao, Qihan Yang, Chuanlin Lan, Vincenzo Lomonaco, Xuesong Shi, Zhengwei Wang, Yao Guo, Yimin Zhang, Fei Qiao, Rosa H. M. Chan

    Abstract: The recent breakthroughs in computer vision have benefited from the availability of large representative datasets (e.g. ImageNet and COCO) for training. Yet, robotic vision poses unique challenges for applying visual algorithms developed from these standard computer vision datasets due to their implicit assumption over non-varying distributions for a fixed set of tasks. Fully retraining models eac… ▽ More

    Submitted 6 March, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: 7 pages, 7 figures, 4 tables

  21. arXiv:1911.05603  [pdf, other

    cs.RO cs.CV

    Are We Ready for Service Robots? The OpenLORIS-Scene Datasets for Lifelong SLAM

    Authors: Xuesong Shi, Dongjiang Li, Pengpeng Zhao, Qinbin Tian, Yuxin Tian, Qiwei Long, Chunhao Zhu, Jingwei Song, Fei Qiao, Le Song, Yangquan Guo, Zhigang Wang, Yimin Zhang, Baoxing Qin, Wei Yang, Fangshi Wang, Rosa H. M. Chan, Qi She

    Abstract: Service robots should be able to operate autonomously in dynamic and daily changing environments over an extended period of time. While Simultaneous Localization And Mapping (SLAM) is one of the most fundamental problems for robotic autonomy, most existing SLAM works are evaluated with data sequences that are recorded in a short period of time. In real-world deployment, there can be out-of-sight s… ▽ More

    Submitted 13 March, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: To be published on ICRA 2020; 7 pages, 3 figures; v2 fixed a number in Table III

  22. Interactive Hand Pose Estimation: Boosting accuracy in localizing extended finger joints

    Authors: Cairong Zhang, Guijin Wang, Hengkai Guo, Xinghao Chen, Fei Qiao, Huazhong Yang

    Abstract: Accurate 3D hand pose estimation plays an important role in Human Machine Interaction (HMI). In the reality of HMI, joints in fingers stretching out, especially corresponding fingertips, are much more important than other joints. We propose a novel method to refine stretching-out finger joint locations after obtaining rough hand pose estimation. It first detects which fingers are stretching out, t… ▽ More

    Submitted 25 July, 2018; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: Original publication available on https://doi.org/10.2352/ISSN.2470-1173.2018.2.VIPC-251

    Journal ref: Electronic Imaging, Visual Information Processing and Communication IX (2018), pp. 251-1-251-6(6)

  23. arXiv:1802.01822  [pdf, other

    cs.CV

    Geometry-Contrastive GAN for Facial Expression Transfer

    Authors: Fengchun Qiao, Naiming Yao, Zirui Jiao, Zhihao Li, Hui Chen, Hongan Wang

    Abstract: In this paper, we propose a Geometry-Contrastive Generative Adversarial Network (GC-GAN) for transferring continuous emotions across different subjects. Given an input face with certain emotion and a target facial expression from another subject, GC-GAN can generate an identity-preserving face with the target expression. Geometry information is introduced into cGANs as continuous conditions to gui… ▽ More

    Submitted 22 October, 2018; v1 submitted 6 February, 2018; originally announced February 2018.

  24. Region Ensemble Network: Improving Convolutional Network for Hand Pose Estimation

    Authors: Hengkai Guo, Guijin Wang, Xinghao Chen, Cairong Zhang, Fei Qiao, Huazhong Yang

    Abstract: Hand pose estimation from monocular depth images is an important and challenging problem for human-computer interaction. Recently deep convolutional networks (ConvNet) with sophisticated design have been employed to address it, but the improvement over traditional methods is not so apparent. To promote the performance of directly 3D coordinate regression, we propose a tree-structured Region Ensemb… ▽ More

    Submitted 8 May, 2017; v1 submitted 8 February, 2017; originally announced February 2017.

    Comments: Accepted to ICIP 2017. Project: https://github.com/guohengkai/region-ensemble-network

  25. arXiv:1603.01954  [pdf

    cs.ET

    A Real-Time and Energy-Efficient Implementation of Difference-of-Gaussian with Flexible Thin-Film Transistors

    Authors: Nan Wu, Zheyu Liu, Fei Qiao, Xiaojun Guo, Qi Wei, Yuan Xie, Huazhong Yang

    Abstract: With many advantageous features, softness and better biocompatibility, flexible electronic devices have developed rapidly and increasingly attracted attention. Many currently applications with flexible devices are sensors and drivers, while there is nearly no utilization aiming at complex computation since flexible devices have lower electron mobility, simple structure and large process variation.… ▽ More

    Submitted 7 March, 2016; originally announced March 2016.

  26. arXiv:1503.02354  [pdf, other

    cs.AR

    A General Scheme for Noise-Tolerant Logic Design Based on Probabilistic and DCVS Approaches

    Authors: Xinghua Yang, Fei Qiao, Qi Wei, Huazhong Yang

    Abstract: In this paper, a general circuit scheme for noise-tolerant logic design based on Markov Random Field theory and differential Cascade Voltage Switch technique has been proposed, which is an extension of the work in [1-3], [4]. A block with only four transistors has been successfully inserted to the original circuit scheme from [3] and extensive simulation results show that our proposed design can o… ▽ More

    Submitted 8 March, 2015; originally announced March 2015.

    Comments: 4 pages, 10 figures

  27. arXiv:1408.2289  [pdf

    cs.CV

    Physical Computing With No Clock to Implement the Gaussian Pyramid of SIFT Algorithm

    Authors: Yi Li, Qi Wei, Fei Qiao, Huazhong Yang

    Abstract: Physical computing is a technology utilizing the nature of electronic devices and circuit topology to cope with computing tasks. In this paper, we propose an active circuit network to implement multi-scale Gaussian filter, which is also called Gaussian Pyramid in image preprocessing. Various kinds of methods have been tried to accelerate the key stage in image feature extracting algorithm these ye… ▽ More

    Submitted 10 August, 2014; originally announced August 2014.

    Comments: 6

  28. arXiv:1405.5948  [pdf

    cs.MM

    Low-complexity video encoder for smart eyes based on underdetermined blind signal separation

    Authors: Jing Liu, Fei Qiao, Zhijian Ou, Huazhong Yang

    Abstract: This paper presents a low complexity video coding method based on Underdetermined Blind Signal Separation (UBSS). The detailed coding framework is designed. Three key techniques are proposed to enhance the compression ratio and the quality of the decoded frames. The experiments validate that the proposed method costs 30ms encoding time less than DISCOVER. The simulation shows that this new method… ▽ More

    Submitted 22 May, 2014; originally announced May 2014.

  29. arXiv:1311.1419  [pdf

    cs.MM

    Increasing Compression Ratio of Low Complexity Compressive Sensing Video Encoder with Application-Aware Configurable Mechanism

    Authors: Shuang Yu, Fei Qiao, Li Luo, Huazhong Yang

    Abstract: With the development of embedded video acquisition nodes and wireless video surveillance systems, traditional video coding methods could not meet the needs of less computing complexity any more, as well as the urgent power consumption. So, a low-complexity compressive sensing video encoder framework with application-aware configurable mechanism is proposed in this paper, where novel encoding metho… ▽ More

    Submitted 6 November, 2013; originally announced November 2013.

    Comments: 5 pages with 6figures and 1 table,conference

  30. arXiv:1310.3356  [pdf

    cs.AR

    A Novel Reconfigurable Computing Architecture for Image Signal Processing Using Circuit-Switched NoC and Synchronous Dataflow Model

    Authors: Feitian Li, Fei Qiao, Qi Wei, Huazhong Yang

    Abstract: In this paper, a novel reconfigurable architecture is proposed for multifunctional image signal processing systems. A circuit-switched NoC is used to provide interconnection because the non-TMD links ensure fixed throughput, which is a desirable behavior for computational intensive image processing algorithms compared with packet-switched NoC. Image processing algorithms are modeled as synchronous… ▽ More

    Submitted 12 October, 2013; originally announced October 2013.

    Comments: ISQED 2014,6 pages,7 figures

  31. arXiv:1205.4572  [pdf

    cs.MM

    A Novel Video Compression Approach Based on Underdetermined Blind Source Separation

    Authors: Jing Liu, Fei Qiao, Qi Wei, Huazhong Yang

    Abstract: This paper develops a new video compression approach based on underdetermined blind source separation. Underdetermined blind source separation, which can be used to efficiently enhance the video compression ratio, is combined with various off-the-shelf codecs in this paper. Combining with MPEG-2, video compression ratio could be improved slightly more than 33%. As for combing with H.264, 4X~12X mo… ▽ More

    Submitted 21 May, 2012; originally announced May 2012.

    Comments: 4 pages with 4 figures and 1 table

    ACM Class: H.5.1; H.4.3

  32. arXiv:0910.3736  [pdf, other

    cs.AR

    A Fault-tolerant Structure for Reliable Multi-core Systems Based on Hardware-Software Co-design

    Authors: Bingbing Xia, Fei Qiao, Huazhong Yang, Hui Wang

    Abstract: To cope with the soft errors and make full use of the multi-core system, this paper gives an efficient fault-tolerant hardware and software co-designed architecture for multi-core systems. And with a not large number of test patterns, it will use less than 33% hardware resources compared with the traditional hardware redundancy (TMR) and it will take less than 50% time compared with the traditio… ▽ More

    Submitted 20 October, 2009; originally announced October 2009.

    Comments: 7 pages, 5 figures