Zum Hauptinhalt springen

Showing 1–50 of 75 results for author: Fan, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.05641  [pdf, other

    eess.AS

    Towards a Quantitative Analysis of Coarticulation with a Phoneme-to-Articulatory Model

    Authors: Chaofei Fan, Jaimie M. Henderson, Chris Manning, Francis R. Willett

    Abstract: Prior coarticulation studies focus mainly on limited phonemic sequences and specific articulators, providing only approximate descriptions of the temporal extent and magnitude of coarticulation. This paper is an initial attempt to comprehensively investigate coarticulation. We leverage existing Electromagnetic Articulography (EMA) datasets to develop and train a phoneme-to-articulatory (P2A) model… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: To be published in Interspeech 2024

  2. arXiv:2407.13895  [pdf, other

    eess.AS

    Improving Robustness and Clinical Applicability of Respiratory Sound Classification via Audio Enhancement

    Authors: Jing-Tong Tzeng, Jeng-Lin Li, Huan-Yu Chen, Chun-Hsiang Huang, Chi-Hsin Chen, Cheng-Yi Fan, Edward Pei-Chuan Huang, Chi-Chun Lee

    Abstract: Deep learning techniques have shown promising results in the automatic classification of respiratory sounds. However, accurately distinguishing these sounds in real-world noisy conditions poses challenges for clinical deployment. Additionally, predicting signals with only background noise could undermine user trust in the system. In this study, we propose an audio enhancement (AE) pipeline as a pr… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: The following article has been submitted to The Journal of the Acoustical Society of America (JASA). After it is published, it will be found at https://pubs.aip.org/asa/jasa

  3. arXiv:2407.08481  [pdf, other

    eess.IV cs.CV

    SliceMamba with Neural Architecture Search for Medical Image Segmentation

    Authors: Chao Fan, Hongyuan Yu, Yan Huang, Liang Wang, Zhenghan Yang, Xibin Jia

    Abstract: Despite the progress made in Mamba-based medical image segmentation models, existing methods utilizing unidirectional or multi-directional feature scanning mechanisms struggle to effectively capture dependencies between neighboring positions, limiting the discriminant representation learning of local features. These local features are crucial for medical image segmentation as they provide critical… ▽ More

    Submitted 19 August, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. arXiv:2407.05726  [pdf, other

    cs.CV eess.IV

    Gait Patterns as Biomarkers: A Video-Based Approach for Classifying Scoliosis

    Authors: Zirui Zhou, Junhao Liang, Zizhao Peng, Chao Fan, Fengwei An, Shiqi Yu

    Abstract: Scoliosis presents significant diagnostic challenges, particularly in adolescents, where early detection is crucial for effective treatment. Traditional diagnostic and follow-up methods, which rely on physical examinations and radiography, face limitations due to the need for clinical expertise and the risk of radiation exposure, thus restricting their use for widespread early screening. In respon… ▽ More

    Submitted 23 August, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted to MICCAI 2024

  5. arXiv:2406.09664  [pdf, other

    cs.SD eess.AS

    Frequency-mix Knowledge Distillation for Fake Speech Detection

    Authors: Cunhang Fan, Shunbo Dong, Jun Xue, Yujie Chen, Jiangyan Yi, Zhao Lv

    Abstract: In the telephony scenarios, the fake speech detection (FSD) task to combat speech spoofing attacks is challenging. Data augmentation (DA) methods are considered effective means to address the FSD task in telephony scenarios, typically divided into time domain and frequency domain stages. While each has its advantages, both can result in information loss. To tackle this issue, we propose a novel DA… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  6. arXiv:2406.06086  [pdf, other

    cs.SD eess.AS

    RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

    Authors: Yujie Chen, Jiangyan Yi, Jun Xue, Chenglong Wang, Xiaohui Zhang, Shunbo Dong, Siding Zeng, Jianhua Tao, Lv Zhao, Cunhang Fan

    Abstract: Fake artefacts for discriminating between bonafide and fake audio can exist in both short- and long-range segments. Therefore, combining local and global feature information can effectively discriminate between bonafide and fake audio. This paper proposes an end-to-end bidirectional state space model, named RawBMamba, to capture both short- and long-range discriminative information for audio deepf… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  7. arXiv:2405.18554  [pdf, other

    cs.LG cs.RO eess.SY

    Scalable Surrogate Verification of Image-based Neural Network Control Systems using Composition and Unrolling

    Authors: Feiyang Cai, Chuchu Fan, Stanley Bak

    Abstract: Verifying safety of neural network control systems that use images as input is a difficult problem because, from a given system state, there is no known way to mathematically model what images are possible in the real-world. We build on recent work that considers a surrogate verification approach, training a conditional generative adversarial network (cGAN) as an image generator in place of the re… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  8. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  9. arXiv:2311.13714  [pdf, other

    cs.RO cs.MA eess.SY math.OC

    Learning Safe Control for Multi-Robot Systems: Methods, Verification, and Open Challenges

    Authors: Kunal Garg, Songyuan Zhang, Oswin So, Charles Dawson, Chuchu Fan

    Abstract: In this survey, we review the recent advances in control design methods for robotic multi-agent systems (MAS), focussing on learning-based methods with safety considerations. We start by reviewing various notions of safety and liveness properties, and modeling frameworks used for problem formulation of MAS. Then we provide a comprehensive review of learning-based methods for safe control design fo… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Submitted to Annual Reviews in Control

  10. arXiv:2311.13014  [pdf, other

    eess.SY

    Neural Graph Control Barrier Functions Guided Distributed Collision-avoidance Multi-agent Control

    Authors: Songyuan Zhang, Kunal Garg, Chuchu Fan

    Abstract: We consider the problem of designing distributed collision-avoidance multi-agent control in large-scale environments with potentially moving obstacles, where a large number of agents are required to maintain safety using only local information and reach their goals. This paper addresses the problem of collision avoidance, scalability, and generalizability by introducing graph control barrier funct… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 20 pages, 10 figures; Accepted by 7th Conference on Robot Learning (CoRL 2023)

  11. Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection

    Authors: Cunhang Fan, Mingming Ding, Jianhua Tao, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Zhao Lv

    Abstract: Most research in synthetic speech detection (SSD) focuses on improving performance on standard noise-free datasets. However, in actual situations, noise interference is usually present, causing significant performance degradation in SSD systems. To improve noise robustness, this paper proposes a dual-branch knowledge distillation synthetic speech detection (DKDSSD) method. Specifically, a parallel… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  12. arXiv:2310.06956  [pdf, other

    eess.SY

    Adversarial optimization leads to over-optimistic security-constrained dispatch, but sampling can help

    Authors: Charles Dawson, Chuchu Fan

    Abstract: To ensure safe, reliable operation of the electrical grid, we must be able to predict and mitigate likely failures. This need motivates the classic security-constrained AC optimal power flow (SCOPF) problem. SCOPF is commonly solved using adversarial optimization, where the dispatcher and an adversary take turns optimizing a robust dispatch and adversarial attack, respectively. We show that advers… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted at NAPS 2023

  13. arXiv:2309.09108  [pdf, other

    cs.RO eess.SY math.OC

    Neural Network-based Fault Detection and Identification for Quadrotors using Dynamic Symmetry

    Authors: Kunal Garg, Chuchu Fan

    Abstract: Autonomous robotic systems, such as quadrotors, are susceptible to actuator faults, and for the safe operation of such systems, timely detection and isolation of these faults is essential. Neural networks can be used for verification of actuator performance via online actuator fault detection with high accuracy. In this paper, we develop a novel model-free fault detection and isolation (FDI) frame… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: Accepted for 2023 Allerton Conference on Communication, Control, & Computing

  14. arXiv:2309.08052  [pdf, other

    cs.RO eess.SY

    A Bayesian approach to breaking things: efficiently predicting and repairing failure modes via sampling

    Authors: Charles Dawson, Chuchu Fan

    Abstract: Before autonomous systems can be deployed in safety-critical applications, we must be able to understand and verify the safety of these systems. For cases where the risk or cost of real-world testing is prohibitive, we propose a simulation-based framework for a) predicting ways in which an autonomous system is likely to fail and b) automatically adjusting the system's design to preemptively mitiga… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: To appear at the 2023 Conference on Robot Learning (CoRL)

  15. arXiv:2309.07147  [pdf, other

    eess.SP cs.HC cs.LG cs.MM cs.SD eess.AS

    DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection

    Authors: Cunhang Fan, Hongyu Zhang, Wei Huang, Jun Xue, Jianhua Tao, Jiangyan Yi, Zhao Lv, Xiaopei Wu

    Abstract: Auditory Attention Detection (AAD) aims to detect target speaker from brain signals in a multi-speaker environment. Although EEG-based AAD methods have shown promising results in recent years, current approaches primarily rely on traditional convolutional neural network designed for processing Euclidean data like images. This makes it challenging to handle EEG signals, which possess non-Euclidean… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  16. Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection

    Authors: Cunhang Fan, Jun Xue, Jianhua Tao, Jiangyan Yi, Chenglong Wang, Chengshi Zheng, Zhao Lv

    Abstract: The rhythm of bonafide speech is often difficult to replicate, which causes that the fundamental frequency (F0) of synthetic speech is significantly different from that of real speech. It is expected that the F0 feature contains the discriminative information for the fake speech detection (FSD) task. In this paper, we propose a novel F0 subband for FSD. In addition, to effectively model the F0 sub… ▽ More

    Submitted 8 July, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

    Comments: Accept by Neural Networks

  17. arXiv:2306.15389  [pdf, other

    cs.SD cs.LG eess.AS

    Multi-perspective Information Fusion Res2Net with RandomSpecmix for Fake Speech Detection

    Authors: Shunbo Dong, Jun Xue, Cunhang Fan, Kang Zhu, Yujie Chen, Zhao Lv

    Abstract: In this paper, we propose the multi-perspective information fusion (MPIF) Res2Net with random Specmix for fake speech detection (FSD). The main purpose of this system is to improve the model's ability to learn precise forgery information for FSD task in low-quality scenarios. The task of random Specmix, a data augmentation, is to improve the generalization ability of the model and enhance the mode… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted by DADA2023

  18. arXiv:2306.08722  [pdf, other

    eess.SY

    Learning to Stabilize High-dimensional Unknown Systems Using Lyapunov-guided Exploration

    Authors: Songyuan Zhang, Chuchu Fan

    Abstract: Designing stabilizing controllers is a fundamental challenge in autonomous systems, particularly for high-dimensional, nonlinear systems that can hardly be accurately modeled with differential equations. The Lyapunov theory offers a solution for stabilizing control systems, still, current methods relying on Lyapunov functions require access to complete dynamics or samples of system executions thro… ▽ More

    Submitted 16 May, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 32 pages, 7 figures; Accepted by the 6th Annual Conference on Learning for Dynamics and Control (L4DC 2024)

  19. arXiv:2303.14564  [pdf, other

    eess.SY

    Compositional Neural Certificates for Networked Dynamical Systems

    Authors: Songyuan Zhang, Yumeng Xiu, Guannan Qu, Chuchu Fan

    Abstract: Developing stable controllers for large-scale networked dynamical systems is crucial but has long been challenging due to two key obstacles: certifiability and scalability. In this paper, we present a general framework to solve these challenges using compositional neural certificates based on ISS (Input-to-State Stability) Lyapunov functions. Specifically, we treat a large networked dynamical syst… ▽ More

    Submitted 11 April, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

    Comments: 25 pages, 8 figures; Accepted by 5th Annual Learning for Dynamics & Control Conference (L4DC) 2023

  20. arXiv:2303.10327  [pdf, other

    cs.RO cs.LG eess.SY

    Hybrid Systems Neural Control with Region-of-Attraction Planner

    Authors: Yue Meng, Chuchu Fan

    Abstract: Hybrid systems are prevalent in robotics. However, ensuring the stability of hybrid systems is challenging due to sophisticated continuous and discrete dynamics. A system with all its system modes stable can still be unstable. Hence special treatments are required at mode switchings to stabilize the system. In this work, we propose a hierarchical, neural network (NN)-based method to control genera… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Accepted to L4DC2023

  21. arXiv:2303.01211  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Learning From Yourself: A Self-Distillation Method for Fake Speech Detection

    Authors: Jun Xue, Cunhang Fan, Jiangyan Yi, Chenglong Wang, Zhengqi Wen, Dan Zhang, Zhao Lv

    Abstract: In this paper, we propose a novel self-distillation method for fake speech detection (FSD), which can significantly improve the performance of FSD without increasing the model complexity. For FSD, some fine-grained information is very important, such as spectrogram defects, mute segments, and so on, which are often perceived by shallow networks. However, shallow networks have much noise, which can… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  22. arXiv:2302.11719  [pdf, other

    cs.RO eess.SY

    Shield Model Predictive Path Integral: A Computationally Efficient Robust MPC Approach Using Control Barrier Functions

    Authors: Ji Yin, Charles Dawson, Chuchu Fan, Panagiotis Tsiotras

    Abstract: Model Predictive Path Integral (MPPI) control is a type of sampling-based model predictive control that simulates thousands of trajectories and uses these trajectories to synthesize optimal controls on-the-fly. In practice, however, MPPI encounters problems limiting its application. For instance, it has been observed that MPPI tends to make poor decisions if unmodeled dynamics or environmental dis… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 8 pages, 7 figures. Submitted to RA-L for review

  23. arXiv:2211.06073  [pdf, other

    cs.SD cs.CL eess.AS

    SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection

    Authors: Jiangyan Yi, Chenglong Wang, Jianhua Tao, Chu Yuan Zhang, Cunhang Fan, Zhengkun Tian, Haoxin Ma, Ruibo Fu

    Abstract: Many datasets have been designed to further the development of fake audio detection. However, fake utterances in previous datasets are mostly generated by altering timbre, prosody, linguistic content or channel noise of original audio. These datasets leave out a scenario, in which the acoustic scene of an original audio is manipulated with a forged one. It will pose a major threat to our society i… ▽ More

    Submitted 4 April, 2024; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: Accepted by Pattern Recognition, 1 April 2024

  24. Density Planner: Minimizing Collision Risk in Motion Planning with Dynamic Obstacles using Density-based Reachability

    Authors: Laura Lützow, Yue Meng, Andres Chavez Armijos, Chuchu Fan

    Abstract: Uncertainty is prevalent in robotics. Due to measurement noise and complex dynamics, we cannot estimate the exact system and environment state. Since conservative motion planners are not guaranteed to find a safe control strategy in a crowded, uncertain environment, we propose a density-based method. Our approach uses a neural network and the Liouville equation to learn the density evolution for a… ▽ More

    Submitted 27 February, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: ©2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  25. arXiv:2209.12270  [pdf, other

    cs.RO eess.SY

    Barrier functions enable safety-conscious force-feedback control

    Authors: Charles Dawson, Austin Garrett, Falk Pollok, Yang Zhang, Chuchu Fan

    Abstract: In order to be effective partners for humans, robots must become increasingly comfortable with making contact with their environment. Unfortunately, it is hard for robots to distinguish between ``just enough'' and ``too much'' force: some force is required to accomplish the task but too much might damage equipment or injure humans. Traditional approaches to designing compliant force-feedback contr… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  26. arXiv:2209.12266  [pdf, other

    cs.RO eess.SY

    Enforcing safety for vision-based controllers via Control Barrier Functions and Neural Radiance Fields

    Authors: Mukun Tong, Charles Dawson, Chuchu Fan

    Abstract: To navigate complex environments, robots must increasingly use high-dimensional visual feedback (e.g. images) for control. However, relying on high-dimensional image data to make control decisions raises important questions; particularly, how might we prove the safety of a visual-feedback controller? Control barrier functions (CBFs) are powerful tools for certifying the safety of feedback controll… ▽ More

    Submitted 28 February, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted to ICRA 2023

  27. arXiv:2209.08073  [pdf, other

    cs.RO cs.AI eess.SY

    Case Studies for Computing Density of Reachable States for Safe Autonomous Motion Planning

    Authors: Yue Meng, Zeng Qiu, Md Tawhid Bin Waez, Chuchu Fan

    Abstract: Density of the reachable states can help understand the risk of safety-critical systems, especially in situations when worst-case reachability is too conservative. Recent work provides a data-driven approach to compute the density distribution of autonomous systems' forward reachable states online. In this paper, we study the use of such approach in combination with model predictive control for ve… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: NASA Formal Methods 2022

  28. arXiv:2208.09618  [pdf, other

    cs.SD cs.AI eess.AS

    Fully Automated End-to-End Fake Audio Detection

    Authors: Chenglong Wang, Jiangyan Yi, Jianhua Tao, Haiyang Sun, Xun Chen, Zhengkun Tian, Haoxin Ma, Cunhang Fan, Ruibo Fu

    Abstract: The existing fake audio detection systems often rely on expert experience to design the acoustic features or manually design the hyperparameters of the network structure. However, artificial adjustment of the parameters can have a relatively obvious influence on the results. It is almost impossible to manually set the best set of parameters. Therefore this paper proposes a fully automated end-toen… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

  29. arXiv:2208.03051  [pdf, other

    cs.CV cs.CL cs.SD eess.AS eess.IV

    Hybrid Multimodal Feature Extraction, Mining and Fusion for Sentiment Analysis

    Authors: Jia Li, Ziyang Zhang, Junjie Lang, Yueqi Jiang, Liuwei An, Peng Zou, Yangyang Xu, Sheng Gao, Jie Lin, Chunxiao Fan, Xiao Sun, Meng Wang

    Abstract: In this paper, we present our solutions for the Multimodal Sentiment Analysis Challenge (MuSe) 2022, which includes MuSe-Humor, MuSe-Reaction and MuSe-Stress Sub-challenges. The MuSe 2022 focuses on humor detection, emotional reactions and multimodal emotional stress utilizing different modalities and data sets. In our work, different kinds of multimodal features are extracted, including acoustic,… ▽ More

    Submitted 12 August, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: 8 pages, 2 figures, to appear in MuSe 2022 (ACM MM2022 co-located workshop)

  30. arXiv:2208.01214  [pdf, other

    cs.SD cs.LG eess.AS

    Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features

    Authors: Jun Xue, Cunhang Fan, Zhao Lv, Jianhua Tao, Jiangyan Yi, Chengshi Zheng, Zhengqi Wen, Minmin Yuan, Shegang Shao

    Abstract: Recently, pioneer research works have proposed a large number of acoustic features (log power spectrogram, linear frequency cepstral coefficients, constant Q cepstral coefficients, etc.) for audio deepfake detection, obtaining good performance, and showing that different subbands have different contributions to audio deepfake detection. However, this lacks an explanation of the specific informatio… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  31. arXiv:2206.02748  [pdf, other

    eess.IV cs.CV cs.LG

    Compound Multi-branch Feature Fusion for Real Image Restoration

    Authors: Chi-Mao Fan, Tsung-Jung Liu, Kuan-Hsien Liu

    Abstract: Image restoration is a challenging and ill-posed problem which also has been a long-standing issue. However, most of learning based restoration methods are proposed to target one degradation type which means they are lack of generalization. In this paper, we proposed a multi-branch restoration model inspired from the Human Visual System (i.e., Retinal Ganglion Cells) which can achieve multiple res… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  32. arXiv:2205.02758  [pdf, other

    physics.soc-ph eess.SY

    Quantitative Measures for Integrating Resilience into Transportation Planning Practice: Study in Texas

    Authors: Cheng-Chun Lee, Akhil Rajput, Chia-Wei Hsu, Chao Fan, Faxi Yuan, Shangjia Dong, Amir Esmalian, Hamed Farahmand, Flavia Ioana Patrascu, Chia-Fu Liu, Bo Li, Junwei Ma, Ali Mostafavi

    Abstract: The objective of this study is to propose a system-level framework with quantitative measures to assess the resilience of road networks. The framework proposed in this paper can help transportation agencies incorporate resilience considerations into project development proactively and to understand the resilience performance of current road networks effectively. This study identified and implement… ▽ More

    Submitted 5 May, 2022; v1 submitted 4 April, 2022; originally announced May 2022.

  33. arXiv:2203.02038  [pdf, other

    cs.RO eess.SY

    Robust Counterexample-guided Optimization for Planning from Differentiable Temporal Logic

    Authors: Charles Dawson, Chuchu Fan

    Abstract: Signal temporal logic (STL) provides a powerful, flexible framework for specifying complex autonomy tasks; however, existing methods for planning based on STL specifications have difficulty scaling to long-horizon tasks and are not robust to external disturbances. In this paper, we present an algorithm for finding robust plans that satisfy STL specifications. Our method alternates between local op… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  34. arXiv:2203.01645  [pdf, other

    eess.IV cs.AI cs.CV cs.LG cs.MM

    Selective Residual M-Net for Real Image Denoising

    Authors: Chi-Mao Fan, Tsung-Jung Liu, Kuan-Hsien Liu

    Abstract: Image restoration is a low-level vision task which is to restore degraded images to noise-free images. With the success of deep neural networks, the convolutional neural networks surpass the traditional restoration methods and become the mainstream in the computer vision area. To advance the performanceof denoising algorithms, we propose a blind real image denoising network (SRMNet) by employing a… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2203.01296

  35. arXiv:2203.01296  [pdf, other

    eess.IV cs.AI cs.CV cs.MM

    Half Wavelet Attention on M-Net+ for Low-Light Image Enhancement

    Authors: Chi-Mao Fan, Tsung-Jung Liu, Kuan-Hsien Liu

    Abstract: Low-Light Image Enhancement is a computer vision task which intensifies the dark images to appropriate brightness. It can also be seen as an ill-posed problem in image restoration domain. With the success of deep neural networks, the convolutional neural networks surpass the traditional algorithm-based methods and become the mainstream in the computer vision area. To advance the performance of enh… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  36. SUNet: Swin Transformer UNet for Image Denoising

    Authors: Chi-Mao Fan, Tsung-Jung Liu, Kuan-Hsien Liu

    Abstract: Image restoration is a challenging ill-posed problem which also has been a long-standing issue. In the past few years, the convolution neural networks (CNNs) almost dominated the computer vision and had achieved considerable success in different levels of vision tasks including image restoration. However, recently the Swin Transformer-based model also shows impressive performance, even surpasses t… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  37. arXiv:2202.11762  [pdf, other

    cs.RO eess.SY

    Safe Control with Learned Certificates: A Survey of Neural Lyapunov, Barrier, and Contraction methods

    Authors: Charles Dawson, Sicun Gao, Chuchu Fan

    Abstract: Learning-enabled control systems have demonstrated impressive empirical performance on challenging control problems in robotics, but this performance comes at the cost of reduced transparency and lack of guarantees on the safety or stability of the learned controllers. In recent years, new techniques have emerged to provide these guarantees by learning certificates alongside control policies -- th… ▽ More

    Submitted 20 December, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: Accepted at IEEE Transactions on Robotics. Supplementary code available at https://github.com/MIT-REALM/neural_clbf

  38. arXiv:2202.08433  [pdf, ps, other

    cs.SD cs.LG eess.AS

    ADD 2022: the First Audio Deep Synthesis Detection Challenge

    Authors: Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Xiaohui Zhang, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu

    Abstract: Audio deepfake detection is an emerging topic, which was included in the ASVspoof 2021. However, the recent shared tasks have not covered many real-life and challenging scenarios. The first Audio Deep synthesis Detection challenge (ADD) was motivated to fill in the gap. The ADD 2022 includes three tracks: low-quality fake audio detection (LF), partially fake audio detection (PF) and audio fake gam… ▽ More

    Submitted 2 July, 2024; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by ICASSP 2022

  39. arXiv:2201.05247  [pdf, other

    cs.RO cs.MA eess.SY

    Multi-agent Motion Planning from Signal Temporal Logic Specifications

    Authors: Dawei Sun, Jingkai Chen, Sayan Mitra, Chuchu Fan

    Abstract: We tackle the challenging problem of multi-agent cooperative motion planning for complex tasks described using signal temporal logic (STL), where robots can have nonlinear and nonholonomic dynamics. Existing methods in multi-agent motion planning, especially those based on discrete abstractions and model predictive control (MPC), suffer from limited scalability with respect to the complexity of th… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L)

  40. arXiv:2201.01918  [pdf, other

    cs.LG cs.AI eess.SY

    SABLAS: Learning Safe Control for Black-box Dynamical Systems

    Authors: Zengyi Qin, Dawei Sun, Chuchu Fan

    Abstract: Control certificates based on barrier functions have been a powerful tool to generate probably safe control policies for dynamical systems. However, existing methods based on barrier certificates are normally for white-box systems with differentiable dynamics, which makes them inapplicable to many practical applications where the system is a black-box and cannot be accurately modeled. On the other… ▽ More

    Submitted 8 January, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: IEEE Robotics and Automation Letters, 2022

  41. arXiv:2201.00932  [pdf, other

    cs.RO eess.SY

    Learning Safe, Generalizable Perception-based Hybrid Control with Certificates

    Authors: Charles Dawson, Bethany Lowenkamp, Dylan Goff, Chuchu Fan

    Abstract: Many robotic tasks require high-dimensional sensors such as cameras and Lidar to navigate complex environments, but developing certifiably safe feedback controllers around these sensors remains a challenging open problem, particularly when learning is involved. Previous works have proved the safety of perception-feedback controllers by separating the perception and control subsystems and making st… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: Accepted for publication in RA-L

  42. arXiv:2112.08232  [pdf

    eess.IV cs.CV cs.LG

    RA V-Net: Deep learning network for automated liver segmentation

    Authors: Zhiqi Lee, Sumin Qi, Chongchong Fan, Ziwei Xie

    Abstract: Accurate segmentation of the liver is a prerequisite for the diagnosis of disease. Automated segmentation is an important application of computer-aided detection and diagnosis of liver disease. In recent years, automated processing of medical images has gained breakthroughs. However, the low contrast of abdominal scan CT images and the complexity of liver morphology make accurate automatic segment… ▽ More

    Submitted 15 December, 2021; v1 submitted 15 December, 2021; originally announced December 2021.

  43. arXiv:2110.10965  [pdf, other

    eess.IV cs.CV

    2020 CATARACTS Semantic Segmentation Challenge

    Authors: Imanol Luengo, Maria Grammatikopoulou, Rahim Mohammadi, Chris Walsh, Chinedu Innocent Nwoye, Deepak Alapatt, Nicolas Padoy, Zhen-Liang Ni, Chen-Chen Fan, Gui-Bin Bian, Zeng-Guang Hou, Heonjin Ha, Jiacheng Wang, Haojie Wang, Dong Guo, Lu Wang, Guotai Wang, Mobarakol Islam, Bharat Giddwani, Ren Hongliang, Theodoros Pissas, Claudio Ravasio, Martin Huber, Jeremy Birch, Joan M. Nunez Do Rio , et al. (15 additional authors not shown)

    Abstract: Surgical scene segmentation is essential for anatomy and instrument localization which can be further used to assess tissue-instrument interactions during a surgical procedure. In 2017, the Challenge on Automatic Tool Annotation for cataRACT Surgery (CATARACTS) released 50 cataract surgery videos accompanied by instrument usage annotations. These annotations included frame-level instrument presenc… ▽ More

    Submitted 24 February, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

  44. arXiv:2110.00693  [pdf, other

    cs.LG cs.RO eess.SY math.OC

    A Theoretical Overview of Neural Contraction Metrics for Learning-based Control with Guaranteed Stability

    Authors: Hiroyasu Tsukamoto, Soon-Jo Chung, Jean-Jacques Slotine, Chuchu Fan

    Abstract: This paper presents a theoretical overview of a Neural Contraction Metric (NCM): a neural network model of an optimal contraction metric and corresponding differential Lyapunov function, the existence of which is a necessary and sufficient condition for incremental exponential stability of non-autonomous nonlinear system trajectories. Its innovation lies in providing formal robustness guarantees f… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: IEEE Conference on Decision and Control (CDC), Preprint Version. Accepted July, 2021

  45. arXiv:2109.06697  [pdf, other

    eess.SY cs.RO

    Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

    Authors: Charles Dawson, Zengyi Qin, Sicun Gao, Chuchu Fan

    Abstract: Safety and stability are common requirements for robotic control systems; however, designing safe, stable controllers remains difficult for nonlinear and uncertain models. We develop a model-based learning approach to synthesize robust feedback controllers with safety and stability guarantees. We take inspiration from robust convex optimization and Lyapunov theory to define robust control Lyapunov… ▽ More

    Submitted 6 October, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted to the 5th Annual Conference on Robot Learning (CoRL 21)

  46. arXiv:2106.12764  [pdf, other

    cs.LG eess.SY

    Density Constrained Reinforcement Learning

    Authors: Zengyi Qin, Yuxiao Chen, Chuchu Fan

    Abstract: We study constrained reinforcement learning (CRL) from a novel perspective by setting constraints directly on state density functions, rather than the value functions considered by previous works. State density has a clear physical and mathematical interpretation, and is able to express a wide variety of constraints such as resource limits and safety requirements. Density constraints can also avoi… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Accepted by ICML, 2021

  47. arXiv:2105.08573  [pdf, other

    cs.LG cs.CV eess.IV

    Dependent Multi-Task Learning with Causal Intervention for Image Captioning

    Authors: Wenqing Chen, Jidong Tian, Caoyun Fan, Hao He, Yaohui Jin

    Abstract: Recent work for image captioning mainly followed an extract-then-generate paradigm, pre-extracting a sequence of object-based features and then formulating image captioning as a single sequence-to-sequence task. Although promising, we observed two problems in generated captions: 1) content inconsistency where models would generate contradicting facts; 2) not informative enough where models would m… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: To be published in IJCAI 2021

  48. arXiv:2105.06270  [pdf, other

    cs.LG cs.RO eess.SP

    Group Feature Learning and Domain Adversarial Neural Network for aMCI Diagnosis System Based on EEG

    Authors: Chen-Chen Fan, Haiqun Xie, Liang Peng, Hongjun Yang, Zhen-Liang Ni, Guan'an Wang, Yan-Jie Zhou, Sheng Chen, Zhijie Fang, Shuyun Huang, Zeng-Guang Hou

    Abstract: Medical diagnostic robot systems have been paid more and more attention due to its objectivity and accuracy. The diagnosis of mild cognitive impairment (MCI) is considered an effective means to prevent Alzheimer's disease (AD). Doctors diagnose MCI based on various clinical examinations, which are expensive and the diagnosis results rely on the knowledge of doctors. Therefore, it is necessary to d… ▽ More

    Submitted 28 April, 2021; originally announced May 2021.

    Comments: This paper has been accepted by 2021 International Conference on Robotics and Automation (ICRA 2021)

  49. arXiv:2103.02114  [pdf, other

    cs.GR eess.SY

    A Computational Design and Evaluation Tool for 3D Structures with Planar Surfaces

    Authors: Chang Liu, Wenzhong Yan, Pehuen Moure, Cody Fan, Ankur Mehta

    Abstract: Three dimensional (3D) structures composed of planar surfaces can be build out of accessible materials using easier fabrication technique with shorter fabrication time. To better design 3D structures with planar surfaces, realistic models are required to understand and evaluate mechanical behaviors. Existing design tools are either effort-consuming (e.g. finite element analysis) or bounded by assu… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  50. arXiv:2102.08261  [pdf, other

    cs.RO eess.SY

    Optimal Mixed Discrete-Continuous Planning for Linear Hybrid Systems

    Authors: Jingkai Chen, Brian Williams, Chuchu Fan

    Abstract: Planning in hybrid systems with both discrete and continuous control variables is important for dealing with real-world applications such as extra-planetary exploration and multi-vehicle transportation systems. Meanwhile, generating high-quality solutions given certain hybrid planning specifications is crucial to building high-performance hybrid systems. However, since hybrid planning is challengi… ▽ More

    Submitted 20 February, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Accepted at HSCC2021. 12 pages, 8 figures, 3 tables