Search | arXiv e-print repository

EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models

Authors: Wenhan Yao, Zedong XingXiarun Chen, Jia Liu, yongqiang He, Weiping Wen

Abstract: Deep speech classification tasks, mainly including keyword spotting and speaker verification, play a crucial role in speech-based human-computer interaction. Recently, the security of these technologies has been demonstrated to be vulnerable to backdoor attacks. Specifically speaking, speech samples are attacked by noisy disruption and component modification in present triggers. We suggest that sp… ▽ More Deep speech classification tasks, mainly including keyword spotting and speaker verification, play a crucial role in speech-based human-computer interaction. Recently, the security of these technologies has been demonstrated to be vulnerable to backdoor attacks. Specifically speaking, speech samples are attacked by noisy disruption and component modification in present triggers. We suggest that speech backdoor attacks can strategically focus on emotion, a higher-level subjective perceptual attribute inherent in speech. Furthermore, we proposed that emotional voice conversion technology can serve as the speech backdoor attack trigger, and the method is called EmoAttack. Based on this, we conducted attack experiments on two speech classification tasks, showcasing that EmoAttack method owns impactful trigger effectiveness and its remarkable attack success rate and accuracy variance. Additionally, the ablation experiments found that speech with intensive emotion is more suitable to be targeted for attacks. △ Less

Submitted 27 August, 2024; originally announced August 2024.

Comments: Submitted to ICASSP 2025

arXiv:2408.00777 [pdf, other]

CATD: Unified Representation Learning for EEG-to-fMRI Cross-Modal Generation

Authors: Weiheng Yao, Shuqiang Wang

Abstract: Multi-modal neuroimaging analysis is crucial for a comprehensive understanding of brain function and pathology, as it allows for the integration of different imaging techniques, thus overcoming the limitations of individual modalities. However, the high costs and limited availability of certain modalities pose significant challenges. To address these issues, this paper proposed the Condition-Align… ▽ More Multi-modal neuroimaging analysis is crucial for a comprehensive understanding of brain function and pathology, as it allows for the integration of different imaging techniques, thus overcoming the limitations of individual modalities. However, the high costs and limited availability of certain modalities pose significant challenges. To address these issues, this paper proposed the Condition-Aligned Temporal Diffusion (CATD) framework for end-to-end cross-modal synthesis of neuroimaging, enabling the generation of functional magnetic resonance imaging (fMRI)-detected Blood Oxygen Level Dependent (BOLD) signals from more accessible Electroencephalography (EEG) signals. By constructing Conditionally Aligned Block (CAB), heterogeneous neuroimages are aligned into a potential space, achieving a unified representation that provides the foundation for cross-modal transformation in neuroimaging. The combination with the constructed Dynamic Time-Frequency Segmentation (DTFS) module also enables the use of EEG signals to improve the temporal resolution of BOLD signals, thus augmenting the capture of the dynamic details of the brain. Experimental validation demonstrated the effectiveness of the framework in improving the accuracy of neural activity prediction, identifying abnormal brain regions, and enhancing the temporal resolution of BOLD signals. The proposed framework establishes a new paradigm for cross-modal synthesis of neuroimaging by unifying heterogeneous neuroimaging data into a potential representation space, showing promise in medical applications such as improving Parkinson's disease prediction and identifying abnormal brain regions. △ Less

Submitted 16 July, 2024; originally announced August 2024.

arXiv:2407.02830 [pdf, other]

A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes

Authors: Li Fang, Tianyu Li, Yanghong Lin, Shudong Zhou, Wei Yao

Abstract: Point clouds are vital in computer vision tasks such as 3D reconstruction, autonomous driving, and robotics. However, TLS-acquired point clouds often contain virtual points from reflective surfaces, causing disruptions. This study presents a reflection noise elimination algorithm for TLS point clouds. Our innovative reflection plane detection algorithm, based on geometry-optical models and physica… ▽ More Point clouds are vital in computer vision tasks such as 3D reconstruction, autonomous driving, and robotics. However, TLS-acquired point clouds often contain virtual points from reflective surfaces, causing disruptions. This study presents a reflection noise elimination algorithm for TLS point clouds. Our innovative reflection plane detection algorithm, based on geometry-optical models and physical properties, identifies and categorizes reflection points per optical reflection theory. We've adapted the LSFH feature descriptor to retain reflection features, mitigating interference from symmetrical architectural structures. By incorporating the Hausdorff feature distance, the algorithm enhances resilience to ghosting and deformation, improving virtual point detection accuracy. Extensive experiments on the 3DRN benchmark dataset, featuring diverse urban environments with virtual TLS reflection noise, show our algorithm improves precision and recall rates for 3D points in reflective regions by 57.03\% and 31.80\%, respectively. Our method achieves a 9.17\% better outlier detection rate and 5.65\% higher accuracy than leading methods. Access the 3DRN dataset at (https://github.com/Tsuiky/3DRN). △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2406.10932 [pdf, other]

Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition

Authors: Wenhan Yao, Jiangkun Yang, Yongqiang He, Jia Liu, Weiping Wen

Abstract: Speech recognition is an essential start ring of human-computer interaction, and recently, deep learning models have achieved excellent success in this task. However, when the model training and private data provider are always separated, some security threats that make deep neural networks (DNNs) abnormal deserve to be researched. In recent years, the typical backdoor attacks have been researched… ▽ More Speech recognition is an essential start ring of human-computer interaction, and recently, deep learning models have achieved excellent success in this task. However, when the model training and private data provider are always separated, some security threats that make deep neural networks (DNNs) abnormal deserve to be researched. In recent years, the typical backdoor attacks have been researched in speech recognition systems. The existing backdoor methods are based on data poisoning. The attacker adds some incorporated changes to benign speech spectrograms or changes the speech components, such as pitch and timbre. As a result, the poisoned data can be detected by human hearing or automatic deep algorithms. To improve the stealthiness of data poisoning, we propose a non-neural and fast algorithm called Random Spectrogram Rhythm Transformation (RSRT) in this paper. The algorithm combines four steps to generate stealthy poisoned utterances. From the perspective of rhythm component transformation, our proposed trigger stretches or squeezes the mel spectrograms and recovers them back to signals. The operation keeps timbre and content unchanged for good stealthiness. Our experiments are conducted on two kinds of speech recognition tasks, including testing the stealthiness of poisoned samples by speaker verification and automatic speech recognition. The results show that our method has excellent effectiveness and stealthiness. The rhythm trigger needs a low poisoning rate and gets a very high attack success rate. △ Less

Submitted 21 August, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

arXiv:2404.12771 [pdf]

Phase-space analysis of a two-section InP laser as an all-optical spiking neuron: dependency on control and design parameters

Authors: Lukas Puts, Daan Lenstra, Kevin Williams, Weiming Yao

Abstract: Using a rate-equation model we numerically evaluate the carrier concentration and photon number in an integrated two-section semiconductor laser, and analyse its dynamics in three-dimensional phase space. The simulation comprises compact model descriptions extracted from a commercially-available generic InP technology platform, allowing us to model an applied reverse-bias voltage to the saturable… ▽ More Using a rate-equation model we numerically evaluate the carrier concentration and photon number in an integrated two-section semiconductor laser, and analyse its dynamics in three-dimensional phase space. The simulation comprises compact model descriptions extracted from a commercially-available generic InP technology platform, allowing us to model an applied reverse-bias voltage to the saturable absorber. We use the model to study the influence of the injected gain current, reverse-bias voltage, and cavity mirror reflectivity on the excitable operation state, which is the operation mode desired for the laser to act as an all-optical integrated neuron. We show in phase-space that our model is capable of demonstrating four different operation modes, i.e. cw, self-pulsating and an on-set and excitable mode under optical pulse injection. In addition, we show that lowering the reflectivity of one of the cavity mirrors greatly enhances the control parameter space for excitable operation, enabling more relaxed operation parameter control and lower power consumption of an integrated two-section laser neuron. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 11 pages, 10 figures

arXiv:2403.06197 [pdf, other]

DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency

Authors: Wenfang Yao, Kejing Yin, William K. Cheung, Jia Liu, Jing Qin

Abstract: The combination of electronic health records (EHR) and medical images is crucial for clinicians in making diagnoses and forecasting prognosis. Strategically fusing these two data modalities has great potential to improve the accuracy of machine learning models in clinical prediction tasks. However, the asynchronous and complementary nature of EHR and medical images presents unique challenges. Miss… ▽ More The combination of electronic health records (EHR) and medical images is crucial for clinicians in making diagnoses and forecasting prognosis. Strategically fusing these two data modalities has great potential to improve the accuracy of machine learning models in clinical prediction tasks. However, the asynchronous and complementary nature of EHR and medical images presents unique challenges. Missing modalities due to clinical and administrative factors are inevitable in practice, and the significance of each data modality varies depending on the patient and the prediction target, resulting in inconsistent predictions and suboptimal model performance. To address these challenges, we propose DrFuse to achieve effective clinical multi-modal fusion. It tackles the missing modality issue by disentangling the features shared across modalities and those unique within each modality. Furthermore, we address the modal inconsistency issue via a disease-wise attention layer that produces the patient- and disease-wise weighting for each modality to make the final prediction. We validate the proposed method using real-world large-scale datasets, MIMIC-IV and MIMIC-CXR. Experimental results show that the proposed method significantly outperforms the state-of-the-art models. Our implementation is publicly available at https://github.com/dorothy-yao/drfuse. △ Less

Submitted 10 March, 2024; originally announced March 2024.

Comments: Accepted by AAAI-24

arXiv:2311.06307 [pdf]

Synthetic Speaking Children -- Why We Need Them and How to Make Them

Authors: Muhammad Ali Farooq, Dan Bigioi, Rishabh Jain, Wang Yao, Mariam Yiwere, Peter Corcoran

Abstract: Contemporary Human Computer Interaction (HCI) research relies primarily on neural network models for machine vision and speech understanding of a system user. Such models require extensively annotated training datasets for optimal performance and when building interfaces for users from a vulnerable population such as young children, GDPR introduces significant complexities in data collection, mana… ▽ More Contemporary Human Computer Interaction (HCI) research relies primarily on neural network models for machine vision and speech understanding of a system user. Such models require extensively annotated training datasets for optimal performance and when building interfaces for users from a vulnerable population such as young children, GDPR introduces significant complexities in data collection, management, and processing. Motivated by the training needs of an Edge AI smart toy platform this research explores the latest advances in generative neural technologies and provides a working proof of concept of a controllable data generation pipeline for speech driven facial training data at scale. In this context, we demonstrate how StyleGAN2 can be finetuned to create a gender balanced dataset of children's faces. This dataset includes a variety of controllable factors such as facial expressions, age variations, facial poses, and even speech-driven animations with realistic lip synchronization. By combining generative text to speech models for child voice synthesis and a 3D landmark based talking heads pipeline, we can generate highly realistic, entirely synthetic, talking child video clips. These video clips can provide valuable, and controllable, synthetic training data for neural network models, bridging the gap when real data is scarce or restricted due to privacy regulations. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: Presented at SpeD 23

arXiv:2309.11715

Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal

Authors: Xiao Feng Zhang, Tian Yi Song, Jia Wei Yao

Abstract: Segment Anything (SAM), an advanced universal image segmentation model trained on an expansive visual dataset, has set a new benchmark in image segmentation and computer vision. However, it faced challenges when it came to distinguishing between shadows and their backgrounds. To address this, we developed Deshadow-Anything, considering the generalization of large-scale datasets, and we performed F… ▽ More Segment Anything (SAM), an advanced universal image segmentation model trained on an expansive visual dataset, has set a new benchmark in image segmentation and computer vision. However, it faced challenges when it came to distinguishing between shadows and their backgrounds. To address this, we developed Deshadow-Anything, considering the generalization of large-scale datasets, and we performed Fine-tuning on large-scale datasets to achieve image shadow removal. The diffusion model can diffuse along the edges and textures of an image, helping to remove shadows while preserving the details of the image. Furthermore, we design Multi-Self-Attention Guidance (MSAG) and adaptive input perturbation (DDPM-AIP) to accelerate the iterative training speed of diffusion. Experiments on shadow removal tasks demonstrate that these methods can effectively improve image restoration performance. △ Less

Submitted 2 January, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: it needs revised

arXiv:2309.02937 [pdf, other]

Resilient source seeking with robot swarms

Authors: Antonio Acuaviva, Jesus Bautista, Weijia Yao, Juan Jimenez, Hector Garcia de Marina

Abstract: We present a solution for locating the source, or maximum, of an unknown scalar field using a swarm of mobile robots. Unlike relying on the traditional gradient information, the swarm determines an ascending direction to approach the source with arbitrary precision. The ascending direction is calculated from measurements of the field strength at the robot locations and their relative positions con… ▽ More We present a solution for locating the source, or maximum, of an unknown scalar field using a swarm of mobile robots. Unlike relying on the traditional gradient information, the swarm determines an ascending direction to approach the source with arbitrary precision. The ascending direction is calculated from measurements of the field strength at the robot locations and their relative positions concerning the centroid. Rather than focusing on individual robots, we focus the analysis on the density of robots per unit area to guarantee a more resilient swarm, i.e., the functionality remains even if individuals go missing or are misplaced during the mission. We reinforce the robustness of the algorithm by providing sufficient conditions for the swarm shape so that the ascending direction is almost parallel to the gradient. The swarm can respond to an unexpected environment by morphing its shape and exploiting the existence of multiple ascending directions. Finally, we validate our approach numerically with hundreds of robots. The fact that a large number of robots always calculate an ascending direction compensates for the loss of individuals and mitigates issues arising from the actuator and sensor noises. △ Less

Submitted 14 August, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

Comments: 7 pages, CDC 2024, accepted version

arXiv:2308.05305 [pdf, other]

From CNN to Transformer: A Review of Medical Image Segmentation Models

Authors: Wenjian Yao, Jiajun Bai, Wei Liao, Yuheng Chen, Mengjuan Liu, Yao Xie

Abstract: Medical image segmentation is an important step in medical image analysis, especially as a crucial prerequisite for efficient disease diagnosis and treatment. The use of deep learning for image segmentation has become a prevalent trend. The widely adopted approach currently is U-Net and its variants. Additionally, with the remarkable success of pre-trained models in natural language processing tas… ▽ More Medical image segmentation is an important step in medical image analysis, especially as a crucial prerequisite for efficient disease diagnosis and treatment. The use of deep learning for image segmentation has become a prevalent trend. The widely adopted approach currently is U-Net and its variants. Additionally, with the remarkable success of pre-trained models in natural language processing tasks, transformer-based models like TransUNet have achieved desirable performance on multiple medical image segmentation datasets. In this paper, we conduct a survey of the most representative four medical image segmentation models in recent years. We theoretically analyze the characteristics of these models and quantitatively evaluate their performance on two benchmark datasets (i.e., Tuberculosis Chest X-rays and ovarian tumors). Finally, we discuss the main challenges and future trends in medical image segmentation. Our work can assist researchers in the related field to quickly establish medical segmentation models tailored to specific regions. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: 18 pages, 8 figures

arXiv:2303.14357 [pdf, other]

Dealing With Heterogeneous 3D MR Knee Images: A Federated Few-Shot Learning Method With Dual Knowledge Distillation

Authors: Xiaoxiao He, Chaowei Tan, Bo Liu, Liping Si, Weiwu Yao, Liang Zhao, Di Liu, Qilong Zhangli, Qi Chang, Kang Li, Dimitris N. Metaxas

Abstract: Federated Learning has gained popularity among medical institutions since it enables collaborative training between clients (e.g., hospitals) without aggregating data. However, due to the high cost associated with creating annotations, especially for large 3D image datasets, clinical institutions do not have enough supervised data for training locally. Thus, the performance of the collaborative mo… ▽ More Federated Learning has gained popularity among medical institutions since it enables collaborative training between clients (e.g., hospitals) without aggregating data. However, due to the high cost associated with creating annotations, especially for large 3D image datasets, clinical institutions do not have enough supervised data for training locally. Thus, the performance of the collaborative model is subpar under limited supervision. On the other hand, large institutions have the resources to compile data repositories with high-resolution images and labels. Therefore, individual clients can utilize the knowledge acquired in the public data repositories to mitigate the shortage of private annotated images. In this paper, we propose a federated few-shot learning method with dual knowledge distillation. This method allows joint training with limited annotations across clients without jeopardizing privacy. The supervised learning of the proposed method extracts features from limited labeled data in each client, while the unsupervised data is used to distill both feature and response-based knowledge from a national data repository to further improve the accuracy of the collaborative model and reduce the communication cost. Extensive evaluations are conducted on 3D magnetic resonance knee images from a private clinical dataset. Our proposed method shows superior performance and less training time than other semi-supervised federated learning methods. Codes and additional visualization results are available at https://github.com/hexiaoxiao-cs/fedml-knee. △ Less

Submitted 17 April, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

arXiv:2302.11110 [pdf, ps, other]

A Novel Vector-Field-Based Motion Planning Algorithm for 3D Nonholonomic Robots

Authors: Xiaodong He, Weijia Yao, Zhiyong Sun, Zhongkui Li

Abstract: This paper focuses on the motion planning for mobile robots in 3D, which are modelled by 6-DOF rigid body systems with nonholonomic kinematics constraints. We not only specify the target position, but also bring in the requirement of the heading direction at the terminal time, which gives rise to a new and more challenging 3D motion planning problem. The proposed planning algorithm involves a nove… ▽ More This paper focuses on the motion planning for mobile robots in 3D, which are modelled by 6-DOF rigid body systems with nonholonomic kinematics constraints. We not only specify the target position, but also bring in the requirement of the heading direction at the terminal time, which gives rise to a new and more challenging 3D motion planning problem. The proposed planning algorithm involves a novel velocity vector field (VF) over the workspace, and by following the VF, the robot can be navigated to the destination with the specified heading direction. In order to circumvent potential collisions with obstacles and other robots, a composite VF is designed by composing the navigation VF and an additional VF tangential to the boundary of the dangerous area. Moreover, we propose a priority-based algorithm to deal with the motion coupling issue among multiple robots. Finally, numerical simulations are conducted to verify the theoretical results. △ Less

Submitted 8 April, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

arXiv:2301.12213 [pdf, other]

The Domain of Attraction of the Desired Path in Vector-field Guided Path Following

Authors: Weijia Yao, Bohuan Lin, Brian D. O. Anderson, Ming Cao

Abstract: In the vector-field guided path-following problem, a sufficiently smooth vector field is designed such that its integral curves converge to and move along a one-dimensional geometric desired path. The existence of singular points where the vector field vanishes creates a topological obstruction to global convergence to the desired path and some associated topological analysis has been conducted in… ▽ More In the vector-field guided path-following problem, a sufficiently smooth vector field is designed such that its integral curves converge to and move along a one-dimensional geometric desired path. The existence of singular points where the vector field vanishes creates a topological obstruction to global convergence to the desired path and some associated topological analysis has been conducted in our previous work. In this paper, we strengthen the result in our previous work by showing that the domain of attraction of the desired path, which is a compact asymptotically stable one-dimensional embedded submanifold of an $n$-dimensional ambient manifold $\mathcal{M}$, is homeomorphic to $\mathbb{R}^{n-1} \times \mathbb{S}^1$, and not just homotopy equivalent to $\mathbb{S}^1$. This result is extended for a $k$-dimensional compact manifold for $k \ge 2$. △ Less

Submitted 28 January, 2023; originally announced January 2023.

arXiv:2212.08729 [pdf, other]

Distribution-aware Goal Prediction and Conformant Model-based Planning for Safe Autonomous Driving

Authors: Jonathan Francis, Bingqing Chen, Weiran Yao, Eric Nyberg, Jean Oh

Abstract: The feasibility of collecting a large amount of expert demonstrations has inspired growing research interests in learning-to-drive settings, where models learn by imitating the driving behaviour from experts. However, exclusively relying on imitation can limit agents' generalisability to novel scenarios that are outside the support of the training data. In this paper, we address this challenge by… ▽ More The feasibility of collecting a large amount of expert demonstrations has inspired growing research interests in learning-to-drive settings, where models learn by imitating the driving behaviour from experts. However, exclusively relying on imitation can limit agents' generalisability to novel scenarios that are outside the support of the training data. In this paper, we address this challenge by factorising the driving task, based on the intuition that modular architectures are more generalisable and more robust to changes in the environment compared to monolithic, end-to-end frameworks. Specifically, we draw inspiration from the trajectory forecasting community and reformulate the learning-to-drive task as obstacle-aware perception and grounding, distribution-aware goal prediction, and model-based planning. Firstly, we train the obstacle-aware perception module to extract salient representation of the visual context. Then, we learn a multi-modal goal distribution by performing conditional density-estimation using normalising flow. Finally, we ground candidate trajectory predictions road geometry, and plan the actions based on on vehicle dynamics. Under the CARLA simulator, we report state-of-the-art results on the CARNOVEL benchmark. △ Less

Submitted 16 December, 2022; originally announced December 2022.

Comments: Accepted: 1st Workshop on Safe Learning for Autonomous Driving, at the International Conference on Machine Learning (ICML 2022); Best Paper Award

arXiv:2209.09478 [pdf, other]

Guiding vector fields for the distributed motion coordination of mobile robots

Authors: Weijia Yao, Hector Garcia de Marina, Zhiyong Sun, Ming Cao

Abstract: We propose coordinating guiding vector fields to achieve two tasks simultaneously with a team of robots: first, the guidance and navigation of multiple robots to possibly different paths or surfaces typically embedded in 2D or 3D; second, their motion coordination while tracking their prescribed paths or surfaces. The motion coordination is defined by desired parametric displacements between robot… ▽ More We propose coordinating guiding vector fields to achieve two tasks simultaneously with a team of robots: first, the guidance and navigation of multiple robots to possibly different paths or surfaces typically embedded in 2D or 3D; second, their motion coordination while tracking their prescribed paths or surfaces. The motion coordination is defined by desired parametric displacements between robots on the path or surface. Such a desired displacement is achieved by controlling the virtual coordinates, which correspond to the path or surface's parameters, between guiding vector fields. Rigorous mathematical guarantees underpinned by dynamical systems theory and Lyapunov theory are provided for the effective distributed motion coordination and navigation of robots on paths or surfaces from all initial positions. As an example for practical robotic applications, we derive a control algorithm from the proposed coordinating guiding vector fields for a Dubins-car-like model with actuation saturation. Our proposed algorithm is distributed and scalable to an arbitrary number of robots. Furthermore, extensive illustrative simulations and fixed-wing aircraft outdoor experiments validate the effectiveness and robustness of our algorithm. △ Less

Submitted 30 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

Comments: Evolved paper from arXiv:2103.12372. Accepted to IEEE Transactions on Robotics. Supplementary video: https://www.bilibili.com/video/BV16e4y147xp/

arXiv:2208.04130 [pdf, other]

Reliability Analysis of Complex Multi-State System Based on Universal Generating Function and Bayesian Network

Authors: Xu Liu, Wen Yao, Xiaohu Zheng, Yingchun Xu

Abstract: In the complex multi-state system (MSS), reliability analysis is a significant research content, both for equipment design, manufacturing, usage and maintenance. Universal Generating Function (UGF) is an important method in the reliability analysis, which efficiently obtains the system reliability by a fast algebraic procedure. However, when structural relationships between subsystems or component… ▽ More In the complex multi-state system (MSS), reliability analysis is a significant research content, both for equipment design, manufacturing, usage and maintenance. Universal Generating Function (UGF) is an important method in the reliability analysis, which efficiently obtains the system reliability by a fast algebraic procedure. However, when structural relationships between subsystems or components are not clear or without explicit expressions, the UGF method is difficult to use or not applicable at all. Bayesian Network (BN) has a natural advantage in terms of uncertainty inference for the relationship without explicit expressions. For the number of components is extremely large, though, it has the defects of low efficiency. To overcome the respective defects of UGF and BN, a novel reliability analysis method called UGF-BN is proposed for the complex MSS. In the UGF-BN framework, the UGF method is firstly used to analyze the bottom components with a large number. Then probability distributions obtained are taken as the input of BN. Finally, the reliability of the complex MSS is modeled by the BN method. This proposed method improves the computational efficiency, especially for the MSS with the large number of bottom components. Besides, the aircraft reliability-based design optimization based on the UGF-BN method is further studied with budget constraints on mass, power, and cost. Finally, two cases are used to demonstrate and verify the proposed method. △ Less

Submitted 15 June, 2022; originally announced August 2022.

arXiv:2205.12760 [pdf, other]

Guiding Vector Fields for Following Occluded Paths

Authors: Weijia Yao, Bohuan Lin, Brian D. O. Anderson, Ming Cao

Abstract: Accurately following a geometric desired path in a two-dimensional space is a fundamental task for many engineering systems, in particular mobile robots. When the desired path is occluded by obstacles, it is necessary and crucial to temporarily deviate from the path for obstacle/collision avoidance. In this paper, we develop a composite guiding vector field via the use of smooth bump functions, an… ▽ More Accurately following a geometric desired path in a two-dimensional space is a fundamental task for many engineering systems, in particular mobile robots. When the desired path is occluded by obstacles, it is necessary and crucial to temporarily deviate from the path for obstacle/collision avoidance. In this paper, we develop a composite guiding vector field via the use of smooth bump functions, and provide theoretical guarantees that the integral curves of the vector field can follow an arbitrary sufficiently smooth desired path and avoid collision with obstacles of arbitrary shapes. These two behaviors are reactive since path (re)-planning and global map construction are not involved. To deal with the common deadlock problem, we introduce a switching vector field, and the Zeno behavior is excluded. Simulations are conducted to support the theoretical results. △ Less

Submitted 25 May, 2022; originally announced May 2022.

arXiv:2205.10734 [pdf, other]

Limit Cycles Analysis and Control of Evolutionary Game Dynamics with Environmental Feedback

Authors: Lulu Gong, Weijia Yao, Jian Gao, Ming Cao

Abstract: Recently, an evolutionary game dynamics model taking into account the environmental feedback has been proposed to describe the co-evolution of strategic actions of a population of individuals and the state of the surrounding environment; correspondingly a range of interesting dynamic behaviors have been reported. In this paper, we provide new theoretical insight into such behaviors and discuss con… ▽ More Recently, an evolutionary game dynamics model taking into account the environmental feedback has been proposed to describe the co-evolution of strategic actions of a population of individuals and the state of the surrounding environment; correspondingly a range of interesting dynamic behaviors have been reported. In this paper, we provide new theoretical insight into such behaviors and discuss control options. Instead of the standard replicator dynamics, we use a more realistic and comprehensive model of replicator-mutator dynamics, to describe the strategic evolution of the population. After integrating the environment feedback, we study the effect of mutations on the resulting closed-loop system dynamics. We prove the conditions for two types of bifurcations, Hopf bifurcation and Heteroclinic bifurcation, both of which result in stable limit cycles. These limit cycles have not been identified in existing works, and we further prove that such limit cycles are in fact persistent in a large parameter space and are almost globally stable. In the end, an intuitive control policy based on incentives is applied, and the effectiveness of this control policy is examined by analysis and simulations. △ Less

Submitted 22 May, 2022; originally announced May 2022.

Comments: 19 pages, 6 figures

arXiv:2204.06746 [pdf, other]

Information fusion approach for biomass estimation in a plateau mountainous forest using a synergistic system comprising UAS-based digital camera and LiDAR

Authors: Rong Huang, Wei Yao, Zhong Xu, Lin Cao, Xin Shen

Abstract: Forest land plays a vital role in global climate, ecosystems, farming and human living environments. Therefore, forest biomass estimation methods are necessary to monitor changes in the forest structure and function, which are key data in natural resources research. Although accurate forest biomass measurements are important in forest inventory and assessments, high-density measurements that invol… ▽ More Forest land plays a vital role in global climate, ecosystems, farming and human living environments. Therefore, forest biomass estimation methods are necessary to monitor changes in the forest structure and function, which are key data in natural resources research. Although accurate forest biomass measurements are important in forest inventory and assessments, high-density measurements that involve airborne light detection and ranging (LiDAR) at a low flight height in large mountainous areas are highly expensive. The objective of this study was to quantify the aboveground biomass (AGB) of a plateau mountainous forest reserve using a system that synergistically combines an unmanned aircraft system (UAS)-based digital aerial camera and LiDAR to leverage their complementary advantages. In this study, we utilized digital aerial photogrammetry (DAP), which has the unique advantages of speed, high spatial resolution, and low cost, to compensate for the deficiency of forestry inventory using UAS-based LiDAR that requires terrain-following flight for high-resolution data acquisition. Combined with the sparse LiDAR points acquired by using a high-altitude and high-speed UAS for terrain extraction, dense normalized DAP point clouds can be obtained to produce an accurate and high-resolution canopy height model (CHM). Based on the CHM and spectral attributes obtained from multispectral images, we estimated and mapped the AGB of the region of interest with considerable cost efficiency. Our study supports the development of predictive models for large-scale wall-to-wall AGB mapping by leveraging the complementarity between DAP and LiDAR measurements. This work also reveals the potential of utilizing a UAS-based digital camera and LiDAR synergistically in a plateau mountainous forest area. △ Less

Submitted 14 April, 2022; originally announced April 2022.

arXiv:2204.01327 [pdf]

Algorithms for Bayesian network modeling and reliability inference of complex multistate systems: Part II-Dependent systems

Authors: Xiaohu Zheng, Wen Yao, Xiaoqian Chen

Abstract: In using the Bayesian network (BN) to construct the complex multistate system's reliability model as described in Part I, the memory storage requirements of the node probability table (NPT) will exceed the random access memory (RAM) of the computer. However, the proposed inference algorithm of Part I is not suitable for the dependent system. This Part II proposes a novel method for BN reliability… ▽ More In using the Bayesian network (BN) to construct the complex multistate system's reliability model as described in Part I, the memory storage requirements of the node probability table (NPT) will exceed the random access memory (RAM) of the computer. However, the proposed inference algorithm of Part I is not suitable for the dependent system. This Part II proposes a novel method for BN reliability modeling and analysis to apply the compression idea to the complex multistate dependent system. In this Part II, the dependent nodes and their parent nodes are equivalent to a block, based on which the multistate joint probability inference algorithm is proposed to calculate the joint probability distribution of a block's all nodes. Then, based on the proposed multistate compression algorithm of Part I, the dependent multistate inference algorithm is proposed for the complex multistate dependent system. The use and accuracy of the proposed algorithms are demonstrated in case 1. Finally, the proposed algorithms are applied to the reliability modeling and analysis of the satellite attitude control system. The results show that both Part I and Part II's proposed algorithms make the reliability modeling and analysis of the complex multistate system feasible. △ Less

Submitted 4 April, 2022; originally announced April 2022.

arXiv:2203.15655 [pdf]

Consistency regularization-based Deep Polynomial Chaos Neural Network Method for Reliability Analysis

Authors: Xiaohu Zheng, Wen Yao, Yunyang Zhang, Xiaoya Zhang

Abstract: Polynomial chaos expansion (PCE) is a powerful surrogate model-based reliability analysis method. Generally, a PCE model with a higher expansion order is usually required to obtain an accurate surrogate model for some complex non-linear stochastic systems. However, the high-order PCE increases the number of labeled data required for solving the expansion coefficients. To alleviate this problem, th… ▽ More Polynomial chaos expansion (PCE) is a powerful surrogate model-based reliability analysis method. Generally, a PCE model with a higher expansion order is usually required to obtain an accurate surrogate model for some complex non-linear stochastic systems. However, the high-order PCE increases the number of labeled data required for solving the expansion coefficients. To alleviate this problem, this paper proposes a consistency regularization-based deep polynomial chaos neural network (Deep PCNN) method, including the low-order adaptive PCE model (the auxiliary model) and the high-order polynomial chaos neural network (the main model). The expansion coefficients of the main model are parameterized into the learnable weights of the polynomial chaos neural network, realizing iterative learning of expansion coefficients to obtain more accurate high-order PCE models. The auxiliary model uses a proposed consistency regularization loss function to assist in training the main model. The consistency regularization-based Deep PCNN method can significantly reduce the number of labeled data in constructing a high-order PCE model without losing accuracy by using few labeled data and abundant unlabeled data. A numerical example validates the effectiveness of the consistency regularization-based Deep PCNN method, and then this method is applied to analyze the reliability of two aerospace engineering systems. △ Less

Submitted 4 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

arXiv:2202.03343 [pdf, other]

Topological Analysis of Vector-Field Guided Path Following on Manifolds

Authors: Weijia Yao, Bohuan Lin, Brian D. O. Anderson, Ming Cao

Abstract: A path-following control algorithm enables a system's trajectories under its guidance to converge to and evolve along a given geometric desired path. There exist various such algorithms, but many of them can only guarantee local convergence to the desired path in its neighborhood. In contrast, the control algorithms using a well-designed guiding vector field can ensure almost global convergence of… ▽ More A path-following control algorithm enables a system's trajectories under its guidance to converge to and evolve along a given geometric desired path. There exist various such algorithms, but many of them can only guarantee local convergence to the desired path in its neighborhood. In contrast, the control algorithms using a well-designed guiding vector field can ensure almost global convergence of trajectories to the desired path; here, "almost" means that in some cases, a measure-zero set of trajectories converge to the singular set where the vector field becomes zero (with all other trajectories converging to the desired path). In this paper, we first generalize the guiding vector field from the Euclidean space to a general smooth Riemannian manifold. This generalization can deal with path-following in some abstract configuration space (such as robot arm joint space). Then we show several theoretical results from a topological viewpoint. Specifically, we are motivated by the observation that singular points of the guiding vector field exist in many examples where the desired path is homeomorphic to the unit circle, but it is unknown whether the existence of singular points always holds in general (i.e., is inherent in the topology of the desired path). In the $n$-dimensional Euclidean space, we provide an affirmative answer, and conclude that it is not possible to guarantee global convergence to desired paths that are homeomorphic to the unit circle. Furthermore, we show that there always exist \emph{non-path-converging trajectories} (i.e., trajectories that do not converge to the desired path) starting from the boundary of a ball containing the desired path in an $n$-dimensional Euclidean space where $n \ge 3$. Examples are provided to illustrate the theoretical results. △ Less

Submitted 19 February, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

Comments: 16 pages, 8 figures, TAC

arXiv:2201.04318 [pdf, other]

Knee Cartilage Defect Assessment by Graph Representation and Surface Convolution

Authors: Zixu Zhuang, Liping Si, Sheng Wang, Kai Xuan, Xi Ouyang, Yiqiang Zhan, Zhong Xue, Lichi Zhang, Dinggang Shen, Weiwu Yao, Qian Wang

Abstract: Knee osteoarthritis (OA) is the most common osteoarthritis and a leading cause of disability. Cartilage defects are regarded as major manifestations of knee OA, which are visible by magnetic resonance imaging (MRI). Thus early detection and assessment for knee cartilage defects are important for protecting patients from knee OA. In this way, many attempts have been made on knee cartilage defect as… ▽ More Knee osteoarthritis (OA) is the most common osteoarthritis and a leading cause of disability. Cartilage defects are regarded as major manifestations of knee OA, which are visible by magnetic resonance imaging (MRI). Thus early detection and assessment for knee cartilage defects are important for protecting patients from knee OA. In this way, many attempts have been made on knee cartilage defect assessment by applying convolutional neural networks (CNNs) to knee MRI. However, the physiologic characteristics of the cartilage may hinder such efforts: the cartilage is a thin curved layer, implying that only a small portion of voxels in knee MRI can contribute to the cartilage defect assessment; heterogeneous scanning protocols further challenge the feasibility of the CNNs in clinical practice; the CNN-based knee cartilage evaluation results lack interpretability. To address these challenges, we model the cartilages structure and appearance from knee MRI into a graph representation, which is capable of handling highly diverse clinical data. Then, guided by the cartilage graph representation, we design a non-Euclidean deep learning network with the self-attention mechanism, to extract cartilage features in the local and global, and to derive the final assessment with a visualized result. Our comprehensive experiments show that the proposed method yields superior performance in knee cartilage defect assessment, plus its convenient 3D visualization for interpretability. △ Less

Submitted 12 January, 2022; originally announced January 2022.

Comments: 10 pages, 4 figures

arXiv:2108.08298 [pdf, other]

doi 10.1007/s11432-021-3645-4

A Machine Learning Surrogate Modeling Benchmark for Temperature Field Reconstruction of Heat-Source Systems

Authors: Xiaoqian Chen, Zhiqiang Gong, Xiaoyu Zhao, Weien Zhou, Wen Yao

Abstract: Temperature field reconstruction of heat source systems (TFR-HSS) with limited monitoring sensors occurred in thermal management plays an important role in real time health detection system of electronic equipment in engineering. However, prior methods with common interpolations usually cannot provide accurate reconstruction performance as required. In addition, there exists no public dataset for… ▽ More Temperature field reconstruction of heat source systems (TFR-HSS) with limited monitoring sensors occurred in thermal management plays an important role in real time health detection system of electronic equipment in engineering. However, prior methods with common interpolations usually cannot provide accurate reconstruction performance as required. In addition, there exists no public dataset for widely research of reconstruction methods to further boost the reconstruction performance and engineering applications. To overcome this problem, this work develops a machine learning modelling benchmark for TFR-HSS task. First, the TFR-HSS task is mathematically modelled from real-world engineering problem and four types of numerically modellings have been constructed to transform the problem into discrete mapping forms. Then, this work proposes a set of machine learning modelling methods, including the general machine learning methods and the deep learning methods, to advance the state-of-the-art methods over temperature field reconstruction. More importantly, this work develops a novel benchmark dataset, namely Temperature Field Reconstruction Dataset (TFRD), to evaluate these machine learning modelling methods for the TFR-HSS task. Finally, a performance analysis of typical methods is given on TFRD, which can be served as the baseline results on this benchmark. △ Less

Submitted 3 January, 2023; v1 submitted 17 August, 2021; originally announced August 2021.

Journal ref: Science China Information Sciences, 2023

arXiv:2107.12545 [pdf, other]

Double Deep Q-learning Based Real-Time Optimization Strategy for Microgrids

Authors: Hang Shuai, Xiaomeng Ai, Jiakun Fang, Wei Yao, Jinyu Wen

Abstract: The uncertainties from distributed energy resources (DERs) bring significant challenges to the real-time operation of microgrids. In addition, due to the nonlinear constraints in the AC power flow equation and the nonlinearity of the battery storage model, etc., the optimization of the microgrid is a mixed-integer nonlinear programming (MINLP) problem. It is challenging to solve this kind of stoch… ▽ More The uncertainties from distributed energy resources (DERs) bring significant challenges to the real-time operation of microgrids. In addition, due to the nonlinear constraints in the AC power flow equation and the nonlinearity of the battery storage model, etc., the optimization of the microgrid is a mixed-integer nonlinear programming (MINLP) problem. It is challenging to solve this kind of stochastic nonlinear optimization problem. To address the challenge, this paper proposes a deep reinforcement learning (DRL) based optimization strategy for the real-time operation of the microgrid. Specifically, we construct the detailed operation model for the microgrid and formulate the real-time optimization problem as a Markov Decision Process (MDP). Then, a double deep Q network (DDQN) based architecture is designed to solve the MINLP problem. The proposed approach can learn a near-optimal strategy only from the historical data. The effectiveness of the proposed algorithm is validated by the simulations on a 10-bus microgrid system and a modified IEEE 69-bus microgrid system. The numerical simulation results demonstrate that the proposed approach outperforms several existing methods. △ Less

Submitted 26 July, 2021; originally announced July 2021.

Comments: 13 pages, 14 figures. Submitted to IEEE Transactions on Systems, Man, and Cybernetics in Aug. 2019

arXiv:2105.02406 [pdf, other]

In the Danger Zone: U-Net Driven Quantile Regression can Predict High-risk SARS-CoV-2 Regions via Pollutant Particulate Matter and Satellite Imagery

Authors: Jacquelyn Shelton, Przemyslaw Polewski, Wei Yao

Abstract: Since the outbreak of COVID-19 policy makers have been relying upon non-pharmacological interventions to control the outbreak. With air pollution as a potential transmission vector there is need to include it in intervention strategies. We propose a U-net driven quantile regression model to predict $PM_{2.5}$ air pollution based on easily obtainable satellite imagery. We demonstrate that our appro… ▽ More Since the outbreak of COVID-19 policy makers have been relying upon non-pharmacological interventions to control the outbreak. With air pollution as a potential transmission vector there is need to include it in intervention strategies. We propose a U-net driven quantile regression model to predict $PM_{2.5}$ air pollution based on easily obtainable satellite imagery. We demonstrate that our approach can reconstruct $PM_{2.5}$ concentrations on ground-truth data and predict reasonable $PM_{2.5}$ values with their spatial distribution, even for locations where pollution data is unavailable. Such predictions of $PM_{2.5}$ characteristics could crucially advise public policy strategies geared to reduce the transmission of and lethality of COVID-19. △ Less

Submitted 5 May, 2021; originally announced May 2021.

Comments: accepted for ICML 2020 Workshop on Healthcare Systems, Population Health, and the Role of Health-Tech

arXiv:2104.03412 [pdf, ps, other]

Leaderless collective motions in affine formation control

Authors: Hector Garcia de Marina, Juan Jimenez Castellanos, Weijia Yao

Abstract: This paper proposes a novel distributed technique to induce collective motions in affine formation control. Instead of the traditional leader-follower strategy, we propose modifying the original weights that build the Laplacian matrix so that a designed steady-state motion of the desired shape emerges from the agents' local interactions. The proposed technique allows a rich collection of collectiv… ▽ More This paper proposes a novel distributed technique to induce collective motions in affine formation control. Instead of the traditional leader-follower strategy, we propose modifying the original weights that build the Laplacian matrix so that a designed steady-state motion of the desired shape emerges from the agents' local interactions. The proposed technique allows a rich collection of collective motions such as rotations around the centroid, translations, scalings, and shearings of a reference shape. These motions can be applied in useful collective behaviors such as \emph{shaped} consensus (the rendezvous with a particular shape), escorting one of the team agents, or area coverage. We prove the global stability and effectiveness of our proposed technique rigorously, and we provide some illustrative numerical simulations. △ Less

Submitted 7 April, 2021; originally announced April 2021.

Comments: 6 pages, submitted to CDC 2021

arXiv:2103.12372 [pdf, other]

Distributed coordinated path following using guiding vector fields

Authors: Weijia Yao, Hector Garcia de Marina, Zhiyong Sun, Ming Cao

Abstract: It is essential in many applications to impose a scalable coordinated motion control on a large group of mobile robots, which is efficient in tasks requiring repetitive execution, such as environmental monitoring. In this paper, we design a guiding vector field to guide multiple robots to follow possibly different desired paths while coordinating their motions. The vector field uses a path paramet… ▽ More It is essential in many applications to impose a scalable coordinated motion control on a large group of mobile robots, which is efficient in tasks requiring repetitive execution, such as environmental monitoring. In this paper, we design a guiding vector field to guide multiple robots to follow possibly different desired paths while coordinating their motions. The vector field uses a path parameter as a virtual coordinate that is communicated among neighboring robots. Then, the virtual coordinate is utilized to control the relative parametric displacement between robots along the paths. This enables us to design a saturated control algorithm for a Dubins-car-like model. The algorithm is distributed, scalable, and applicable for any smooth paths in an $n$-dimensional configuration space, and global convergence is guaranteed. Simulations with up to fifty robots and outdoor experiments with fixed-wing aircraft validate the theoretical results. △ Less

Submitted 9 April, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

Comments: Accepted to 2021 IEEE International Conference on Robotics and Automation (ICRA)

arXiv:2012.15391 [pdf, other]

Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective

Authors: Shen Chen, Mingwei Zhang, Jiamin Cui, Wei Yao

Abstract: Deep learning (DL) has brought about remarkable breakthrough in processing images, video and speech due to its efficacy in extracting highly abstract representation and learning very complex functions. However, there is seldom operating procedure reported on how to make it for real use cases. In this paper, we intend to address this problem by presenting a generalized operating procedure for DL fr… ▽ More Deep learning (DL) has brought about remarkable breakthrough in processing images, video and speech due to its efficacy in extracting highly abstract representation and learning very complex functions. However, there is seldom operating procedure reported on how to make it for real use cases. In this paper, we intend to address this problem by presenting a generalized operating procedure for DL from the perspective of unconstrained optimal design, which is motivated by a simple intension to remove the barrier of using DL, especially for those scientists or engineers who are new but eager to use it. Our proposed procedure contains seven steps, which are project/problem statement, data collection, architecture design, initialization of parameters, defining loss function, computing optimal parameters, and inference, respectively. Following this procedure, we build a multi-stream end-to-end speaker verification system, in which the input speech utterance is processed by multiple parallel streams within different frequency range, so that the acoustic modeling can be more robust resulting from the diversity of features. Trained with VoxCeleb dataset, our experimental results verify the effectiveness of our proposed operating procedure, and also show that our multi-stream framework outperforms single-stream baseline with 20 % relative reduction in minimum decision cost function (minDCF). △ Less

Submitted 30 December, 2020; originally announced December 2020.

Comments: 5 pages, 4 figures, 1 table

arXiv:2012.11159 [pdf, other]

Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification

Authors: Wei Yao, Shen Chen, Jiamin Cui, Yaolin Lou

Abstract: Speaker verification aims to verify whether an input speech corresponds to the claimed speaker, and conventionally, this kind of system is deployed based on single-stream scenario, wherein the feature extractor operates in full frequency range. In this paper, we hypothesize that machine can learn enough knowledge to do classification task when listening to partial frequency range instead of full f… ▽ More Speaker verification aims to verify whether an input speech corresponds to the claimed speaker, and conventionally, this kind of system is deployed based on single-stream scenario, wherein the feature extractor operates in full frequency range. In this paper, we hypothesize that machine can learn enough knowledge to do classification task when listening to partial frequency range instead of full frequency range, which is so called frequency selection technique, and further propose a novel framework of multi-stream Convolutional Neural Network (CNN) with this technique for speaker verification tasks. The proposed framework accommodates diverse temporal embeddings generated from multiple streams to enhance the robustness of acoustic modeling. For the diversity of temporal embeddings, we consider feature augmentation with frequency selection, which is to manually segment the full-band of frequency into several sub-bands, and the feature extractor of each stream can select which sub-bands to use as target frequency domain. Different from conventional single-stream solution wherein each utterance would only be processed for one time, in this framework, there are multiple streams processing it in parallel. The input utterance for each stream is pre-processed by a frequency selector within specified frequency range, and post-processed by mean normalization. The normalized temporal embeddings of each stream will flow into a pooling layer to generate fused embeddings. We conduct extensive experiments on VoxCeleb dataset, and the experimental results demonstrate that multi-stream CNN significantly outperforms single-stream baseline with 20.53 % of relative improvement in minimum Decision Cost Function (minDCF). △ Less

Submitted 12 January, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

Comments: 12 pages, 11 figures, 8 tables

arXiv:2012.01826 [pdf, other]

Singularity-free Guiding Vector Field for Robot Navigation

Authors: Weijia Yao, Hector Garcia de Marina, Bohuan Lin, Ming Cao

Abstract: Most of the existing path-following navigation algorithms cannot guarantee global convergence to desired paths or enable following self-intersected desired paths due to the existence of singular points where navigation algorithms return unreliable or even no solutions. One typical example arises in vector-field guided path-following (VF-PF) navigation algorithms. These algorithms are based on a ve… ▽ More Most of the existing path-following navigation algorithms cannot guarantee global convergence to desired paths or enable following self-intersected desired paths due to the existence of singular points where navigation algorithms return unreliable or even no solutions. One typical example arises in vector-field guided path-following (VF-PF) navigation algorithms. These algorithms are based on a vector field, and the singular points are exactly where the vector field diminishes. In this paper, we show that it is mathematically impossible for conventional VF-PF algorithms to achieve global convergence to desired paths that are self-intersected or even just simple closed (precisely, homeomorphic to the unit circle). Motivated by this new impossibility result, we propose a novel method to transform self-intersected or simple closed desired paths to non-self-intersected and unbounded (precisely, homeomorphic to the real line) counterparts in a higher-dimensional space. Corresponding to this new desired path, we construct a singularity-free guiding vector field on a higher-dimensional space. The integral curves of this new guiding vector field is thus exploited to enable global convergence to the higher-dimensional desired path, and therefore the projection of the integral curves on a lower-dimensional subspace converge to the physical (lower-dimensional) desired path. Rigorous theoretical analysis is carried out for the theoretical results using dynamical systems theory. In addition, we show both by theoretical analysis and numerical simulations that our proposed method is an extension combining conventional VF-PF algorithms and trajectory tracking algorithms. Finally, to show the practical value of our proposed approach for complex engineering systems, we conduct outdoor experiments with a fixed-wing airplane in windy environment to follow both 2D and 3D desired paths. △ Less

Submitted 23 October, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

Comments: Accepted for publication in IEEE Trransactions on Robotics (T-RO)

arXiv:2011.12365 [pdf, other]

Online Detection of Low-Quality Synchrophasor Data Considering Frequency Similarity

Authors: Wenyun Ju, Horacio Silva-Saravia, Neeraj Nayak, Wenxuan Yao, Yichen Zhang, Qingxin Shi, Fan Ye

Abstract: This letter proposes a new approach for online detection of low-quality synchrophasor data under both normal and event conditions. The proposed approach utilizes the features of synchrophasor data in time and frequency domains to distinguish multiple regional PMU signals and detect low-quality synchrophasor data. The proposed approach does not require any offline study and it is more effective to… ▽ More This letter proposes a new approach for online detection of low-quality synchrophasor data under both normal and event conditions. The proposed approach utilizes the features of synchrophasor data in time and frequency domains to distinguish multiple regional PMU signals and detect low-quality synchrophasor data. The proposed approach does not require any offline study and it is more effective to detect low-quality data with apparently indistinguishable profiles. Case studies from recorded synchrophasor measurements verify the effectiveness of the proposed approach. △ Less

Submitted 24 November, 2020; originally announced November 2020.

Comments: 3 pages, 6 figures

arXiv:2008.10415 [pdf, ps, other]

doi 10.1016/j.cnsns.2020.105688

Time irreversibility and amplitude irreversibility measures for nonequilibrium processes

Authors: Wenpo Yao, Jun Wang, Matjaz Perc, Wenli Yao, Jiafei Dai, Daqing Guo, Dezhong Yao

Abstract: Time irreversibility, which characterizes nonequilibrium processes, can be measured based on the probabilistic differences between symmetric vectors. To simplify the quantification of time irreversibility, symmetric permutations instead of symmetric vectors have been employed in some studies. However, although effective in practical applications, this approach is conceptually incorrect. Time irrev… ▽ More Time irreversibility, which characterizes nonequilibrium processes, can be measured based on the probabilistic differences between symmetric vectors. To simplify the quantification of time irreversibility, symmetric permutations instead of symmetric vectors have been employed in some studies. However, although effective in practical applications, this approach is conceptually incorrect. Time irreversibility should be measured based on the permutations of symmetric vectors rather than symmetric permutations, whereas symmetric permutations can instead be employed to determine the quantitative amplitude irreversibility -- a novel parameter proposed in this paper for nonequilibrium calculated by means of the probabilistic difference in amplitude fluctuations. Through theoretical and experimental analyses, we highlight the strong similarities and close associations between the time irreversibility and amplitude irreversibility measures. Our paper clarifies the connections of and the differences between the two types of permutation-based parameters for quantitative nonequilibrium, and by doing so, we bridge the concepts of amplitude irreversibility and time irreversibility and broaden the selection of quantitative tools for studying nonequilibrium processes in complex systems. △ Less

Submitted 30 December, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

Comments: 16 pages, 6 figures

Journal ref: Commun. Nonlinear Sci. Numer. Simulat. 96, 105688 (2021)

arXiv:2007.00791 [pdf, other]

Learning a Distributed Control Scheme for Demand Flexibility in Thermostatically Controlled Loads

Authors: Bingqing Chen, Weiran Yao, Jonathan Francis, Mario Bergés

Abstract: Demand flexibility is increasingly important for power grids, in light of growing penetration of renewable generation. Careful coordination of thermostatically controlled loads (TCLs) can potentially modulate energy demand, decrease operating costs, and increase grid resiliency. However, it is challenging to control a heterogeneous population of TCLs: the control problem has a large state action s… ▽ More Demand flexibility is increasingly important for power grids, in light of growing penetration of renewable generation. Careful coordination of thermostatically controlled loads (TCLs) can potentially modulate energy demand, decrease operating costs, and increase grid resiliency. However, it is challenging to control a heterogeneous population of TCLs: the control problem has a large state action space; each TCL has unique and complex dynamics; and multiple system-level objectives need to be optimized simultaneously. To address these challenges, we propose a distributed control solution, which consists of a central load aggregator that optimizes system-level objectives and building-level controllers that track the load profiles planned by the aggregator. To optimize our agents' policies, we draw inspirations from both reinforcement learning (RL) and model predictive control. Specifically, the aggregator is updated with an evolutionary strategy, which was recently demonstrated to be a competitive and scalable alternative to more sophisticated RL algorithms and enables policy updates independent of the building-level controllers. We evaluate our proposed approach across four climate zones in four nine-building clusters, using the newly-introduced CityLearn simulation environment. Our approach achieved an average reduction of 16.8% in the environment cost compared to the benchmark rule-based controller. △ Less

Submitted 5 October, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

Comments: Accepted by IEEE SmartGridComm 2020; 7 pages

Journal ref: 2020 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), November 2020, Virtual

arXiv:2005.13522 [pdf, other]

doi 10.1177/0361198120917668

Learning to Recommend Signal Plans under Incidents with Real-Time Traffic Prediction

Authors: Weiran Yao, Sean Qian

Abstract: The main question to address in this paper is to recommend optimal signal timing plans in real time under incidents by incorporating domain knowledge developed with the traffic signal timing plans tuned for possible incidents, and learning from historical data of both traffic and implemented signals timing. The effectiveness of traffic incident management is often limited by the late response time… ▽ More The main question to address in this paper is to recommend optimal signal timing plans in real time under incidents by incorporating domain knowledge developed with the traffic signal timing plans tuned for possible incidents, and learning from historical data of both traffic and implemented signals timing. The effectiveness of traffic incident management is often limited by the late response time and excessive workload of traffic operators. This paper proposes a novel decision-making framework that learns from both data and domain knowledge to real-time recommend contingency signal plans that accommodate non-recurrent traffic, with the outputs from real-time traffic prediction at least 30 minutes in advance. Specifically, considering the rare occurrences of engagement of contingency signal plans for incidents, we propose to decompose the end-to-end recommendation task into two hierarchical models: real-time traffic prediction and plan association. We learn the connections between the two models through metric learning, which reinforces partial-order preferences observed from historical signal engagement records. We demonstrate the effectiveness of our approach by testing this framework on the traffic network in Cranberry Township in 2019. Results show that our recommendation system has a precision score of 96.75% and recall of 87.5% on the testing plan, and make recommendation of an average of 22.5 minutes lead time ahead of Waze alerts. The results suggest that our framework is capable of giving traffic operators a significant time window to access the conditions and respond appropriately. △ Less

Submitted 20 May, 2020; originally announced May 2020.

Comments: To be published in Transportation Research Record (2020)

arXiv:2005.09212 [pdf, other]

A Self-ensembling Framework for Semi-supervised Knee Cartilage Defects Assessment with Dual-Consistency

Authors: Jiayu Huo, Liping Si, Xi Ouyang, Kai Xuan, Weiwu Yao, Zhong Xue, Qian Wang, Dinggang Shen, Lichi Zhang

Abstract: Knee osteoarthritis (OA) is one of the most common musculoskeletal disorders and requires early-stage diagnosis. Nowadays, the deep convolutional neural networks have achieved greatly in the computer-aided diagnosis field. However, the construction of the deep learning models usually requires great amounts of annotated data, which is generally high-cost. In this paper, we propose a novel approach… ▽ More Knee osteoarthritis (OA) is one of the most common musculoskeletal disorders and requires early-stage diagnosis. Nowadays, the deep convolutional neural networks have achieved greatly in the computer-aided diagnosis field. However, the construction of the deep learning models usually requires great amounts of annotated data, which is generally high-cost. In this paper, we propose a novel approach for knee cartilage defects assessment, including severity classification and lesion localization. This can be treated as a subtask of knee OA diagnosis. Particularly, we design a self-ensembling framework, which is composed of a student network and a teacher network with the same structure. The student network learns from both labeled data and unlabeled data and the teacher network averages the student model weights through the training course. A novel attention loss function is developed to obtain accurate attention masks. With dual-consistency checking of the attention in the lesion classification and localization, the two networks can gradually optimize the attention distribution and improve the performance of each other, whereas the training relies on partially labeled data only and follows the semi-supervised manner. Experiments show that the proposed method can significantly improve the self-ensembling performance in both knee cartilage defects classification and localization, and also greatly reduce the needs of annotated data. △ Less

Submitted 12 October, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

Comments: accepted by International Workshop on PRedictive Intelligence In MEdicine, 2020

arXiv:2003.12627 [pdf]

doi 10.1016/j.patcog.2021.108103

Reducing Magnetic Resonance Image Spacing by Learning Without Ground-Truth

Authors: Kai Xuan, Liping Si, Lichi Zhang, Zhong Xue, Yining Jiao, Weiwu Yao, Dinggang Shen, Dijia Wu, Qian Wang

Abstract: High-quality magnetic resonance (MR) image, i.e., with near isotropic voxel spacing, is desirable in various scenarios of medical image analysis. However, many MR acquisitions use large inter-slice spacing in clinical practice. In this work, we propose a novel deep-learning-based super-resolution algorithm to generate high-resolution (HR) MR images with small slice spacing from low-resolution (LR)… ▽ More High-quality magnetic resonance (MR) image, i.e., with near isotropic voxel spacing, is desirable in various scenarios of medical image analysis. However, many MR acquisitions use large inter-slice spacing in clinical practice. In this work, we propose a novel deep-learning-based super-resolution algorithm to generate high-resolution (HR) MR images with small slice spacing from low-resolution (LR) inputs of large slice spacing. Notice that most existing deep-learning-based methods need paired LR and HR images to supervise the training, but in clinical scenarios, usually no HR images will be acquired. Therefore, our unique goal herein is to design and train the super-resolution network with no real HR ground-truth. Specifically, two training stages are used in our method. First, HR images of reduced slice spacing are synthesized from real LR images using variational auto-encoder (VAE). Although these synthesized HR images are as realistic as possible, they may still suffer from unexpected morphing induced by VAE, implying that the synthesized HR images cannot be paired with the real LR images in terms of anatomical structure details. In the second stage, we degrade the synthesized HR images to generate corresponding LR images and train a super-resolution network based on these synthesized HR and degraded LR pairs. The underlying mechanism is that such a super-resolution network is less vulnerable to anatomical variability. Experiments on knee MR images successfully demonstrate the effectiveness of our proposed solution to reduce the slice spacing for better rendering. △ Less

Submitted 17 August, 2021; v1 submitted 27 March, 2020; originally announced March 2020.

arXiv:2003.10012 [pdf, other]

Vector Field Guided Path Following Control: Singularity Elimination and Global Convergence

Authors: Weijia Yao, Hector Garcia de Marina, Ming Cao

Abstract: Vector field guided path following (VF-PF) algorithms are fundamental in robot navigation tasks, but may not deliver the desirable performance when robots encounter singular points where the vector field becomes zero. The existence of singular points prevents the global convergence of the vector field's integral curves to the desired path. Moreover, VF-PF algorithms, as well as most of the existin… ▽ More Vector field guided path following (VF-PF) algorithms are fundamental in robot navigation tasks, but may not deliver the desirable performance when robots encounter singular points where the vector field becomes zero. The existence of singular points prevents the global convergence of the vector field's integral curves to the desired path. Moreover, VF-PF algorithms, as well as most of the existing path following algorithms, fail to enable following a self-intersected desired path. In this paper, we show that such failures are fundamentally related to the mathematical topology of the path, and that by "stretching" the desired path along a virtual dimension, one can remove the topological obstruction. Consequently, this paper proposes a new guiding vector field defined in a higher-dimensional space, in which self-intersected desired paths become free of self-intersections; more importantly, the new guiding vector field does not have any singular points, enabling the integral curves to converge globally to the "stretched" path. We further introduce the extended dynamics to retain this appealing global convergence property for the desired path in the original lower-dimensional space. Both simulations and experiments are conducted to verify the theory. △ Less

Submitted 1 September, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

Comments: Accepted by 2020 IEEE 59th Conference on Decision and Control (CDC). This is the full version

arXiv:1911.02304 [pdf, other]

Path Following Control in 3D Using a Vector Field

Authors: Weijia Yao, Ming Cao

Abstract: Using a designed vector field to control a mobile robot to follow a given desired path is intuitive and practical, and to build a rigorous theory to guide its implementation is essential. In this paper, we study the properties of a general 3D vector field for robotic path following. We propose and investigate assumptions that turn out to be crucial for this method, but have been rarely explicitly… ▽ More Using a designed vector field to control a mobile robot to follow a given desired path is intuitive and practical, and to build a rigorous theory to guide its implementation is essential. In this paper, we study the properties of a general 3D vector field for robotic path following. We propose and investigate assumptions that turn out to be crucial for this method, but have been rarely explicitly stated in related works. We derive conditions under which the local path-following error vanishes exponentially in a sufficiently small neighborhood of the desired path, which is key to show the local input-to-state stability (local ISS) property of the path-following error dynamics. The local ISS property then justifies the control algorithm design for a fixed-wing aircraft model. Our approach is effective for any sufficiently smooth desired path in 3D, bounded or unbounded; note that the case for unbounded desired paths has not been sufficiently discussed in the literature. Simulations are conducted to verify the theoretical results. △ Less

Submitted 8 March, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

arXiv:1904.00562 [pdf, other]

Deep Clustering With Intra-class Distance Constraint for Hyperspectral Images

Authors: Jinguang Sun, Wanli Wang, Xian Wei, Li Fang, Xiaoliang Tang, Yusheng Xu, Hui Yu, Wei Yao

Abstract: The high dimensionality of hyperspectral images often results in the degradation of clustering performance. Due to the powerful ability of deep feature extraction and non-linear feature representation, the clustering algorithm based on deep learning has become a hot research topic in the field of hyperspectral remote sensing. However, most deep clustering algorithms for hyperspectral images utiliz… ▽ More The high dimensionality of hyperspectral images often results in the degradation of clustering performance. Due to the powerful ability of deep feature extraction and non-linear feature representation, the clustering algorithm based on deep learning has become a hot research topic in the field of hyperspectral remote sensing. However, most deep clustering algorithms for hyperspectral images utilize deep neural networks as feature extractor without considering prior knowledge constraints that are suitable for clustering. To solve this problem, we propose an intra-class distance constrained deep clustering algorithm for high-dimensional hyperspectral images. The proposed algorithm constrains the feature mapping procedure of the auto-encoder network by intra-class distance so that raw images are transformed from the original high-dimensional space to the low-dimensional feature space that is more conducive to clustering. Furthermore, the related learning process is treated as a joint optimization problem of deep feature extraction and clustering. Experimental results demonstrate the intense competitiveness of the proposed algorithm in comparison with state-of-the-art clustering methods of hyperspectral images. △ Less

Submitted 1 April, 2019; originally announced April 2019.

arXiv:1903.10336 [pdf]

Line Outage Detection and Localization via Synchrophasor Measurement

Authors: Xianda Deng, Desong Bian, Di Shi, Wenxuan Yao, Zhihao Jiang, Yilu Liu

Abstract: Since transmission lines are crucial links in the power system, one line outage event may bring about interruption or even cascading failure of the power system. If a quick and accurate line outage detection and localization can be achieved, the system operator can take necessary actions in time to mitigate the negative impact. Therefore, the objective of this paper is to study a method for line o… ▽ More Since transmission lines are crucial links in the power system, one line outage event may bring about interruption or even cascading failure of the power system. If a quick and accurate line outage detection and localization can be achieved, the system operator can take necessary actions in time to mitigate the negative impact. Therefore, the objective of this paper is to study a method for line outage detection and localization via synchrophasor measurements. The density of deployed phasor measurement units (PMUs) is increasing recently, which greatly improves the visibility of the power grid. Taking advantage of the high-resolution synchrophasor data, the proposed method utilizes frequency measurement for line outage detection and power change for localization. The procedure of the proposed method is given. Compared with conventional methods, it does not require the pre-knowledge on the system. Simulation study validates the effectiveness of the proposed method. △ Less

Submitted 2 April, 2019; v1 submitted 21 March, 2019; originally announced March 2019.

arXiv:1409.2332 [pdf, other]

Partially Independent Control Scheme for Spacecraft Rendezvous in Near-Circular Orbits

Authors: Neng Wan, Weiran Yao

Abstract: Due to the complexity and inconstancy of the space environment, accurate mathematical models for spacecraft rendezvous are difficult to obtain, which consequently complicates the control tasks. In this paper, a linearized time-variant plant model with external perturbations is adopted to approximate the real circumstance. To realize the robust stability with optimal performance cost, a partially i… ▽ More Due to the complexity and inconstancy of the space environment, accurate mathematical models for spacecraft rendezvous are difficult to obtain, which consequently complicates the control tasks. In this paper, a linearized time-variant plant model with external perturbations is adopted to approximate the real circumstance. To realize the robust stability with optimal performance cost, a partially independent control scheme is proposed, which consists of a robust anti-windup controller for the in-plane motion and a ${{H}_{\infty}}$ controller for the out-of-plane motion. Finally, a rendezvous simulation is given to corroborate the practicality and advantages of the partially independent control scheme over a coupled control scheme. △ Less

Submitted 3 January, 2016; v1 submitted 8 September, 2014; originally announced September 2014.

Comments: 18 pages, 7 figures, Submitted to ASCE Journal of Aerospace Engineering

Showing 1–42 of 42 results for author: Yao, W