Search | arXiv e-print repository

arXiv:2407.19130 [pdf]

Panoramic single-pixel imaging with megapixel resolution based on rotational subdivision

Authors: Huan Cui, Jie Cao, Haoyu Zhang, Chang Zhou, Haifeng Yao, Yingbo Wang, Qun Hao

Abstract: Single-pixel imaging (SPI) using a single-pixel detector is an unconventional imaging method, which has great application prospects in many fields to realize high-performance imaging. In especial, the recent proposed catadioptric panoramic ghost imaging (CPGI) extends the application potential of SPI to high-performance imaging at a wide field of view (FOV) with recent growing demands. However, th… ▽ More Single-pixel imaging (SPI) using a single-pixel detector is an unconventional imaging method, which has great application prospects in many fields to realize high-performance imaging. In especial, the recent proposed catadioptric panoramic ghost imaging (CPGI) extends the application potential of SPI to high-performance imaging at a wide field of view (FOV) with recent growing demands. However, the resolution of CPGI is limited by hardware parameters of the digital micromirror device (DMD), which may not meet ultrahigh-resolution panoramic imaging needs that require detailed information. Therefore, to overcome the resolution limitation of CPGI, we propose a panoramic SPI based on rotational subdivision (RSPSI). The key of the proposed RSPSI is to obtain the entire panoramic scene by the rotation-scanning with a rotating mirror tilted 45°, so that one single pattern that only covers one sub-Fov with a small FOV can complete a uninterrupted modulation on the entire panoramic FOV during a once-through pattern projection. Then, based on temporal resolution subdivision, images sequence of sub-Fovs subdivided from the entire panoramic FOV can be reconstructed with pixels-level or even subpixels-level horizontal shifting adjacently. Experimental results using a proof-of-concept setup show that the panoramic image can be obtained with 10428*543 of 5,662,404 pixels, which is more than 9.6 times higher than the resolution limit of the CPGI using the same DMD. To our best knowledge, the RSPSI is the first to achieve a megapixel resolution via SPI, which can provide potential applications in fields requiring the imaging with ultrahigh-resolution and wide FOV. △ Less

Submitted 26 July, 2024; originally announced July 2024.

arXiv:2405.02208 [pdf, other]

Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts

Authors: Han Cui, Alfredo De Goyeneche, Efrat Shimron, Boyuan Ma, Michael Lustig

Abstract: Image Quality Assessment (IQA) is essential in various Computer Vision tasks such as image deblurring and super-resolution. However, most IQA methods require reference images, which are not always available. While there are some reference-free IQA metrics, they have limitations in simulating human perception and discerning subtle image quality variations. We hypothesize that the JPEG quality facto… ▽ More Image Quality Assessment (IQA) is essential in various Computer Vision tasks such as image deblurring and super-resolution. However, most IQA methods require reference images, which are not always available. While there are some reference-free IQA metrics, they have limitations in simulating human perception and discerning subtle image quality variations. We hypothesize that the JPEG quality factor is representatives of image quality measurement, and a well-trained neural network can learn to accurately evaluate image quality without requiring a clean reference, as it can recognize image degradation artifacts based on prior knowledge. Thus, we developed a reference-free quality evaluation network, dubbed "Quality Factor (QF) Predictor", which does not require any reference. Our QF Predictor is a lightweight, fully convolutional network comprising seven layers. The model is trained in a self-supervised manner: it receives JPEG compressed image patch with a random QF as input, is trained to accurately predict the corresponding QF. We demonstrate the versatility of the model by applying it to various tasks. First, our QF Predictor can generalize to measure the severity of various image artifacts, such as Gaussian Blur and Gaussian noise. Second, we show that the QF Predictor can be trained to predict the undersampling rate of images reconstructed from Magnetic Resonance Imaging (MRI) data. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2311.02646 [pdf]

Flexible uniform-sampling foveated Fourier single-pixel imaging

Authors: Huan Cui, Jie Cao, Qun Hao, Haoyu Zhang, Chang Zhou

Abstract: Fourier single-pixel imaging (FSI) is a data-efficient single-pixel imaging (SPI). However, there is still a serious challenge to obtain higher imaging quality using fewer measurements, which limits the development of real-time SPI. In this work, a uniform-sampling foveated FSI (UFFSI) is proposed with three features, uniform sampling, effective sampling and flexible fovea, to achieve under-sampli… ▽ More Fourier single-pixel imaging (FSI) is a data-efficient single-pixel imaging (SPI). However, there is still a serious challenge to obtain higher imaging quality using fewer measurements, which limits the development of real-time SPI. In this work, a uniform-sampling foveated FSI (UFFSI) is proposed with three features, uniform sampling, effective sampling and flexible fovea, to achieve under-sampling high-efficiency and high-quality SPI, even in a large-scale scene. First, by flexibly using the three proposed foveated pattern structures, data redundancy is reduced significantly to only require high resolution (HR) on regions of interest (ROIs), which radically reduces the need of total data number. Next, by the non-uniform weight distribution processing, non-uniform spatial sampling is transformed into uniform sampling, then the fast Fourier transform is used accurately and directly to obtain under-sampling high imaging quality with further reduced measurements. At a sampling ratio of 0.0084 referring to HR FSI with 1024*768 pixels, experimentally, by UFFSI with 255*341 cells of 89% reduction in data redundancy, the ROI has a significantly better imaging quality to meet imaging needs. We hope this work can provide a breakthrough for future real-time SPI. △ Less

Submitted 5 November, 2023; originally announced November 2023.

Comments: 7 pages,5 figures

arXiv:2305.11614 [pdf, other]

Two-Bit RIS-Aided Communications at 3.5GHz: Some Insights from the Measurement Results Under Multiple Practical Scenes

Authors: Shun Zhang, Haoran Sun, Runze Yu, Hongshenyuan Cui, Jian Ren, Feifei Gao, Shi Jin, Hongxiang Xie, Hao Wang

Abstract: In this paper, we propose a two-bit reconfigurable intelligent surface (RIS)-aided communication system, which mainly consists of a two-bit RIS, a transmitter and a receiver. A corresponding prototype verification system is designed to perform experimental tests in practical environments. The carrier frequency is set as 3.5GHz, and the RIS array possesses 256 units, each of which adopts two-bit ph… ▽ More In this paper, we propose a two-bit reconfigurable intelligent surface (RIS)-aided communication system, which mainly consists of a two-bit RIS, a transmitter and a receiver. A corresponding prototype verification system is designed to perform experimental tests in practical environments. The carrier frequency is set as 3.5GHz, and the RIS array possesses 256 units, each of which adopts two-bit phase quantization. In particular, we adopt a self-developed broadband intelligent communication system 40MHz-Net (BICT-40N) terminal in order to fully acquire the channel information. The terminal mainly includes a baseband board and a radio frequency (RF) front-end board, where the latter can achieve 26 dB transmitting link gain and 33 dB receiving link gain. The orthogonal frequency division multiplexing (OFDM) signal is used for the terminal, where the bandwidth is 40MHz and the subcarrier spacing is 625KHz. Also, the terminal supports a series of modulation modes, including QPSK, QAM, etc.Through experimental tests, we validate a few functions and properties of the RIS as follows. First, we validate a novel RIS power consumption model, which considers both the static and the dynamic power consumption. Besides, we demonstrate the existence of the imaging interference and find that two-bit RIS can lower the imaging interference about 10 dBm. Moreover, we verify that the RIS can outperform the metal plate in terms of the beam focusing performance. In addition, we find that the RIS has the ability to improve the channel stationarity. Then, we realize the multi-beam reflection of the RIS utilizing the pattern addition (PA) algorithm. Lastly, we validate the existence of the mutual coupling between different RIS units. △ Less

Submitted 19 May, 2023; originally announced May 2023.

arXiv:2303.15865 [pdf]

Chloride Ion Erosion of Pre-Stressed Concrete Bridges in Cold Regions

Authors: Hongtao Cui, Yi Zhuo, Dongyuan Ke, Zhonglong Li, Shunlong Li

Abstract: The erosion of chloride ions in concrete bridges will accelerate the corrosion of reinforcement, which is an important reason for the decline of bridge durability. The erosion process of chloride ion, especially deicing salt solution in cold regions, is complex and has many influencing factors. It is very important to use accurate and effective methods to analyze the chloride ion erosion process i… ▽ More The erosion of chloride ions in concrete bridges will accelerate the corrosion of reinforcement, which is an important reason for the decline of bridge durability. The erosion process of chloride ion, especially deicing salt solution in cold regions, is complex and has many influencing factors. It is very important to use accurate and effective methods to analyze the chloride ion erosion process in concrete. In this study, the pre-stressed concrete bridge retired in the cold region was taken as the research object, and the specimens from the whole bridge are obtained by the method of core drilling sampling. The concentration of chloride ion was measured at different depths of the specimens. The process of chloride ion erosion was simulated in two-dimensional space through COMSOL multi-physical field simulation, and compared with the measured results. The simulation method proposed in this paper has good reliability and accuracy. △ Less

Submitted 28 March, 2023; originally announced March 2023.

arXiv:2302.10736 [pdf, other]

Bus Admittance Matrix Revisited: Is It Outdated on Modern Computers?

Authors: Hantao Cui

Abstract: Bus admittance matrix is widely used in power engineering for modeling networks. Being highly sparse, it requires fewer CPU operations when used for calculations. Meanwhile, sparse matrix calculations involve numerous indexing and scalar operations, which are unfavorable to modern processors. Without using the admittance matrix, nodal power injections and the corresponding sparse Jacobian can be c… ▽ More Bus admittance matrix is widely used in power engineering for modeling networks. Being highly sparse, it requires fewer CPU operations when used for calculations. Meanwhile, sparse matrix calculations involve numerous indexing and scalar operations, which are unfavorable to modern processors. Without using the admittance matrix, nodal power injections and the corresponding sparse Jacobian can be computed by an element-wise method, which consists of a highly regular, vectorized evaluation step and a reduction step. This paper revisits the admittance matrix from the computational performance perspective by comparing it with the element-wise method. Case studies show that the admittance matrix method is generally slower than the element-wise method for grid test cases with thousands to hundreds of thousands of buses, especially on CPUs with support for wide vector instructions. This paper also analyzes the impact of the width of vector instructions and memory speed to predict the trend for future computers. △ Less

Submitted 21 February, 2023; originally announced February 2023.

arXiv:2301.13553 [pdf, other]

Millimetre-wave Radar for Low-Cost 3D Imaging: A Performance Study

Authors: Han Cui, Jiacheng Wu, Naim Dahnoun

Abstract: Millimetre-wave (mmWave) radars can generate 3D point clouds to represent objects in the scene. However, the accuracy and density of the generated point cloud can be lower than a laser sensor. Although researchers have used mmWave radars for various applications, there are few quantitative evaluations on the quality of the point cloud generated by the radar and there is a lack of a standard on how… ▽ More Millimetre-wave (mmWave) radars can generate 3D point clouds to represent objects in the scene. However, the accuracy and density of the generated point cloud can be lower than a laser sensor. Although researchers have used mmWave radars for various applications, there are few quantitative evaluations on the quality of the point cloud generated by the radar and there is a lack of a standard on how this quality can be assessed. This work aims to fill the gap in the literature. A radar simulator is built to evaluate the most common data processing chains of 3D point cloud construction and to examine the capability of the mmWave radar as a 3D imaging sensor under various factors. It will be shown that the radar detection can be noisy and have an imbalance distribution. To address the problem, a novel super-resolution point cloud construction (SRPC) algorithm is proposed to improve the spatial resolution of the point cloud and is shown to be able to produce a more natural point cloud and reduce outliers. △ Less

Submitted 31 January, 2023; originally announced January 2023.

Comments: 14 pages, 16 figures

arXiv:2211.11990 [pdf]

doi 10.1109/NAPS58826.2023.10318583

DiME and AGVis: A Distributed Messaging Environment and Geographical Visualizer for Large-scale Power System Simulation

Authors: Nicholas Parsly, Jinning Wang, Nick West, Qiwei Zhang, Hantao Cui, Fangxing Li

Abstract: This paper introduces the messaging environment and the geographical visualization tool of the CURENT Large-scale Testbed (LTB) that can be used for large-scale power system closed-loop simulation. First, Distributed Messaging Environment (DiME) implements an asynchronous shared workspace to enable high-concurrent data exchange. Second, Another Grid Visualizer (AGVis) is presented as a geovisualiz… ▽ More This paper introduces the messaging environment and the geographical visualization tool of the CURENT Large-scale Testbed (LTB) that can be used for large-scale power system closed-loop simulation. First, Distributed Messaging Environment (DiME) implements an asynchronous shared workspace to enable high-concurrent data exchange. Second, Another Grid Visualizer (AGVis) is presented as a geovisualization tool that facilitates the visualization of real-time power system simulation. Third, case studies show the use of DiME and AGVis. The results demonstrate that, with the modular structure, the LTB is capable of not only federal use for real-time, large-scale power system simulation, but also independent use for customized power system research. △ Less

Submitted 17 October, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

Comments: 5 pages, 7 figures, conference

arXiv:2211.00261 [pdf, other]

Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks

Authors: Yue Yu, Xuan Kan, Hejie Cui, Ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, Carl Yang

Abstract: Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downs… ▽ More Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downstream prediction tasks and can lead to inferior results for GNN-based models. To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis. The key component of TBDS is the brain network generator which adopts a DAG learning approach to transform the raw time-series into task-aware brain connectivities. Besides, we design an additional contrastive regularization to inject task-specific knowledge during the brain network generation process. Comprehensive experiments on two fMRI datasets, namely Adolescent Brain Cognitive Development (ABCD) and Philadelphia Neuroimaging Cohort (PNC) datasets demonstrate the efficacy of TBDS. In addition, the generated brain networks also highlight the prediction-related brain regions and thus provide unique interpretations of the prediction results. Our implementation will be published to https://github.com/yueyu1030/TBDS upon acceptance. △ Less

Submitted 31 October, 2022; originally announced November 2022.

Comments: Work in progress

arXiv:2210.16028

Laugh Betrays You? Learning Robust Speaker Representation From Speech Containing Non-Verbal Fragments

Authors: Yuke Lin, Xiaoyi Qin, Huahua Cui, Zhenyi Zhu, Ming Li

Abstract: The success of automatic speaker verification shows that discriminative speaker representations can be extracted from neutral speech. However, as a kind of non-verbal voice, laughter should also carry speaker information intuitively. Thus, this paper focuses on exploring speaker verification about utterances containing non-verbal laughter segments. We collect a set of clips with laughter component… ▽ More The success of automatic speaker verification shows that discriminative speaker representations can be extracted from neutral speech. However, as a kind of non-verbal voice, laughter should also carry speaker information intuitively. Thus, this paper focuses on exploring speaker verification about utterances containing non-verbal laughter segments. We collect a set of clips with laughter components by conducting a laughter detection script on VoxCeleb and part of the CN-Celeb dataset. To further filter untrusted clips, probability scores are calculated by our binary laughter detection classifier, which is pre-trained by pure laughter and neutral speech. After that, based on the clips whose scores are over the threshold, we construct trials under two different evaluation scenarios: Laughter-Laughter (LL) and Speech-Laughter (SL). Then a novel method called Laughter-Splicing based Network (LSN) is proposed, which can significantly boost performance in both scenarios and maintain the performance on the neutral speech, such as the VoxCeleb1 test set. Specifically, our system achieves relative 20% and 22% improvement on Laughter-Laughter and Speech-Laughter trials, respectively. The meta-data and sample clips have been released at https://github.com/nevermoreLin/Laugh_LSN. △ Less

Submitted 20 November, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

Comments: see 2308.07056 which is a newer version of this work

arXiv:2209.06677 [pdf]

Virtual Inertia Scheduling for Power Systems with High Penetration of Inverter-based Resources

Authors: Buxin She, Fangxing Li, Hantao Cui, Jinnng Wang, Qiwei Zhang, Rui Bo

Abstract: This paper proposes a new concept called virtual inertia scheduling (VIS) to efficiently handle the high penetration of inverter-based resources (IBRs). VIS is an inertia management framework that targets security-constrained and economy-oriented inertia scheduling and generation dispatch of power systems with a large scale of renewable generations. Specifically, it schedules the proper power sett… ▽ More This paper proposes a new concept called virtual inertia scheduling (VIS) to efficiently handle the high penetration of inverter-based resources (IBRs). VIS is an inertia management framework that targets security-constrained and economy-oriented inertia scheduling and generation dispatch of power systems with a large scale of renewable generations. Specifically, it schedules the proper power setting points and reserved capacities of both synchronous generators and IBRs, as well as the control modes and control parameters of IBRs to provide secure and cost-effective inertia support. First, a uniform system model is employed to quantify the frequency dynamics of the IBRs-penetrated power system after disturbances. Based on the model, the s-domain and time-domain analytical responses of IBRs with inertia support capability are derived. Then, VIS-based real-time economic dispatch (VIS-RTED) is formulated to minimize generation and reserve costs, with a full consideration of dynamic frequency constraints and derived inertia support reserve constraints. The virtual inertia and damping of IBRs are formulated as decision variables. To address the non-linearity of dynamic constraints, deep learning-assisted linearization is employed to solve the optimization problem. Finally, the proposed VIS-RTED is demonstrated on a modified IEEE 39-bus system. A full-order time-domain simulation is performed to verify the scheduling results. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: Still under review by IEEE Transaction on Power System, 10 pages, 11 figures

arXiv:2207.02997 [pdf, other]

Impact of Internal Algebraic Variable Treatment on Transient Stability Simulation Performance

Authors: Hantao Cui

Abstract: It is a general notion that, in transient stability simulations, reducing the number of algebraic variables for the differential-algebraic equations (DAE) can improve the simulation performance. Many simulation programs split algebraic variables internal to a dynamic model from the full DAE and evaluate them outside each iterative step, using results from the previous iteration. The updated intern… ▽ More It is a general notion that, in transient stability simulations, reducing the number of algebraic variables for the differential-algebraic equations (DAE) can improve the simulation performance. Many simulation programs split algebraic variables internal to a dynamic model from the full DAE and evaluate them outside each iterative step, using results from the previous iteration. The updated internal variables are then treated as constants when solving for the current iteration. This letter discusses how such a split formulation can impact simulation performance. Case studies using various systems with synchronous generator and converter models demonstrate the impact of the split on the convergence pattern and simulation performance. △ Less

Submitted 6 July, 2022; originally announced July 2022.

arXiv:2206.11407 [pdf]

doi 10.1109/TEC.2023.3258919

Decentralized and Coordinated Vf Control for Islanded Microgrids Considering DER Inadequacy and Demand Control

Authors: Buxin She, Fangxing Li, Hantao Cui, Jinning Wang, Liang Min, Oroghene Oboreh Snapps, Rui Bo

Abstract: This paper proposes a decentralized and coordinated voltage and frequency (Vf) control framework for islanded microgrids, with full consideration of the limited capacity of distributed energy resources (DERs) and Vf dependent load. First, the concept of DER inadequacy is illustrated with the challenges it poses. Then, a decentralized and coordinated control framework is proposed to regulate the ou… ▽ More This paper proposes a decentralized and coordinated voltage and frequency (Vf) control framework for islanded microgrids, with full consideration of the limited capacity of distributed energy resources (DERs) and Vf dependent load. First, the concept of DER inadequacy is illustrated with the challenges it poses. Then, a decentralized and coordinated control framework is proposed to regulate the output of inverter based generations and reallocate limited DER capacity for Vf control. The control framework is composed of a power regulator and a Vf regulator, which generates the supplementary signals for the primary controller. The power regulator regulates the output of grid forming inverters according to the real time capacity constraints of DERs, while the Vf regulator improves the Vf deviation by leveraging the load sensitivity to Vf. Next, the static feasibility and small signal stability of the proposed method are rigorously proven through mathematical formulation and eigenvalue analysis. Finally, a MATLAB Simulink simulation demonstrates the functionalities of the control framework. A few goals are fulfilled within the decentralized and coordinated framework, such as making the best use of limited DERs capacity, enhancing the DC side stability of inverter based generations, and reducing involuntary load shedding. △ Less

Submitted 8 April, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

Journal ref: IEEE Transaction on Energy Conversion, 21 March 2023

arXiv:2206.11398 [pdf, other]

doi 10.1109/TSG.2022.3222323

Fusion of Model-free Reinforcement Learning with Microgrid Control: Review and Vision

Authors: Buxin She, Fangxing Li, Hantao Cui, Jingqiu Zhang, Rui Bo

Abstract: Challenges and opportunities coexist in microgrids as a result of emerging large-scale distributed energy resources (DERs) and advanced control techniques. In this paper, a comprehensive review of microgrid control is presented with its fusion of model-free reinforcement learning (MFRL). A high-level research map of microgrid control is developed from six distinct perspectives, followed by bottom-… ▽ More Challenges and opportunities coexist in microgrids as a result of emerging large-scale distributed energy resources (DERs) and advanced control techniques. In this paper, a comprehensive review of microgrid control is presented with its fusion of model-free reinforcement learning (MFRL). A high-level research map of microgrid control is developed from six distinct perspectives, followed by bottom-level modularized control blocks illustrating the configurations of grid-following (GFL) and grid-forming (GFM) inverters. Then, mainstream MFRL algorithms are introduced with an explanation of how MFRL can be integrated into the existing control framework. Next, the application guideline of MFRL is summarized with a discussion of three fusing approaches, i.e., model identification and parameter tuning, supplementary signal generation, and controller substitution, with the existing control framework. Finally, the fundamental challenges associated with adopting MFRL in microgrid control and corresponding insights for addressing these concerns are fully discussed. △ Less

Submitted 6 February, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

Comments: 14 pages, 4 figures, published on IEEE Transaction on Smart Grid 2022 Nov 15. See: https://ieeexplore-ieee-org.utk.idm.oclc.org/stamp/stamp.jsp?arnumber=9951405

arXiv:2205.12465 [pdf, other]

FBNETGEN: Task-aware GNN-based fMRI Analysis via Functional Brain Network Generation

Authors: Xuan Kan, Hejie Cui, Joshua Lukemire, Ying Guo, Carl Yang

Abstract: Functional magnetic resonance imaging (fMRI) is one of the most common imaging modalities to investigate brain functions. Recent studies in neuroscience stress the great potential of functional brain networks constructed from fMRI data for clinical predictions. Traditional functional brain networks, however, are noisy and unaware of downstream prediction tasks, while also incompatible with the dee… ▽ More Functional magnetic resonance imaging (fMRI) is one of the most common imaging modalities to investigate brain functions. Recent studies in neuroscience stress the great potential of functional brain networks constructed from fMRI data for clinical predictions. Traditional functional brain networks, however, are noisy and unaware of downstream prediction tasks, while also incompatible with the deep graph neural network (GNN) models. In order to fully unleash the power of GNNs in network-based fMRI analysis, we develop FBNETGEN, a task-aware and interpretable fMRI analysis framework via deep brain network generation. In particular, we formulate (1) prominent region of interest (ROI) features extraction, (2) brain networks generation, and (3) clinical predictions with GNNs, in an end-to-end trainable model under the guidance of particular prediction tasks. Along with the process, the key novel component is the graph generator which learns to transform raw time-series features into task-oriented brain networks. Our learnable graphs also provide unique interpretations by highlighting prediction-related brain regions. Comprehensive experiments on two datasets, i.e., the recently released and currently largest publicly available fMRI dataset Adolescent Brain Cognitive Development (ABCD), and the widely-used fMRI dataset PNC, prove the superior effectiveness and interpretability of FBNETGEN. The implementation is available at https://github.com/Wayfear/FBNETGEN. △ Less

Submitted 29 May, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

Comments: This paper has been accepted for presentation in MIDL 2022

MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

arXiv:2205.02682 [pdf]

doi 10.1016/j.optcom.2022.128982

Temporally and Spatially variant-resolution illumination patterns in computational ghost imaging

Authors: Dong Zhou, Jie Cao, Huan Cui, Li-Xing Lin, Haoyu Zhang, Yingqiang Zhang, Qun Hao

Abstract: Conventional computational ghost imaging (CGI) uses light carrying a sequence of patterns with uniform-resolution to illuminate the object, then performs correlation calculation based on the light intensity value reflected by the target and the preset patterns to obtain object image. It requires a large number of measurements to obtain high-quality images, especially if high-resolution images are… ▽ More Conventional computational ghost imaging (CGI) uses light carrying a sequence of patterns with uniform-resolution to illuminate the object, then performs correlation calculation based on the light intensity value reflected by the target and the preset patterns to obtain object image. It requires a large number of measurements to obtain high-quality images, especially if high-resolution images are to be obtained. To solve this problem, we developed temporally variable-resolution illumination patterns, replacing the conventional uniform-resolution illumination patterns with a sequence of patterns of different imaging resolutions. In addition, we propose to combine temporally variable-resolution illumination patterns and spatially variable-resolution structure to develop temporally and spatially variable-resolution (TSV) illumination patterns, which not only improve the imaging quality of the region of interest (ROI) but also improve the robustness to noise. The methods using proposed illumination patterns are verified by simulations and experiments compared with CGI. For the same number of measurements, the method using temporally variable-resolution illumination patterns has better imaging quality than CGI, but it is less robust to noise. The method using TSV illumination patterns has better imaging quality in ROI than the method using temporally variable-resolution illumination patterns and CGI under the same number of measurements. We also experimentally verify that the method using TSV patterns have better imaging performance when applied to higher resolution imaging. The proposed methods are expected to solve the current computational ghost imaging that is difficult to achieve high-resolution and high-quality imaging. △ Less

Submitted 14 May, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

arXiv:2203.01292 [pdf, other]

Andes_gym: A Versatile Environment for Deep Reinforcement Learning in Power Systems

Authors: Hantao Cui, Yichen Zhang

Abstract: This paper presents Andes_gym, a versatile and high-performance reinforcement learning environment for power system studies. The environment leverages the modeling and simulation capability of ANDES and the reinforcement learning (RL) environment OpenAI Gym to enable the prototyping and demonstration of RL algorithms for power systems. The architecture of the proposed software tool is elaborated t… ▽ More This paper presents Andes_gym, a versatile and high-performance reinforcement learning environment for power system studies. The environment leverages the modeling and simulation capability of ANDES and the reinforcement learning (RL) environment OpenAI Gym to enable the prototyping and demonstration of RL algorithms for power systems. The architecture of the proposed software tool is elaborated to provide the observation and action interfaces for RL algorithms. An example is shown to rapidly prototype a load-frequency control algorithm based on RL trained by available algorithms. The proposed environment is highly generalized by supporting all the power system dynamic models available in ANDES and numerous RL algorithms available for OpenAI Gym. △ Less

Submitted 2 March, 2022; originally announced March 2022.

Comments: 5 pages, 7 figures, accepted by 2022 IEEE Power and Energy Society General Meeting

arXiv:2108.05096 [pdf]

doi 10.1364/OL.440660

Omnidirectional ghost imaging system and unwrapping-free panoramic ghost imaging

Authors: Huan Cui, Jie Cao, Qun Hao, Dong Zhou, Mingyuan Tang, Kaiyu Zhang, Yingqiang Zhang

Abstract: Ghost imaging (GI) is a novel imaging method, which can reconstruct the object information by the light intensity correlation measurements. However, at present, the field of view (FOV) is limited to the illuminating range of the light patterns. To enlarge FOV of GI efficiently, here we proposed the omnidirectional ghost imaging system (OGIS), which can achieve a 360° omnidirectional FOV at one sho… ▽ More Ghost imaging (GI) is a novel imaging method, which can reconstruct the object information by the light intensity correlation measurements. However, at present, the field of view (FOV) is limited to the illuminating range of the light patterns. To enlarge FOV of GI efficiently, here we proposed the omnidirectional ghost imaging system (OGIS), which can achieve a 360° omnidirectional FOV at one shot only by adding a curved mirror. Moreover, by designing the retina-like annular patterns with log-polar patterns, OGIS can obtain unwrapping-free undistorted panoramic images with uniform resolution, which opens up a new way for the application of GI. △ Less

Submitted 11 August, 2021; originally announced August 2021.

arXiv:2108.01667 [pdf]

doi 10.1364/OE.439704

Optimization of retina-like illumination patterns in ghost imaging

Authors: Jie Cao, Dong Zhou, Ying-Qiang Zhang, Huan Cui, Fang-Hua Zhang, Qun Hao

Abstract: Ghost imaging (GI) reconstructs images using a single-pixel or bucket detector, which has the advantages of scattering robustness, wide spectrum and beyond-visual-field imaging. However, this technique needs large amount of measurements to obtain a sharp image. There have been a lot of methods proposed to overcome this disadvantage. Retina-like patterns, as one of the compressive sensing approache… ▽ More Ghost imaging (GI) reconstructs images using a single-pixel or bucket detector, which has the advantages of scattering robustness, wide spectrum and beyond-visual-field imaging. However, this technique needs large amount of measurements to obtain a sharp image. There have been a lot of methods proposed to overcome this disadvantage. Retina-like patterns, as one of the compressive sensing approaches, enhance the imaging quality of region of interest (ROI) while not increase measurements. The design of the retina-like patterns determines the performance of the ROI in the reconstructed image. Unlike the conventional method to fill in ROI with random patterns, we propose to optimize retina-like patterns by filling in the ROI with the patterns containing the sparsity prior of objects. This proposed method is verified by simulations and experiments compared with conventional GI, retina-like GI and GI using patterns optimized by principal component analysis. The method using optimized retina-like patterns obtain the best imaging quality in ROI than other methods. Meanwhile, the good generalization ability of the optimized retina-like pattern is also verified. While designing the size and position of the ROI of retina-like pattern, the feature information of the target can be obtained to optimize the pattern of ROI. This proposed method paves the way for realizing high-quality GI. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2108.01666 [pdf]

Complementary Fourier single-pixel imaging

Authors: Dong Zhou, Jie Cao, Huan Cui, Qun Hao, Bing-Kun Chen, Kai Lin

Abstract: Single-pixel imaging, with the advantages of a wide spectrum, beyond-visual-field imaging, and robustness to light scattering, has attracted increasing attention in recent years. Fourier single-pixel imaging (FSI) can reconstruct sharp images under sub-Nyquist sampling. However, the conventional FSI has difficulty with balancing the imaging quality and efficiency. To overcome this issue, we propos… ▽ More Single-pixel imaging, with the advantages of a wide spectrum, beyond-visual-field imaging, and robustness to light scattering, has attracted increasing attention in recent years. Fourier single-pixel imaging (FSI) can reconstruct sharp images under sub-Nyquist sampling. However, the conventional FSI has difficulty with balancing the imaging quality and efficiency. To overcome this issue, we proposed a novel approach called complementary Fourier single-pixel imaging (CFSI) to reduce measurements while retaining its robustness. The complementary nature of Fourier patterns based on a four-step phase-shift algorithm is combined with the complementary nature of a digital micromirror device. CFSI only requires two phase-shifted patterns to obtain one Fourier spectral value. Four light intensity values are obtained by load the two patterns, and the spectral value is calculated through differential measurement, which has good robustness to noise. The proposed method is verified by simulations and experiments compared with FSI based on two-, three-, and four-step phase shift algorithms. CFSI performed better than the other methods under the condition that the best imaging quality of CFSI is not reached. The reported technique provides an alternative approach to realize real-time and high-quality imaging. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2107.05097 [pdf, other]

BrainNNExplainer: An Interpretable Graph Neural Network Framework for Brain Network based Disease Analysis

Authors: Hejie Cui, Wei Dai, Yanqiao Zhu, Xiaoxiao Li, Lifang He, Carl Yang

Abstract: Interpretable brain network models for disease prediction are of great value for the advancement of neuroscience. GNNs are promising to model complicated network data, but they are prone to overfitting and suffer from poor interpretability, which prevents their usage in decision-critical scenarios like healthcare. To bridge this gap, we propose BrainNNExplainer, an interpretable GNN framework for… ▽ More Interpretable brain network models for disease prediction are of great value for the advancement of neuroscience. GNNs are promising to model complicated network data, but they are prone to overfitting and suffer from poor interpretability, which prevents their usage in decision-critical scenarios like healthcare. To bridge this gap, we propose BrainNNExplainer, an interpretable GNN framework for brain network analysis. It is mainly composed of two jointly learned modules: a backbone prediction model that is specifically designed for brain networks and an explanation generator that highlights disease-specific prominent brain network connections. Extensive experimental results with visualizations on two challenging disease prediction datasets demonstrate the unique interpretability and outstanding performance of BrainNNExplainer. △ Less

Submitted 11 July, 2021; originally announced July 2021.

Comments: This paper has been accepted to ICML 2021 Workshop on Interpretable Machine Learning in Healthcare

MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

arXiv:2107.01502 [pdf, other]

doi 10.1007/978-3-030-32226-7_33

Pulmonary Vessel Segmentation based on Orthogonal Fused U-Net++ of Chest CT Images

Authors: Hejie Cui, Xinglong Liu, Ning Huang

Abstract: Pulmonary vessel segmentation is important for clinical diagnosis of pulmonary diseases, while is also challenging due to the complicated structure. In this work, we present an effective framework and refinement process of pulmonary vessel segmentation from chest computed tomographic (CT) images. The key to our approach is a 2.5D segmentation network applied from three orthogonal axes, which prese… ▽ More Pulmonary vessel segmentation is important for clinical diagnosis of pulmonary diseases, while is also challenging due to the complicated structure. In this work, we present an effective framework and refinement process of pulmonary vessel segmentation from chest computed tomographic (CT) images. The key to our approach is a 2.5D segmentation network applied from three orthogonal axes, which presents a robust and fully automated pulmonary vessel segmentation result with lower network complexity and memory usage compared to 3D networks. The slice radius is introduced to convolve the adjacent information of the center slice and the multi-planar fusion optimizes the presentation of intra- and inter- slice features. Besides, the tree-like structure of the pulmonary vessel is extracted in the post-processing process, which is used for segmentation refining and pruning. In the evaluation experiments, three fusion methods are tested and the most promising one is compared with the state-of-the-art 2D and 3D structures on 300 cases of lung images randomly selected from LIDC dataset. Our method outperforms other network structures by a large margin and achieves by far the highest average DICE score of 0.9272 and precision of 0.9310, as per our knowledge from the pulmonary vessel segmentation models available in the literature. △ Less

Submitted 3 July, 2021; originally announced July 2021.

Comments: Published in Medical Image Computing and Computer Assisted Intervention (MICCAI 2019)

MSC Class: 68T45; 68T07 ACM Class: I.2.10; J.3

arXiv:2104.10832 [pdf, other]

Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss

Authors: Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li

Abstract: Building cross-lingual voice conversion (VC) systems for multiple speakers and multiple languages has been a challenging task for a long time. This paper describes a parallel non-autoregressive network to achieve bilingual and code-switched voice conversion for multiple speakers when there are only mono-lingual corpora for each language. We achieve cross-lingual VC between Mandarin speech with mul… ▽ More Building cross-lingual voice conversion (VC) systems for multiple speakers and multiple languages has been a challenging task for a long time. This paper describes a parallel non-autoregressive network to achieve bilingual and code-switched voice conversion for multiple speakers when there are only mono-lingual corpora for each language. We achieve cross-lingual VC between Mandarin speech with multiple speakers and English speech with multiple speakers by applying bilingual bottleneck features. To boost voice cloning performance, we use an adversarial speaker classifier with a gradient reversal layer to reduce the source speaker's information from the output of encoder. Furthermore, in order to improve speaker similarity between reference speech and converted speech, we adopt an embedding consistency loss between the synthesized speech and its natural reference speech in our network. Experimental results show that our proposed method can achieve high quality converted speech with mean opinion score (MOS) around 4. The conversion system performs well in terms of speaker similarity for both in-set speaker conversion and out-set-of one-shot conversion. △ Less

Submitted 21 April, 2021; originally announced April 2021.

Comments: Submitted to Interspeech 2021

arXiv:2104.09701 [pdf, ps, other]

doi 10.1016/j.knosys.2021.106753

Free-form tumor synthesis in computed tomography images via richer generative adversarial network

Authors: Qiangguo Jin, Hui Cui, Changming Sun, Zhaopeng Meng, Ran Su

Abstract: The insufficiency of annotated medical imaging scans for cancer makes it challenging to train and validate data-hungry deep learning models in precision oncology. We propose a new richer generative adversarial network for free-form 3D tumor/lesion synthesis in computed tomography (CT) images. The network is composed of a new richer convolutional feature enhanced dilated-gated generator (RicherDG)… ▽ More The insufficiency of annotated medical imaging scans for cancer makes it challenging to train and validate data-hungry deep learning models in precision oncology. We propose a new richer generative adversarial network for free-form 3D tumor/lesion synthesis in computed tomography (CT) images. The network is composed of a new richer convolutional feature enhanced dilated-gated generator (RicherDG) and a hybrid loss function. The RicherDG has dilated-gated convolution layers to enable tumor-painting and to enlarge perceptive fields; and it has a novel richer convolutional feature association branch to recover multi-scale convolutional features especially from uncertain boundaries between tumor and surrounding healthy tissues. The hybrid loss function, which consists of a diverse range of losses, is designed to aggregate complementary information to improve optimization. We perform a comprehensive evaluation of the synthesis results on a wide range of public CT image datasets covering the liver, kidney tumors, and lung nodules. The qualitative and quantitative evaluations and ablation study demonstrated improved synthesizing results over advanced tumor synthesis methods. △ Less

Submitted 19 April, 2021; originally announced April 2021.

arXiv:2104.09699 [pdf, ps, other]

doi 10.1016/j.eswa.2021.114848

Domain adaptation based self-correction model for COVID-19 infection segmentation in CT images

Authors: Qiangguo Jin, Hui Cui, Changming Sun, Zhaopeng Meng, Leyi Wei, Ran Su

Abstract: The capability of generalization to unseen domains is crucial for deep learning models when considering real-world scenarios. However, current available medical image datasets, such as those for COVID-19 CT images, have large variations of infections and domain shift problems. To address this issue, we propose a prior knowledge driven domain adaptation and a dual-domain enhanced self-correction le… ▽ More The capability of generalization to unseen domains is crucial for deep learning models when considering real-world scenarios. However, current available medical image datasets, such as those for COVID-19 CT images, have large variations of infections and domain shift problems. To address this issue, we propose a prior knowledge driven domain adaptation and a dual-domain enhanced self-correction learning scheme. Based on the novel learning schemes, a domain adaptation based self-correction model (DASC-Net) is proposed for COVID-19 infection segmentation on CT images. DASC-Net consists of a novel attention and feature domain enhanced domain adaptation model (AFD-DA) to solve the domain shifts and a self-correction learning process to refine segmentation results. The innovations in AFD-DA include an image-level activation feature extractor with attention to lung abnormalities and a multi-level discrimination module for hierarchical feature domain alignment. The proposed self-correction learning process adaptively aggregates the learned model and corresponding pseudo labels for the propagation of aligned source and target domain information to alleviate the overfitting to noises caused by pseudo labels. Extensive experiments over three publicly available COVID-19 CT datasets demonstrate that DASC-Net consistently outperforms state-of-the-art segmentation, domain shift, and coronavirus infection segmentation methods. Ablation analysis further shows the effectiveness of the major components in our model. The DASC-Net enriches the theory of domain adaptation and self-correction learning in medical imaging and can be generalized to multi-site COVID-19 infection segmentation on CT images for clinical deployment. △ Less

Submitted 19 April, 2021; originally announced April 2021.

arXiv:2103.05123 [pdf, other]

Deep Transfer Learning for WiFi Localization

Authors: Peizheng Li, Han Cui, Aftab Khan, Usman Raza, Robert Piechocki, Angela Doufexi, Tim Farnham

Abstract: This paper studies a WiFi indoor localisation technique based on using a deep learning model and its transfer strategies. We take CSI packets collected via the WiFi standard channel sounding as the training dataset and verify the CNN model on the subsets collected in three experimental environments. We achieve a localisation accuracy of 46.55 cm in an ideal $(6.5m \times 2.5m)$ office with no obst… ▽ More This paper studies a WiFi indoor localisation technique based on using a deep learning model and its transfer strategies. We take CSI packets collected via the WiFi standard channel sounding as the training dataset and verify the CNN model on the subsets collected in three experimental environments. We achieve a localisation accuracy of 46.55 cm in an ideal $(6.5m \times 2.5m)$ office with no obstacles, 58.30 cm in an office with obstacles, and 102.8 cm in a sports hall $(40 \times 35m)$. Then, we evaluate the transfer ability of the proposed model to different environments. The experimental results show that, for a trained localisation model, feature extraction layers can be directly transferred to other models and only the fully connected layers need to be retrained to achieve the same baseline accuracy with non-transferred base models. This can save 60% of the training parameters and reduce the training time by more than half. Finally, an ablation study of the training dataset shows that, in both office and sport hall scenarios, after reusing the feature extraction layers of the base model, only 55% of the training data is required to obtain the models' accuracy similar to the base models. △ Less

Submitted 8 March, 2021; originally announced March 2021.

Comments: 5 pages, 5 figures, has been accepted for lecture presentation at the 2021 IEEE Radar Conference (IEEE RadarConf 2021)

arXiv:2102.09583 [pdf, other]

doi 10.1109/TPWRS.2021.3110881

Encoding Frequency Constraints in Preventive Unit Commitment Using Deep Learning with Region-of-Interest Active Sampling

Authors: Yichen Zhang, Hantao Cui, Jianzhe Liu, Feng Qiu, Tianqi Hong, Rui Yao, Fangxing Li

Abstract: With the increasing penetration of renewable energy, frequency response and its security are of significant concerns for reliable power system operations. Frequency-constrained unit commitment (FCUC) is proposed to address this challenge. Despite existing efforts in modeling frequency characteristics in unit commitment (UC), current strategies can only handle oversimplified low-order frequency res… ▽ More With the increasing penetration of renewable energy, frequency response and its security are of significant concerns for reliable power system operations. Frequency-constrained unit commitment (FCUC) is proposed to address this challenge. Despite existing efforts in modeling frequency characteristics in unit commitment (UC), current strategies can only handle oversimplified low-order frequency response models and do not consider wide-range operating conditions. This paper presents a generic data-driven framework for FCUC under high renewable penetration. Deep neural networks (DNNs) are trained to predict the frequency response using real data or high-fidelity simulation data. Next, the DNN is reformulated as a set of mixed-integer linear constraints to be incorporated into the ordinary UC formulation. In the data generation phase, all possible power injections are considered, and a region-of-interests active sampling is proposed to include power injection samples with frequency nadirs closer to the UFLC threshold, which significantly enhances the accuracy of frequency constraints in FCUC. The proposed FCUC is verified on the the IEEE 39-bus system. Then, a full-order dynamic model simulation using PSS/E verifies the effectiveness of FCUC in frequency-secure generator commitments. △ Less

Submitted 12 October, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

arXiv:2101.05894 [pdf]

Transmission-and-Distribution Frequency Dynamic Co-Simulation Framework for Distributed Energy Resources Frequency Response

Authors: Wenbo Wang, Xin Fang, Hantao Cui, Fangxing Li

Abstract: The rapid deployment of distributed energy resources (DERs) in distribution networks has brought challenges to balance the system and stabilize frequency. DERs have the ability to provide frequency regulation; however, existing dynamic frequency simulation tools-which were developed mainly for the transmission system-lack the capability to simulate distribution network dynamics with high penetrati… ▽ More The rapid deployment of distributed energy resources (DERs) in distribution networks has brought challenges to balance the system and stabilize frequency. DERs have the ability to provide frequency regulation; however, existing dynamic frequency simulation tools-which were developed mainly for the transmission system-lack the capability to simulate distribution network dynamics with high penetrations of DERs. Although electromagnetic transient (EMT) simulation tools can simulate distribution network dynamics, the computation efficiency limits their use for large-scale transmission-and-distribution (T&D) simulations. This paper presents an efficient T&D dynamic frequency co-simulation framework for DER frequency response based on the HELICS platform and existing off-the-shelf simulators. The challenge of synchronizing frequency between the transmission network and DERs hosted in the distribution network is approached by detailed modeling of DERs in frequency dynamic models while DER phasor models are also preserved in the distribution networks. Thereby, local voltage constraints can be respected when dispatching the DER power for frequency response. The DER frequency responses (primary and secondary)-are simulated in case studies to validate the proposed framework. Lastly, fault-induced delayed voltage recovery (FIDVR) event of a large system is presented to demonstrate the efficiency and effectiveness of the overall framework. △ Less

Submitted 14 January, 2021; originally announced January 2021.

arXiv:2012.12468 [pdf, other]

CN-Celeb: multi-genre speaker recognition

Authors: Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fan, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng, Dong Wang

Abstract: Research on speaker recognition is extending to address the vulnerability in the wild conditions, among which genre mismatch is perhaps the most challenging, for instance, enrollment with reading speech while testing with conversational or singing audio. This mismatch leads to complex and composite inter-session variations, both intrinsic (i.e., speaking style, physiological status) and extrinsic… ▽ More Research on speaker recognition is extending to address the vulnerability in the wild conditions, among which genre mismatch is perhaps the most challenging, for instance, enrollment with reading speech while testing with conversational or singing audio. This mismatch leads to complex and composite inter-session variations, both intrinsic (i.e., speaking style, physiological status) and extrinsic (i.e., recording device, background noise). Unfortunately, the few existing multi-genre corpora are not only limited in size but are also recorded under controlled conditions, which cannot support conclusive research on the multi-genre problem. In this work, we firstly publish CN-Celeb, a large-scale multi-genre corpus that includes in-the-wild speech utterances of 3,000 speakers in 11 different genres. Secondly, using this dataset, we conduct a comprehensive study on the multi-genre phenomenon, in particular the impact of the multi-genre challenge on speaker recognition and the performance gain when the new dataset is used to conduct multi-genre training. △ Less

Submitted 24 November, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

Comments: submitted to Speech Communication

arXiv:2011.11880 [pdf, other]

doi 10.1109/TPWRS.2021.3073591

Effective Parallelism for Equation and Jacobian Evaluation in Power Flow Calculation

Authors: Hantao Cui, Fangxing Li, Xin Fang

Abstract: This letter investigates parallelism approaches for equation and Jacobian evaluations in large-scale power flow calculation. Two levels of parallelism are proposed and analyzed: inter-model parallelism, which evaluates models in parallel, and intra-model parallelism, which evaluates calculations within each model in parallel. Parallelism techniques such as multi-threading and single instruction mu… ▽ More This letter investigates parallelism approaches for equation and Jacobian evaluations in large-scale power flow calculation. Two levels of parallelism are proposed and analyzed: inter-model parallelism, which evaluates models in parallel, and intra-model parallelism, which evaluates calculations within each model in parallel. Parallelism techniques such as multi-threading and single instruction multiple data (SIMD) vectorization are discussed, implemented, and benchmarked as six calculation workflows. Case studies on the 70,000-bus synthetic grid show that equation evaluations can be accelerated by ten times, and the overall Newton power flow advances the state of the art by 20%. △ Less

Submitted 21 August, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

arXiv:2011.03525 [pdf, other]

SigNet: A Novel Deep Learning Framework for Radio Signal Classification

Authors: Zhuangzhi Chen, Hui Cui, Jingyang Xiang, Kunfeng Qiu, Liang Huang, Shilian Zheng, Shichuan Chen, Qi Xuan, Xiaoniu Yang

Abstract: Deep learning methods achieve great success in many areas due to their powerful feature extraction capabilities and end-to-end training mechanism, and recently they are also introduced for radio signal modulation classification. In this paper, we propose a novel deep learning framework called SigNet, where a signal-to-matrix (S2M) operator is adopted to convert the original signal into a square ma… ▽ More Deep learning methods achieve great success in many areas due to their powerful feature extraction capabilities and end-to-end training mechanism, and recently they are also introduced for radio signal modulation classification. In this paper, we propose a novel deep learning framework called SigNet, where a signal-to-matrix (S2M) operator is adopted to convert the original signal into a square matrix first and is co-trained with a follow-up CNN architecture for classification. This model is further accelerated by integrating 1D convolution operators, leading to the upgraded model SigNet2.0. The simulations on two signal datasets show that both SigNet and SigNet2.0 outperform a number of well-known baselines. More interestingly, our proposed models behave extremely well in small-sample learning when only a small training dataset is provided. They can achieve a relatively high accuracy even when 1\% training data are kept, while other baseline models may lose their effectiveness much more quickly as the datasets get smaller. Such result suggests that SigNet/SigNet2.0 could be extremely useful in the situations where labeled signal data are difficult to obtain. The visualization of the output features of our models demonstrates that our model can well divide different modulation types of signals in the feature hyper-space. △ Less

Submitted 18 October, 2021; v1 submitted 28 October, 2020; originally announced November 2020.

Comments: 13 pages, 8 figures

arXiv:2010.08658 [pdf, other]

Wireless Localisation in WiFi using Novel Deep Architectures

Authors: Peizheng Li, Han Cui, Aftab Khan, Usman Raza, Robert Piechocki, Angela Doufexi, Tim Farnham

Abstract: This paper studies the indoor localisation of WiFi devices based on a commodity chipset and standard channel sounding. First, we present a novel shallow neural network (SNN) in which features are extracted from the channel state information (CSI) corresponding to WiFi subcarriers received on different antennas and used to train the model. The single-layer architecture of this localisation neural n… ▽ More This paper studies the indoor localisation of WiFi devices based on a commodity chipset and standard channel sounding. First, we present a novel shallow neural network (SNN) in which features are extracted from the channel state information (CSI) corresponding to WiFi subcarriers received on different antennas and used to train the model. The single-layer architecture of this localisation neural network makes it lightweight and easy-to-deploy on devices with stringent constraints on computational resources. We further investigate for localisation the use of deep learning models and design novel architectures for convolutional neural network (CNN) and long-short term memory (LSTM). We extensively evaluate these localisation algorithms for continuous tracking in indoor environments. Experimental results prove that even an SNN model, after a careful handcrafted feature extraction, can achieve accurate localisation. Meanwhile, using a well-organised architecture, the neural network models can be trained directly with raw data from the CSI and localisation features can be automatically extracted to achieve accurate position estimates. We also found that the performance of neural network-based methods are directly affected by the number of anchor access points (APs) regardless of their structure. With three APs, all neural network models proposed in this paper can obtain localisation accuracy of around 0.5 metres. In addition the proposed deep NN architecture reduces the data pre-processing time by 6.5 hours compared with a shallow NN using the data collected in our testbed. In the deployment phase, the inference time is also significantly reduced to 0.1 ms per sample. We also demonstrate the generalisation capability of the proposed method by evaluating models using different target movement characteristics to the ones in which they were trained. △ Less

Submitted 16 October, 2020; originally announced October 2020.

Comments: Accepted for presentation at the 25th International Conference on Pattern Recognition (ICPR), IEEE, 2020

arXiv:2008.03883 [pdf, other]

Mass-Matrix Differential-Algebraic Equation Formulation for Transient Stability Simulation

Authors: Hantao Cui, Fangxing Li, Joe H. Chow

Abstract: This letter proposes a mass-matrix differential-algebraic equation (DAE) formulation for transient stability simulation. This formulation has two prominent advantages: compatible with a multitude of implicit DAE solvers and can be conveniently implemented based on the traditional formulation, for example, by separating the parameters in denominators into the diagonals of the mass matrix. It also a… ▽ More This letter proposes a mass-matrix differential-algebraic equation (DAE) formulation for transient stability simulation. This formulation has two prominent advantages: compatible with a multitude of implicit DAE solvers and can be conveniently implemented based on the traditional formulation, for example, by separating the parameters in denominators into the diagonals of the mass matrix. It also allows reducing the dynamics using null time constants. Benchmark studies are presented on the time and accuracy of 17 implicit solvers for the proposed formulation using the Kundur's two-area system and a 2,000 bus system. △ Less

Submitted 9 August, 2020; originally announced August 2020.

arXiv:2006.02000 [pdf, other]

MultiXNet: Multiclass Multistage Multimodal Motion Prediction

Authors: Nemanja Djuric, Henggang Cui, Zhaoen Su, Shangxuan Wu, Huahua Wang, Fang-Chieh Chou, Luisa San Martin, Song Feng, Rui Hu, Yang Xu, Alyssa Dayan, Sidney Zhang, Brian C. Becker, Gregory P. Meyer, Carlos Vallespi-Gonzalez, Carl K. Wellington

Abstract: One of the critical pieces of the self-driving puzzle is understanding the surroundings of a self-driving vehicle (SDV) and predicting how these surroundings will change in the near future. To address this task we propose MultiXNet, an end-to-end approach for detection and motion prediction based directly on lidar sensor data. This approach builds on prior work by handling multiple classes of traf… ▽ More One of the critical pieces of the self-driving puzzle is understanding the surroundings of a self-driving vehicle (SDV) and predicting how these surroundings will change in the near future. To address this task we propose MultiXNet, an end-to-end approach for detection and motion prediction based directly on lidar sensor data. This approach builds on prior work by handling multiple classes of traffic actors, adding a jointly trained second-stage trajectory refinement step, and producing a multimodal probability distribution over future actor motion that includes both multiple discrete traffic behaviors and calibrated continuous position uncertainties. The method was evaluated on large-scale, real-world data collected by a fleet of SDVs in several cities, with the results indicating that it outperforms existing state-of-the-art approaches. △ Less

Submitted 24 May, 2021; v1 submitted 2 June, 2020; originally announced June 2020.

Comments: Accepted for publication at IEEE Intelligent Vehicles Symposium (IV) 2021

arXiv:2005.05430 [pdf, other]

On the Modeling and Simulation of Anti-Windup Proportional-Integral Controller

Authors: Hantao Cui, Yichen Zhang, Federico Milano, Fangxing Li

Abstract: This paper investigates the chattering and deadlock behaviors of the proportional-integral (PI) controller with an anti-windup (AW) limiter recommended by the IEEE Standard 421.5-2016. Depending on the simulation method, the controller may enter a chattering or deadlock state in some combinations of parameters and inputs. Chattering and deadlock are analyzed in the context of three numerical integ… ▽ More This paper investigates the chattering and deadlock behaviors of the proportional-integral (PI) controller with an anti-windup (AW) limiter recommended by the IEEE Standard 421.5-2016. Depending on the simulation method, the controller may enter a chattering or deadlock state in some combinations of parameters and inputs. Chattering and deadlock are analyzed in the context of three numerical integration approaches: explicit partitioned method (EPM), execution-list based method (ELM), and implicit trapezoidal method (ITM). This paper derives the chattering stop condition for EPM and ELP, and analyzes the impacts of step size and convergence tolerance for simultaneous method. The deduced chattering stop conditions and deadlock behavior is verified with numerical simulations. △ Less

Submitted 11 May, 2020; originally announced May 2020.

arXiv:2004.06247 [pdf, other]

Improving Movement Predictions of Traffic Actors in Bird's-Eye View Models using GANs and Differentiable Trajectory Rasterization

Authors: Eason Wang, Henggang Cui, Sai Yalamanchi, Mohana Moorthy, Fang-Chieh Chou, Nemanja Djuric

Abstract: One of the most critical pieces of the self-driving puzzle is the task of predicting future movement of surrounding traffic actors, which allows the autonomous vehicle to safely and effectively plan its future route in a complex world. Recently, a number of algorithms have been proposed to address this important problem, spurred by a growing interest of researchers from both industry and academia.… ▽ More One of the most critical pieces of the self-driving puzzle is the task of predicting future movement of surrounding traffic actors, which allows the autonomous vehicle to safely and effectively plan its future route in a complex world. Recently, a number of algorithms have been proposed to address this important problem, spurred by a growing interest of researchers from both industry and academia. Methods based on top-down scene rasterization on one side and Generative Adversarial Networks (GANs) on the other have shown to be particularly successful, obtaining state-of-the-art accuracies on the task of traffic movement prediction. In this paper we build upon these two directions and propose a raster-based conditional GAN architecture, powered by a novel differentiable rasterizer module at the input of the conditional discriminator that maps generated trajectories into the raster space in a differentiable manner. This simplifies the task for the discriminator as trajectories that are not scene-compliant are easier to discern, and allows the gradients to flow back forcing the generator to output better, more realistic trajectories. We evaluated the proposed method on a large-scale, real-world data set, showing that it outperforms state-of-the-art GAN-based baselines. △ Less

Submitted 11 June, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

Comments: Accepted for publication at ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2020

arXiv:2002.11088 [pdf, other]

Model Watermarking for Image Processing Networks

Authors: Jie Zhang, Dongdong Chen, Jing Liao, Han Fang, Weiming Zhang, Wenbo Zhou, Hao Cui, Nenghai Yu

Abstract: Deep learning has achieved tremendous success in numerous industrial applications. As training a good model often needs massive high-quality data and computation resources, the learned models often have significant business values. However, these valuable deep models are exposed to a huge risk of infringements. For example, if the attacker has the full information of one target model including the… ▽ More Deep learning has achieved tremendous success in numerous industrial applications. As training a good model often needs massive high-quality data and computation resources, the learned models often have significant business values. However, these valuable deep models are exposed to a huge risk of infringements. For example, if the attacker has the full information of one target model including the network structure and weights, the model can be easily finetuned on new datasets. Even if the attacker can only access the output of the target model, he/she can still train another similar surrogate model by generating a large scale of input-output training pairs. How to protect the intellectual property of deep models is a very important but seriously under-researched problem. There are a few recent attempts at classification network protection only. In this paper, we propose the first model watermarking framework for protecting image processing models. To achieve this goal, we leverage the spatial invisible watermarking mechanism. Specifically, given a black-box target model, a unified and invisible watermark is hidden into its outputs, which can be regarded as a special task-agnostic barrier. In this way, when the attacker trains one surrogate model by using the input-output pairs of the target model, the hidden watermark will be learned and extracted afterward. To enable watermarks from binary bits to high-resolution images, both traditional and deep spatial invisible watermarking mechanism are considered. Experiments demonstrate the robustness of the proposed watermarking mechanism, which can resist surrogate models learned with different network structures and objective functions. Besides deep models, the proposed method is also easy to be extended to protect data and traditional image processing algorithms. △ Less

Submitted 25 February, 2020; originally announced February 2020.

Comments: AAAI 2020

arXiv:2002.09455 [pdf, other]

Hybrid Symbolic-Numeric Framework for Power System Modeling and Analysis

Authors: Hantao Cui, Fangxing Li, Kevin Tomsovic

Abstract: With the recent proliferation of open-source packages for computing, power system differential-algebraic equation (DAE) modeling and simulation are being revisited to reduce the programming efforts. Existing open-source tools require manual efforts to develop code for numerical equations, sparse Jacobians, and discontinuous components. This paper proposes a hybrid symbolic-numeric framework, exemp… ▽ More With the recent proliferation of open-source packages for computing, power system differential-algebraic equation (DAE) modeling and simulation are being revisited to reduce the programming efforts. Existing open-source tools require manual efforts to develop code for numerical equations, sparse Jacobians, and discontinuous components. This paper proposes a hybrid symbolic-numeric framework, exemplified by an open-source Python-based library ANDES, which consists of a symbolic layer for descriptive modeling and a numeric layer for vector-based numerical computation. This method enables the implementation of DAE models by mixing and matching modeling components, through which models are described. In the framework, a rich set of discontinuous components and standard transfer function blocks are provided besides essential modeling elements for rapid modeling. ANDES can automatically generate robust and fast numerical simulation code, as well as and high-quality documentation. Case studies present a) two implementations of turbine governor model TGOV1, b) power flow computation time break down for MATPOWER systems, c) validation of time-domain simulation with commercial software using three test systems with a variety of models, and d) the full eigenvalue analysis for Kundur's system. Validation shows that ANDES closely matches the commercial tool DSATools for power flow, time-domain simulation, and eigenvalue analysis. △ Less

Submitted 12 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

arXiv:1906.08469 [pdf, other]

Predicting Motion of Vulnerable Road Users using High-Definition Maps and Efficient ConvNets

Authors: Fang-Chieh Chou, Tsung-Han Lin, Henggang Cui, Vladan Radosavljevic, Thi Nguyen, Tzu-Kuo Huang, Matthew Niedoba, Jeff Schneider, Nemanja Djuric

Abstract: Following detection and tracking of traffic actors, prediction of their future motion is the next critical component of a self-driving vehicle (SDV) technology, allowing the SDV to operate safely and efficiently in its environment. This is particularly important when it comes to vulnerable road users (VRUs), such as pedestrians and bicyclists. These actors need to be handled with special care due… ▽ More Following detection and tracking of traffic actors, prediction of their future motion is the next critical component of a self-driving vehicle (SDV) technology, allowing the SDV to operate safely and efficiently in its environment. This is particularly important when it comes to vulnerable road users (VRUs), such as pedestrians and bicyclists. These actors need to be handled with special care due to an increased risk of injury, as well as the fact that their behavior is less predictable than that of motorized actors. To address this issue, in the current study we present a deep learning-based method for predicting VRU movement, where we rasterize high-definition maps and actor's surroundings into a bird's-eye view image used as an input to deep convolutional networks. In addition, we propose a fast architecture suitable for real-time inference, and perform an ablation study of various rasterization approaches to find the optimal choice for accurate prediction. The results strongly indicate benefits of using the proposed approach for motion prediction of VRUs, both in terms of accuracy and latency. △ Less

Submitted 11 June, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

Comments: Accepted for publication at IEEE Intelligent Vehicles Symposium (IV) 2020

arXiv:1803.02337 [pdf]

Cyber-Physical Testbed for Power System Wide-Area Measurement-Based Control Using Open-Source Software

Authors: Hantao Cui, Fangxing Li, Kevin Tomsovic, Siqi Wang, Riyasat Azim, Yidan Lu, Haoyu Yuan

Abstract: The electric power system is a cyber-physical system with power flow in the physical system and information flow in the cyber. Simulation is crucial to understanding the dynamics and control of electric power systems yet the underlying communication system has historically been ignored in these studies. This paper aims at meeting the increasing needs to simulate the operations of a real power syst… ▽ More The electric power system is a cyber-physical system with power flow in the physical system and information flow in the cyber. Simulation is crucial to understanding the dynamics and control of electric power systems yet the underlying communication system has historically been ignored in these studies. This paper aims at meeting the increasing needs to simulate the operations of a real power system including the physical system, the energy management system, the communication system, and the emerging wide-area measurement-based controls. This paper proposes a cyber-physical testbed design and implementation for verifying and demonstrating wide-area control methods based on streaming telemetry and phasor measurement unit data. The proposed decoupled architecture is composed of a differential algebraic equation based physical system simulator, a software-defined network, a scripting language environment for prototyping an EMS system and a control system, all of which are integrated over industry-standard communication protocols. The proposed testbed is implemented using open-source software packages managed by a Python dispatcher. Finally, demonstrations are presented to show two wide-area measurement-based controls - system separation control and hierarchical voltage control, in the implemented testbed. △ Less

Submitted 6 March, 2018; originally announced March 2018.

Comments: Submitted to IET CPS

Showing 1–40 of 40 results for author: Cui, H