Zum Hauptinhalt springen

Showing 1–50 of 95 results for author: Zheng, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.12829  [pdf, other

    cs.LG cs.SD eess.AS

    Uncertainty-Aware Mean Opinion Score Prediction

    Authors: Hui Wang, Shiwan Zhao, Jiaming Zhou, Xiguang Zheng, Haoqin Sun, Xuechen Wang, Yong Qin

    Abstract: Mean Opinion Score (MOS) prediction has made significant progress in specific domains. However, the unstable performance of MOS prediction models across diverse samples presents ongoing challenges in the practical application of these systems. In this paper, we point out that the absence of uncertainty modeling is a significant limitation hindering MOS prediction systems from applying to the real… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: Accepted by Interspeech 2024, oral

  2. arXiv:2408.10235  [pdf, other

    eess.SP cs.HC cs.LG

    Multi-Source EEG Emotion Recognition via Dynamic Contrastive Domain Adaptation

    Authors: Yun Xiao, Yimeng Zhang, Xiaopeng Peng, Shuzheng Han, Xia Zheng, Dingyi Fang, Xiaojiang Chen

    Abstract: Electroencephalography (EEG) provides reliable indications of human cognition and mental states. Accurate emotion recognition from EEG remains challenging due to signal variations among individuals and across measurement sessions. To address these challenges, we introduce a multi-source dynamic contrastive domain adaptation method (MS-DCDA), which models coarse-grained inter-domain and fine-graine… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  3. arXiv:2408.03124  [pdf, other

    eess.SY cs.LG

    Closed-loop Diffusion Control of Complex Physical Systems

    Authors: Long Wei, Haodong Feng, Peiyan Hu, Tao Zhang, Yuchen Yang, Xiang Zheng, Ruiqi Feng, Dixia Fan, Tailin Wu

    Abstract: The control problems of complex physical systems have wide applications in science and engineering. Several previous works have demonstrated that generative control methods based on diffusion models have significant advantages for solving these problems. However, existing generative control methods face challenges in handling closed-loop control, which is an inherent constraint for effective contr… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  4. arXiv:2408.02943  [pdf, other

    eess.SP

    Recent Advances in Data-driven Intelligent Control for Wireless Communication: A Comprehensive Survey

    Authors: Wei Huo, Huiwen Yang, Nachuan Yang, Zhaohua Yang, Jiuzhou Zhang, Fuhai Nan, Xingzhou Chen, Yifan Mao, Suyang Hu, Pengyu Wang, Xuanyu Zheng, Mingming Zhao, Ling Shi

    Abstract: The advent of next-generation wireless communication systems heralds an era characterized by high data rates, low latency, massive connectivity, and superior energy efficiency. These systems necessitate innovative and adaptive strategies for resource allocation and device behavior control in wireless networks. Traditional optimization-based methods have been found inadequate in meeting the complex… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  5. arXiv:2407.02007  [pdf, other

    eess.AS

    SOT Triggered Neural Clustering for Speaker Attributed ASR

    Authors: Xianrui Zheng, Guangzhi Sun, Chao Zhang, Philip C. Woodland

    Abstract: This paper introduces a novel approach to speaker-attributed ASR transcription using a neural clustering method. With a parallel processing mechanism, diarisation and ASR can be applied simultaneously, helping to prevent the accumulation of errors from one sub-system to the next in a cascaded system. This is achieved by the use of ASR, trained using a serialised output training method, together wi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: To appear in Interspeech 2024

  6. arXiv:2406.03011  [pdf, other

    cs.IT eess.SP

    Huygens-Fresnel Model Based Position-Aided Phase Configuration for 1-Bit RIS Assisted Wireless Communication

    Authors: Xiao Zheng, Wenchi Cheng, Jiangzhou Wang

    Abstract: Reconfigurable intelligent surface (RIS), composed of nearly passive elements, is regarded as one of the potential paradigms to support multi-gigabit data in real-time. However, in traditional CSI (channel state information) driven frame, the training overhead of channel estimation greatly increases as the number of RIS elements increases to intelligently manipulate the reflected signals. To conve… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 15 pages, accepted by IEEE TCOM (early access)

    ACM Class: H.1.1

  7. arXiv:2405.06125  [pdf

    eess.SY

    Cooperative Route Guidance and Flow Control for Mixed Road Networks Comprising Expressway and Arterial Network

    Authors: Yunran Di, Haotian Shi, Weihua Zhang, Heng Ding, Xiaoyan Zheng, Bin Ran

    Abstract: Facing the congestion challenges of mixed road networks comprising expressways and arterial road networks, traditional control solutions fall short. To effectively alleviate traffic congestion in mixed road networks, it is crucial to clear the interaction between expressways and arterial networks and achieve orderly coordination between them. This study employs the multi-class cell transmission mo… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  8. arXiv:2404.19242  [pdf, other

    cs.CV eess.IV stat.ME

    A Minimal Set of Parameters Based Depth-Dependent Distortion Model and Its Calibration Method for Stereo Vision Systems

    Authors: Xin Ma, Puchen Zhu, Xiao Li, Xiaoyin Zheng, Jianshu Zhou, Xuchen Wang, Kwok Wai Samuel Au

    Abstract: Depth position highly affects lens distortion, especially in close-range photography, which limits the measurement accuracy of existing stereo vision systems. Moreover, traditional depth-dependent distortion models and their calibration methods have remained complicated. In this work, we propose a minimal set of parameters based depth-dependent distortion model (MDM), which considers the radial an… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for publication in IEEE Transactions on Instrumentation and Measurement

  9. arXiv:2403.13346  [pdf, other

    eess.SY

    A Control-Recoverable Added-Noise-based Privacy Scheme for LQ Control in Networked Control Systems

    Authors: Xuening Tang, Xianghui Cao, Wei Xing Zheng

    Abstract: As networked control systems continue to evolve, ensuring the privacy of sensitive data becomes an increasingly pressing concern, especially in situations where the controller is physically separated from the plant. In this paper, we propose a secure control scheme for computing linear quadratic control in a networked control system utilizing two networked controllers, a privacy encoder and a cont… ▽ More

    Submitted 22 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  10. arXiv:2402.01808  [pdf, other

    cs.SD eess.AS

    KS-Net: Multi-band joint speech restoration and enhancement network for 2024 ICASSP SSI Challenge

    Authors: Guochen Yu, Runqiang Han, Chenglin Xu, Haoran Zhao, Nan Li, Chen Zhang, Xiguang Zheng, Chao Zhou, Qi Huang, Bing Yu

    Abstract: This paper presents the speech restoration and enhancement system created by the 1024K team for the ICASSP 2024 Speech Signal Improvement (SSI) Challenge. Our system consists of a generative adversarial network (GAN) in complex-domain for speech restoration and a fine-grained multi-band fusion module for speech enhancement. In the blind test set of SSI, the proposed system achieves an overall mean… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP 2024; Rank 1st in ICASSP 2024 Speech Signal Improvement (SSI) Challenge

  11. arXiv:2401.11349  [pdf, other

    physics.flu-dyn cs.LG eess.SY

    Asynchronous Parallel Reinforcement Learning for Optimizing Propulsive Performance in Fin Ray Control

    Authors: Xin-Yang Liu, Dariush Bodaghi, Qian Xue, Xudong Zheng, Jian-Xun Wang

    Abstract: Fish fin rays constitute a sophisticated control system for ray-finned fish, facilitating versatile locomotion within complex fluid environments. Despite extensive research on the kinematics and hydrodynamics of fish locomotion, the intricate control strategies in fin-ray actuation remain largely unexplored. While deep reinforcement learning (DRL) has demonstrated potential in managing complex non… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 37 pages, 12 figures

  12. arXiv:2312.13722  [pdf, other

    cs.SD eess.AS

    BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution

    Authors: Guochen Yu, Xiguang Zheng, Nan Li, Runqiang Han, Chengshi Zheng, Chen Zhang, Chao Zhou, Qi Huang, Bing Yu

    Abstract: Speech bandwidth extension (BWE) has demonstrated promising performance in enhancing the perceptual speech quality in real communication systems. Most existing BWE researches primarily focus on fixed upsampling ratios, disregarding the fact that the effective bandwidth of captured audio may fluctuate frequently due to various capturing devices and transmission conditions. In this paper, we propose… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  13. arXiv:2311.12840  [pdf, other

    cs.CV cs.AI eess.IV

    Wafer Map Defect Patterns Semi-Supervised Classification Using Latent Vector Representation

    Authors: Qiyu Wei, Wei Zhao, Xiaoyan Zheng, Zeng Zeng

    Abstract: As the globalization of semiconductor design and manufacturing processes continues, the demand for defect detection during integrated circuit fabrication stages is becoming increasingly critical, playing a significant role in enhancing the yield of semiconductor products. Traditional wafer map defect pattern detection methods involve manual inspection using electron microscopes to collect sample i… ▽ More

    Submitted 6 October, 2023; originally announced November 2023.

    Comments: 6 pages, 2 figures, CIS confernece

  14. arXiv:2310.15417  [pdf, other

    eess.SY

    A Semantic-driven Approach for Maintenance Digitalization in the Pharmaceutical Industry

    Authors: Ju Wu, Xiaochen Zheng, Marco Madlena, Dimitrios Kyritsis

    Abstract: The digital transformation of pharmaceutical industry is a challenging task due to the high complexity of involved elements and the strict regulatory compliance. Maintenance activities in the pharmaceutical industry play an essential role in ensuring product quality and integral functioning of equipment and premises. This paper first identifies the key challenges of digitalization in pharmaceutica… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  15. arXiv:2310.04791  [pdf, other

    eess.AS cs.LG cs.SD

    Conditional Diffusion Model for Target Speaker Extraction

    Authors: Theodor Nguyen, Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C Woodland

    Abstract: We propose DiffSpEx, a generative target speaker extraction method based on score-based generative modelling through stochastic differential equations. DiffSpEx deploys a continuous-time stochastic diffusion process in the complex short-time Fourier transform domain, starting from the target speaker source and converging to a Gaussian distribution centred on the mixture of sources. For the reverse… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 5 pages, 4 figures, submitted to ICASSP 2024

  16. RAMP: Retrieval-Augmented MOS Prediction via Confidence-based Dynamic Weighting

    Authors: Hui Wang, Shiwan Zhao, Xiguang Zheng, Yong Qin

    Abstract: Automatic Mean Opinion Score (MOS) prediction is crucial to evaluate the perceptual quality of the synthetic speech. While recent approaches using pre-trained self-supervised learning (SSL) models have shown promising results, they only partly address the data scarcity issue for the feature extractor. This leaves the data scarcity issue for the decoder unresolved and leading to suboptimal performa… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by Interspeech 2023, oral

    Journal ref: INTERSPEECH 2023, 1095-1099

  17. arXiv:2308.07127  [pdf, other

    eess.SY

    A Lightweight Sensor Scheduler Based on AoI Function for Remote State Estimation over Lossy Wireless Channels

    Authors: Taige Chang, Xianghui Cao, Wei Xing Zheng

    Abstract: This paper investigates the problem of sensor scheduling for remotely estimating the states of heterogeneous dynamical systems over resource-limited and lossy wireless channels. Considering the low time complexity and high versatility requirements of schedulers deployed on the transport layer, we propose a lightweight scheduler based on an Age of Information (AoI) function built with the tight sca… ▽ More

    Submitted 30 August, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  18. arXiv:2306.14143  [pdf, other

    eess.SP

    Intelligent Multi-Modal Sensing-Communication Integration: Synesthesia of Machines

    Authors: Xiang Cheng, Haotian Zhang, Jianan Zhang, Shijian Gao, Sijiang Li, Ziwei Huang, Lu Bai, Zonghui Yang, Xinhu Zheng, Liuqing Yang

    Abstract: In the era of sixth-generation (6G) wireless communications, integrated sensing and communications (ISAC) is recognized as a promising solution to upgrade the physical system by endowing wireless communications with sensing capability. Existing ISAC is mainly oriented to static scenarios with radio-frequency (RF) sensors being the primary participants, thus lacking a comprehensive environment feat… ▽ More

    Submitted 20 November, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted by IEEE Communications Surveys & Tutorials

  19. arXiv:2306.05358  [pdf, other

    cs.CR cs.AI cs.LG cs.SD eess.AS

    Trustworthy Sensor Fusion against Inaudible Command Attacks in Advanced Driver-Assistance System

    Authors: Jiwei Guan, Lei Pan, Chen Wang, Shui Yu, Longxiang Gao, Xi Zheng

    Abstract: There are increasing concerns about malicious attacks on autonomous vehicles. In particular, inaudible voice command attacks pose a significant threat as voice commands become available in autonomous driving systems. How to empirically defend against these inaudible attacks remains an open question. Previous research investigates utilizing deep learning-based multimodal fusion for defense, without… ▽ More

    Submitted 29 May, 2023; originally announced June 2023.

  20. arXiv:2306.01942  [pdf, other

    cs.CL cs.SD eess.AS

    Can Contextual Biasing Remain Effective with Whisper and GPT-2?

    Authors: Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland

    Abstract: End-to-end automatic speech recognition (ASR) and large language models, such as Whisper and GPT-2, have recently been scaled to use vast amounts of training data. Despite the large amount of training data, infrequent content words that occur in a particular task may still exhibit poor ASR performance, with contextual biasing a possible remedy. This paper investigates the effectiveness of neural c… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: To appear in Interspeech 2023

  21. arXiv:2305.04929  [pdf, other

    physics.ao-ph eess.SY

    Impact of Climate Simulation Resolutions on Future Energy System Reliability Assessment: A Texas Case Study

    Authors: Xiangtian Zheng, Le Xie, Kiyeob Lee, Dan Fu, Jiahan Wu, Ping Chang

    Abstract: The reliability of energy systems is strongly influenced by the prevailing climate conditions. With the increasing prevalence of renewable energy sources, the interdependence between energy and climate systems has become even stronger. This study examines the impact of different spatial resolutions in climate modeling on energy grid reliability assessment, with the Texas interconnection between 20… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  22. arXiv:2304.04952  [pdf, other

    cs.CV cs.LG eess.IV

    Data-Efficient Image Quality Assessment with Attention-Panel Decoder

    Authors: Guanyi Qin, Runze Hu, Yutao Liu, Xiawu Zheng, Haotian Liu, Xiu Li, Yan Zhang

    Abstract: Blind Image Quality Assessment (BIQA) is a fundamental task in computer vision, which however remains unresolved due to the complex distortion conditions and diversified image contents. To confront this challenge, we in this paper propose a novel BIQA pipeline based on the Transformer architecture, which achieves an efficient quality-aware feature representation with much fewer data. More specific… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted by AAAI 2023

  23. arXiv:2304.00871  [pdf, other

    eess.AS

    Self-Supervised Learning-Based Source Separation for Meeting Data

    Authors: Yuang Li, Xianrui Zheng, Philip C. Woodland

    Abstract: Source separation can improve automatic speech recognition (ASR) under multi-party meeting scenarios by extracting single-speaker signals from overlapped speech. Despite the success of self-supervised learning models in single-channel source separation, most studies have focused on simulated setups. In this paper, seven SSL models were compared on both simulated and real-world corpora. Then, we pr… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: To appear in Proc. ICASSP2023

  24. arXiv:2303.02308  [pdf, other

    eess.SP

    A Physics-based and Data-driven Approach for Localized Statistical Channel Modeling

    Authors: Shutao Zhang, Xinzhi Ning, Xi Zheng, Qingjiang Shi, Tsung-Hui Chang, Zhi-Quan Luo

    Abstract: Localized channel modeling is crucial for offline performance optimization of 5G cellular networks, but the existing channel models are for general scenarios and do not capture local geographical structures. In this paper, we propose a novel physics-based and data-driven localized statistical channel modeling (LSCM), which is capable of sensing the physical geographical structures of the targeted… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: the 34th International Teletraffic Congress (ITC), Shenzhen, China, 2022

  25. arXiv:2212.14189  [pdf, other

    cs.CY eess.SY

    High Resolution Modeling and Analysis of Cryptocurrency Mining's Impact on Power Grids: Carbon Footprint, Reliability, and Electricity Price

    Authors: Ali Menati, Xiangtian Zheng, Kiyeob Lee, Ranyu Shi, Pengwei Du, Chanan Singh, Le Xie

    Abstract: Blockchain technologies are considered one of the most disruptive innovations of the last decade, enabling secure decentralized trust-building. However, in recent years, with the rapid increase in the energy consumption of blockchain-based computations for cryptocurrency mining, there have been growing concerns about their sustainable operation in electric grids. This paper investigates the tri-fa… ▽ More

    Submitted 14 April, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: This paper has been accepted for publication in the journal of "Advances in Applied Energy"

  26. arXiv:2212.04250  [pdf, other

    cs.RO eess.SY

    Adaptive Neural Network Backstepping Control Method for Aerial Manipulator Based on Variable Inertia Parameter Modeling

    Authors: Hai Li, Zhan Li, Xiaolong Zheng, Jinhui Liu

    Abstract: For the aerial manipulator that performs aerial work tasks, the actual operating environment it faces is very complex, and it is affected by internal and external multi-source disturbances. In this paper, to effectively improve the anti-disturbance control performance of the aerial manipulator, an adaptive neural network backstepping control method based on variable inertia parameter modeling is p… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  27. arXiv:2211.15127  [pdf

    cs.RO eess.SY

    Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map

    Authors: Xi Zheng, Weisong Wen, Li-Ta Hsu

    Abstract: Accurate and safety-quantifiable localization is of great significance for safety-critical autonomous systems, such as unmanned ground vehicles (UGV) and unmanned aerial vehicles (UAV). The visual odometry-based method can provide accurate positioning in a short period but is subjected to drift over time. Moreover, the quantification of the safety of the localization solution (the error is bounded… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  28. arXiv:2211.13440  [pdf, other

    eess.IV cs.CV

    Iterative Data Refinement for Self-Supervised MR Image Reconstruction

    Authors: Xue Liu, Juan Zou, Xiawu Zheng, Cheng Li, Hairong Zheng, Shanshan Wang

    Abstract: Magnetic Resonance Imaging (MRI) has become an important technique in the clinic for the visualization, detection, and diagnosis of various diseases. However, one bottleneck limitation of MRI is the relatively slow data acquisition process. Fast MRI based on k-space undersampling and high-quality image reconstruction has been widely utilized, and many deep learning-based methods have been develope… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: 5 pages, 2 figures, 1 table

    MSC Class: 68T10 ACM Class: I.4.5

  29. arXiv:2211.07993  [pdf, other

    eess.IV cs.CV cs.LG

    DIGEST: Deeply supervIsed knowledGE tranSfer neTwork learning for brain tumor segmentation with incomplete multi-modal MRI scans

    Authors: Haoran Li, Cheng Li, Weijian Huang, Xiawu Zheng, Yan Xi, Shanshan Wang

    Abstract: Brain tumor segmentation based on multi-modal magnetic resonance imaging (MRI) plays a pivotal role in assisting brain cancer diagnosis, treatment, and postoperative evaluations. Despite the achieved inspiring performance by existing automatic segmentation methods, multi-modal MRI data are still unavailable in real-world clinical applications due to quite a few uncontrollable factors (e.g. differe… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 4 pages,2 figures,2 tables

  30. arXiv:2211.07966  [pdf, other

    eess.IV cs.CV cs.LG

    Adaptive PromptNet For Auxiliary Glioma Diagnosis without Contrast-Enhanced MRI

    Authors: Yeqi Wang, Weijian Huang, Cheng Li, Xiawu Zheng, Yusong Lin, Shanshan Wang

    Abstract: Multi-contrast magnetic resonance imaging (MRI)-based automatic auxiliary glioma diagnosis plays an important role in the clinic. Contrast-enhanced MRI sequences (e.g., contrast-enhanced T1-weighted imaging) were utilized in most of the existing relevant studies, in which remarkable diagnosis results have been reported. Nevertheless, acquiring contrast-enhanced MRI data is sometimes not feasible d… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 5 pages, 2 figures, 2 tables

    MSC Class: 68T10 ACM Class: I.4.9

  31. arXiv:2211.04584  [pdf, other

    cs.AI eess.SY

    Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality

    Authors: Le Xie, Tong Huang, Xiangtian Zheng, Yan Liu, Mengdi Wang, Vijay Vittal, P. R. Kumar, Srinivas Shakkottai, Yi Cui

    Abstract: The transition towards carbon-neutral electricity is one of the biggest game changers in addressing climate change since it addresses the dual challenges of removing carbon emissions from the two largest sectors of emitters: electricity and transportation. The transition to a carbon-neutral electric grid poses significant challenges to conventional paradigms of modern grid planning and operation.… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: To be published in Patterns (Cell Press)

  32. arXiv:2210.11212  [pdf, ps, other

    eess.SY

    Robust prescribed-time coordination control of cooperative-antagonistic networks with disturbances

    Authors: Zhen-Hua Zhu, Huaiyu Wu, Zhi-Hong Guan, Zhi-Wei Liu, Yang Chen, Xiujuan Zheng

    Abstract: This article targets at addressing the robust prescribed-time coordination control (PTCC) problems for single-integrator cooperative-antagonistic networks (CANs) with external disturbances under arbitrary fixed signed digraphs without any structural constraints. Toward this end, the PTCC problems for nominal single-integrator CANs without disturbances are first investigated and a fully distributed… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 16 pages, 12 figures

  33. arXiv:2210.01337  [pdf, ps, other

    eess.SP

    Compressed CPD-Based Channel Estimation and Joint Beamforming for RIS-Assisted Millimeter Wave Communications

    Authors: Xi Zheng, Jun Fang, Hongwei Wang, Peilan Wang, Hongbin Li

    Abstract: We consider the problem of channel estimation and joint active and passive beamforming for reconfigurable intelligent surface (RIS) assisted millimeter wave (mmWave) multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) systems. We show that, with a well-designed frame-based training protocol, the received pilot signal can be organized into a low-rank third-order… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2203.16164

  34. arXiv:2210.00902  [pdf

    cs.NI eess.SP

    AdaComm: Tracing Channel Dynamics for Reliable Cross-Technology Communication

    Authors: Weiguo Wang, Xiaolong Zheng, Yuan He, Xiuzhen Guo

    Abstract: Cross-Technology Communication (CTC) is an emerging technology to support direct communication between wireless devices that follow different standards. In spite of the many different proposals from the community to enable CTC, the performance aspect of CTC is an equally important problem but has seldom been studied before. We find this problem is extremely challenging, due to the following reason… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  35. arXiv:2209.13645  [pdf, other

    eess.SP cs.LG

    PearNet: A Pearson Correlation-based Graph Attention Network for Sleep Stage Recognition

    Authors: Jianchao Lu, Yuzhe Tian, Shuang Wang, Michael Sheng, Xi Zheng

    Abstract: Sleep stage recognition is crucial for assessing sleep and diagnosing chronic diseases. Deep learning models, such as Convolutional Neural Networks and Recurrent Neural Networks, are trained using grid data as input, making them not capable of learning relationships in non-Euclidean spaces. Graph-based deep models have been developed to address this issue when investigating the external relationsh… ▽ More

    Submitted 16 October, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

  36. arXiv:2209.00805  [pdf, other

    eess.AS

    Multi-scale temporal-frequency attention for music source separation

    Authors: Lianwu Chen, Xiguang Zheng, Chen Zhang, Liang Guo, Bing Yu

    Abstract: In recent years, deep neural networks (DNNs) based approaches have achieved the start-of-the-art performance for music source separation (MSS). Although previous methods have addressed the large receptive field modeling using various methods, the temporal and frequency correlations of the music spectrogram with repeated patterns have not been explicitly explored for the MSS task. In this paper, a… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

  37. arXiv:2208.04661  [pdf, other

    eess.IV cs.CV

    OL-DN: Online learning based dual-domain network for HEVC intra frame quality enhancement

    Authors: Renwei Yang, Shuyuan Zhu, Xiaozhen Zheng, Bing Zeng

    Abstract: Convolution neural network (CNN) based methods offer effective solutions for enhancing the quality of compressed image and video. However, these methods ignore using the raw data to enhance the quality. In this paper, we adopt the raw data in the quality enhancement for the HEVC intra-coded image by proposing an online learning-based method. When quality enhancement is demanded, we online train ou… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  38. arXiv:2208.04130  [pdf, other

    eess.SY cs.LG

    Reliability Analysis of Complex Multi-State System Based on Universal Generating Function and Bayesian Network

    Authors: Xu Liu, Wen Yao, Xiaohu Zheng, Yingchun Xu

    Abstract: In the complex multi-state system (MSS), reliability analysis is a significant research content, both for equipment design, manufacturing, usage and maintenance. Universal Generating Function (UGF) is an important method in the reliability analysis, which efficiently obtains the system reliability by a fast algebraic procedure. However, when structural relationships between subsystems or component… ▽ More

    Submitted 15 June, 2022; originally announced August 2022.

  39. arXiv:2207.03852  [pdf, other

    eess.AS cs.SD

    Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription

    Authors: Xianrui Zheng, Chao Zhang, Philip C. Woodland

    Abstract: Self-supervised-learning-based pre-trained models for speech data, such as Wav2Vec 2.0 (W2V2), have become the backbone of many speech tasks. In this paper, to achieve speaker diarisation and speech recognition using a single model, a tandem multitask training (TMT) method is proposed to fine-tune W2V2. For speaker diarisation, the tasks of voice activity detection (VAD) and speaker classification… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: To appear in Interspeech 2022

  40. arXiv:2207.02464  [pdf, other

    cs.RO cs.AI eess.SY

    A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative Object

    Authors: Shengjie Wang, Yuxue Cao, Xiang Zheng, Tao Zhang

    Abstract: Recent years have seen the emergence of non-cooperative objects in space, like failed satellites and space junk. These objects are usually operated or collected by free-float dual-arm space manipulators. Thanks to eliminating the difficulties of modeling and manual parameter-tuning, reinforcement learning (RL) methods have shown a more promising sign in the trajectory planning of space manipulator… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 15 pages, 6 figures

  41. arXiv:2206.00184  [pdf, other

    eess.SY

    How Much Demand Flexibility Could Have Spared Texas from the 2021 Outage?

    Authors: Dongqi Wu, Xiangtian Zheng, Ali Menati, Lane Smith, Bainan Xia, Yixing Xu, Chanan Singh, Le Xie

    Abstract: The February 2021 Texas winter power outage has led to hundreds of deaths and billions of dollars in economic losses, largely due to the generation failure and record-breaking electric demand. In this paper, we study the scaling-up of demand flexibility as a means to avoid load shedding during such an extreme weather event. The three mechanisms considered are interruptible load, residential load r… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: This paper has been submitted to a journal for review

  42. arXiv:2205.05180  [pdf, other

    eess.SY cs.AI

    Massively Digitized Power Grid: Opportunities and Challenges of Use-inspired AI

    Authors: Le Xie, Xiangtian Zheng, Yannan Sun, Tong Huang, Tony Bruton

    Abstract: This article presents a use-inspired perspective of the opportunities and challenges in a massively digitized power grid. It argues that the intricate interplay of data availability, computing capability, and artificial intelligence (AI) algorithm development are the three key factors driving the adoption of digitized solutions in the power grid. The impact of these three factors on critical funct… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  43. arXiv:2205.04821  [pdf, other

    eess.IV cs.CV

    Self-supervised regression learning using domain knowledge: Applications to improving self-supervised denoising in imaging

    Authors: Il Yong Chun, Dongwon Park, Xuehang Zheng, Se Young Chun, Yong Long

    Abstract: Regression that predicts continuous quantity is a central part of applications using computational imaging and computer vision technologies. Yet, studying and understanding self-supervised learning for regression tasks - except for a particular regression task, image denoising - have lagged behind. This paper proposes a general self-supervised regression learning (SSRL) framework that enables lear… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 17 pages, 16 figures, 2 tables, submitted to IEEE T-IP

  44. arXiv:2204.10636  [pdf

    cs.SE eess.SY

    Ontology-based system to support industrial system design for aircraft assembly

    Authors: Xiaodu Hu, Rebeca Arista, Xiaochen Zheng, Joachim Lentes, Jyri Sorvari, Jinzhi Lu, Fernando Ubis, Dimitris Kiritsis

    Abstract: The development of an aircraft industrial system is a complex process which faces the challenge of digital discontinuity in multidisciplinary engineering due to various interfaces between different digital tools, leading to extra development time and costs. This paper proposes an ontology-based system, aiming at functionality integration and design process automation, by Models for Manufacturing m… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: 6 pages, 9 figures, IFAC IMS 2022

  45. arXiv:2204.01327  [pdf

    cs.LG eess.SY

    Algorithms for Bayesian network modeling and reliability inference of complex multistate systems: Part II-Dependent systems

    Authors: Xiaohu Zheng, Wen Yao, Xiaoqian Chen

    Abstract: In using the Bayesian network (BN) to construct the complex multistate system's reliability model as described in Part I, the memory storage requirements of the node probability table (NPT) will exceed the random access memory (RAM) of the computer. However, the proposed inference algorithm of Part I is not suitable for the dependent system. This Part II proposes a novel method for BN reliability… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  46. arXiv:2203.16164  [pdf, ps, other

    eess.SP

    Compressed Channel Estimation for IRS-Assisted Millimeter Wave OFDM Systems: A Low-Rank Tensor Decomposition-Based Approach

    Authors: Xi Zheng, Peilan Wang, Jun Fang, Hongbin Li

    Abstract: We consider the problem of downlink channel estimation for intelligent reflecting surface (IRS)-assisted millimeter Wave (mmWave) orthogonal frequency division multiplexing (OFDM) systems. By exploring the inherent sparse scattering characteristics of mmWave channels, we show that the received signals can be expressed as a low-rank third-order tensor that admits a tensor rank decomposition, also k… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE Wireless Communications Letters

  47. arXiv:2203.15655  [pdf

    cs.LG eess.SY

    Consistency regularization-based Deep Polynomial Chaos Neural Network Method for Reliability Analysis

    Authors: Xiaohu Zheng, Wen Yao, Yunyang Zhang, Xiaoya Zhang

    Abstract: Polynomial chaos expansion (PCE) is a powerful surrogate model-based reliability analysis method. Generally, a PCE model with a higher expansion order is usually required to obtain an accurate surrogate model for some complex non-linear stochastic systems. However, the high-order PCE increases the number of labeled data required for solving the expansion coefficients. To alleviate this problem, th… ▽ More

    Submitted 4 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

  48. arXiv:2203.14033  [pdf, other

    cs.RO eess.SY

    Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement Learning

    Authors: Qiyu Sun, Jinbao Fang, Wei Xing Zheng, Yang Tang

    Abstract: The ability to perform aggressive movements, which are called aggressive flights, is important for quadrotors during navigation. However, aggressive quadrotor flights are still a great challenge to practical applications. The existing solutions to aggressive flights heavily rely on a predefined trajectory, which is a time-consuming preprocessing step. To avoid such path planning, we propose a curi… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

  49. arXiv:2203.03634  [pdf, other

    eess.IV cs.CV cs.HC

    Remote blood pressure measurement via spatiotemporal mapping of a short-time facial video

    Authors: Jialiang Zhuang, Bin Li, Yun Zhang, Yuheng Chen, Xiujuan Zheng

    Abstract: Blood pressure (BP) monitoring is vital in daily healthcare, especially for cardiovascular diseases. However, BP values are mainly acquired through the contact sensing method, which is inconvenient and unfriendly to continuous BP measurement. Hence, we propose an efficient end-to-end network to estimate the BP values from a facial video to achieve remote BP measurement in daily life. In this study… ▽ More

    Submitted 23 June, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 7 pages, 7 figures

  50. arXiv:2202.10372  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

    Authors: Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello

    Abstract: The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments. This challenge improves and extends the tasks of the L3DAS21 edition. We generated a new dataset, which maintains the same general characteristics of L3DAS21 datasets, but with an extended number of data points a… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: Accepted to 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022). arXiv admin note: substantial text overlap with arXiv:2104.05499

    Journal ref: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 9186-9190