Search | arXiv e-print repository

arXiv:2408.11982 [pdf, other]

AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results

Authors: Maksim Smirnov, Aleksandr Gushchin, Anastasia Antsiferova, Dmitry Vatolin, Radu Timofte, Ziheng Jia, Zicheng Zhang, Wei Sun, Jiaying Qian, Yuqin Cao, Yinan Sun, Yuxin Zhu, Xiongkuo Min, Guangtao Zhai, Kanjar De, Qing Luo, Ao-Xiang Zhang, Peng Zhang, Haibo Lei, Linyan Jiang, Yaqing Li, Wenhui Meng, Xiaoheng Tan, Haiqiang Wang, Xiaozhong Xu , et al. (11 additional authors not shown)

Abstract: Video quality assessment (VQA) is a crucial task in the development of video compression standards, as it directly impacts the viewer experience. This paper presents the results of the Compressed Video Quality Assessment challenge, held in conjunction with the Advances in Image Manipulation (AIM) workshop at ECCV 2024. The challenge aimed to evaluate the performance of VQA methods on a diverse dat… ▽ More Video quality assessment (VQA) is a crucial task in the development of video compression standards, as it directly impacts the viewer experience. This paper presents the results of the Compressed Video Quality Assessment challenge, held in conjunction with the Advances in Image Manipulation (AIM) workshop at ECCV 2024. The challenge aimed to evaluate the performance of VQA methods on a diverse dataset of 459 videos, encoded with 14 codecs of various compression standards (AVC/H.264, HEVC/H.265, AV1, and VVC/H.266) and containing a comprehensive collection of compression artifacts. To measure the methods performance, we employed traditional correlation coefficients between their predictions and subjective scores, which were collected via large-scale crowdsourced pairwise human comparisons. For training purposes, participants were provided with the Compressed Video Quality Assessment Dataset (CVQAD), a previously developed dataset of 1022 videos. Up to 30 participating teams registered for the challenge, while we report the results of 6 teams, which submitted valid final solutions and code for reproducing the results. Moreover, we calculated and present the performance of state-of-the-art VQA methods on the developed dataset, providing a comprehensive benchmark for future research. The dataset, results, and online leaderboard are publicly available at https://challenges.videoprocessing.ai/challenges/compressedvideo-quality-assessment.html. △ Less

Submitted 28 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

arXiv:2408.10500 [pdf, other]

doi 10.1145/3689092.3689404

SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition

Authors: Zebang Cheng, Shuyuan Tu, Dawei Huang, Minghan Li, Xiaojiang Peng, Zhi-Qi Cheng, Alexander G. Hauptmann

Abstract: This paper presents our winning approach for the MER-NOISE and MER-OV tracks of the MER2024 Challenge on multimodal emotion recognition. Our system leverages the advanced emotional understanding capabilities of Emotion-LLaMA to generate high-quality annotations for unlabeled samples, addressing the challenge of limited labeled data. To enhance multimodal fusion while mitigating modality-specific n… ▽ More This paper presents our winning approach for the MER-NOISE and MER-OV tracks of the MER2024 Challenge on multimodal emotion recognition. Our system leverages the advanced emotional understanding capabilities of Emotion-LLaMA to generate high-quality annotations for unlabeled samples, addressing the challenge of limited labeled data. To enhance multimodal fusion while mitigating modality-specific noise, we introduce Conv-Attention, a lightweight and efficient hybrid framework. Extensive experimentation vali-dates the effectiveness of our approach. In the MER-NOISE track, our system achieves a state-of-the-art weighted average F-score of 85.30%, surpassing the second and third-place teams by 1.47% and 1.65%, respectively. For the MER-OV track, our utilization of Emotion-LLaMA for open-vocabulary annotation yields an 8.52% improvement in average accuracy and recall compared to GPT-4V, securing the highest score among all participating large multimodal models. The code and model for Emotion-LLaMA are available at https://github.com/ZebangCheng/Emotion-LLaMA. △ Less

Submitted 21 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

Comments: Ranked 1st in MER24@IJCAI and MRAC24@ACM MM (MER-NOISE & MER-OV (self-evaluated))

arXiv:2407.21157 [pdf, other]

Movable Frequency Diverse Array for Wireless Communication Security

Authors: Zihao Cheng, Jiangbo Si, Zan Li, Pengpeng Liu, Yangchao Huang, Naofal Al-Dhahir

Abstract: Frequency diverse array (FDA) is a promising antenna technology to achieve physical layer security by varying the frequency of each antenna at the transmitter. However, when the channels of the legitimate user and eavesdropper are highly correlated, FDA is limited by the frequency constraint and cannot provide satisfactory security performance. In this paper, we propose a novel movable FDA (MFDA)… ▽ More Frequency diverse array (FDA) is a promising antenna technology to achieve physical layer security by varying the frequency of each antenna at the transmitter. However, when the channels of the legitimate user and eavesdropper are highly correlated, FDA is limited by the frequency constraint and cannot provide satisfactory security performance. In this paper, we propose a novel movable FDA (MFDA) antenna technology where the positions of antennas can be dynamically adjusted in a given finite region. Specifically, we aim to maximize the secrecy capacity by jointly optimizing the antenna beamforming vector, antenna frequency vector and antenna position vector. To solve this non-convex optimization problem with coupled variables, we develop a two-stage alternating optimization (AO) algorithm based on block successive upper-bound minimization (BSUM) method. Moreover, to evaluate the security performance provided by MFDA, we introduce two benchmark schemes, i.e., phased array (PA) and FDA. Simulation results demonstrate that MFDA can significantly enhance security performance compared to PA and FDA. In particular, when the frequency constraint is strict, MFDA can further increase the secrecy capacity by adjusting the positions of antennas instead of the frequencies. △ Less

Submitted 25 July, 2024; originally announced July 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2407.20280

arXiv:2407.20280 [pdf, other]

Movable Frequency Diverse Array-Assisted Covert Communication With Multiple Wardens

Authors: Zihao Cheng, Jiangbo Si, Zan Li, Pengpeng Liu, Xiaoting Wang, Naofal Al-Dhahir

Abstract: The frequency diverse array (FDA) is highly promising for improving covert communication performance by adjusting the frequency of each antenna at the transmitter. However, when faced with the cases of multiple wardens and highly correlated channels, FDA is limited by the frequency constraint and cannot provide satisfactory covert performance. In this paper, we propose a novel movable FDA (MFDA) a… ▽ More The frequency diverse array (FDA) is highly promising for improving covert communication performance by adjusting the frequency of each antenna at the transmitter. However, when faced with the cases of multiple wardens and highly correlated channels, FDA is limited by the frequency constraint and cannot provide satisfactory covert performance. In this paper, we propose a novel movable FDA (MFDA) antenna technology where positions of the antennas can be dynamically adjusted in a given finite region. Specifically, we aim to maximize the covert rate by jointly optimizing the antenna beamforming vector, antenna frequency vector and antenna position vector. To solve this non-convex optimization problem with coupled variables, we develop a two-stage alternating optimization (AO) algorithm based on the block successive upper-bound minimization (BSUM) method. Moreover, considering the challenge of obtaining perfect channel state information (CSI) at multiple wardens, we study the case of imperfect CSI. Simulation results demonstrate that MFDA can significantly enhance covert performance compared to the conventional FDA. In particular, when the frequency constraint is strict, MFDA can further increase the covert rate by adjusting the positions of antennas instead of the frequencies. △ Less

Submitted 25 July, 2024; originally announced July 2024.

arXiv:2407.19209 [pdf, ps, other]

Exploiting Target Location Distribution in MIMO Radar: PCRB vs. PSBP for Waveform Design

Authors: Lingyun Xu, Bowen Wang, Huiyong Li, Ziyang Cheng

Abstract: This paper investigates the issue of how to exploit target location distribution for multiple input multiple output (MIMO) radar waveform design. We consider a MIMO radar aiming to estimate the unknown and random angular location parameters of a point target, whose distribution information can be exploited by the radar. First, we establish the models of the MIMO radar system and the target locatio… ▽ More This paper investigates the issue of how to exploit target location distribution for multiple input multiple output (MIMO) radar waveform design. We consider a MIMO radar aiming to estimate the unknown and random angular location parameters of a point target, whose distribution information can be exploited by the radar. First, we establish the models of the MIMO radar system and the target location distribution. Based on the considered models, we propose the first category of target location distribution exploitation methods by analyzing the radar direction-of-angle (DoA) estimation performance and deriving a general form of posterior Cramer-Rao bound (PCRB) as the lower bound of the mean square error of DoA estimation. Following this, to explore more insights, we proposed the second category of target location distribution exploitation methods by introducing a novel radar metric, probability scaled beampattern (PSBP), from the perspective of radar beampattern. To compare the two methods, we formulate the PCRB and PSBP oriented radar waveform design problems and propose corresponding low-complexity and convergence-guaranteed algorithms to tackle them. Finally, numerical simulations are conducted in different scenarios to provide a comprehensive evaluation and comparison of the radar performance. △ Less

Submitted 8 August, 2024; v1 submitted 27 July, 2024; originally announced July 2024.

arXiv:2407.13401 [pdf, other]

Cooperative Integrated Sensing and Communication Networks: Analysis and Distributed Design

Authors: Bowen Wang, Hongyu Li, Fan Liu, Ziyang Cheng, Shanpu Shen

Abstract: This paper proposes a cooperative integrated sensing and communication network (Co-ISACNet) adopting hybrid beamforming (HBF) architecture, which improves both radar sensing and communication performance. The main contributions of this work are four-fold. First, we introduce a novel cooperative sensing method for the considered Co-ISACNet, followed by a comprehensive analysis of this method. This… ▽ More This paper proposes a cooperative integrated sensing and communication network (Co-ISACNet) adopting hybrid beamforming (HBF) architecture, which improves both radar sensing and communication performance. The main contributions of this work are four-fold. First, we introduce a novel cooperative sensing method for the considered Co-ISACNet, followed by a comprehensive analysis of this method. This analysis mathematically verifies the benefits of Co-ISACNet and provides insightful design guidelines. Second, to show the benefits of Co-ISACNet, we propose to jointly design the HBF to maximize the network communication capacity while satisfying the constraint of beampattern similarity for radar sensing, which results in a highly dimensional and non-convex problem. Third, to facilitate the joint design, we propose a novel distributed optimization framework based on proximal gradient and alternating direction method of multipliers, namely PANDA. Fourth, we further adopt the proposed PANDA framework to solve the joint HBF design problem for the Co-ISACNet. By using the proposed PANDA framework, all access points (APs) optimize the HBF in parallel, where each AP only requires local channel state information and limited message exchange among the APs. Such framework reduces significantly the computational complexity and thus has pronounced benefits in practical scenarios. Simulation results verify the effectiveness of the proposed algorithm compared with the conventional centralized algorithm and show the remarkable performance improvement of radar sensing and communication by deploying Co-ISACNet. △ Less

Submitted 18 July, 2024; originally announced July 2024.

arXiv:2406.01850 [pdf, other]

Cell-free massive MIMO Channels in an Urban Environment -- Measurements and Channel Statistics

Authors: Yuning Zhang, Thomas Choi, Zihang Cheng, Jorge Gomez-Ponce, Issei Kanno, Masaaki Ito, Andreas F. Molisch

Abstract: Cell-free massive MIMO (CF-mMIMO), where each user equipment (UE) is connected to multiple access points (APs), is emerging as an important component for 5G and 6G cellular systems. Accurate channel models based on measurements are required to optimize their design and deployment. This paper presents an extensive measurement campaign for CF-mMIMO in an urban environment. A new "virtual AP" techniq… ▽ More Cell-free massive MIMO (CF-mMIMO), where each user equipment (UE) is connected to multiple access points (APs), is emerging as an important component for 5G and 6G cellular systems. Accurate channel models based on measurements are required to optimize their design and deployment. This paper presents an extensive measurement campaign for CF-mMIMO in an urban environment. A new "virtual AP" technique measures channels between 80 UE locations and more than 20,000 possible microcellular AP locations. Measurements are done at 3.5 GHz carrier frequency with 350 MHz bandwidth (BW). The paper describes the measurement setup and data processing, shows sample results and their physical interpretation, and provides statistics for key quantities such as pathloss, shadowing, delay spread (DS), and delay window. We find pathloss coefficients of 2.9 and 10.4 for line-of-sight (LOS) and non line-of-sight (NLOS), respectively, where the high LOS coefficient is mainly because larger distance leads to more grazing angle of incidence and thus lower antenna gain in our setup. Shadowing standard deviations are 5.1/16.6 dB, and root mean squared (RMS) DSs of -80.6/-72.6 dBs. The measurements can also be used for parameterizing a CUNEC-type model, which will be reported in future work. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: submitted to IEEE TWC

arXiv:2405.20617 [pdf, other]

Large-scale Outdoor Cell-free mMIMO Channel Measurement in an Urban Scenario at 3.5 GHz

Authors: Yuning Zhang, Thomas Choi, Zihang Cheng, Issei Kanno, Masaaki Ito, Jorge Gomez-Ponce, Hussein Hammoud, Bowei Wu, Ashwani Pradhan, Kelvin Arana, Pramod Krishna, Tianyi Yang, Tyler Chen, Ishita Vasishtha, Haoyu Xie, Linyu Sun, Andreas F. Molisch

Abstract: The design of cell-free massive MIMO (CF-mMIMO) systems requires accurate, measurement-based channel models. This paper provides the first results from the by far most extensive outdoor measurement campaign for CF-mMIMO channels in an urban environment. We measured impulse responses between over 20,000 potential access point (AP) locations and 80 user equipments (UEs) at 3.5 GHz with 350 MHz bandw… ▽ More The design of cell-free massive MIMO (CF-mMIMO) systems requires accurate, measurement-based channel models. This paper provides the first results from the by far most extensive outdoor measurement campaign for CF-mMIMO channels in an urban environment. We measured impulse responses between over 20,000 potential access point (AP) locations and 80 user equipments (UEs) at 3.5 GHz with 350 MHz bandwidth (BW). Measurements use a "virtual array" approach at the AP and a hybrid switched/virtual approach at the UE. This paper describes the sounder design, measurement environment, data processing, and sample results, particularly the evolution of the power-delay profiles (PDPs) as a function of the AP locations, and its relation to the propagation environment. △ Less

Submitted 6 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

Comments: Submitted to: VTC 2024-Fall

arXiv:2405.15553 [pdf, other]

Massive MIMO-ISAC System With 1-Bit ADCs/DACs

Authors: Bowen Wang, Hongyu Li, Bin Liao, Ziyang Cheng

Abstract: This paper investigates a hardware-efficient massive multiple-input multiple-output integrated sensing and communication (MIMO-ISAC) system with 1-bit analog-to-digital converters (ADCs)/digital-to-analog converters (DACs). The proposed system, referred to as 1BitISAC, employs 1-bit DACs at the ISAC transmitter and 1-bit ADCs at the sensing receiver, achieving significant reductions in power consu… ▽ More This paper investigates a hardware-efficient massive multiple-input multiple-output integrated sensing and communication (MIMO-ISAC) system with 1-bit analog-to-digital converters (ADCs)/digital-to-analog converters (DACs). The proposed system, referred to as 1BitISAC, employs 1-bit DACs at the ISAC transmitter and 1-bit ADCs at the sensing receiver, achieving significant reductions in power consumption and hardware costs. For such kind of systems, two 1BitISAC joint transceiver designs, i.e., i) quality of service constrained 1BitISAC design and ii) quality of detection constrained design, are considered and the corresponding problems are formulated. In order to address these problems, we thoroughly analyze the radar detection performance after 1-bit ADCs quantization and the communication bit error rate. This analysis yields new design insights and leads to unique radar and communication metrics, which enables us to simplify the original problems and employ majorization-minimization and integer linear programming methods to solve the problems. Numerical results are provided to validate the performance analysis of the proposed 1BitISAC and to compare with other ISAC configurations. The superiority of the proposed 1BitISAC system in terms of balancing ISAC performance and energy efficiency is also demonstrated. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.07281 [pdf, ps, other]

Movable Antennas Aided Multicast MISO Communication Systems

Authors: Zhenqiao Cheng, Nanxi Li, Ruizhe Long, Jianchi Zhu, Chongjun Ouyang, Peng Chen

Abstract: A novel multicast communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the transmission rate. Specifically, an MA-assisted two-user multicast multiple-input single-input system is considered. The joint optimization of the transmit beamforming vector and transmit MA positions is studied by modeling the motion of the MA element… ▽ More A novel multicast communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the transmission rate. Specifically, an MA-assisted two-user multicast multiple-input single-input system is considered. The joint optimization of the transmit beamforming vector and transmit MA positions is studied by modeling the motion of the MA elements as discrete movements. A low-complexity greedy search-based algorithm is proposed to tackle this non-convex inter-programming problem. A branch-and-bound (BAB)-based method is proposed to achieve the optimal multicast rate with a reduced time complexity than the brute-force search by assuming the two users suffer similar line-of-sight path losses. Numerical results reveal that the proposed MA systems significantly improve the multicast rate compared to conventional fixed-position antennas (FPAs)-based systems. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: 5 pages

arXiv:2403.11575 [pdf, other]

Task-Oriented Hybrid Beamforming for OFDM-DFRC Systems with Flexibly Controlled Space-Frequency Spectra

Authors: Lingyun Xu, Bowen Wang, Ziyang Cheng

Abstract: This paper investigates the issues of the hybrid beamforming design for the orthogonal frequency division multiplexing dual-function radar-communication (DFRC) system in multiple task scenarios involving the radar scanning and detection task and the target tracking task. To meet different task requirements of the DFRC system, we introduce two novel radar beampattern metrics, the average integrated… ▽ More This paper investigates the issues of the hybrid beamforming design for the orthogonal frequency division multiplexing dual-function radar-communication (DFRC) system in multiple task scenarios involving the radar scanning and detection task and the target tracking task. To meet different task requirements of the DFRC system, we introduce two novel radar beampattern metrics, the average integrated sidelobe to minimum mainlobe ratio (AISMMR) and average peak sidelobe to integrated mainlobe ratio (APSIMR), to characterize the space-frequency spectra in different scenarios. Then, two HBF design problems are formulated for two task scenarios by minimizing the AISMMR and APSIMR respectively subject to the constraints of communication quality-of-service (QoS), power budget, and hardware. Due to the non-linearity and close coupling between the analog and digital beamformers in both the objective functions and QoS constraint, the resultant formulated problems are challenging to solve. Towards that end, a unified optimization algorithm based on a consensus alternating direction method of multipliers (CADMM) is proposed to solve these two problems. Moreover, under the unified CADMM framework, the closed-form solutions of primal variables in the original two problems are obtained with low complexity. Numerical simulations are provided to demonstrate the feasibility and effectiveness of the proposed algorithm. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.07655 [pdf, ps, other]

doi 10.1109/LWC.2024.3382035

Enhancing Physical Layer Security in Dual-Function Radar-Communication Systems with Hybrid Beamforming Architecture

Authors: Lingyun Xu, Bowen Wang, Huiyong Li, Ziyang Cheng

Abstract: In this letter, we investigate enhancing the physical layer security (PLS) for the dual-function radar-communication (DFRC) system with hybrid beamforming (HBF) architecture, where the base station (BS) achieves downlink communication and radar target detection simultaneously. We consider an eavesdropper intercepting the information transmitted from the BS to the downlink communication users with… ▽ More In this letter, we investigate enhancing the physical layer security (PLS) for the dual-function radar-communication (DFRC) system with hybrid beamforming (HBF) architecture, where the base station (BS) achieves downlink communication and radar target detection simultaneously. We consider an eavesdropper intercepting the information transmitted from the BS to the downlink communication users with imperfectly known channel state information. Additionally, the location of the radar target is also imperfectly known by the BS. To enhance PLS in the considered DFRC system, we propose a novel HBF architecture, which introduces a new integrated sensing and security (I2S) symbol. The secure HBF design problem for DFRC is formulated by maximizing the minimum legitimate user communication rate subject to radar signal-to-interference-plus-noise ratio, eavesdropping rate, hardware and power constraints. To solve this non-convex problem, we propose an alternating optimization based method to jointly optimize transmit and receive beamformers. Numerical simulation results validate the effectiveness of the proposed algorithm and show the superiority of the proposed I2S-aided HBF architecture for achieving DFRC and enhancing PLS. △ Less

Submitted 4 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

Journal ref: IEEE Wireless Communications Letters, 2024

arXiv:2401.02662 [pdf, other]

GainNet: Coordinates the Odd Couple of Generative AI and 6G Networks

Authors: Ning Chen, Jie Yang, Zhipeng Cheng, Xuwei Fan, Zhang Liu, Bangzhen Huang, Yifeng Zhao, Lianfen Huang, Xiaojiang Du, Mohsen Guizani

Abstract: The rapid expansion of AI-generated content (AIGC) reflects the iteration from assistive AI towards generative AI (GAI) with creativity. Meanwhile, the 6G networks will also evolve from the Internet-of-everything to the Internet-of-intelligence with hybrid heterogeneous network architectures. In the future, the interplay between GAI and the 6G will lead to new opportunities, where GAI can learn th… ▽ More The rapid expansion of AI-generated content (AIGC) reflects the iteration from assistive AI towards generative AI (GAI) with creativity. Meanwhile, the 6G networks will also evolve from the Internet-of-everything to the Internet-of-intelligence with hybrid heterogeneous network architectures. In the future, the interplay between GAI and the 6G will lead to new opportunities, where GAI can learn the knowledge of personalized data from the massive connected 6G end devices, while GAI's powerful generation ability can provide advanced network solutions for 6G network and provide 6G end devices with various AIGC services. However, they seem to be an odd couple, due to the contradiction of data and resources. To achieve a better-coordinated interplay between GAI and 6G, the GAI-native networks (GainNet), a GAI-oriented collaborative cloud-edge-end intelligence framework, is proposed in this paper. By deeply integrating GAI with 6G network design, GainNet realizes the positive closed-loop knowledge flow and sustainable-evolution GAI model optimization. On this basis, the GAI-oriented generic resource orchestration mechanism with integrated sensing, communication, and computing (GaiRom-ISCC) is proposed to guarantee the efficient operation of GainNet. Two simple case studies demonstrate the effectiveness and robustness of the proposed schemes. Finally, we envision the key challenges and future directions concerning the interplay between GAI models and 6G networks. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 10 pages, 5 figures, 1 table

arXiv:2312.14018 [pdf, ps, other]

Enabling Secure Wireless Communications via Movable Antennas

Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Xiaoming She, Chongjun Ouyang, Peng Chen

Abstract: A pioneering secure transmission scheme is proposed, which harnesses movable antennas (MAs) to optimize antenna positions for augmenting the physical layer security. Particularly, an MA-enabled secure wireless system is considered, where a multi-antenna transmitter communicates with a single-antenna receiver in the presence of an eavesdropper. The beamformer and antenna positions at the transmitte… ▽ More A pioneering secure transmission scheme is proposed, which harnesses movable antennas (MAs) to optimize antenna positions for augmenting the physical layer security. Particularly, an MA-enabled secure wireless system is considered, where a multi-antenna transmitter communicates with a single-antenna receiver in the presence of an eavesdropper. The beamformer and antenna positions at the transmitter are jointly optimized under two criteria: power consumption minimization and secrecy rate maximization. For each scenario, a novel suboptimal algorithm was proposed to tackle the resulting nonconvex optimization problem, capitalizing on the approaches of alternating optimization and gradient descent. Numerical results demonstrate that the proposed MA systems significantly improve physical layer security compared to various benchmark schemes relying on conventional fixed-position antennas (FPAs). △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: Accepted by IEEE ICASSP 2024

arXiv:2312.03196 [pdf, other]

Domain Invariant Representation Learning and Sleep Dynamics Modeling for Automatic Sleep Staging

Authors: Seungyeon Lee, Thai-Hoang Pham, Zhao Cheng, Ping Zhang

Abstract: Sleep staging has become a critical task in diagnosing and treating sleep disorders to prevent sleep related diseases. With growing large scale sleep databases, significant progress has been made toward automatic sleep staging. However, previous studies face critical problems in sleep studies; the heterogeneity of subjects' physiological signals, the inability to extract meaningful information fro… ▽ More Sleep staging has become a critical task in diagnosing and treating sleep disorders to prevent sleep related diseases. With growing large scale sleep databases, significant progress has been made toward automatic sleep staging. However, previous studies face critical problems in sleep studies; the heterogeneity of subjects' physiological signals, the inability to extract meaningful information from unlabeled data to improve predictive performances, the difficulty in modeling correlations between sleep stages, and the lack of an effective mechanism to quantify predictive uncertainty. In this study, we propose a neural network based sleep staging model, DREAM, to learn domain generalized representations from physiological signals and models sleep dynamics. DREAM learns sleep related and subject invariant representations from diverse subjects' sleep signals and models sleep dynamics by capturing interactions between sequential signal segments and between sleep stages. We conducted a comprehensive empirical study to demonstrate the superiority of DREAM, including sleep stage prediction experiments, a case study, the usage of unlabeled data, and uncertainty. Notably, the case study validates DREAM's ability to learn generalized decision function for new subjects, especially in case there are differences between testing and training subjects. Uncertainty quantification shows that DREAM provides prediction uncertainty, making the model reliable and helping sleep experts in real world applications. △ Less

Submitted 9 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

arXiv:2311.03815 [pdf, other]

Integrated Sensing, Communication, and Computing for Cost-effective Multimodal Federated Perception

Authors: Ning Chen, Zhipeng Cheng, Xuwei Fan, Bangzhen Huang, Yifeng Zhao, Lianfen Huang, Xiaojiang Du, Mohsen Guizani

Abstract: Federated learning (FL) is a classic paradigm of 6G edge intelligence (EI), which alleviates privacy leaks and high communication pressure caused by traditional centralized data processing in the artificial intelligence of things (AIoT). The implementation of multimodal federated perception (MFP) services involves three sub-processes, including sensing-based multimodal data generation, communicati… ▽ More Federated learning (FL) is a classic paradigm of 6G edge intelligence (EI), which alleviates privacy leaks and high communication pressure caused by traditional centralized data processing in the artificial intelligence of things (AIoT). The implementation of multimodal federated perception (MFP) services involves three sub-processes, including sensing-based multimodal data generation, communication-based model transmission, and computing-based model training, ultimately relying on available underlying multi-domain physical resources such as time, frequency, and computing power. How to reasonably coordinate the multi-domain resources scheduling among sensing, communication, and computing, therefore, is crucial to the MFP networks. To address the above issues, this paper investigates service-oriented resource management with integrated sensing, communication, and computing (ISCC). With the incentive mechanism of the MFP service market, the resources management problem is redefined as a social welfare maximization problem, where the idea of "expanding resources" and "reducing costs" is used to improve learning performance gain and reduce resource costs. Experimental results demonstrate the effectiveness and robustness of the proposed resource scheduling mechanisms. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2310.08718 [pdf, other]

A Framework for Developing and Evaluating Algorithms for Estimating Multipath Propagation Parameters from Channel Sounder Measurements

Authors: Akbar Sayeed, Damla Guven, Michael Doebereiner, Sebastian Semper, Camillo Gentile, Anuraag Bodi, Zihang Cheng

Abstract: A framework is proposed for developing and evaluating algorithms for extracting multipath propagation components (MPCs) from measurements collected by channel sounders at millimeter-wave frequencies. Sounders equipped with an omnidirectional transmitter and a receiver with a uniform planar array (UPA) are considered. An accurate mathematical model is developed for the spatial frequency response of… ▽ More A framework is proposed for developing and evaluating algorithms for extracting multipath propagation components (MPCs) from measurements collected by channel sounders at millimeter-wave frequencies. Sounders equipped with an omnidirectional transmitter and a receiver with a uniform planar array (UPA) are considered. An accurate mathematical model is developed for the spatial frequency response of the sounder that incorporates the non-ideal cross-polar beampatterns for the UPA elements. Due to the limited Field-of-View (FoV) of each element, the model is extended to accommodate multi-FoV measurements in distinct azimuth directions. A beamspace representation of the spatial frequency response is leveraged to develop three progressively complex algorithms aimed at solving the singlesnapshot maximum likelihood estimation problem: greedy matching pursuit (CLEAN), space-alternative generalized expectationmaximization (SAGE), and RiMAX. The first two are based on purely specular MPCs whereas RiMAX also accommodates diffuse MPCs. Two approaches for performance evaluation are proposed, one with knowledge of ground truth parameters, and one based on reconstruction mean-squared error. The three algorithms are compared through a demanding channel model with hundreds of MPCs and through real measurements. The results demonstrate that CLEAN gives quite reasonable estimates which are improved by SAGE and RiMAX. Lessons learned and directions for future research are discussed. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: 17 pages

arXiv:2309.12596 [pdf, ps, other]

Movable Antenna-Empowered AirComp

Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Xiaoming She, Chongjun Ouyang, Peng Chen

Abstract: A novel over-the-air computation (AirComp) framework, empowered by the incorporation of movable antennas (MAs), is proposed to significantly enhance computation accuracy. Within this framework, the joint optimization of transmit power control, antenna positioning, and receive combining is investigated. An efficient method is proposed to tackle the problem of computation mean-squared error (MSE) mi… ▽ More A novel over-the-air computation (AirComp) framework, empowered by the incorporation of movable antennas (MAs), is proposed to significantly enhance computation accuracy. Within this framework, the joint optimization of transmit power control, antenna positioning, and receive combining is investigated. An efficient method is proposed to tackle the problem of computation mean-squared error (MSE) minimization, capitalizing on the approach of alternating optimization. Numerical results are provided to substantiate the superior MSE performance of the proposed framework, which establish its clear advantage over benchmark systems employing conventional fixed-position antennas (FPAs). △ Less

Submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.11135 [pdf, ps, other]

Sum-Rate Maximization for Movable Antenna Enabled Multiuser Communications

Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Chongjun Ouyang

Abstract: A novel multiuser communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the downlink sum-rate. The joint optimization of the transmit beamforming vector and transmit MA positions is studied for a multiuser multiple-input single-input system. An efficient algorithm is proposed to tackle the formulated non-convex problem via cap… ▽ More A novel multiuser communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the downlink sum-rate. The joint optimization of the transmit beamforming vector and transmit MA positions is studied for a multiuser multiple-input single-input system. An efficient algorithm is proposed to tackle the formulated non-convex problem via capitalizing on fractional programming, alternating optimization, and gradient descent methods. To strike a better performance-complexity trade-off, a zero-forcing beamforming-based design is also proposed as an alternative. Numerical investigations are presented to verify the efficiency of the proposed algorithms and their superior performance compared with the benchmark relying on conventional fixed-position antennas (FPAs). △ Less

Submitted 22 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: 11 pages

arXiv:2305.15079 [pdf, other]

Life cycle economic viability analysis of battery storage in electricity market

Authors: Yinguo Yang, Yiling Ye, Zhuoxiao Cheng, Guangchun Ruan, Qiuyu Lu, Xuan Wang, Haiwang Zhong

Abstract: Battery storage is essential to enhance the flexibility and reliability of electric power systems by providing auxiliary services and load shifting. Storage owners typically gains incentives from quick responses to auxiliary service prices, but frequent charging and discharging also reduce its lifetime. Therefore, this paper embeds the battery degradation cost into the operation simulation to avoi… ▽ More Battery storage is essential to enhance the flexibility and reliability of electric power systems by providing auxiliary services and load shifting. Storage owners typically gains incentives from quick responses to auxiliary service prices, but frequent charging and discharging also reduce its lifetime. Therefore, this paper embeds the battery degradation cost into the operation simulation to avoid overestimated profits caused by an aggressive bidding strategy. Based on an operation simulation model, this paper conducts the economic viability analysis of whole life cycle using the internal rate of return(IRR). A clustering method and a typical day method are developed to reduce the huge computational burdens in the life-cycle simulation of battery storage. Our models and algorithms are validated by the case study of two mainstream technology routes currently: lithium nickel cobalt manganese oxide (NCM) batteries and lithium iron phosphate (LFP) batteries. Then a sensitivity analysis is presented to identify the critical factors that boost battery storage in the future. We evaluate the IRR results of different types of battery storage to provide guidance for investment portfolio. △ Less

Submitted 28 May, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: 17 pages, accepted by JPS

arXiv:2305.11548 [pdf, ps, other]

Sensing Aided Uplink Transmission in OTFS ISAC with Joint Parameter Association, Channel Estimation and Signal Detection

Authors: Xi Yang, Hang Li, Qinghua Guo, J. Andrew Zhang, Xiaojing Huang, Zhiqun Cheng

Abstract: In this work, we study sensing-aided uplink transmission in an integrated sensing and communication (ISAC) vehicular network with the use of orthogonal time frequency space (OTFS) modulation. To exploit sensing parameters for improving uplink communications, the parameters must be first associated with the transmitters, which is a challenging task. We propose a scheme that jointly conducts paramet… ▽ More In this work, we study sensing-aided uplink transmission in an integrated sensing and communication (ISAC) vehicular network with the use of orthogonal time frequency space (OTFS) modulation. To exploit sensing parameters for improving uplink communications, the parameters must be first associated with the transmitters, which is a challenging task. We propose a scheme that jointly conducts parameter association, channel estimation and signal detection by formulating it as a constrained bilinear recovery problem. Then we develop a message passing algorithm to solve the problem, leveraging the bilinear unitary approximate message passing (Bi-UAMP) algorithm. Numerical results validate the proposed scheme, which show that relevant performance bounds can be closely approached. △ Less

Submitted 19 May, 2023; originally announced May 2023.

arXiv:2305.04294

PELE scores: Pelvic X-ray Landmark Detection by Pelvis Extraction and Enhancement

Authors: Zhen Huang, Han Li, Shitong Shao, Heqin Zhu, Huijie Hu, Zhiwei Cheng, Jianji Wang, S. Kevin Zhou

Abstract: The pelvis, the lower part of the trunk, supports and balances the trunk. Landmark detection from a pelvic X-ray (PXR) facilitates downstream analysis and computer-assisted diagnosis and treatment of pelvic diseases. Although PXRs have the advantages of low radiation and reduced cost compared to computed tomography (CT) images, their 2D pelvis-tissue superposition of 3D structures confuses clinica… ▽ More The pelvis, the lower part of the trunk, supports and balances the trunk. Landmark detection from a pelvic X-ray (PXR) facilitates downstream analysis and computer-assisted diagnosis and treatment of pelvic diseases. Although PXRs have the advantages of low radiation and reduced cost compared to computed tomography (CT) images, their 2D pelvis-tissue superposition of 3D structures confuses clinical decision-making. In this paper, we propose a PELvis Extraction (PELE) module that utilizes 3D prior anatomical knowledge in CT to guide and well isolate the pelvis from PXRs, thereby eliminating the influence of soft tissue. We conduct an extensive evaluation based on two public datasets and one private dataset, totaling 850 PXRs. The experimental results show that the proposed PELE module significantly improves the accuracy of PXRs landmark detection and achieves state-of-the-art performances in several benchmark metrics, thus better serving downstream tasks. △ Less

Submitted 7 June, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

Comments: will revise it and resubmit it again later

arXiv:2303.14506 [pdf, other]

doi 10.1109/TPAMI.2024.3401048

Toward DNN of LUTs: Learning Efficient Image Restoration with Multiple Look-Up Tables

Authors: Jiacheng Li, Chang Chen, Zhen Cheng, Zhiwei Xiong

Abstract: The widespread usage of high-definition screens on edge devices stimulates a strong demand for efficient image restoration algorithms. The way of caching deep learning models in a look-up table (LUT) is recently introduced to respond to this demand. However, the size of a single LUT grows exponentially with the increase of its indexing capacity, which restricts its receptive field and thus the per… ▽ More The widespread usage of high-definition screens on edge devices stimulates a strong demand for efficient image restoration algorithms. The way of caching deep learning models in a look-up table (LUT) is recently introduced to respond to this demand. However, the size of a single LUT grows exponentially with the increase of its indexing capacity, which restricts its receptive field and thus the performance. To overcome this intrinsic limitation of the single-LUT solution, we propose a universal method to construct multiple LUTs like a neural network, termed MuLUT. Firstly, we devise novel complementary indexing patterns, as well as a general implementation for arbitrary patterns, to construct multiple LUTs in parallel. Secondly, we propose a re-indexing mechanism to enable hierarchical indexing between cascaded LUTs. Finally, we introduce channel indexing to allow cross-channel interaction, enabling LUTs to process color channels jointly. In these principled ways, the total size of MuLUT is linear to its indexing capacity, yielding a practical solution to obtain superior performance with the enlarged receptive field. We examine the advantage of MuLUT on various image restoration tasks, including super-resolution, demosaicing, denoising, and deblocking. MuLUT achieves a significant improvement over the single-LUT solution, e.g., up to 1.1dB PSNR for super-resolution and up to 2.8dB PSNR for grayscale denoising, while preserving its efficiency, which is 100$\times$ less in energy cost compared with lightweight deep neural networks. Our code and trained models are publicly available at https://github.com/ddlee-cn/MuLUT. △ Less

Submitted 25 March, 2023; originally announced March 2023.

Comments: Project Page: https://mulut.pages.dev/

Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2024, early access

arXiv:2302.14204 [pdf, other]

HalluAudio: Hallucinating Frequency as Concepts for Few-Shot Audio Classification

Authors: Zhongjie Yu, Shuyang Wang, Lin Chen, Zhongwei Cheng

Abstract: Few-shot audio classification is an emerging topic that attracts more and more attention from the research community. Most existing work ignores the specificity of the form of the audio spectrogram and focuses largely on the embedding space borrowed from image tasks, while in this work, we aim to take advantage of this special audio format and propose a new method by hallucinating high-frequency a… ▽ More Few-shot audio classification is an emerging topic that attracts more and more attention from the research community. Most existing work ignores the specificity of the form of the audio spectrogram and focuses largely on the embedding space borrowed from image tasks, while in this work, we aim to take advantage of this special audio format and propose a new method by hallucinating high-frequency and low-frequency parts as structured concepts. Extensive experiments on ESC-50 and our curated balanced Kaggle18 dataset show the proposed method outperforms the baseline by a notable margin. The way that our method hallucinates high-frequency and low-frequency parts also enables its interpretability and opens up new potentials for the few-shot audio classification. △ Less

Submitted 27 February, 2023; originally announced February 2023.

Comments: Accepted at ICASSP 2023

arXiv:2302.03601 [pdf, other]

Industrial computed tomography based intelligent non-destructive testing method for power capacitor

Authors: Zhenxing Cheng, Peng Wang, Yue Liu, Wei Qin, Zidi Tang

Abstract: Power capacitor device is a widely used reactive power compensation equipment in power transmission and distribution system which can easily have internal fault and therefore affects the safe operation of the power system. An intelligent non-destructive testing (I-NDT) method based on ICT is proposed to test the quality of power capacitors automatically in this study. The internal structure of pow… ▽ More Power capacitor device is a widely used reactive power compensation equipment in power transmission and distribution system which can easily have internal fault and therefore affects the safe operation of the power system. An intelligent non-destructive testing (I-NDT) method based on ICT is proposed to test the quality of power capacitors automatically in this study. The internal structure of power capacitors would be scanned by the ICT device and then defects could be recognized by the SSD algorithm. Moreover, the data data augmentation algorithm is used to extend the image set to improve the stability and accuracy of the trained SSD model. △ Less

Submitted 6 February, 2023; originally announced February 2023.

arXiv:2301.08087 [pdf, other]

Relative Entropy-Based Constant-Envelope Beamforming for Target Detection in Large-Scale MIMO Radar With Low-Resoultion ADCs

Authors: Ziyang Cheng, Linlong Wu, Bowen Wang, Julan Xie, Huiyong Li

Abstract: Hybrid digital/analog architecture and low-resolution analog-to-digital/digital-to-analog converters (ADCs /DACs) are two low-cost implementations for large-scale millimeter wave (mmWave) systems. In this paper, we investigate the problem of constant-envelope transmit beamforming for large-scale multiple-input multiple-output (MIMO) radar system, where the transmit array adopts a hybrid digital/an… ▽ More Hybrid digital/analog architecture and low-resolution analog-to-digital/digital-to-analog converters (ADCs /DACs) are two low-cost implementations for large-scale millimeter wave (mmWave) systems. In this paper, we investigate the problem of constant-envelope transmit beamforming for large-scale multiple-input multiple-output (MIMO) radar system, where the transmit array adopts a hybrid digital/analog architecture with a small number of RF chains and the receive array adopts a fully digital architecture with low-resolution ADCs. We derive the relative entropy between the probability density functions associated with the two test hypotheses under low-resolution ADCs. We formulate our optimization problem by maximizing the relative entropy, subject to the constant envelope and orthogonality constraints. To suboptimally solve the resultant problem, a two-stage framework is developed. In the first stage, we optimize the transmit power at the directions of the target and clutter. In the second stage, an efficient iterative algorithm based on majorization-minimization is presented to obtain the constant-envelope beamformer according to the attained transmit power. Specifically, we apply a quadratic function as the minorizer, leading to a low-complexity solution at each iteration. In addition, to further facilitate low-cost implementation of the constant-envelope beamformer, we consider the problem of one-bit beamforming design and propose an efficient iterative method based on the Nesterov-like gradient method to solve it. Numerical simulations are provided to demonstrate the effectiveness of the proposed schemes. △ Less

Submitted 5 March, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

arXiv:2301.03286 [pdf, other]

doi 10.1109/TCOMM.2024.3447917.

A Dual-Function Radar-Communication System Empowered by Beyond Diagonal Reconfigurable Intelligent Surface

Authors: Bowen Wang, Hongyu Li, Shanpu Shen, Ziyang Cheng, Bruno Clerckx

Abstract: This work focuses on the use of reconfigurable intelligent surface (RIS) in dual-function radar-communication (DFRC) systems to improve communication capacity and sensing precision, and enhance coverage for both functions. In contrast to most of the existing RIS aided DFRC works where the RIS is modeled as a diagonal phase shift matrix and can only reflect signals to half space, we propose a novel… ▽ More This work focuses on the use of reconfigurable intelligent surface (RIS) in dual-function radar-communication (DFRC) systems to improve communication capacity and sensing precision, and enhance coverage for both functions. In contrast to most of the existing RIS aided DFRC works where the RIS is modeled as a diagonal phase shift matrix and can only reflect signals to half space, we propose a novel beyond diagonal RIS (BD-RIS) aided DFRC system. Specifically, the proposed BD-RIS supports the hybrid reflecting and transmitting mode, and is compatible with flexible architectures, enabling the system to realize full-space coverage and to achieve enhanced performance. To achieve the expected benefits, we jointly optimize the transmit waveform, the BD-RIS matrices, and sensing receive filters, by maximizing the minimum signal-to-clutter-plus-noise ratio for fair target detection, subject to the constraints of the communication quality of service, different BD-RIS architectures and power budget. To solve the non-convex and non-smooth max-min problem, a general solution based on the alternating direction method of multipliers is provided. Numerical simulations validate the efficacy of the proposed algorithm and show the superiority of the BD-RIS aided DFRC system in terms of both communication and sensing compared to conventional RIS aided DFRC. △ Less

Submitted 22 August, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

Comments: IEEE Transactions on Communications, 2024

arXiv:2211.05256 [pdf, other]

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

Authors: Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang, Hsien-Kai Kuo, Yu-Syuan Xu, Man-Yu Lee, Allen Lu, Chia-Ming Cheng, Chih-Cheng Chen, Jia-Ying Yong, Hong-Han Shuai, Wen-Huang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang , et al. (29 additional authors not shown)

Abstract: Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this prob… ▽ More Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this problem and propose the participants to design an end-to-end real-time video super-resolution solution for mobile NPUs optimized for low energy consumption. The participants were provided with the REDS training dataset containing video sequences for a 4X video upscaling task. The runtime and power efficiency of all models was evaluated on the powerful MediaTek Dimensity 9000 platform with a dedicated AI processing unit capable of accelerating floating-point and quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 500 FPS rate and 0.2 [Watt / 30 FPS] power consumption. A detailed description of all models developed in the challenge is provided in this paper. △ Less

Submitted 7 November, 2022; originally announced November 2022.

Comments: arXiv admin note: text overlap with arXiv:2105.08826, arXiv:2105.07809, arXiv:2211.04470, arXiv:2211.03885

arXiv:2211.03058 [pdf, other]

Towards Real World HDRTV Reconstruction: A Data Synthesis-based Approach

Authors: Zhen Cheng, Tao Wang, Yong Li, Fenglong Song, Chang Chen, Zhiwei Xiong

Abstract: Existing deep learning based HDRTV reconstruction methods assume one kind of tone mapping operators (TMOs) as the degradation procedure to synthesize SDRTV-HDRTV pairs for supervised training. In this paper, we argue that, although traditional TMOs exploit efficient dynamic range compression priors, they have several drawbacks on modeling the realistic degradation: information over-preservation, c… ▽ More Existing deep learning based HDRTV reconstruction methods assume one kind of tone mapping operators (TMOs) as the degradation procedure to synthesize SDRTV-HDRTV pairs for supervised training. In this paper, we argue that, although traditional TMOs exploit efficient dynamic range compression priors, they have several drawbacks on modeling the realistic degradation: information over-preservation, color bias and possible artifacts, making the trained reconstruction networks hard to generalize well to real-world cases. To solve this problem, we propose a learning-based data synthesis approach to learn the properties of real-world SDRTVs by integrating several tone mapping priors into both network structures and loss functions. In specific, we design a conditioned two-stream network with prior tone mapping results as a guidance to synthesize SDRTVs by both global and local transformations. To train the data synthesis network, we form a novel self-supervised content loss to constraint different aspects of the synthesized SDRTVs at regions with different brightness distributions and an adversarial loss to emphasize the details to be more realistic. To validate the effectiveness of our approach, we synthesize SDRTV-HDRTV pairs with our method and use them to train several HDRTV reconstruction networks. Then we collect two inference datasets containing both labeled and unlabeled real-world SDRTVs, respectively. Experimental results demonstrate that, the networks trained with our synthesized data generalize significantly better to these two real-world datasets than existing solutions. △ Less

Submitted 6 November, 2022; originally announced November 2022.

arXiv:2209.04848 [pdf, other]

doi 10.1109/TVT.2023.3312759

Dynamic Hybrid Beamforming Design for Dual-Function Radar-Communication Systems

Authors: Bowen Wang, Hongyu Li, Ziyang Cheng

Abstract: This paper investigates dynamic hybrid beamforming (HBF) for a dual-function radar-communication (DFRC) system, where the DFRC base station (BS) simultaneously serves multiple single-antenna users and senses a target in the presence of multiple clutters. Particularly, we apply a HBF architecture with dynamic subarrays and double phase shifters in the DFRC BS. Aiming at maximizing the radar mutual… ▽ More This paper investigates dynamic hybrid beamforming (HBF) for a dual-function radar-communication (DFRC) system, where the DFRC base station (BS) simultaneously serves multiple single-antenna users and senses a target in the presence of multiple clutters. Particularly, we apply a HBF architecture with dynamic subarrays and double phase shifters in the DFRC BS. Aiming at maximizing the radar mutual information, we consider jointly designing the dynamic HBF of the DFRC system, subject to the constraints of communication quality of service (QoS), transmit power, and analog beamformer. To solve the complicated non-convex optimization, an efficient alternating optimization algorithm based on the majorization-minimization methods is developed. Simulation results verify the advancement of the considered HBF architecture and the effectiveness of the proposed design method. △ Less

Submitted 14 September, 2023; v1 submitted 11 September, 2022; originally announced September 2022.

arXiv:2209.04656 [pdf, other]

Hybrid Beamforming in mmWave Dual-Function Radar-Communication Systems: Models, Technologies, and Challenges

Authors: Ziyang Cheng, Linlong Wu, Bowen Wang, Bhavani Shankar, Bin Liao, Björn Ottersten

Abstract: As a promising technology in beyond-5G (B5G) and 6G, dual-function radar-communication (DFRC) aims to ensure both radar sensing and communication on a single integrated platform with unified signaling schemes. To achieve accurate sensing and reliable communication, large-scale arrays are anticipated to be implemented in such systems, which brings out the prominent issues on hardware cost and power… ▽ More As a promising technology in beyond-5G (B5G) and 6G, dual-function radar-communication (DFRC) aims to ensure both radar sensing and communication on a single integrated platform with unified signaling schemes. To achieve accurate sensing and reliable communication, large-scale arrays are anticipated to be implemented in such systems, which brings out the prominent issues on hardware cost and power consumption. To address these issues, hybrid beamforming (HBF), beyond its successful deployment in communication-only systems, could be a promising approach in the emerging DFRC ones. In this article, we investigate the development of the HBF techniques on the DFRC system in a self-contained manner. Specifically, we first introduce the basics of the HBF based DFRC system, where the system model and different receive modes are discussed with focus. Then we illustrate the corresponding design principles, which span from the performance metrics and optimization formulations to the design approaches and our preliminary results. Finally, potential extension and key research opportunities, such as the combination with the reconfigurable intelligent surface, are discussed concisely. △ Less

Submitted 4 March, 2024; v1 submitted 10 September, 2022; originally announced September 2022.

Comments: This manuscript has not been fully modified

arXiv:2208.14022 [pdf, other]

Stabilize, Decompose, and Denoise: Self-Supervised Fluoroscopy Denoising

Authors: Ruizhou Liu, Qiang Ma, Zhiwei Cheng, Yuanyuan Lyu, Jianji Wang, S. Kevin Zhou

Abstract: Fluoroscopy is an imaging technique that uses X-ray to obtain a real-time 2D video of the interior of a 3D object, helping surgeons to observe pathological structures and tissue functions especially during intervention. However, it suffers from heavy noise that mainly arises from the clinical use of a low dose X-ray, thereby necessitating the technology of fluoroscopy denoising. Such denoising is… ▽ More Fluoroscopy is an imaging technique that uses X-ray to obtain a real-time 2D video of the interior of a 3D object, helping surgeons to observe pathological structures and tissue functions especially during intervention. However, it suffers from heavy noise that mainly arises from the clinical use of a low dose X-ray, thereby necessitating the technology of fluoroscopy denoising. Such denoising is challenged by the relative motion between the object being imaged and the X-ray imaging system. We tackle this challenge by proposing a self-supervised, three-stage framework that exploits the domain knowledge of fluoroscopy imaging. (i) Stabilize: we first construct a dynamic panorama based on optical flow calculation to stabilize the non-stationary background induced by the motion of the X-ray detector. (ii) Decompose: we then propose a novel mask-based Robust Principle Component Analysis (RPCA) decomposition method to separate a video with detector motion into a low-rank background and a sparse foreground. Such a decomposition accommodates the reading habit of experts. (iii) Denoise: we finally denoise the background and foreground separately by a self-supervised learning strategy and fuse the denoised parts into the final output via a bilateral, spatiotemporal filter. To assess the effectiveness of our work, we curate a dedicated fluoroscopy dataset of 27 videos (1,568 frames) and corresponding ground truth. Our experiments demonstrate that it achieves significant improvements in terms of denoising and enhancement effects when compared with standard approaches. Finally, expert rating confirms this efficacy. △ Less

Submitted 30 August, 2022; originally announced August 2022.

Comments: 11 pages, 18 figures

arXiv:2205.05448 [pdf, other]

Symphony Generation with Permutation Invariant Language Model

Authors: Jiafeng Liu, Yuanliang Dong, Zehua Cheng, Xinran Zhang, Xiaobing Li, Feng Yu, Maosong Sun

Abstract: In this work, we propose a permutation invariant language model, SymphonyNet, as a solution for symbolic symphony music generation. We propose a novel Multi-track Multi-instrument Repeatable (MMR) representation for symphonic music and model the music sequence using a Transformer-based auto-regressive language model with specific 3-D positional embedding. To overcome length overflow when modeling… ▽ More In this work, we propose a permutation invariant language model, SymphonyNet, as a solution for symbolic symphony music generation. We propose a novel Multi-track Multi-instrument Repeatable (MMR) representation for symphonic music and model the music sequence using a Transformer-based auto-regressive language model with specific 3-D positional embedding. To overcome length overflow when modeling extra-long symphony tokens, we also propose a modified Byte Pair Encoding algorithm (Music BPE) for music tokens and introduce a novel linear transformer decoder architecture as a backbone. Meanwhile, we train the decoder to learn automatic orchestration as a joint task by masking instrument information from the input. We also introduce a large-scale symbolic symphony dataset for the advance of symphony generation research. Empirical results show that the proposed approach can generate coherent, novel, complex and harmonious symphony as a pioneer solution for multi-track multi-instrument symbolic music generation. △ Less

Submitted 16 September, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

Journal ref: International Society for Music Information Retrieval (ISMIR) 2022

arXiv:2205.02939 [pdf]

Modelling Pre-fatigue, Low-velocity Impact and Fatigue behaviours of Composite Helicopter Tail Structures under Multipoint Coordinated Loading Spectrum

Authors: Zheng-Qiang Cheng, Wei Tan, Jun-Jiang Xiong

Abstract: This paper aims to numerically study the pre-fatigue, low-velocity impact (LVI) and fatigue progressive damage behaviours of a full-scale composite helicopter tail structure under multipoint coordinated loading spectrum. First, a fatigue progressive damage model (PDM) incorporating multiaxial fatigue residual strength degradation rule, fatigue failure criteria based on fatigue residual strength co… ▽ More This paper aims to numerically study the pre-fatigue, low-velocity impact (LVI) and fatigue progressive damage behaviours of a full-scale composite helicopter tail structure under multipoint coordinated loading spectrum. First, a fatigue progressive damage model (PDM) incorporating multiaxial fatigue residual strength degradation rule, fatigue failure criteria based on fatigue residual strength concept and sudden stiffness degradation rule was proposed. Then, an LVI progressive damage model for plain-weave (PW) and unidirectional (UD) composites was developed. Moreover, a full-process analysis algorithm with a reasonable damage transfer strategy for pre-fatigue, LVI and fatigue progressive damage analysis was proposed. Finally, a highly computational efficient and accurate full-scale global-local finite element (FE) model of helicopter tail structure was built to predict strain distribution under two flight working conditions, to predict LVI damage under impact loading, and to assess fatigue damage behaviours under multipoint coordinated loading spectrum. The numerical predictions agree well with test results from this work and literature data, indicating that the developed pre-fatigue, LVI, fatigue PDMs and algorithms, as well as the global-local FE modelling based on shell-to-solid coupling, can effectively analyse the impact damage tolerance of full-scale aircraft structures. △ Less

Submitted 5 May, 2022; originally announced May 2022.

Comments: 43 pages, 16 figures

arXiv:2203.13678 [pdf, other]

LQoCo: Learning to Optimize Cache Capacity Overloading in Storage Systems

Authors: Ji Zhang, Xijun Li, Xiyao Zhou, Mingxuan Yuan, Zhuo Cheng, Keji Huang, Yifan Li

Abstract: Cache plays an important role to maintain high and stable performance (i.e. high throughput, low tail latency and throughput jitter) in storage systems. Existing rule-based cache management methods, coupled with engineers' manual configurations, cannot meet ever-growing requirements of both time-varying workloads and complex storage systems, leading to frequent cache overloading. In this paper, we… ▽ More Cache plays an important role to maintain high and stable performance (i.e. high throughput, low tail latency and throughput jitter) in storage systems. Existing rule-based cache management methods, coupled with engineers' manual configurations, cannot meet ever-growing requirements of both time-varying workloads and complex storage systems, leading to frequent cache overloading. In this paper, we for the first time propose a light-weight learning-based cache bandwidth control technique, called \LQoCo which can adaptively control the cache bandwidth so as to effectively prevent cache overloading in storage systems. Extensive experiments with various workloads on real systems show that LQoCo, with its strong adaptability and fast learning ability, can adapt to various workloads to effectively control cache bandwidth, thereby significantly improving the storage performance (e.g. increasing the throughput by 10\%-20\% and reducing the throughput jitter and tail latency by 2X-6X and 1.5X-4X, respectively, compared with two representative rule-based methods). △ Less

Submitted 21 March, 2022; originally announced March 2022.

Comments: This paper has been accepted by DAC 2022. Xijun is the correspoonding author

arXiv:2112.02496 [pdf, other]

Double-Phase-Shifter based Hybrid Beamforming for mmWave DFRC in the Presence of Extended Target and Clutters

Authors: Ziyang Cheng, Linlong Wu, Bowen Wang, Bhavani Shankar M. R., Björn Ottersten

Abstract: In millimeter-wave (mmWave) dual-function radar-communication (DFRC) systems, hybrid beamforming (HBF) is recognized as a promising technique utilizing a limited number of radio frequency chains. In this work, in the presence of extended target and clutters, a HBF design based on the subarray connection architecture is proposed for a multiple-input multiple-output (MIMO) DFRC system. In this HBF,… ▽ More In millimeter-wave (mmWave) dual-function radar-communication (DFRC) systems, hybrid beamforming (HBF) is recognized as a promising technique utilizing a limited number of radio frequency chains. In this work, in the presence of extended target and clutters, a HBF design based on the subarray connection architecture is proposed for a multiple-input multiple-output (MIMO) DFRC system. In this HBF, the double-phase-shifter (DPS) structure is embedded to further increase the design flexibility. We derive the communication spectral efficiency (SE) and radar signal-to-interference-plus-noise-ratio (SINR) with respect to the transmit HBF and radar receiver, and formulate the HBF design problem as the SE maximization subjecting to the radar SINR and power constraints. To solve the formulated nonconvex problem, the joinT Hybrid bRamforming and Radar rEceiver OptimizatioN (THEREON) is proposed, in which the radar receiver is optimized via the generalized eigenvalue decomposition, and the transmit HBF is updated with low complexity in a parallel manner using the consensus alternating direction method of multipliers (consensus-ADMM). Furthermore, we extend the proposed method to the multi-user multiple-input single-output (MU-MISO) scenario. Numerical simulations demonstrate the efficacy of the proposed algorithm and show that the solution provides a good trade-off between number of phase shifters and performance gain of the DPS HBF. △ Less

Submitted 4 November, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

arXiv:2112.00729

doi 10.1002/mp.16163

Total-Body Low-Dose CT Image Denoising using Prior Knowledge Transfer Technique with Contrastive Regularization Mechanism

Authors: Minghan Fu, Yanhua Duan, Zhaoping Cheng, Wenjian Qin, Ying Wang, Dong Liang, Zhanli Hu

Abstract: Reducing the radiation exposure for patients in Total-body CT scans has attracted extensive attention in the medical imaging community. Given the fact that low radiation dose may result in increased noise and artifacts, which greatly affected the clinical diagnosis. To obtain high-quality Total-body Low-dose CT (LDCT) images, previous deep-learning-based research work has introduced various networ… ▽ More Reducing the radiation exposure for patients in Total-body CT scans has attracted extensive attention in the medical imaging community. Given the fact that low radiation dose may result in increased noise and artifacts, which greatly affected the clinical diagnosis. To obtain high-quality Total-body Low-dose CT (LDCT) images, previous deep-learning-based research work has introduced various network architectures. However, most of these methods only adopt Normal-dose CT (NDCT) images as ground truths to guide the training of the denoising network. Such simple restriction leads the model to less effectiveness and makes the reconstructed images suffer from over-smoothing effects. In this paper, we propose a novel intra-task knowledge transfer method that leverages the distilled knowledge from NDCT images to assist the training process on LDCT images. The derived architecture is referred to as the Teacher-Student Consistency Network (TSC-Net), which consists of the teacher network and the student network with identical architecture. Through the supervision between intermediate features, the student network is encouraged to imitate the teacher network and gain abundant texture details. Moreover, to further exploit the information contained in CT scans, a contrastive regularization mechanism (CRM) built upon contrastive learning is introduced.CRM performs to pull the restored CT images closer to the NDCT samples and push far away from the LDCT samples in the latent space. In addition, based on the attention and deformable convolution mechanism, we design a Dynamic Enhancement Module (DEM) to improve the network transformation capability. △ Less

Submitted 5 December, 2021; v1 submitted 1 December, 2021; originally announced December 2021.

Comments: Want to improve the methodology

arXiv:2111.15102 [pdf, other]

Manifold Optimization Methods for Hybrid beamforming in mmWave Dual-Function Radar-Communication System

Authors: Bowen Wang, Ziyang Cheng, Zishu He

Abstract: As a cost-effective alternative, hybrid analog and digital beamforming architecture is a promising scheme for millimeter wave (mmWave) system. This paper considers two hybrid beamforming architectures, i.e. the partially-connected and fully-connected structures, for mmWave dual-function radar communication (DFRC) system, where the transmitter communicates with the downlink users and detects radar… ▽ More As a cost-effective alternative, hybrid analog and digital beamforming architecture is a promising scheme for millimeter wave (mmWave) system. This paper considers two hybrid beamforming architectures, i.e. the partially-connected and fully-connected structures, for mmWave dual-function radar communication (DFRC) system, where the transmitter communicates with the downlink users and detects radar targets simultaneously. The optimization problems are formulated by minimizing a weighted summation of radar and communication performance, subject to constant modulus and power constraints. To tackle the non-convexities caused by the two resultant problems, effective Riemannian optimization algorithms are proposed. Specifically, for the fully-connected structure, a manifold algorithm based on the alternating direction method of multipliers (ADMM) is developed. While for the partially-connected structure, a low-complexity Riemannian product manifold trust region (RPM-TR) algorithm is proposed to approach the near-optional solution. Numerical simulations are provided to demonstrate the effectiveness of the proposed methods. △ Less

Submitted 4 September, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

arXiv:2110.12150 [pdf, other]

Spatio-Temporal Graph Complementary Scattering Networks

Authors: Zida Cheng, Siheng Chen, Ya Zhang

Abstract: Spatio-temporal graph signal analysis has a significant impact on a wide range of applications, including hand/body pose action recognition. To achieve effective analysis, spatio-temporal graph convolutional networks (ST-GCN) leverage the powerful learning ability to achieve great empirical successes; however, those methods need a huge amount of high-quality training data and lack theoretical inte… ▽ More Spatio-temporal graph signal analysis has a significant impact on a wide range of applications, including hand/body pose action recognition. To achieve effective analysis, spatio-temporal graph convolutional networks (ST-GCN) leverage the powerful learning ability to achieve great empirical successes; however, those methods need a huge amount of high-quality training data and lack theoretical interpretation. To address this issue, the spatio-temporal graph scattering transform (ST-GST) was proposed to put forth a theoretically interpretable framework; however, the empirical performance of this approach is constrainted by the fully mathematical design. To benefit from both sides, this work proposes a novel complementary mechanism to organically combine the spatio-temporal graph scattering transform and neural networks, resulting in the proposed spatio-temporal graph complementary scattering networks (ST-GCSN). The essence is to leverage the mathematically designed graph wavelets with pruning techniques to cover major information and use trainable networks to capture complementary information. The empirical experiments on hand pose action recognition show that the proposed ST-GCSN outperforms both ST-GCN and ST-GST. △ Less

Submitted 23 October, 2021; originally announced October 2021.

Comments: 5 pages, 3 figures

arXiv:2110.10960 [pdf, other]

doi 10.1109/TSP.2022.3176953

One-Bit ADCs/DACs based MIMO Radar: Performance Analysis and Joint Design

Authors: Minglong Deng, Ziyang Cheng, Linlong Wu, Bhavani Shankar, Zishu He

Abstract: Extremely low-resolution (e.g. one-bit) analog-to-digital converters (ADCs) and digital-to-analog converters (DACs) can substantially reduce hardware cost and power consumption for MIMO radar especially with large scale antennas. In this paper, we focus on the detection performance analysis and joint design for the MIMO radar with one-bit ADCs and DACs. Specifically, under the assumption of low si… ▽ More Extremely low-resolution (e.g. one-bit) analog-to-digital converters (ADCs) and digital-to-analog converters (DACs) can substantially reduce hardware cost and power consumption for MIMO radar especially with large scale antennas. In this paper, we focus on the detection performance analysis and joint design for the MIMO radar with one-bit ADCs and DACs. Specifically, under the assumption of low signal-to-noise ratio (SNR) and interference-to-noise ratio (INR), we derive the expressions of probability of detection ($\mathcal{P}_d$) and probability of false alarm ($\mathcal{P}_f$) for one-bit MIMO radar and also the theoretical performance gap to infinite-bit MIMO radars for the noise-only case. We further find that for a fixed $\mathcal{P}_f$, $\mathcal{P}_d$ depends on the defined quantized signal-to-interference-plus-noise ratio (QSINR), which is a function of the transmit waveform and receive filter. Thus, an optimization problem arises naturally to maximize the QSINR by joint designing the waveform and filter. For the formulated problem, we propose an alternatin\emph{g} wavefo\emph{r}m and filt\emph{e}r d\emph{e}sign for QSINR maximiza\emph{t}ion (GREET). At each iteration of GREET, the receive filter is upadted via the minimum variance distortionless response (MVDR) method, and the one-bit waveform is optimized based on the alternating direction method of multipliers (ADMM) algorithm where the closed-form solutions are obtained for both the primary and slack variables. Numerical simulations are consistent to the theoretical performance analysis and demonstrate the effectiveness of the proposed design algorithm. △ Less

Submitted 24 December, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

arXiv:2109.06131 [pdf, other]

A Framework for Developing Algorithms for Estimating Propagation Parameters from Measurements

Authors: Akbar Sayeed, Peter Vouras, Camillo Gentile, Alec Weiss, Jeanne Quimby, Zihang Cheng, Bassel Modad, Yuning Zhang, Chethan Anjinappa, Fatih Erden, Ozgur Ozdemir, Robert Muller, Diego Dupleich, Han Niu, 6David Michelson, 6Aidan Hughes

Abstract: A framework is proposed for developing and evaluating algorithms for extracting multipath propagation components (MPCs) from measurements collected by sounders at millimeter-wave (mmW) frequencies. To focus on algorithmic performance, an idealized model is proposed for the spatial frequency response of the propagation environment measured by a sounder. The input to the sounder model is a pre-deter… ▽ More A framework is proposed for developing and evaluating algorithms for extracting multipath propagation components (MPCs) from measurements collected by sounders at millimeter-wave (mmW) frequencies. To focus on algorithmic performance, an idealized model is proposed for the spatial frequency response of the propagation environment measured by a sounder. The input to the sounder model is a pre-determined set of MPC parameters that serve as the "ground truth." A three-dimensional angle-delay (beamspace) representation of the measured spatial frequency response serves as a natural domain for implementing and analyzing MPC extraction algorithms. Metrics for quantifying the error in estimated MPC parameters are introduced. Initial results are presented for a greedy matching pursuit algorithm that performs a least-squares (LS) reconstruction of the MPC path gains within the iterations. The results indicate that the simple greedy-LS algorithm has the ability to extract MPCs over a large dynamic range, and suggest several avenues for further performance improvement through extensions of the greedy-LS algorithm as well as by incorporating features of other algorithms, such as SAGE and RIMAX. △ Less

Submitted 13 September, 2021; originally announced September 2021.

Journal ref: IEEE Globecom 2020

arXiv:2109.05287 [pdf, other]

Dual-view Snapshot Compressive Imaging via Optical Flow Aided Recurrent Neural Network

Authors: Ruiying Lu, Bo Chen, Guanliang Liu, Ziheng Cheng, Mu Qiao, Xin Yuan

Abstract: Dual-view snapshot compressive imaging (SCI) aims to capture videos from two field-of-views (FoVs) using a 2D sensor (detector) in a single snapshot, achieving joint FoV and temporal compressive sensing, and thus enjoying the advantages of low-bandwidth, low-power, and low-cost. However, it is challenging for existing model-based decoding algorithms to reconstruct each individual scene, which usua… ▽ More Dual-view snapshot compressive imaging (SCI) aims to capture videos from two field-of-views (FoVs) using a 2D sensor (detector) in a single snapshot, achieving joint FoV and temporal compressive sensing, and thus enjoying the advantages of low-bandwidth, low-power, and low-cost. However, it is challenging for existing model-based decoding algorithms to reconstruct each individual scene, which usually require exhaustive parameter tuning with extremely long running time for large scale data. In this paper, we propose an optical flow-aided recurrent neural network for dual video SCI systems, which provides high-quality decoding in seconds. Firstly, we develop a diversity amplification method to enlarge the differences between scenes of two FoVs, and design a deep convolutional neural network with dual branches to separate different scenes from the single measurement. Secondly, we integrate the bidirectional optical flow extracted from adjacent frames with the recurrent neural network to jointly reconstruct each video in a sequential manner. Extensive results on both simulation and real data demonstrate the superior performance of our proposed model in a short inference time. The code and data are available at https://github.com/RuiyingLu/OFaNet-for-Dual-view-SCI. △ Less

Submitted 11 September, 2021; originally announced September 2021.

arXiv:2103.12968 [pdf, other]

Receding Horizon Motion Planning for Multi-Agent Systems: A Velocity Obstacle Based Probabilistic Method

Authors: Xiaoxue Zhang, Jun Ma, Zilong Cheng, Sunan Huang, Tong Heng Lee

Abstract: In this paper, a novel and innovative methodology for feasible motion planning in the multi-agent system is developed. On the basis of velocity obstacles characteristics, the chance constraints are formulated in the receding horizon control (RHC) problem, and geometric information of collision cones is used to generate the feasible regions of velocities for the host agent. By this approach, the mo… ▽ More In this paper, a novel and innovative methodology for feasible motion planning in the multi-agent system is developed. On the basis of velocity obstacles characteristics, the chance constraints are formulated in the receding horizon control (RHC) problem, and geometric information of collision cones is used to generate the feasible regions of velocities for the host agent. By this approach, the motion planning is conducted at the velocity level instead of the position level. Thus, it guarantees a safer collision-free trajectory for the multi-agent system, especially for the systems with high-speed moving agents. Moreover, a probability threshold of potential collisions can be satisfied during the motion planning process. In order to validate the effectiveness of the methodology, different scenarios for multiple agents are investigated, and the simulation results clearly show that the proposed approach can effectively avoid potential collisions with a collision probability less than a specific threshold. △ Less

Submitted 23 March, 2021; originally announced March 2021.

Comments: 8 pages, 7 figures

arXiv:2103.12580 [pdf, other]

doi 10.1109/TMECH.2021.3096601

Global Iterative Sliding Mode Control of an Industrial Biaxial Gantry System for Contouring Motion Tasks

Authors: Wenxin Wang, Jun Ma, Zilong Cheng, Xiaocong Li, Clarence W de Silva, Tong Heng Lee

Abstract: This paper proposes a global iterative sliding mode control approach for high-precision contouring tasks of a flexure-linked biaxial gantry system. For such high-precision contouring tasks, it is the typical situation that the involved multi-axis cooperation is one of the most challenging problems. As also would be inevitably encountered, various factors render the multi-axis cooperation rather di… ▽ More This paper proposes a global iterative sliding mode control approach for high-precision contouring tasks of a flexure-linked biaxial gantry system. For such high-precision contouring tasks, it is the typical situation that the involved multi-axis cooperation is one of the most challenging problems. As also would be inevitably encountered, various factors render the multi-axis cooperation rather difficult; such as the strong coupling (which naturally brings nonlinearity) between different axes due to its mechanical structure, the backlash and deadzone caused by the friction, and the difficulties in system identification, etc. To overcome the above-mentioned issues, this work investigates an intelligent model-free contouring control method for such a multi-axis motion stage. Essentially in the methodology developed here, it is firstly ensured that all the coupling, friction, nonlinearity, and disturbance (regarded as uncertain dynamics in each axis) are suitably posed as `uncertainties'. Then, a varying-gain sliding mode control method is proposed to adaptively compensate for the matched unknown dynamics in the time domain, while an iterative learning law is applied to suppress the undesirable effects (arising from the repetitive matched and unmatched uncertainties in the iteration domain). With this approach, the chattering that typically results from the overestimated control gains in the sliding mode control is thus suppressed during the iterations. To analyze the contouring performance and show the improved outcomes, rigorous proof is furnished on both the stability in the time domain and the convergence in the iteration domain; and the real-time experiments also illustrate that the requirements of precision motion control towards high-speed and complex-curvature references can be satisfied using the proposed method, without prior knowledge of the boundary to the unknown dynamics. △ Less

Submitted 27 August, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

Comments: 11 pages, 8 figures

arXiv:2103.12567 [pdf, other]

Generalized Iterative Super-Twisting Sliding Mode Control: A Case Study on Flexure-Joint Dual-Drive H-Gantry Stage

Authors: Wenxin Wang, Jun Ma, Zilong Cheng, Xiaocong Li, Abdullah Al Mamun, Tong Heng Lee

Abstract: Mechatronic systems are commonly used in the industry, where fast and accurate motion performance is always required to guarantee manufacturing precision and efficiency. Nevertheless, the system model and parameters are difficult to be obtained accurately. Moreover, the high-order modes, strong coupling in the multi-axis systems, or unmodeled frictions will bring uncertain dynamics to the system.… ▽ More Mechatronic systems are commonly used in the industry, where fast and accurate motion performance is always required to guarantee manufacturing precision and efficiency. Nevertheless, the system model and parameters are difficult to be obtained accurately. Moreover, the high-order modes, strong coupling in the multi-axis systems, or unmodeled frictions will bring uncertain dynamics to the system. To overcome the above-mentioned issues and enhance the motion performance, this paper introduces a novel intelligent and totally model-free control method for mechatronic systems with unknown dynamics. In detail, a 2-degree-of-freedom (DOF) architecture is designed, which organically merges a generalized super-twisting algorithm with a unique iterative learning law. The controller solely utilizes the input-output data collected in iterations such that it works without any knowledge of the system parameters. The rigorous proof of convergence ability is given and a case study on flexture-joint dual-drive H-gantry stage is shown to validate the effectiveness of the proposed method. △ Less

Submitted 23 March, 2021; originally announced March 2021.

Comments: 7 pages, 8 figures

arXiv:2103.11569 [pdf, other]

Convex Parameterization and Optimization for Robust Tracking of a Magnetically Levitated Planar Positioning System

Authors: Jun Ma, Zilong Cheng, Haiyue Zhu, Xiaocong Li, Masayoshi Tomizuka, Tong Heng Lee

Abstract: Magnetic levitation positioning technology has attracted considerable research efforts and dedicated attention due to its extremely attractive features. The technology offers high-precision, contactless, dust/lubricant-free, multi-axis, and large-stroke positioning. In this work, we focus on the accurate and smooth tracking problem of a multi-axis magnetically levitated (maglev) planar positioning… ▽ More Magnetic levitation positioning technology has attracted considerable research efforts and dedicated attention due to its extremely attractive features. The technology offers high-precision, contactless, dust/lubricant-free, multi-axis, and large-stroke positioning. In this work, we focus on the accurate and smooth tracking problem of a multi-axis magnetically levitated (maglev) planar positioning system for a specific S-curve reference trajectory. The floating characteristics and the multi-axis coupling make accurate identification of the system dynamics difficult, which lead to a challenge to design a high performance control system. Here, the tracking task is achieved by a 2-Degree of Freedom (DoF) controller consisting of a feedforward controller and a robust stabilizing feedback controller with a prescribed sparsity pattern. The approach proposed in this paper utilizes the basis of an H-infinity controller formulation and a suitably established convex inner approximation. Particularly, a subset of robust stabilizable controllers with prescribed structural constraints is characterized in the parameter space, and so thus the re-formulated convex optimization problem can be easily solved by several powerful numerical algorithms and solvers. With this approach, the robust stability of the overall system is ensured with a satisfactory system performance despite the presence of parametric uncertainties. Furthermore, experimental results clearly demonstrate the effectiveness of the proposed approach. △ Less

Submitted 30 December, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

Comments: 11 pages, 9 figures

arXiv:2103.03089 [pdf, other]

Memory-Efficient Network for Large-scale Video Compressive Sensing

Authors: Ziheng Cheng, Bo Chen, Guanliang Liu, Hao Zhang, Ruiying Lu, Zhengjue Wang, Xin Yuan

Abstract: Video snapshot compressive imaging (SCI) captures a sequence of video frames in a single shot using a 2D detector. The underlying principle is that during one exposure time, different masks are imposed on the high-speed scene to form a compressed measurement. With the knowledge of masks, optimization algorithms or deep learning methods are employed to reconstruct the desired high-speed video frame… ▽ More Video snapshot compressive imaging (SCI) captures a sequence of video frames in a single shot using a 2D detector. The underlying principle is that during one exposure time, different masks are imposed on the high-speed scene to form a compressed measurement. With the knowledge of masks, optimization algorithms or deep learning methods are employed to reconstruct the desired high-speed video frames from this snapshot measurement. Unfortunately, though these methods can achieve decent results, the long running time of optimization algorithms or huge training memory occupation of deep networks still preclude them in practical applications. In this paper, we develop a memory-efficient network for large-scale video SCI based on multi-group reversible 3D convolutional neural networks. In addition to the basic model for the grayscale SCI system, we take one step further to combine demosaicing and SCI reconstruction to directly recover color video from Bayer measurements. Extensive results on both simulation and real data captured by SCI cameras demonstrate that our proposed model outperforms previous state-of-the-art with less memory and thus can be used in large-scale problems. The code is at https://github.com/BoChenGroup/RevSCI-net. △ Less

Submitted 5 March, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

arXiv:2103.01786 [pdf, other]

MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing

Authors: Zhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan

Abstract: To capture high-speed videos using a two-dimensional detector, video snapshot compressive imaging (SCI) is a promising system, where the video frames are coded by different masks and then compressed to a snapshot measurement. Following this, efficient algorithms are desired to reconstruct the high-speed frames, where the state-of-the-art results are achieved by deep learning networks. However, the… ▽ More To capture high-speed videos using a two-dimensional detector, video snapshot compressive imaging (SCI) is a promising system, where the video frames are coded by different masks and then compressed to a snapshot measurement. Following this, efficient algorithms are desired to reconstruct the high-speed frames, where the state-of-the-art results are achieved by deep learning networks. However, these networks are usually trained for specific small-scale masks and often have high demands of training time and GPU memory, which are hence {\bf \em not flexible} to $i$) a new mask with the same size and $ii$) a larger-scale mask. We address these challenges by developing a Meta Modulated Convolutional Network for SCI reconstruction, dubbed MetaSCI. MetaSCI is composed of a shared backbone for different masks, and light-weight meta-modulation parameters to evolve to different modulation parameters for each mask, thus having the properties of {\bf \em fast adaptation} to new masks (or systems) and ready to {\bf \em scale to large data}. Extensive simulation and real data results demonstrate the superior performance of our proposed approach. Our code is available at {\small\url{https://github.com/xyvirtualgroup/MetaSCI-CVPR2021}}. △ Less

Submitted 2 March, 2021; originally announced March 2021.

Comments: 12 pages, 6 figures, CVPR 2021

arXiv:2101.09894 [pdf, other]

Alternating Direction Method of Multipliers-Based Parallel Optimization for Multi-Agent Collision-Free Model Predictive Control

Authors: Zilong Cheng, Jun Ma, Wenxin Wang, Zicheng Zhu, Clarence W. de Silva, Tong Heng Lee

Abstract: This paper investigates the collision-free control problem for multi-agent systems. For such multi-agent systems, it is the typical situation where conventional methods using either the usual centralized model predictive control (MPC), or even the distributed counterpart, would suffer from substantial difficulty in balancing optimality and computational efficiency. Additionally, the non-convex cha… ▽ More This paper investigates the collision-free control problem for multi-agent systems. For such multi-agent systems, it is the typical situation where conventional methods using either the usual centralized model predictive control (MPC), or even the distributed counterpart, would suffer from substantial difficulty in balancing optimality and computational efficiency. Additionally, the non-convex characteristics that invariably arise in such collision-free control and optimization problems render it difficult to effectively derive a reliable solution (and also to thoroughly analyze the associated convergence properties). To overcome these challenging issues, this work establishes a suitably novel parallel computation framework through an innovative mathematical problem formulation; and then with this framework and formulation, a parallel algorithm based on alternating direction method of multipliers (ADMM) is presented to solve the sub-problems arising from the resulting parallel structure. Furthermore, an efficient and intuitive initialization procedure is developed to accelerate the optimization process, and the optimum is thus determined with significantly improved computational efficiency. As supported by rigorous proofs, the convergence of the proposed ADMM iterations for this non-convex optimization problem is analyzed and discussed in detail. Finally, a simulation with a group of unmanned aerial vehicles (UAVs) serves as an illustrative example here to demonstrate the effectiveness and efficiency of the proposed approach. Also, the simulation results verify significant improvements in accuracy and computational efficiency compared to other baselines, including primal quadratic mixed integer programming (PQ-MIP), non-convex quadratic mixed integer programming (NC-MIP), and non-convex quadratically constrained quadratic programming (NC-QCQP). △ Less

Submitted 6 February, 2024; v1 submitted 24 January, 2021; originally announced January 2021.

arXiv:2101.00202

Sequential Convex Programming for Collaboration of Connected and Automated Vehicles

Authors: Xiaoxue Zhang, Jun Ma, Zilong Cheng, Frank L. Lewis, Tong Heng Lee

Abstract: This paper investigates the collaboration of multiple connected and automated vehicles (CAVs) in different scenarios. In general, the collaboration of CAVs can be formulated as a nonlinear and nonconvex model predictive control (MPC) problem. Most of the existing approaches available for utilization to solve such an optimization problem suffer from the drawback of considerable computational burden… ▽ More This paper investigates the collaboration of multiple connected and automated vehicles (CAVs) in different scenarios. In general, the collaboration of CAVs can be formulated as a nonlinear and nonconvex model predictive control (MPC) problem. Most of the existing approaches available for utilization to solve such an optimization problem suffer from the drawback of considerable computational burden, which hinders the practical implementation in real time. This paper proposes the use of sequential convex programming (SCP), which is a powerful approach to solving the nonlinear and nonconvex MPC problem in real time. To appropriately deploy the methodology, as a first stage, SCP requires linearization and discretization when addressing the nonlinear dynamics of the system model adequately. Based on the linearization and discretization, the original MPC problem can be transformed into a quadratically constrained quadratic programming (QCQP) problem. Besides, SCP also involves convexification to handle the associated nonconvex constraints. Thus, the nonconvex QCQP can be reduced to a quadratic programming (QP) problem that can be solved rather quickly. Therefore, the computational efficiency is suitably improved despite the existence of nonlinear and nonconvex characteristics, whereby the implementation is realized in real time. Furthermore, simulation results in three different scenarios of autonomous driving are presented to validate the effectiveness and efficiency of our proposed approach. △ Less

Submitted 24 July, 2022; v1 submitted 1 January, 2021; originally announced January 2021.

Comments: With internal discussions and upon agreement from all co-authors, we would like to withdraw this preprint

Showing 1–50 of 80 results for author: Cheng, Z