Zum Hauptinhalt springen

Showing 1–50 of 123 results for author: Cao, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.06667  [pdf, ps, other

    eess.SP

    Joint Source-Channel Optimization for UAV Video Coding and Transmission

    Authors: Kesong Wu, Xianbin Cao, Peng Yang, Haijun Zhang, Tony Q. S. Quek, Dapeng Oliver Wu

    Abstract: This paper is concerned with unmanned aerial vehicle (UAV) video coding and transmission in scenarios such as emergency rescue and environmental monitoring. Unlike existing methods of modeling UAV video source coding and channel transmission separately, we investigate the joint source-channel optimization issue for video coding and transmission. Particularly, we design eight-dimensional delay-powe… ▽ More

    Submitted 19 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  2. arXiv:2408.04192  [pdf, ps, other

    eess.SP

    Pilot-Aided Joint Time Synchronization and Channel Estimation for OTFS

    Authors: Jiazheng Sun, Peng Yang, Xianbin Cao, Zehui Xiong, Haijun Zhang, Tony Q. S. Quek

    Abstract: This letter proposes a pilot-aided joint time synchronization and channel estimation (JTSCE) algorithm for orthogonal time frequency space (OTFS) systems. Unlike existing algorithms, JTSCE employs a maximum length sequence (MLS) rather than an isolated signal as the pilot. Distinctively, JTSCE explores MLS's autocorrelation properties to estimate timing offset and channel delay taps. After obtaini… ▽ More

    Submitted 13 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

  3. arXiv:2407.21395  [pdf, other

    eess.IV

    HINER: Neural Representation for Hyperspectral Image

    Authors: Junqi Shi, Mingyi Jiang, Ming Lu, Tong Chen, Xun Cao, Zhan Ma

    Abstract: This paper introduces {HINER}, a novel neural representation for compressing HSI and ensuring high-quality downstream tasks on compressed HSI. HINER fully exploits inter-spectral correlations by explicitly encoding of spectral wavelengths and achieves a compact representation of the input HSI sample through joint optimization with a learnable decoder. By additionally incorporating the Content Angl… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: ACM MM24

  4. arXiv:2407.16554  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization

    Authors: Junyan Wu, Wei Lu, Xiangyang Luo, Rui Yang, Qian Wang, Xiaochun Cao

    Abstract: Recently, a novel form of audio partial forgery has posed challenges to its forensics, requiring advanced countermeasures to detect subtle forgery manipulations within long-duration audio. However, existing countermeasures still serve a classification purpose and fail to perform meaningful analysis of the start and end timestamps of partial forgery segments. To address this challenge, we introduce… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 9pages, 3figures. This paper has been accepted for ACM MM 2024

    MSC Class: 68T07; 68T10 ACM Class: I.2; I.5

  5. arXiv:2407.08509  [pdf, other

    eess.IV cs.CV

    Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration

    Authors: Shuang Xu, Chang Yu, Jiangjun Peng, Xiangyong Cao

    Abstract: Remote sensing image restoration aims to reconstruct missing or corrupted areas within images. To date, low-rank based models have garnered significant interest in this field. This paper proposes a novel low-rank regularization term, named the Haar nuclear norm (HNN), for efficient and effective remote sensing image restoration. It leverages the low-rank properties of wavelet coefficients derive… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  6. arXiv:2407.06633  [pdf, other

    eess.IV cs.CV

    Variational Zero-shot Multispectral Pansharpening

    Authors: Xiangyu Rui, Xiangyong Cao, Yining Li, Deyu Meng

    Abstract: Pansharpening aims to generate a high spatial resolution multispectral image (HRMS) by fusing a low spatial resolution multispectral image (LRMS) and a panchromatic image (PAN). The most challenging issue for this task is that only the to-be-fused LRMS and PAN are available, and the existing deep learning-based methods are unsuitable since they rely on many training pairs. Traditional variational… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  7. arXiv:2407.06064  [pdf, other

    eess.IV cs.CV

    Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation

    Authors: Shuang Xu, Qiao Ke, Jiangjun Peng, Xiangyong Cao, Zixiang Zhao

    Abstract: This paper introduces a novel paradigm for hyperspectral image (HSI) denoising, which is termed \textit{pan-denoising}. In a given scene, panchromatic (PAN) images capture similar structures and textures to HSIs but with less noise. This enables the utilization of PAN images to guide the HSI denoising process. Consequently, pan-denoising, which incorporates an additional prior, has the potential t… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  8. arXiv:2407.04928  [pdf, other

    cs.CV eess.IV

    CLIPVQA:Video Quality Assessment via CLIP

    Authors: Fengchuang Xing, Mingjie Li, Yuan-Gen Wang, Guopu Zhu, Xiaochun Cao

    Abstract: In learning vision-language representations from web-scale data, the contrastive language-image pre-training (CLIP) mechanism has demonstrated a remarkable performance in many vision tasks. However, its application to the widely studied video quality assessment (VQA) task is still an open issue. In this paper, we propose an efficient and effective CLIP-based Transformer method for the VQA problem… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  9. arXiv:2407.03335  [pdf, other

    math.NA cs.CV eess.IV

    Dual-Domain Deep D-bar Method for Solving Electrical Impedance Tomography

    Authors: Xiang Cao, Qiaoqiao Ding, Xiaoqun Zhang

    Abstract: The regularized D-bar method is one of the most prominent methods for solving Electrical Impedance Tomography (EIT) problems due to its efficiency and simplicity. It provides a direct approach by applying low-pass filtering to the scattering data in the non-linear Fourier domain, thereby yielding a smoothed conductivity approximation. However, D-bar images often present low contrast and low resolu… ▽ More

    Submitted 12 May, 2024; originally announced July 2024.

    Comments: 15 pages, 7 figures

  10. arXiv:2407.00933  [pdf, other

    cs.DC eess.SP

    Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks: Design Optimization and Analysis

    Authors: Xueyao Zhang, Bo Yang, Zhiwen Yu, Xuelin Cao, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen

    Abstract: This paper investigates autonomous driving safety improvement via task offloading from cellular vehicles (CVs) to a multi-access edge computing (MEC) server using vehicle-to-infrastructure (V2I) links. Considering that the latter links can be reused by vehicle-to-vehicle (V2V) communications to improve spectrum utilization, the receiver of the V2I link may suffer from severe interference that can… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  11. arXiv:2406.18055  [pdf, other

    cs.IT eess.SP

    Filtering Reconfigurable Intelligent Computational Surface for RF Spectrum Purification

    Authors: Kaining Wang, Bo Yang, Zhiwen Yu, Xuelin Cao, Mérouane Debbah, Chau Yuen

    Abstract: The increasing demand for communication is degrading the electromagnetic (EM) transmission environment due to severe EM interference, significantly reducing the efficiency of the radio frequency (RF) spectrum. Metasurfaces, a promising technology for controlling desired EM waves, have recently received significant attention from both academia and industry. However, the potential impact of out-of-b… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  12. arXiv:2406.13335  [pdf, other

    cs.NI eess.SP

    AI-Empowered Multiple Access for 6G: A Survey of Spectrum Sensing, Protocol Designs, and Optimizations

    Authors: Xuelin Cao, Bo Yang, Kaining Wang, Xinghua Li, Zhiwen Yu, Chau Yuen, Yan Zhang, Zhu Han

    Abstract: With the rapidly increasing number of bandwidth-intensive terminals capable of intelligent computing and communication, such as smart devices equipped with shallow neural network models, the complexity of multiple access for these intelligent terminals is increasing due to the dynamic network environment and ubiquitous connectivity in 6G systems. Traditional multiple access (MA) design and optimiz… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  13. arXiv:2406.05982  [pdf

    eess.IV cs.LG physics.med-ph

    Artificial Intelligence for Neuro MRI Acquisition: A Review

    Authors: Hongjia Yang, Guanhua Wang, Ziyu Li, Haoxiang Li, Jialan Zheng, Yuxin Hu, Xiaozhi Cao, Congyu Liao, Huihui Ye, Qiyuan Tian

    Abstract: Magnetic resonance imaging (MRI) has significantly benefited from the resurgence of artificial intelligence (AI). By leveraging AI's capabilities in large-scale optimization and pattern recognition, innovative methods are transforming the MRI acquisition workflow, including planning, sequence design, and correction of acquisition artifacts. These emerging algorithms demonstrate substantial potenti… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Magn Reson Mater Phy (2024)

  14. arXiv:2405.10513  [pdf, other

    cs.LG eess.SP

    Federated Learning With Energy Harvesting Devices: An MDP Framework

    Authors: Kai Zhang, Xuanyu Cao

    Abstract: Federated learning (FL) requires edge devices to perform local training and exchange information with a parameter server, leading to substantial energy consumption. A critical challenge in practical FL systems is the rapid energy depletion of battery-limited edge devices, which curtails their operational lifespan and affects the learning performance. To address this issue, we apply energy harvesti… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  15. arXiv:2404.19167  [pdf

    eess.IV physics.med-ph

    Advancing low-field MRI with a universal denoising imaging transformer: Towards fast and high-quality imaging

    Authors: Zheren Zhu, Azaan Rehman, Xiaozhi Cao, Congyu Liao, Yoo Jin Lee, Michael Ohliger, Hui Xue, Yang Yang

    Abstract: Recent developments in low-field (LF) magnetic resonance imaging (MRI) systems present remarkable opportunities for affordable and widespread MRI access. A robust denoising method to overcome the intrinsic low signal-noise-ratio (SNR) barrier is critical to the success of LF MRI. However, current data-driven MRI denoising methods predominantly handle magnitude images and rely on customized models… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  16. GNSS Measurement-Based Context Recognition for Vehicle Navigation using Gated Recurrent Unit

    Authors: Sheng Liu, Zhiqiang Yao, Xuemeng Cao, Xiaowen Cai

    Abstract: Recent years, people have put forward higher and higher requirements for context-adaptive navigation (CAN). CAN system realizes seamless navigation in complex environments by recognizing the ambient surroundings of vehicles, and it is crucial to develop a fast, reliable, and robust navigational context recognition (NCR) method to enable CAN systems to operate effectively. Environmental context rec… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 9 pages, 9 figures, 5 tables

    Journal ref: Proceedings of the 36th International Technical Meeting of the Satellite Division of The Institute of Navigation (ION GNSS+ 2023)

  17. arXiv:2404.07215  [pdf, other

    cs.NI cs.AI eess.SP

    Computation Offloading for Multi-server Multi-access Edge Vehicular Networks: A DDQN-based Method

    Authors: Siyu Wang, Bo Yang, Zhiwen Yu, Xuelin Cao, Yan Zhang, Chau Yuen

    Abstract: In this paper, we investigate a multi-user offloading problem in the overlapping domain of a multi-server mobile edge computing system. We divide the original problem into two stages: the offloading decision making stage and the request scheduling stage. To prevent the terminal from going out of service area during offloading, we consider the mobility parameter of the terminal according to the hum… ▽ More

    Submitted 20 February, 2024; originally announced April 2024.

  18. arXiv:2404.03327  [pdf, other

    cs.CV eess.IV

    DI-Retinex: Digital-Imaging Retinex Theory for Low-Light Image Enhancement

    Authors: Shangquan Sun, Wenqi Ren, Jingyang Peng, Fenglong Song, Xiaochun Cao

    Abstract: Many existing methods for low-light image enhancement (LLIE) based on Retinex theory ignore important factors that affect the validity of this theory in digital imaging, such as noise, quantization error, non-linearity, and dynamic range overflow. In this paper, we propose a new expression called Digital-Imaging Retinex theory (DI-Retinex) through theoretical and experimental analysis of Retinex t… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  19. Stochastic-Robust Planning of Networked Hydrogen-Electrical Microgrids: A Study on Induced Refueling Demand

    Authors: Xunhang Sun, Xiaoyu Cao, Bo Zeng, Qiaozhu Zhai, Tamer Başar, Xiaohong Guan

    Abstract: Hydrogen-electrical microgrids are increasingly assuming an important role on the pathway toward decarbonization of energy and transportation systems. This paper studies networked hydrogen-electrical microgrids planning (NHEMP), considering a critical but often-overlooked issue, i.e., the demand-inducing effect (DIE) associated with infrastructure development decisions. Specifically, higher refuel… ▽ More

    Submitted 27 August, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  20. arXiv:2403.13562  [pdf, other

    eess.SY

    Augmented LRFS-based Filter: Holistic Tracking of Group Objects

    Authors: Chaoqun Yang, Xiaowei Liang, Zhiguo Shi, Heng Zhang, Xianghui Cao

    Abstract: This paper addresses the problem of group target tracking (GTT), wherein multiple closely spaced targets within a group pose a coordinated motion. To improve the tracking performance, the labeled random finite sets (LRFSs) theory is adopted, and this paper develops a new kind of LRFSs, i.e., augmented LRFSs, which introduces group information into the definition of LRFSs. Specifically, for each el… ▽ More

    Submitted 19 August, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  21. arXiv:2403.13346  [pdf, other

    eess.SY

    A Control-Recoverable Added-Noise-based Privacy Scheme for LQ Control in Networked Control Systems

    Authors: Xuening Tang, Xianghui Cao, Wei Xing Zheng

    Abstract: As networked control systems continue to evolve, ensuring the privacy of sensitive data becomes an increasingly pressing concern, especially in situations where the controller is physically separated from the plant. In this paper, we propose a secure control scheme for computing linear quadratic control in a networked control system utilizing two networked controllers, a privacy encoder and a cont… ▽ More

    Submitted 22 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  22. arXiv:2403.05906  [pdf, other

    eess.IV cs.CV

    Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration

    Authors: Jingyun Xue, Tao Wang, Jun Wang, Kaihao Zhang, Wenhan Luo, Wenqi Ren, Zikun Liu, Hyunhee Park, Xiaochun Cao

    Abstract: Under-Display Camera (UDC) is an emerging technology that achieves full-screen display via hiding the camera under the display panel. However, the current implementation of UDC causes serious degradation. The incident light required for camera imaging undergoes attenuation and diffraction when passing through the display panel, leading to various artifacts in UDC imaging. Presently, the prevailing… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 13 pages, 10 figures, conference or other essential info

  23. arXiv:2403.05247  [pdf, other

    cs.CV eess.IV

    Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds

    Authors: Tianrui Lou, Xiaojun Jia, Jindong Gu, Li Liu, Siyuan Liang, Bangyan He, Xiaochun Cao

    Abstract: Adversarial attack methods based on point manipulation for 3D point cloud classification have revealed the fragility of 3D models, yet the adversarial examples they produce are easily perceived or defended against. The trade-off between the imperceptibility and adversarial strength leads most point attack methods to inevitably introduce easily detectable outlier points upon a successful attack. An… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  24. arXiv:2402.15865  [pdf, other

    cs.CV eess.IV

    HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models

    Authors: Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Cao

    Abstract: Hyperspectral image (HSI) restoration aims at recovering clean images from degraded observations and plays a vital role in downstream tasks. Existing model-based methods have limitations in accurately modeling the complex image characteristics with handcraft priors, and deep learning-based methods suffer from poor generalization ability. To alleviate these issues, this paper proposes an unsupervis… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  25. arXiv:2402.00398  [pdf, other

    cs.DC eess.SP

    Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks

    Authors: Bo Yang, Xueyao Zhang, Zhiwen Yu, Xuelin Cao, Chongwen Huang, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen

    Abstract: In this paper, we focus on improving autonomous driving safety via task offloading from cellular vehicles (CVs), using vehicle-to-infrastructure (V2I) links, to an multi-access edge computing (MEC) server. Considering that the frequencies used for V2I links can be reused for vehicle-to-vehicle (V2V) communications to improve spectrum utilization, the receiver of each V2I link may suffer from sever… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  26. DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal Pitch Estimation

    Authors: Haojie Wei, Xueke Cao, Wenbo Xu, Tangpeng Dan, Yueguo Chen

    Abstract: Singing voice separation and vocal pitch estimation are pivotal tasks in music information retrieval. Existing methods for simultaneous extraction of clean vocals and vocal pitches can be classified into two categories: pipeline methods and naive joint learning methods. However, the efficacy of these methods is limited by the following problems: On the one hand, pipeline methods train models for e… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted by ICASSP 2024

  27. arXiv:2401.03381  [pdf, other

    math.OC eess.SY

    Distributionally Robust Frequency-Constrained Microgrid Scheduling Towards Seamless Islanding

    Authors: Lun Yang, Haoxiang Yang, Xiaoyu Cao, Xiaohong Guan

    Abstract: Unscheduled islanding events of microgrids result in the transition between grid-connected and islanded modes and induce a sudden and unknown power imbalance, posing a threat to frequency security. To achieve seamless islanding, we propose a distributionally robust frequency-constrained microgrid scheduling model considering unscheduled islanding events. This model co-optimizes unit commitments, p… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 10 pages, 7 figures

  28. arXiv:2312.13523  [pdf

    physics.med-ph eess.IV

    High-resolution myelin-water fraction and quantitative relaxation mapping using 3D ViSTa-MR fingerprinting

    Authors: Congyu Liao, Xiaozhi Cao, Siddharth Srinivasan Iyer, Sophie Schauman, Zihan Zhou, Xiaoqian Yan, Quan Chen, Zhitao Li, Nan Wang, Ting Gong, Zhe Wu, Hongjian He, Jianhui Zhong, Yang Yang, Adam Kerr, Kalanit Grill-Spector, Kawin Setsompop

    Abstract: Purpose: This study aims to develop a high-resolution whole-brain multi-parametric quantitative MRI approach for simultaneous mapping of myelin-water fraction (MWF), T1, T2, and proton-density (PD), all within a clinically feasible scan time. Methods: We developed 3D ViSTa-MRF, which combined Visualization of Short Transverse relaxation time component (ViSTa) technique with MR Fingerprinting (MR… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 38 pages, 12 figures and 1 table

    Journal ref: Magnetic Resonance in Medicine 2023

  29. arXiv:2312.13127  [pdf, other

    eess.IV cs.CV

    Pixel-to-Abundance Translation: Conditional Generative Adversarial Networks Based on Patch Transformer for Hyperspectral Unmixing

    Authors: Li Wang, Xiaohua Zhang, Longfei Li, Hongyun Meng, Xianghai Cao

    Abstract: Spectral unmixing is a significant challenge in hyperspectral image processing. Existing unmixing methods utilize prior knowledge about the abundance distribution to solve the regularization optimization problem, where the difficulty lies in choosing appropriate prior knowledge and solving the complex regularization optimization problem. To solve these problems, we propose a hyperspectral conditio… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  30. arXiv:2312.09488  [pdf

    eess.IV cs.LG physics.med-ph

    Sequence adaptive field-imperfection estimation (SAFE): retrospective estimation and correction of $B_1^+$ and $B_0$ inhomogeneities for enhanced MRF quantification

    Authors: Mengze Gao, Xiaozhi Cao, Daniel Abraham, Zihan Zhou, Kawin Setsompop

    Abstract: $B_1^+$ and $B_0$ field-inhomogeneities can significantly reduce accuracy and robustness of MRF's quantitative parameter estimates. Additional $B_1^+$ and $B_0$ calibration scans can mitigate this but add scan time and cannot be applied retrospectively to previously collected data. Here, we proposed a calibration-free sequence-adaptive deep-learning framework, to estimate and correct for $B_1^+… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 12 pages, 5 figures, submitted to International Society for Magnetic Resonance in Medicine 31th Scientific Meeting, 2024

  31. arXiv:2311.15556  [pdf, other

    cs.CV eess.IV

    PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images

    Authors: Jiquan Yuan, Xinyan Cao, Changjin Li, Fanyi Yang, Jinlong Lin, Xixin Cao

    Abstract: As image generation technology advances, AI-based image generation has been applied in various fields and Artificial Intelligence Generated Content (AIGC) has garnered widespread attention. However, the development of AI-based image generative models also brings new problems and challenges. A significant challenge is that AI-generated images (AIGI) may exhibit unique distortions compared to natura… ▽ More

    Submitted 29 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 18 pages

  32. arXiv:2310.10823  [pdf, other

    eess.SP

    Implicit Representation of GRAPPA Kernels for Fast MRI Reconstruction

    Authors: Daniel Abraham, Mark Nishimura, Xiaozhi Cao, Congyu Liao, Kawin Setsompop

    Abstract: MRI data is acquired in Fourier space/k-space. Data acquisition is typically performed on a Cartesian grid in this space to enable the use of a fast Fourier transform algorithm to achieve fast and efficient reconstruction. However, it has been shown that for multiple applications, non-Cartesian data acquisition can improve the performance of MR imaging by providing fast and more efficient data acq… ▽ More

    Submitted 14 January, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  33. arXiv:2310.09025  [pdf, other

    cs.NI eess.SP

    Survey on Near-Space Information Networks: Channel Modeling, Networking, and Transmission Perspectives

    Authors: Xianbin Cao, Peng Yang, Xiaoning Su

    Abstract: Near-space information networks (NSINs) composed of high-altitude platforms (HAPs) and high- and low-altitude unmanned aerial vehicles (UAVs) are a new regime for providing quick, robust, and cost-efficient sensing and communication services. Precipitated by innovations and breakthroughs in manufacturing, materials, communications, electronics, and control techniques, NSINs have been envisioned as… ▽ More

    Submitted 13 May, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  34. Continuous 3D Myocardial Motion Tracking via Echocardiography

    Authors: Chengkang Shen, Hao Zhu, You Zhou, Yu Liu, Si Yi, Lili Dong, Weipeng Zhao, David J. Brady, Xun Cao, Zhan Ma, Yi Lin

    Abstract: Myocardial motion tracking stands as an essential clinical tool in the prevention and detection of cardiovascular diseases (CVDs), the foremost cause of death globally. However, current techniques suffer from incomplete and inaccurate motion estimation of the myocardium in both spatial and temporal dimensions, hindering the early identification of myocardial dysfunction. To address these challenge… ▽ More

    Submitted 27 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, 11 figures

    Journal ref: IEEE Transactions on Medical Imaging, June 2024

  35. arXiv:2310.02690  [pdf, other

    eess.IV cs.CV

    Multi-Dimension-Embedding-Aware Modality Fusion Transformer for Psychiatric Disorder Clasification

    Authors: Guoxin Wang, Xuyang Cao, Shan An, Fengmei Fan, Chao Zhang, Jinsong Wang, Feng Yu, Zhiren Wang

    Abstract: Deep learning approaches, together with neuroimaging techniques, play an important role in psychiatric disorders classification. Previous studies on psychiatric disorders diagnosis mainly focus on using functional connectivity matrices of resting-state functional magnetic resonance imaging (rs-fMRI) as input, which still needs to fully utilize the rich temporal information of the time series of rs… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  36. arXiv:2309.16372  [pdf, other

    cs.CV eess.IV

    Aperture Diffraction for Compact Snapshot Spectral Imaging

    Authors: Tao Lv, Hao Ye, Quan Yuan, Zhan Shi, Yibo Wang, Shuming Wang, Xun Cao

    Abstract: We demonstrate a compact, cost-effective snapshot spectral imaging system named Aperture Diffraction Imaging Spectrometer (ADIS), which consists only of an imaging lens with an ultra-thin orthogonal aperture mask and a mosaic filter sensor, requiring no additional physical footprint compared to common RGB cameras. Then we introduce a new optical design that each point in the object space is multip… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: accepted by International Conference on Computer Vision (ICCV) 2023

  37. arXiv:2309.13166  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Invisible Watermarking for Audio Generation Diffusion Models

    Authors: Xirong Cao, Xiang Li, Divyesh Jadav, Yanzhao Wu, Zhehui Chen, Chen Zeng, Wenqi Wei

    Abstract: Diffusion models have gained prominence in the image domain for their capabilities in data generation and transformation, achieving state-of-the-art performance in various tasks in both image and audio domains. In the rapidly evolving field of audio-based machine learning, safeguarding model integrity and establishing data copyright are of paramount importance. This paper presents the first waterm… ▽ More

    Submitted 31 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: This is an invited paper for IEEE TPS, part of the IEEE CIC/CogMI/TPS 2023 conference

  38. arXiv:2309.11745  [pdf, other

    eess.IV cs.CV cs.LG

    PIE: Simulating Disease Progression via Progressive Image Editing

    Authors: Kaizhao Liang, Xu Cao, Kuei-Da Liao, Tianren Gao, Wenqian Ye, Zhengyu Chen, Jianguo Cao, Tejas Nama, Jimeng Sun

    Abstract: Disease progression simulation is a crucial area of research that has significant implications for clinical diagnosis, prognosis, and treatment. One major challenge in this field is the lack of continuous medical imaging monitoring of individual patients over time. To address this issue, we develop a novel framework termed Progressive Image Editing (PIE) that enables controlled manipulation of dis… ▽ More

    Submitted 5 October, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: Code and checkpoints for replicating our results can be found at https://github.com/IrohXu/PIE and https://huggingface.co/IrohXu/stable-diffusion-mimic-cxr-v0.1

  39. arXiv:2309.05964  [pdf, other

    cs.NI eess.SP

    Massive Access of Static and Mobile Users via Reconfigurable Intelligent Surfaces: Protocol Design and Performance Analysis

    Authors: Xuelin Cao, Bo Yang, Chongwen Huang, George C. Alexandropoulos, Chau Yuen, Zhu Han, H. Vincent Poor, Lajos Hanzo

    Abstract: The envisioned wireless networks of the future entail the provisioning of massive numbers of connections, heterogeneous data traffic, ultra-high spectral efficiency, and low latency services. This vision is spurring research activities focused on defining a next generation multiple access (NGMA) protocol that can accommodate massive numbers of users in different resource blocks, thereby, achieving… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  40. arXiv:2309.00907  [pdf, other

    eess.SP cs.LG

    A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical Computation Offloading

    Authors: Ruihuai Liang, Bo Yang, Zhiwen Yu, Xuelin Cao, Derrick Wing Kwan Ng, Chau Yuen

    Abstract: Computation offloading has become a popular solution to support computationally intensive and latency-sensitive applications by transferring computing tasks to mobile edge servers (MESs) for execution, which is known as mobile/multi-access edge computing (MEC). To improve the MEC performance, it is required to design an optimal offloading strategy that includes offloading decision (i.e., whether o… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  41. arXiv:2308.16612  [pdf, other

    cs.CV eess.IV

    Neural Gradient Regularizer

    Authors: Shuang Xu, Yifan Wang, Zixiang Zhao, Jiangjun Peng, Xiangyong Cao, Deyu Meng, Yulun Zhang, Radu Timofte, Luc Van Gool

    Abstract: Owing to its significant success, the prior imposed on gradient maps has consistently been a subject of great interest in the field of image processing. Total variation (TV), one of the most representative regularizers, is known for its ability to capture the intrinsic sparsity prior underlying gradient maps. Nonetheless, TV and its variants often underestimate the gradient maps, leading to the we… ▽ More

    Submitted 13 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  42. arXiv:2308.10196  [pdf, other

    cs.CV eess.IV

    Blind Face Restoration for Under-Display Camera via Dictionary Guided Transformer

    Authors: Jingfan Tan, Xiaoxu Chen, Tao Wang, Kaihao Zhang, Wenhan Luo, Xiaocun Cao

    Abstract: By hiding the front-facing camera below the display panel, Under-Display Camera (UDC) provides users with a full-screen experience. However, due to the characteristics of the display, images taken by UDC suffer from significant quality degradation. Methods have been proposed to tackle UDC image restoration and advances have been achieved. There are still no specialized methods and datasets for res… ▽ More

    Submitted 1 December, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: To appear in IEEE TCSVT

  43. arXiv:2308.07127  [pdf, other

    eess.SY

    A Lightweight Sensor Scheduler Based on AoI Function for Remote State Estimation over Lossy Wireless Channels

    Authors: Taige Chang, Xianghui Cao, Wei Xing Zheng

    Abstract: This paper investigates the problem of sensor scheduling for remotely estimating the states of heterogeneous dynamical systems over resource-limited and lossy wireless channels. Considering the low time complexity and high versatility requirements of schedulers deployed on the transport layer, we propose a lightweight scheduler based on an Age of Information (AoI) function built with the tight sca… ▽ More

    Submitted 30 August, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  44. arXiv:2307.12264  [pdf, ps, other

    cs.NI eess.SP

    QoE-Driven Video Transmission: Energy-Efficient Multi-UAV Network Optimization

    Authors: Kesong Wu, Xianbin Cao, Peng Yang, Zongyang Yu, Dapeng Oliver Wu, Tony Q. S. Quek

    Abstract: This paper is concerned with the issue of improving video subscribers' quality of experience (QoE) by deploying a multi-unmanned aerial vehicle (UAV) network. Different from existing works, we characterize subscribers' QoE by video bitrates, latency, and frame freezing and propose to improve their QoE by energy-efficiently and dynamically optimizing the multi-UAV network in terms of serving UAV se… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  45. arXiv:2306.17799  [pdf, other

    cs.CV cs.SD eess.AS

    A Low-rank Matching Attention based Cross-modal Feature Fusion Method for Conversational Emotion Recognition

    Authors: Yuntao Shou, Xiangyong Cao, Deyu Meng, Bo Dong, Qinghua Zheng

    Abstract: Conversational emotion recognition (CER) is an important research topic in human-computer interactions. Although deep learning (DL) based CER approaches have achieved excellent performance, existing cross-modal feature fusion methods used in these DL-based approaches either ignore the intra-modal and inter-modal emotional interaction or have high computational complexity. To address these issues,… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 10 pages, 4 figures

  46. arXiv:2306.17797  [pdf, other

    cs.CV eess.IV

    HIDFlowNet: A Flow-Based Deep Network for Hyperspectral Image Denoising

    Authors: Li Pang, Weizhen Gu, Xiangyong Cao, Xiangyu Rui, Jiangjun Peng, Shuang Xu, Gang Yang, Deyu Meng

    Abstract: Hyperspectral image (HSI) denoising is essentially ill-posed since a noisy HSI can be degraded from multiple clean HSIs. However, current deep learning-based approaches ignore this fact and restore the clean image with deterministic mapping (i.e., the network receives a noisy HSI and outputs a clean HSI). To alleviate this issue, this paper proposes a flow-based HSI denoising network (HIDFlowNet)… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 10 pages, 8 figures

  47. RMVPE: A Robust Model for Vocal Pitch Estimation in Polyphonic Music

    Authors: Haojie Wei, Xueke Cao, Tangpeng Dan, Yueguo Chen

    Abstract: Vocal pitch is an important high-level feature in music audio processing. However, extracting vocal pitch in polyphonic music is more challenging due to the presence of accompaniment. To eliminate the influence of the accompaniment, most previous methods adopt music source separation models to obtain clean vocals from polyphonic music before predicting vocal pitches. As a result, the performance o… ▽ More

    Submitted 27 June, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted by INTERSPEECH 2023

  48. arXiv:2305.15635  [pdf

    cs.RO eess.SY

    Vehicle-in-Virtual-Environment (VVE)

    Authors: Xincheng Cao, Haochong Chen, Sukru Yaren Gelbal, Bilin Aksun-Guvenc, Levent Guvenc

    Abstract: The current approach to connected and autonomous driving function development and evaluation uses model-in-the-loop simulation, hardware-in-the-loop simulation, and limited proving ground work followed by public road deployment of beta version of software and technology. The rest of the road users are involuntarily forced into taking part in the development and evaluation of these connected and au… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  49. arXiv:2305.10925  [pdf, other

    cs.CV eess.IV

    Unsupervised Hyperspectral Pansharpening via Low-rank Diffusion Model

    Authors: Xiangyu Rui, Xiangyong Cao, Li Pang, Zeyu Zhu, Zongsheng Yue, Deyu Meng

    Abstract: Hyperspectral pansharpening is a process of merging a high-resolution panchromatic (PAN) image and a low-resolution hyperspectral (LRHS) image to create a single high-resolution hyperspectral (HRHS) image. Existing Bayesian-based HS pansharpening methods require designing handcraft image prior to characterize the image features, and deep learning-based HS pansharpening methods usually require a la… ▽ More

    Submitted 19 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  50. arXiv:2305.07774  [pdf, other

    cs.CV eess.IV

    PanFlowNet: A Flow-Based Deep Network for Pan-sharpening

    Authors: Gang Yang, Xiangyong Cao, Wenzhe Xiao, Man Zhou, Aiping Liu, Xun chen, Deyu Meng

    Abstract: Pan-sharpening aims to generate a high-resolution multispectral (HRMS) image by integrating the spectral information of a low-resolution multispectral (LRMS) image with the texture details of a high-resolution panchromatic (PAN) image. It essentially inherits the ill-posed nature of the super-resolution (SR) task that diverse HRMS images can degrade into an LRMS image. However, existing deep learn… ▽ More

    Submitted 16 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.