Zum Hauptinhalt springen

Showing 1–50 of 139 results for author: Ren, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.16455  [pdf, other

    cs.IT eess.SP

    Addressing the Mutual Interference in Uplink ISAC Receivers: A Projection Method

    Authors: Zhiyuan Yu, Hong Ren, Cunhua Pan, Gui Zhou, Ruizhe Wang, Mengyu Liu, Jiangzhou Wang

    Abstract: Dual function radar and communication (DFRC) is a promising research direction within integrated sensing and communication (ISAC), improving hardware and spectrum efficiency by merging sensing and communication (S&C) functionalities into a shared platform. However, the DFRC receiver (DFRC-R) is tasked with both uplink communication signal detection and simultaneously target-related parameter estim… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 5 pages, 3 figures, accepted by IEEE WCL

  2. arXiv:2408.16277  [pdf

    eess.IV cs.CV

    Fine-grained Classification of Port Wine Stains Using Optical Coherence Tomography Angiography

    Authors: Xiaofeng Deng, Defu Chen, Bowen Liu, Xiwan Zhang, Haixia Qiu, Wu Yuan, Hongliang Ren

    Abstract: Accurate classification of port wine stains (PWS, vascular malformations present at birth), is critical for subsequent treatment planning. However, the current method of classifying PWS based on the external skin appearance rarely reflects the underlying angiopathological heterogeneity of PWS lesions, resulting in inconsistent outcomes with the common vascular-targeted photodynamic therapy (V-PDT)… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2408.04593  [pdf, other

    cs.CV cs.RO eess.IV

    SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation

    Authors: Jieming Yu, An Wang, Wenzhen Dong, Mengya Xu, Mobarakol Islam, Jie Wang, Long Bai, Hongliang Ren

    Abstract: The recent Segment Anything Model (SAM) 2 has demonstrated remarkable foundational competence in semantic segmentation, with its memory mechanism and mask decoder further addressing challenges in video tracking and object occlusion, thereby achieving superior results in interactive segmentation for both images and videos. Building upon our previous empirical studies, we further explore the zero-sh… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Empirical study. Previous work "SAM Meets Robotic Surgery" is accessible at: arXiv:2308.07156

  4. arXiv:2407.03228  [pdf, other

    eess.SP

    Movable Antenna-enabled RIS-aided Integrated Sensing and Communication

    Authors: Haisu Wu, Hong Ren, Cunhua Pan

    Abstract: In this paper, we investigate a movable antenna (MA)-aided integrated sensing and communication (ISAC) system, where a reconfigurable intelligent surface (RIS) is employed to enhance wireless communication and sensing performance in dead zones. Specifically, this paper aims to maximize the minimum beampattern gain at the RIS by jointly optimizing beamforming matrix at the base station (BS), the re… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 14 pages, 11 figures

  5. arXiv:2406.16876  [pdf, other

    eess.SP

    Near-Field Mobile Tracking: A Framework of Using XL-RIS Information

    Authors: Tuo Wu, Cunhua Pan, Kangda Zhi, Junteng Yao, Hong Ren, Maged Elkashlan, Chau Yuen

    Abstract: This paper introduces a novel mobile tracking framework leveraging the high-dimensional signal received from extremely large-scale (XL) reconfigurable intelligent surfaces (RIS). This received signal, named XL-RIS information, has a much larger data dimension and therefore offers a richer feature set compared to the traditional base station (BS) received signal, i.e., BS information, enabling more… ▽ More

    Submitted 5 August, 2024; v1 submitted 3 April, 2024; originally announced June 2024.

  6. arXiv:2406.13705  [pdf, other

    eess.IV cs.AI cs.CV

    EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy

    Authors: Long Bai, Tong Chen, Qiaozhi Tan, Wan Jun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, Jinlin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren

    Abstract: Wireless Capsule Endoscopy (WCE) is highly valued for its non-invasive and painless approach, though its effectiveness is compromised by uneven illumination from hardware constraints and complex internal dynamics, leading to overexposed or underexposed images. While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels rema… ▽ More

    Submitted 8 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: To appear in MICCAI 2024. Code and dataset availability: https://github.com/longbai1006/EndoUIC

  7. arXiv:2405.18775  [pdf, other

    eess.SP

    Synchronization Scheme based on Pilot Sharing in Cell-Free Massive MIMO Systems

    Authors: Qihao Peng, Hong Ren, Zhendong Peng, Cunhua Pan, Maged Elkashlan, Dongming Wang, Jiangzhou Wang, Xiaohu You

    Abstract: This paper analyzes the impact of pilot-sharing scheme on synchronization performance in a scenario where several slave access points (APs) with uncertain carrier frequency offsets (CFOs) and timing offsets (TOs) share a common pilot sequence. First, the Cramer-Rao bound (CRB) with pilot contamination is derived for pilot-pairing estimation. Furthermore, a maximum likelihood algorithm is presented… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Journal for pos

  8. arXiv:2405.10948  [pdf, other

    cs.CV cs.AI cs.RO eess.IV

    Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery

    Authors: Guankun Wang, Long Bai, Wan Jun Nah, Jie Wang, Zhaoxi Zhang, Zhen Chen, Jinlin Wu, Mobarakol Islam, Hongbin Liu, Hongliang Ren

    Abstract: Recent advancements in Surgical Visual Question Answering (Surgical-VQA) and related region grounding have shown great promise for robotic and medical applications, addressing the critical need for automated methods in personalized surgical mentorship. However, existing models primarily provide simple structured answers and struggle with complex scenarios due to their limited capability in recogni… ▽ More

    Submitted 22 March, 2024; originally announced May 2024.

  9. arXiv:2405.10550  [pdf, other

    eess.IV cs.CV

    LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion

    Authors: Tong Chen, Qingcheng Lyu, Long Bai, Erjian Guo, Huxin Gao, Xiaoxiao Yang, Hongliang Ren, Luping Zhou

    Abstract: Advances in endoscopy use in surgeries face challenges like inadequate lighting. Deep learning, notably the Denoising Diffusion Probabilistic Model (DDPM), holds promise for low-light image enhancement in the medical field. However, DDPMs are computationally demanding and slow, limiting their practical medical applications. To bridge this gap, we propose a lightweight DDPM, dubbed LighTDiff. It ad… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  10. arXiv:2405.08672  [pdf, other

    eess.IV cs.CV

    EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera

    Authors: Beilei Cui, Mobarakol Islam, Long Bai, An Wang, Hongliang Ren

    Abstract: Depth estimation plays a crucial role in various tasks within endoscopic surgery, including navigation, surface reconstruction, and augmented reality visualization. Despite the significant achievements of foundation models in vision tasks, including depth estimation, their direct application to the medical domain often results in suboptimal performance. This highlights the need for efficient adapt… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: early accepted by MICCAI 2024

  11. arXiv:2405.07218  [pdf, other

    physics.med-ph eess.SY

    Chained Flexible Capsule Endoscope: Unraveling the Conundrum of Size Limitations and Functional Integration for Gastrointestinal Transitivity

    Authors: Sishen Yuan, Guang Li, Baijia Liang, Lailu Li, Qingzhuo Zheng, Shuang Song, Zhen Li, Hongliang Ren

    Abstract: Capsule endoscopes, predominantly serving diagnostic functions, provide lucid internal imagery but are devoid of surgical or therapeutic capabilities. Consequently, despite lesion detection, physicians frequently resort to traditional endoscopic or open surgical procedures for treatment, resulting in more complex, potentially risky interventions. To surmount these limitations, this study introduce… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  12. arXiv:2405.07216  [pdf, other

    eess.SY

    Magnetic-Guided Flexible Origami Robot toward Long-Term Phototherapy of H. pylori in the Stomach

    Authors: Sishen Yuan, Baijia Liang, Po Wa Wong, Mingjing Xu, Chi Hsuan Li, Zhen Li, Hongliang Ren

    Abstract: Helicobacter pylori, a pervasive bacterial infection associated with gastrointestinal disorders such as gastritis, peptic ulcer disease, and gastric cancer, impacts approximately 50% of the global population. The efficacy of standard clinical eradication therapies is diminishing due to the rise of antibiotic-resistant strains, necessitating alternative treatment strategies. Photodynamic therapy (P… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: IEEE ICRA 2024

  13. arXiv:2405.06946  [pdf, other

    eess.SP

    Two-Timescale Design for Reconfigurable Intelligent Surface-Aided URLLC

    Authors: Qihao Peng, Hong Ren, Cunhua Pan, Maged Elkashlan, Ana Garcia Armada, Petar Popovski

    Abstract: In this paper, to tackle the blockage issue in massive multiple-input-multiple-output (mMIMO) systems, a reconfigurable intelligent surface (RIS) is seamlessly deployed to support devices with ultra-reliable and low-latency communications (URLLC). The transmission power of the base station and the phase shifts of the RIS are jointly devised to maximize the weighted sum rate while considering the s… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: This paper has already been accepted by IEEE Transactions on Wireless Communications

  14. arXiv:2405.00734  [pdf, other

    eess.SP cs.AI cs.LG

    EEG-MACS: Manifold Attention and Confidence Stratification for EEG-based Cross-Center Brain Disease Diagnosis under Unreliable Annotations

    Authors: Zhenxi Song, Ruihan Qin, Huixia Ren, Zhen Liang, Yi Guo, Min Zhang, Zhiguo Zhang

    Abstract: Cross-center data heterogeneity and annotation unreliability significantly challenge the intelligent diagnosis of diseases using brain signals. A notable example is the EEG-based diagnosis of neurodegenerative diseases, which features subtler abnormal neural dynamics typically observed in small-group settings. To advance this area, in this work, we introduce a transferable framework employing Mani… ▽ More

    Submitted 13 August, 2024; v1 submitted 29 April, 2024; originally announced May 2024.

  15. arXiv:2404.15854  [pdf, other

    cs.CR cs.LG cs.SD eess.AS

    CLAD: Robust Audio Deepfake Detection Against Manipulation Attacks with Contrastive Learning

    Authors: Haolin Wu, Jing Chen, Ruiying Du, Cong Wu, Kun He, Xingcan Shang, Hao Ren, Guowen Xu

    Abstract: The increasing prevalence of audio deepfakes poses significant security threats, necessitating robust detection methods. While existing detection systems exhibit promise, their robustness against malicious audio manipulations remains underexplored. To bridge the gap, we undertake the first comprehensive study of the susceptibility of the most widely adopted audio deepfake detectors to manipulation… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE TDSC

  16. arXiv:2404.15469  [pdf, other

    cs.IT eess.SP

    NMBEnet: Efficient Near-field mmWave Beam Training for Multiuser OFDM Systems Using Sub-6 GHz Pilots

    Authors: Wang Liu, Cunhua Pan, Hong Ren, Cheng-Xiang Wang, Jiangzhou Wang, Xiaohu You

    Abstract: Combining millimetre-wave (mmWave) communications with an extremely large-scale antenna array (ELAA) presents a promising avenue for meeting the spectral efficiency demands of the future sixth generation (6G) mobile communications. However, beam training for mmWave ELAA systems is challenged by excessive pilot overheads as well as insufficient accuracy, as the huge near-field codebook has to be ac… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  17. Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans

    Authors: Lixing Tan, Shuang Song, Kangneng Zhou, Chengbo Duan, Lanying Wang, Huayang Ren, Linlin Liu, Wei Zhang, Ruoxiu Xiao

    Abstract: X-ray images play a vital role in the intraoperative processes due to their high resolution and fast imaging speed and greatly promote the subsequent segmentation, registration and reconstruction. However, over-dosed X-rays superimpose potential risks to human health to some extent. Data-driven algorithms from volume scans to X-ray images are restricted by the scarcity of paired X-ray and volume d… ▽ More

    Submitted 30 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures, ACM MM2024

  18. arXiv:2404.10640  [pdf, other

    eess.IV

    Adapting SAM for Surgical Instrument Tracking and Segmentation in Endoscopic Submucosal Dissection Videos

    Authors: Jieming Yu, Long Bai, Guankun Wang, An Wang, Xiaoxiao Yang, Huxin Gao, Hongliang Ren

    Abstract: The precise tracking and segmentation of surgical instruments have led to a remarkable enhancement in the efficiency of surgical procedures. However, the challenge lies in achieving accurate segmentation of surgical instruments while minimizing the need for manual annotation and reducing the time required for the segmentation process. To tackle this, we propose a novel framework for surgical instr… ▽ More

    Submitted 8 August, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: IEEE ICRA 2024 C4SR+ Workshop

  19. arXiv:2403.16529  [pdf, other

    eess.SP

    Exploit High-Dimensional RIS Information to Localization: What Is the Impact of Faulty Element?

    Authors: Tuo Wu, Cunhua Pan, Kangda Zhi, Hong Ren, Maged Elkashlan, Cheng-Xiang Wang, Robert Schober, Xiaohu You

    Abstract: This paper proposes a novel localization algorithm using the reconfigurable intelligent surface (RIS) received signal, i.e., RIS information. Compared with BS received signal, i.e., BS information, RIS information offers higher dimension and richer feature set, thereby providing an enhanced capacity to distinguish positions of the mobile users (MUs). Additionally, we address a practical scenario w… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 17 pages, Accepted by IEEE JSAC

  20. arXiv:2403.16521  [pdf, other

    eess.SP

    Employing High-Dimensional RIS Information for RIS-aided Localization Systems

    Authors: Tuo Wu, Cunhua Pan, Kangda Zhi, Hong Ren, Maged Elkashlan, Jiangzhou Wang, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS)-aided localization systems have attracted extensive research attention due to their accuracy enhancement capabilities. However, most studies primarily utilized the base stations (BS) received signal, i.e., BS information, for localization algorithm design, neglecting the potential of RIS received signal, i.e., RIS information. Compared with BS information,… ▽ More

    Submitted 16 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  21. arXiv:2403.11061  [pdf, other

    eess.SP

    Beamforming Design for Double-Active-RIS-aided Communication Systems with Inter-Excitation

    Authors: Boshi Wang, Cunhua Pan, Hong Ren, Zhiyuan Yu, Yang Zhang, Mengyu Liu, Gui Zhou

    Abstract: In this paper, we investigate a double-active-reconfigurable intelligent surface (RIS)-aided downlink wireless communication system, where a multi-antenna base station (BS) serves multiple single-antenna users with both double reflection and single reflection links. Due to the signal amplification capability of active RISs, they can effectively mitigate the multiplicative fading effect. However, t… ▽ More

    Submitted 23 August, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  22. arXiv:2403.10009  [pdf

    eess.IV cs.CV

    Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation

    Authors: Zhennong Chen, Sekeun Kim, Hui Ren, Quanzheng Li, Xiang Li

    Abstract: Accurate myocardium segmentation across all phases in one cardiac cycle in cine cardiac magnetic resonance (CMR) scans is crucial for comprehensively cardiac function analysis. Despite advancements in deep learning (DL) for automatic cine CMR segmentation, generalizability on unseen data remains a significant challenge. Recently, the segment-anything-model (SAM) has been invented as a segmentation… ▽ More

    Submitted 15 July, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 10 pages, 3 figures

  23. arXiv:2403.09058  [pdf, ps, other

    cs.IT eess.SP

    Performance Analysis on RIS-Aided Wideband Massive MIMO OFDM Systems with Low-Resolution ADCs

    Authors: Xianzhe Chen, Hong Ren, Cunhua Pan, Zhangjie Peng, Kangda Zhi, Yong Liu, Xiaojun Xi, Ana Garcia Armada, Cheng-Xiang Wang

    Abstract: This paper investigates a reconfigurable intelligent surface (RIS)-aided wideband massive multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) system with low-resolution analog-to-digital converters (ADCs). Frequency-selective Rician fading channels are considered, and the OFDM data transmission process is presented in time domain. This paper derives the closed-f… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  24. arXiv:2403.06940  [pdf, other

    eess.IV cs.LG q-bio.QM

    Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction

    Authors: Qing Xiao, Siyeop Yoon, Hui Ren, Matthew Tivnan, Lichao Sun, Quanzheng Li, Tianming Liu, Yu Zhang, Xiang Li

    Abstract: Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffe… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  25. arXiv:2403.04269  [pdf, other

    cs.IT eess.SP

    Secure MIMO Communication Relying on Movable Antennas

    Authors: Jun Tang, Cunhua Pan, Yang Zhang, Hong Ren, Kezhi Wang

    Abstract: This paper considers a movable antenna (MA)-aided secure multiple-input multiple-output (MIMO) communication system consisting of a base station (BS), a legitimate information receiver (IR) and an eavesdropper (Eve), where the BS is equipped with MAs to enhance the system's physical layer security (PLS). Specifically, we aim to maximize the secrecy rate (SR) by jointly optimizing the transmit prec… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  26. arXiv:2403.02942  [pdf, other

    cs.IT eess.SP

    Channel Estimation for mmWave MIMO-OFDM Systems in High-Mobility Scenarios: Instantaneous Model or Statistical Model?

    Authors: Ruizhe Wang, Hong Ren, Cunhua Pan, Gui Zhou, Ruisong Weng, Jiangzhou Wang

    Abstract: Classical linear statistical models, like the first-order auto-regressive (AR) model, are commonly used as channel model in high-mobility scenarios. However, compared to sub-6G, the effect of Doppler frequency shifts is more significant at millimeter wave (mmWave) frequencies, and the effectiveness of the statistical channel model in high-mobility mmWave scenarios should be reconsidered. In this p… ▽ More

    Submitted 27 August, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  27. arXiv:2403.02028  [pdf, other

    eess.SP

    Target Localization in Cooperative ISAC Systems: A Scheme Based on 5G NR OFDM Signals

    Authors: Zhenkun Zhang, Hong Ren, Cunhua Pan, Sheng Hong, Dongming Wang, Jiangzhou Wang, Xiaohu You

    Abstract: The integration of sensing capabilities into communication systems, by sharing physical resources, has a significant potential for reducing spectrum, hardware, and energy costs while inspiring innovative applications. Cooperative networks, in particular, are expected to enhance sensing services by enlarging the coverage area and enriching sensing measurements, thus improving the service availabili… ▽ More

    Submitted 22 August, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  28. arXiv:2402.13798  [pdf, other

    eess.SY

    AFPR-CIM: An Analog-Domain Floating-Point RRAM-based Compute-In-Memory Architecture with Dynamic Range Adaptive FP-ADC

    Authors: Haobo Liu, Zhengyang Qian, Wei Wu, Hongwei Ren, Zhiwei Liu, Leibin Ni

    Abstract: Power consumption has become the major concern in neural network accelerators for edge devices. The novel non-volatile-memory (NVM) based computing-in-memory (CIM) architecture has shown great potential for better energy efficiency. However, most of the recent NVM-CIM solutions mainly focus on fixed-point calculation and are not applicable to floating-point (FP) processing. In this paper, we propo… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by DATE 2024

  29. arXiv:2402.13692  [pdf, other

    eess.SP

    Reconfigurable Intelligent Surface assisted Integrated Communication, Sensing, and Computation Systems

    Authors: Jiahua Wan, Hong Ren, Zhiyuan Yu, Zhenkun Zhang, Yang Zhang, Cunhua Pan, Jiangzhou Wang

    Abstract: This paper studies a mobile edge computing (MEC) assisted integrated sensing and communication (ISAC), where reconfigurable intelligent surface (RIS) is used to alleviate the attenuation of communication links during computational offloading. In this paradigm, the dual function radar and communication (DFRC)-enabled user equipments (UEs) simultaneously perform radar sensing and communication tasks… ▽ More

    Submitted 14 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  30. arXiv:2402.13597  [pdf, other

    cs.IT eess.SP

    Near-Field Multiuser Beam-Training for Extremely Large-Scale MIMO Systems

    Authors: Wang Liu, Cunhua Pan, Hong Ren, Jiangzhou Wang, Robert Schober, Lajos Hanzo

    Abstract: Extremely large-scale multiple-input multiple-output (XL-MIMO) systems are capable of improving spectral efficiency by employing far more antennas than conventional massive MIMO at the base station (BS). However, beam training in multiuser XL-MIMO systems is challenging. To tackle these issues, we conceive a three-phase graph neural network (GNN)-based beam training scheme for multiuser XL-MIMO sy… ▽ More

    Submitted 25 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: submitted to IEEE

  31. arXiv:2402.05847  [pdf, other

    eess.SP

    Reconfigurable Intelligent Surface-Aided Dual-Function Radar and Communication Systems With MU-MIMO Communication

    Authors: Yasheng Jin, Hong Ren, Cunhua Pan, Zhiyuan Yu, Ruisong Weng, Boshi Wang, Gui Zhou, Yongchao He, Maged Elkashlan

    Abstract: In this paper, we investigate an reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) system. Our objective is to maximize the achievable sum rate of the multi-antenna communication users through the joint active and passive beamforming. {Specifically}, the weighted minimum mean-square error (WMMSE) method is { first} used to reformulate the original problem i… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  32. arXiv:2402.04532  [pdf, other

    eess.SP

    Joint Beamforming Design for Double Active RIS-assisted Radar-Communication Coexistence Systems

    Authors: Mengyu Liu, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Kangda Zhi, Yongchao He

    Abstract: Integrated sensing and communication (ISAC) technology has been considered as one of the key candidate technologies in the next-generation wireless communication systems. However, when radar and communication equipment coexist in the same system, i.e. radar-communication coexistence (RCC), the interference from communication systems to radar can be large and cannot be ignored. Recently, reconfigur… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  33. arXiv:2402.02122  [pdf, other

    eess.SP

    Secure Wireless Communication in Active RIS-Assisted DFRC System

    Authors: Yang Zhang, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Tuo Wu, Yongchao He

    Abstract: This work considers a dual-functional radar and communication (DFRC) system with an active reconfigurable intelligent surface (RIS) and a potential eavesdropper. Our purpose is to maximize the secrecy rate (SR) of the system by jointly designing the beamforming matrix at the DFRC base station (BS) and the reflecting coefficients at the active RIS, subject to the signal-to-interference-plus-noise-r… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 13 pages, 9 figures

  34. arXiv:2401.07446  [pdf, other

    cs.IT eess.SP

    Quantized RIS-aided mmWave Massive MIMO Channel Estimation with Uniform Planar Arrays

    Authors: Ruizhe Wang, Hong Ren, Cunhua Pan, Shi Jin, Petar Popovski, Jiangzhou Wang

    Abstract: In this paper, we investigate a cascaded channel estimation method for a millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) system aided by a reconfigurable intelligent surface (RIS) with the BS equipped with low-resolution analog-to-digital converters (ADCs), where the BS and the RIS are both equipped with a uniform planar array (UPA). Due to the sparse property of mmWave chan… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  35. DISCOVER: 2-D Multiview Summarization of Optical Coherence Tomography Angiography for Automatic Diabetic Retinopathy Diagnosis

    Authors: Mostafa El Habib Daho, Yihao Li, Rachid Zeghlache, Hugo Le Boité, Pierre Deman, Laurent Borderie, Hugang Ren, Niranchana Mannivanan, Capucine Lepicard, Béatrice Cochener, Aude Couturier, Ramin Tadayoni, Pierre-Henri Conze, Mathieu Lamard, Gwenolé Quellec

    Abstract: Diabetic Retinopathy (DR), an ocular complication of diabetes, is a leading cause of blindness worldwide. Traditionally, DR is monitored using Color Fundus Photography (CFP), a widespread 2-D imaging modality. However, DR classifications based on CFP have poor predictive power, resulting in suboptimal DR management. Optical Coherence Tomography Angiography (OCTA) is a recent 3-D imaging modality o… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Journal ref: Artificial Intelligence in Medicine 2024, 102803

  36. arXiv:2312.05832  [pdf, other

    cs.CV eess.IV

    Spatial-wise Dynamic Distillation for MLP-like Efficient Visual Fault Detection of Freight Trains

    Authors: Yang Zhang, Huilin Pan, Mingying Li, An Wang, Yang Zhou, Hongliang Ren

    Abstract: Despite the successful application of convolutional neural networks (CNNs) in object detection tasks, their efficiency in detecting faults from freight train images remains inadequate for implementation in real-world engineering scenarios. Existing modeling shortcomings of spatial invariance and pooling layers in conventional CNNs often ignore the neglect of crucial global information, resulting i… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 10 pages, 6 figures

  37. arXiv:2310.09937  [pdf, other

    eess.IV eess.SP

    Joint Sparse Representations and Coupled Dictionary Learning in Multi-Source Heterogeneous Image Pseudo-color Fusion

    Authors: Long Bai, Shilong Yao, Kun Gao, Yanjun Huang, Ruijie Tang, Hong Yan, Max Q. -H. Meng, Hongliang Ren

    Abstract: Considering that Coupled Dictionary Learning (CDL) method can obtain a reasonable linear mathematical relationship between resource images, we propose a novel CDL-based Synthetic Aperture Radar (SAR) and multispectral pseudo-color fusion method. Firstly, the traditional Brovey transform is employed as a pre-processing method on the paired SAR and multispectral images. Then, CDL is used to capture… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: To appear in IEEE Sensors Journal

  38. arXiv:2308.07156  [pdf, other

    eess.IV cs.CV

    SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Yang Zhang, Hongliang Ren

    Abstract: The Segment Anything Model (SAM) serves as a fundamental model for semantic segmentation and demonstrates remarkable generalization capabilities across a wide range of downstream scenarios. In this empirical study, we examine SAM's robustness and zero-shot generalizability in the field of robotic surgery. We comprehensively explore different scenarios, including prompted and unprompted situations,… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted as Oral Presentation at MedAGI Workshop - MICCAI 2023 1st International Workshop on Foundation Models for General Medical AI. arXiv admin note: substantial text overlap with arXiv:2304.14674

  39. arXiv:2308.02845  [pdf, other

    eess.IV cs.CV cs.RO

    Landmark Detection using Transformer Toward Robot-assisted Nasal Airway Intubation

    Authors: Tianhang Liu, Hechen Li, Long Bai, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren

    Abstract: Robot-assisted airway intubation application needs high accuracy in locating targets and organs. Two vital landmarks, nostrils and glottis, can be detected during the intubation to accommodate the stages of nasal intubation. Automated landmark detection can provide accurate localization and quantitative evaluation. The Detection Transformer (DeTR) leads object detectors to a new paradigm with long… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: ICBIR 2023 (Best Student Paper Award). Code availability: https://github.com/ConorLTH/airway_intubation_landmarks_detection

  40. arXiv:2307.02514  [pdf, other

    eess.AS cs.AI cs.SD

    Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

    Authors: Hongmin Cai, Xiaoke Huang, Zhengliang Liu, Wenxiong Liao, Haixing Dai, Zihao Wu, Dajiang Zhu, Hui Ren, Quanzheng Li, Tianming Liu, Xiang Li

    Abstract: Alzheimer's disease (AD) is a common form of dementia that severely impacts patient health. As AD impairs the patient's language understanding and expression ability, the speech of AD patients can serve as an indicator of this disease. This study investigates various methods for detecting AD using patients' speech and transcripts data from the DementiaBank Pitt database. The proposed approach invo… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  41. arXiv:2307.02452  [pdf, other

    eess.IV cs.CV cs.RO

    LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion

    Authors: Long Bai, Tong Chen, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren

    Abstract: Wireless capsule endoscopy (WCE) is a painless and non-invasive diagnostic tool for gastrointestinal (GI) diseases. However, due to GI anatomical constraints and hardware manufacturing limitations, WCE vision signals may suffer from insufficient illumination, leading to a complicated screening and examination procedure. Deep learning-based low-light image enhancement (LLIE) in the medical field gr… ▽ More

    Submitted 22 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: To appear in MICCAI 2023. Code availability: https://github.com/longbai1006/LLCaps

  42. arXiv:2306.16285  [pdf, other

    eess.IV cs.CV

    Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Hongliang Ren

    Abstract: Despite their impressive performance in various surgical scene understanding tasks, deep learning-based methods are frequently hindered from deploying to real-world surgical applications for various causes. Particularly, data collection, annotation, and domain shift in-between sites and patients are the most common obstacles. In this work, we mitigate data-related issues by efficiently leveraging… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: First two authors contributed equally. Accepted by IROS2023

  43. arXiv:2306.03581  [pdf

    eess.SY

    Optimal sizing of solar photovoltaic and lithium battery storage to reduce grid electricity reliance in buildings

    Authors: Han Kun Ren, Malcolm McCulloch, David Wallom

    Abstract: In alignment with the Paris Agreement, the city of Oxford in the UK aims to become carbon neutral by 2040. Renewable energy help achieve this target by reducing the reliance on carbon-intensive grid electricity. This research seeks to optimally size solar photovoltaic and lithium battery storage systems, reducing Oxford's grid electricity reliance in buildings. The analysis starts with modeling th… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 10 pages, 8 figures, published in the conference of ECEEE 2022 Summer Study on energy efficiency: agents of change

    Report number: 8-096-22

    Journal ref: ECEEE 2022 Summer Study on energy efficiency: agents of change, (2022), 1199-1208, ECEEE

  44. arXiv:2306.03511  [pdf, other

    eess.IV cs.CV

    Curriculum-Based Augmented Fourier Domain Adaptation for Robust Medical Image Segmentation

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Hongliang Ren

    Abstract: Accurate and robust medical image segmentation is fundamental and crucial for enhancing the autonomy of computer-aided diagnosis and intervention systems. Medical data collection normally involves different scanners, protocols, and populations, making domain adaptation (DA) a highly demanding research field to alleviate model degradation in the deployment site. To preserve the model performance ac… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Work under review. First three authors contributed equally

  45. arXiv:2306.00451  [pdf, other

    eess.IV cs.CV

    S$^2$ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation

    Authors: An Wang, Mengya Xu, Yang Zhang, Mobarakol Islam, Hongliang Ren

    Abstract: Fully-supervised polyp segmentation has accomplished significant triumphs over the years in advancing the early diagnosis of colorectal cancer. However, label-efficient solutions from weak supervision like scribbles are rarely explored yet primarily meaningful and demanding in medical practice due to the expensiveness and scarcity of densely-annotated polyp data. Besides, various deployment issues… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: MICCAI 2023 Early Acceptance

  46. arXiv:2305.11686  [pdf, other

    eess.IV cs.CV cs.RO

    Domain Adaptive Sim-to-Real Segmentation of Oropharyngeal Organs Towards Robot-assisted Intubation

    Authors: Guankun Wang, Tian-Ao Ren, Jiewen Lai, Long Bai, Hongliang Ren

    Abstract: Robotic-assisted tracheal intubation requires the robot to distinguish anatomical features like an experienced physician using deep-learning techniques. However, real datasets of oropharyngeal organs are limited due to patient privacy issues, making it challenging to train deep-learning models for accurate image segmentation. We hereby consider generating a new data modality through a virtual envi… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Extended abstract in IEEE ICRA 2023 Workshop (New Evolutions in Surgical Robotics: Embracing Multimodal Imaging Guidance, Intelligence, and Bio-inspired Mechanisms). arXiv admin note: text overlap with arXiv:2305.10883

  47. arXiv:2305.10883  [pdf, other

    cs.AI cs.CV eess.IV

    Domain Adaptive Sim-to-Real Segmentation of Oropharyngeal Organs

    Authors: Guankun Wang, Tian-Ao Ren, Jiewen Lai, Long Bai, Hongliang Ren

    Abstract: Video-assisted transoral tracheal intubation (TI) necessitates using an endoscope that helps the physician insert a tracheal tube into the glottis instead of the esophagus. The growing trend of robotic-assisted TI would require a medical robot to distinguish anatomical features like an experienced physician which can be imitated by utilizing supervised deep-learning techniques. However, the real d… ▽ More

    Submitted 27 July, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: The manuscript is accepted by Medical & Biological Engineering & Computing. Code and dataset: https://github.com/gkw0010/EISOST-Sim2Real-Dataset-Release

  48. arXiv:2304.14674  [pdf, other

    eess.IV cs.CV cs.RO

    SAM Meets Robotic Surgery: An Empirical Study in Robustness Perspective

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Yang Zhang, Hongliang Ren

    Abstract: Segment Anything Model (SAM) is a foundation model for semantic segmentation and shows excellent generalization capability with the prompts. In this empirical study, we investigate the robustness and zero-shot generalizability of the SAM in the domain of robotic surgery in various settings of (i) prompted vs. unprompted; (ii) bounding box vs. points-based prompt; (iii) generalization under corrupt… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Work under active progress

  49. arXiv:2304.09974  [pdf, other

    cs.CV cs.AI eess.IV

    SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery

    Authors: Lalithkumar Seenivasan, Mobarakol Islam, Gokul Kannan, Hongliang Ren

    Abstract: Advances in GPT-based large language models (LLMs) are revolutionizing natural language processing, exponentially increasing its use across various domains. Incorporating uni-directional attention, these autoregressive LLMs can generate long and coherent paragraphs. However, for visual question answering (VQA) tasks that require both vision and language processing, models with bi-directional atten… ▽ More

    Submitted 22 July, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: The manuscript is accepted in MICCAI 2023. Code are available at: https://github.com/lalithjets/SurgicalGPT

  50. arXiv:2304.00172  [pdf, other

    eess.SP

    Performance Analysis and Low-Complexity Design for XL-MIMO with Near-Field Spatial Non-Stationarities

    Authors: Kangda Zhi, Cunhua Pan, Hong Ren, Kok Keong Chai, Cheng-Xiang Wang, Robert Schober, Xiaohu You

    Abstract: Extremely large-scale multiple-input multiple-output (XL-MIMO) is capable of supporting extremely high system capacities with large numbers of users. In this work, we build a framework for the analysis and low-complexity design of XL-MIMO in the near field with spatial non-stationarities. Specifically, we first analyze the theoretical performance of discrete-aperture XL-MIMO using an electromagnet… ▽ More

    Submitted 12 October, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: 18 pages. Accepted by IEEE JSAC