Zum Hauptinhalt springen

Showing 1–50 of 114 results for author: Shi, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.05645  [pdf

    eess.IV cs.CV cs.LG

    BeyondCT: A deep learning model for predicting pulmonary function from chest CT scans

    Authors: Kaiwen Geng, Zhiyi Shi, Xiaoyan Zhao, Alaa Ali, Jing Wang, Joseph Leader, Jiantao Pu

    Abstract: Abstract Background: Pulmonary function tests (PFTs) and computed tomography (CT) imaging are vital in diagnosing, managing, and monitoring lung diseases. A common issue in practice is the lack of access to recorded pulmonary functions despite available chest CT scans. Purpose: To develop and validate a deep learning algorithm for predicting pulmonary function directly from chest CT scans. M… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: 5 tables, 7 figures,22 pages

  2. arXiv:2408.02095  [pdf, other

    cs.IT eess.SP

    Secure Semantic Communications: From Perspective of Physical Layer Security

    Authors: Yongkang Li, Zheng Shi, Han Hu, Yaru Fu, Hong Wang, Hongjiang Lei

    Abstract: Semantic communications have been envisioned as a potential technique that goes beyond Shannon paradigm. Unlike modern communications that provide bit-level security, the eaves-dropping of semantic communications poses a significant risk of potentially exposing intention of legitimate user. To address this challenge, a novel deep neural network (DNN) enabled secure semantic communication (DeepSSC)… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  3. arXiv:2407.20518  [pdf, other

    eess.IV cs.AI cs.CV

    High-Resolution Spatial Transcriptomics from Histology Images using HisToSGE

    Authors: Zhiceng Shi, Shuailin Xue, Fangfang Zhu, Wenwen Min

    Abstract: Spatial transcriptomics (ST) is a groundbreaking genomic technology that enables spatial localization analysis of gene expression within tissue sections. However, it is significantly limited by high costs and sparse spatial resolution. An alternative, more cost-effective strategy is to use deep learning methods to predict high-density gene expression profiles from histological images. However, exi… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  4. arXiv:2407.17902  [pdf, other

    eess.AS

    Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization

    Authors: Ruijie Tao, Zhan Shi, Yidi Jiang, Duc-Tuan Truong, Eng-Siong Chng, Massimo Alioto, Haizhou Li

    Abstract: The human brain has the capability to associate the unknown person's voice and face by leveraging their general relationship, referred to as ``cross-modal speaker verification''. This task poses significant challenges due to the complex relationship between the modalities. In this paper, we propose a ``Multi-stage Face-voice Association Learning with Keynote Speaker Diarization''~(MFV-KSD) framewo… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  5. arXiv:2407.10427  [pdf, other

    eess.IV cs.CV

    Transformer for Multitemporal Hyperspectral Image Unmixing

    Authors: Hang Li, Qiankun Dong, Xueshuo Xie, Xia Xu, Tao Li, Zhenwei Shi

    Abstract: Multitemporal hyperspectral image unmixing (MTHU) holds significant importance in monitoring and analyzing the dynamic changes of surface. However, compared to single-temporal unmixing, the multitemporal approach demands comprehensive consideration of information across different phases, rendering it a greater challenge. To address this challenge, we propose the Multitemporal Hyperspectral Image U… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  6. arXiv:2407.08216  [pdf, other

    eess.IV cs.AI cs.CV q-bio.QM

    Multimodal contrastive learning for spatial gene expression prediction using histology images

    Authors: Wenwen Min, Zhiceng Shi, Jun Zhang, Jun Wan, Changmiao Wang

    Abstract: In recent years, the advent of spatial transcriptomics (ST) technology has unlocked unprecedented opportunities for delving into the complexities of gene expression patterns within intricate biological systems. Despite its transformative potential, the prohibitive cost of ST technology remains a significant barrier to its widespread adoption in large-scale studies. An alternative, more cost-effect… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: BIB, Code: https://github.com/shizhiceng/mclSTExp

  7. arXiv:2407.00987  [pdf, other

    cs.NI eess.SY

    Exploiting Dependency-Aware Priority Adjustment for Mixed-Criticality TSN Flow Scheduling

    Authors: Miao Guo, Yifei Sun, Chaojie Gu, Shibo He, Zhiguo Shi

    Abstract: Time-Sensitive Networking (TSN) serves as a one-size-fits-all solution for mixed-criticality communication, in which flow scheduling is vital to guarantee real-time transmissions. Traditional approaches statically assign priorities to flows based on their associated applications, resulting in significant queuing delays. In this paper, we observe that assigning different priorities to a flow leads… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by IWQoS'24

  8. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Yajing Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, Jing Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  9. arXiv:2406.18993  [pdf, ps, other

    eess.SP

    Interference Cancellation Based Neural Receiver for Superimposed Pilot in Multi-Layer Transmission

    Authors: Han Xiao, Wenqiang Tian, Shi Jin, Wendong Liu, Jia Shen, Zhihua Shi, Zhi Zhang

    Abstract: In this paper, an interference cancellation based neural receiver for superimposed pilot (SIP) in multi-layer transmission is proposed, where the data and pilot are non-orthogonally superimposed in the same time-frequency resource. Specifically, to deal with the intra-layer and inter-layer interference of SIP under multi-layer transmission, the interference cancellation with superimposed symbol ai… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  10. arXiv:2404.15284  [pdf, other

    eess.SP cs.AI

    Global 4D Ionospheric STEC Prediction based on DeepONet for GNSS Rays

    Authors: Dijia Cai, Zenghui Shi, Haiyang Fu, Huan Liu, Hongyi Qian, Yun Sui, Feng Xu, Ya-Qiu Jin

    Abstract: The ionosphere is a vitally dynamic charged particle region in the Earth's upper atmosphere, playing a crucial role in applications such as radio communication and satellite navigation. The Slant Total Electron Contents (STEC) is an important parameter for characterizing wave propagation, representing the integrated electron density along the ray of radio signals passing through the ionosphere. Th… ▽ More

    Submitted 12 March, 2024; originally announced April 2024.

  11. arXiv:2404.07956  [pdf, other

    cs.LG cs.AI cs.RO eess.SY math.OC

    Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation

    Authors: Lujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang

    Abstract: Learning-based neural network (NN) control policies have shown impressive empirical performance in a wide range of tasks in robotics and control. However, formal (Lyapunov) stability guarantees over the region-of-attraction (ROA) for NN controllers with nonlinear dynamical systems are challenging to obtain, and most existing approaches rely on expensive solvers such as sums-of-squares (SOS), mixed… ▽ More

    Submitted 4 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Paper accepted by ICML 2024

  12. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Liping Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  13. arXiv:2404.00863  [pdf, other

    eess.AS

    Voice Conversion Augmentation for Speaker Recognition on Defective Datasets

    Authors: Ruijie Tao, Zhan Shi, Yidi Jiang, Tianchi Liu, Haizhou Li

    Abstract: Modern speaker recognition system relies on abundant and balanced datasets for classification training. However, diverse defective datasets, such as partially-labelled, small-scale, and imbalanced datasets, are common in real-world applications. Previous works usually studied specific solutions for each scenario from the algorithm perspective. However, the root cause of these problems lies in data… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 5 pages

  14. arXiv:2403.20198  [pdf, other

    cs.IT eess.SY

    Minimizing End-to-End Latency for Joint Source-Channel Coding Systems

    Authors: Kaiyi Chi, Qianqian Yang, Yuanchao Shu, Zhaohui Yang, Zhiguo Shi

    Abstract: While existing studies have highlighted the advantages of deep learning (DL)-based joint source-channel coding (JSCC) schemes in enhancing transmission efficiency, they often overlook the crucial aspect of resource management during the deployment phase. In this paper, we propose an approach to minimize the transmission latency in an uplink JSCC-based system. We first analyze the correlation betwe… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 7 Pages, 5 Figures, accepted by 2024 IEEE ICC Workshop

  15. arXiv:2403.18134  [pdf, other

    eess.IV cs.CV

    Integrative Graph-Transformer Framework for Histopathology Whole Slide Image Representation and Classification

    Authors: Zhan Shi, Jingwei Zhang, Jun Kong, Fusheng Wang

    Abstract: In digital pathology, the multiple instance learning (MIL) strategy is widely used in the weakly supervised histopathology whole slide image (WSI) classification task where giga-pixel WSIs are only labeled at the slide level. However, existing attention-based MIL approaches often overlook contextual information and intrinsic spatial relationships between neighboring tissue tiles, while graph-based… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  16. arXiv:2403.13562  [pdf, other

    eess.SY

    Augmented LRFS-based Filter: Holistic Tracking of Group Objects

    Authors: Chaoqun Yang, Xiaowei Liang, Zhiguo Shi, Heng Zhang, Xianghui Cao

    Abstract: This paper addresses the problem of group target tracking (GTT), wherein multiple closely spaced targets within a group pose a coordinated motion. To improve the tracking performance, the labeled random finite sets (LRFSs) theory is adopted, and this paper develops a new kind of LRFSs, i.e., augmented LRFSs, which introduces group information into the definition of LRFSs. Specifically, for each el… ▽ More

    Submitted 19 August, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  17. arXiv:2403.01093  [pdf, other

    eess.SP

    Variational Bayesian Learning Based Localization and Channel Reconstruction in RIS-aided Systems

    Authors: Yunfei Li, Yiting Luo, Xianda Wu, Zheng Shi, Shaodan Ma, Guanghua Yang

    Abstract: The emerging immersive and autonomous services have posed stringent requirements on both communications and localization. By considering the great potential of reconfigurable intelligent surface (RIS), this paper focuses on the joint channel estimation and localization for RIS-aided wireless systems. As opposed to existing works that treat channel estimation and localization independently, this pa… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  18. arXiv:2401.15619  [pdf, ps, other

    eess.SP

    A semidefinite programming approach for robust elliptic localization

    Authors: Wenxin Xiong, Jiajun He, Zhang-Lei Shi, Keyuan Hu, Hing Cheung So, Chi-Sing Leung

    Abstract: This short communication addresses the problem of elliptic localization with outlier measurements, whose occurrences are prevalent in various location-enabled applications and can significantly compromise the positioning performance if not adequately handled. In contrast to the reliance on $M$-estimation adopted in the majority of existing solutions, we take a different path, specifically explorin… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  19. arXiv:2401.15564  [pdf

    eess.SY cs.AI

    Design of UAV flight state recognition and trajectory prediction system based on trajectory feature construction

    Authors: Xingyu Zhou, Zhuoyong Shi

    Abstract: With the impact of artificial intelligence on the traditional UAV industry, autonomous UAV flight has become a current hot research field. Based on the demand for research on critical technologies for autonomous flying UAVs, this paper addresses the field of flight state recognition and trajectory prediction of UAVs. This paper proposes a method to improve the accuracy of UAV trajectory prediction… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  20. arXiv:2401.11960  [pdf, other

    cs.CV eess.IV

    Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method

    Authors: Zili Liu, Hao Chen, Lei Bai, Wenyuan Li, Keyan Chen, Zhengyi Wang, Wanli Ouyang, Zhengxia Zou, Zhenwei Shi

    Abstract: Downscaling (DS) of meteorological variables involves obtaining high-resolution states from low-resolution meteorological fields and is an important task in weather forecasting. Previous methods based on deep learning treat downscaling as a super-resolution task in computer vision and utilize high-resolution gridded meteorological fields as supervision to improve resolution at specific grid scales… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  21. arXiv:2312.15575  [pdf, other

    eess.IV cs.CV cs.LG

    Neural Born Series Operator for Biomedical Ultrasound Computed Tomography

    Authors: Zhijun Zeng, Yihang Zheng, Youjia Zheng, Yubing Li, Zuoqiang Shi, He Sun

    Abstract: Ultrasound Computed Tomography (USCT) provides a radiation-free option for high-resolution clinical imaging. Despite its potential, the computationally intensive Full Waveform Inversion (FWI) required for tissue property reconstruction limits its clinical utility. This paper introduces the Neural Born Series Operator (NBSO), a novel technique designed to speed up wave simulations, thereby facilita… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    ACM Class: I.4.5; J.3

  22. arXiv:2312.04377  [pdf, other

    cs.IT eess.SP

    HARQ-IR Aided Short Packet Communications: BLER Analysis and Throughput Maximization

    Authors: Fuchao He, Zheng Shi, Guanghua Yang, Xiaofan Li, Xinrong Ye, Shaodan Ma

    Abstract: This paper introduces hybrid automatic repeat request with incremental redundancy (HARQ-IR) to boost the reliability of short packet communications. The finite blocklength information theory and correlated decoding events tremendously preclude the analysis of average block error rate (BLER). Fortunately, the recursive form of average BLER motivates us to calculate its value through the trapezoidal… ▽ More

    Submitted 9 January, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 13 pages, 10 figures

  23. arXiv:2311.02389  [pdf, other

    eess.SY cs.GT cs.RO

    Multiplayer Homicidal Chauffeur Reach-Avoid Games: A Pursuit Enclosure Function Approach

    Authors: Rui Yan, Xiaoming Duan, Rui Zou, Xin He, Zongying Shi, Francesco Bullo

    Abstract: This paper presents a multiplayer Homicidal Chauffeur reach-avoid differential game, which involves Dubins-car pursuers and simple-motion evaders. The goal of the pursuers is to cooperatively protect a planar convex region from the evaders, who strive to reach the region. We propose a cooperative strategy for the pursuers based on subgames for multiple pursuers against one evader and optimal task… ▽ More

    Submitted 22 December, 2023; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 17 pages, 5 figures

  24. arXiv:2310.15548  [pdf, ps, other

    eess.SP

    Knowledge-driven Meta-learning for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Jiajia Guo, Zhi Zhang, Shi Jin, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output systems. Recently, deep learning (DL) has been introduced for CSI feedback enhancement through massive collected training data and lengthy training time, which is quite costly and impractical for realistic deployment. In this article, a knowledge-driven meta-learning a… ▽ More

    Submitted 25 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.13475

  25. arXiv:2310.10964  [pdf, other

    cs.IT eess.SP

    Spectral-Efficiency and Energy-Efficiency of Variable-Length XP-HARQ

    Authors: Jiahui Feng, Zheng Shi, Yaru Fu, Hong Wang, Guanghua Yang, Shaodan Ma

    Abstract: A variable-length cross-packet hybrid automatic repeat request (VL-XP-HARQ) is proposed to boost the spectral efficiency (SE) and the energy efficiency (EE) of communications. The SE is firstly derived in terms of the outage probabilities, with which the SE is proved to be upper bounded by the ergodic capacity (EC). Moreover, to facilitate the maximization of the SE, the asymptotic outage probabil… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  26. arXiv:2310.06259  [pdf, other

    eess.IV cs.SD eess.AS

    Cross-modal Cognitive Consensus guided Audio-Visual Segmentation

    Authors: Zhaofeng Shi, Qingbo Wu, Fanman Meng, Linfeng Xu, Hongliang Li

    Abstract: Audio-Visual Segmentation (AVS) aims to extract the sounding object from a video frame, which is represented by a pixel-wise segmentation mask for application scenarios such as multi-modal video editing, augmented reality, and intelligent robot systems. The pioneering work conducts this task through dense feature-level audio-visual interaction, which ignores the dimension gap between different mod… ▽ More

    Submitted 17 July, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE Transactions on Multimedia; 16 pages

    MSC Class: 68U10 ACM Class: I.4.6

  27. arXiv:2309.16372  [pdf, other

    cs.CV eess.IV

    Aperture Diffraction for Compact Snapshot Spectral Imaging

    Authors: Tao Lv, Hao Ye, Quan Yuan, Zhan Shi, Yibo Wang, Shuming Wang, Xun Cao

    Abstract: We demonstrate a compact, cost-effective snapshot spectral imaging system named Aperture Diffraction Imaging Spectrometer (ADIS), which consists only of an imaging lens with an ultra-thin orthogonal aperture mask and a mosaic filter sensor, requiring no additional physical footprint compared to common RGB cameras. Then we introduce a new optical design that each point in the object space is multip… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: accepted by International Conference on Computer Vision (ICCV) 2023

  28. arXiv:2309.07141  [pdf

    eess.SP cs.AI cs.LG

    Design of Recognition and Evaluation System for Table Tennis Players' Motor Skills Based on Artificial Intelligence

    Authors: Zhuo-yong Shi, Ye-tao Jia, Ke-xin Zhang, Ding-han Wang, Long-meng Ji, Yong Wu

    Abstract: With the rapid development of electronic science and technology, the research on wearable devices is constantly updated, but for now, it is not comprehensive for wearable devices to recognize and analyze the movement of specific sports. Based on this, this paper improves wearable devices of table tennis sport, and realizes the pattern recognition and evaluation of table tennis players' motor skill… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 34pages, 16figures

    MSC Class: 93-01 ACM Class: G.1; H.4

  29. arXiv:2308.04304  [pdf, other

    cs.IT cs.CR cs.LG eess.IV

    The Model Inversion Eavesdropping Attack in Semantic Communication Systems

    Authors: Yuhao Chen, Qianqian Yang, Zhiguo Shi, Jiming Chen

    Abstract: In recent years, semantic communication has been a popular research topic for its superiority in communication efficiency. As semantic communication relies on deep learning to extract meaning from raw messages, it is vulnerable to attacks targeting deep learning models. In this paper, we introduce the model inversion eavesdropping attack (MIEA) to reveal the risk of privacy leaks in the semantic c… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by 2023 IEEE Global Communications Conference (GLOBECOM)

  30. arXiv:2308.02140  [pdf, ps, other

    cs.IT eess.SP

    Deep Reinforcement Learning Empowered Rate Selection of XP-HARQ

    Authors: Da Wu, Jiahui Feng, Zheng Shi, Hongjiang Lei, Guanghua Yang, Shaodan Ma

    Abstract: The complex transmission mechanism of cross-packet hybrid automatic repeat request (XP-HARQ) hinders its optimal system design. To overcome this difficulty, this letter attempts to use the deep reinforcement learning (DRL) to solve the rate selection problem of XP-HARQ over correlated fading channels. In particular, the long term average throughput (LTAT) is maximized by properly choosing the incr… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  31. arXiv:2308.02131  [pdf, other

    cs.IT eess.SP

    Graph Convolutional Network Enabled Power-Constrained HARQ Strategy for URLLC

    Authors: Yi Chen, Zheng Shi, Hong Wang, Yaru Fu, Guanghua Yang, Shaodan Ma, Haichuan Ding

    Abstract: In this paper, a power-constrained hybrid automatic repeat request (HARQ) transmission strategy is developed to support ultra-reliable low-latency communications (URLLC). In particular, we aim to minimize the delivery latency of HARQ schemes over time-correlated fading channels, meanwhile ensuring the high reliability and limited power consumption. To ease the optimization, the simple asymptotic o… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  32. arXiv:2307.13220  [pdf

    eess.IV cs.AI physics.med-ph

    One for Multiple: Physics-informed Synthetic Data Boosts Generalizable Deep Learning for Fast MRI Reconstruction

    Authors: Zi Wang, Xiaotong Yu, Chengyan Wang, Weibo Chen, Jiazheng Wang, Ying-Hua Chu, Hongwei Sun, Rushuai Li, Peiyong Li, Fan Yang, Haiwei Han, Taishan Kang, Jianzhong Lin, Chen Yang, Shufu Chang, Zhang Shi, Sha Hua, Yan Li, Juan Hu, Liuhong Zhu, Jianjun Zhou, Meijing Lin, Jiefeng Guo, Congbo Cai, Zhong Chen , et al. (3 additional authors not shown)

    Abstract: Magnetic resonance imaging (MRI) is a widely used radiological modality renowned for its radiation-free, comprehensive insights into the human body, facilitating medical diagnoses. However, the drawback of prolonged scan times hinders its accessibility. The k-space undersampling offers a solution, yet the resultant artifacts necessitate meticulous removal during image reconstruction. Although Deep… ▽ More

    Submitted 28 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 38 pages, 19 figures, 5 tables

  33. arXiv:2306.13296  [pdf, other

    eess.SP

    Semantic-aware Transmission for Robust Point Cloud Classification

    Authors: Tianxiao Han, Kaiyi Chi, Qianqian Yang, Zhiguo Shi

    Abstract: As three-dimensional (3D) data acquisition devices become increasingly prevalent, the demand for 3D point cloud transmission is growing. In this study, we introduce a semantic-aware communication system for robust point cloud classification that capitalizes on the advantages of pre-trained Point-BERT models. Our proposed method comprises four main components: the semantic encoder, channel encoder,… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: submitted to globecom 2023

  34. arXiv:2305.03546  [pdf, other

    eess.IV cs.CV

    Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge Review

    Authors: Chuang Zhu, Shengjie Liu, Zekuan Yu, Feng Xu, Arpit Aggarwal, Germán Corredor, Anant Madabhushi, Qixun Qu, Hongwei Fan, Fangda Li, Yueheng Li, Xianchao Guan, Yongbing Zhang, Vivek Kumar Singh, Farhan Akram, Md. Mostafa Kamal Sarker, Zhongyue Shi, Mulan Jin

    Abstract: For invasive breast cancer, immunohistochemical (IHC) techniques are often used to detect the expression level of human epidermal growth factor receptor-2 (HER2) in breast tissue to formulate a precise treatment plan. From the perspective of saving manpower, material and time costs, directly generating IHC-stained images from Hematoxylin and Eosin (H&E) stained images is a valuable research direct… ▽ More

    Submitted 22 September, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 12 pages, 12 figures, 2tables

  35. arXiv:2305.01871  [pdf

    physics.med-ph eess.IV

    Convolutional neural network-based single-shot speckle tracking for x-ray phase-contrast imaging

    Authors: Serena Qinyun Z. Shi, Nadav Shapira, Peter B. Noël, Sebastian Meyer

    Abstract: X-ray phase-contrast imaging offers enhanced sensitivity for weakly-attenuating materials, such as breast and brain tissue, but has yet to be widely implemented clinically due to high coherence requirements and expensive x-ray optics. Speckle-based phase contrast imaging has been proposed as an affordable and simple alternative; however, obtaining high-quality phase-contrast images requires accura… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  36. arXiv:2304.12184  [pdf, other

    eess.SP cs.AI cs.IT cs.LG

    Active RIS-aided EH-NOMA Networks: A Deep Reinforcement Learning Approach

    Authors: Zhaoyuan Shi, Huabing Lu, Xianzhong Xie, Helin Yang, Chongwen Huang, Jun Cai, Zhiguo Ding

    Abstract: An active reconfigurable intelligent surface (RIS)-aided multi-user downlink communication system is investigated, where non-orthogonal multiple access (NOMA) is employed to improve spectral efficiency, and the active RIS is powered by energy harvesting (EH). The problem of joint control of the RIS's amplification matrix and phase shift matrix is formulated to maximize the communication success ra… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  37. arXiv:2304.11341  [pdf, ps, other

    cs.IT eess.SP

    Performance Analysis and Optimal Design of HARQ-IR-Aided Terahertz Communications

    Authors: Ziyang Song, Zheng Shi, Jiaji Su, Qingping Dou, Guanghua Yang, Haichuan Ding, Shaodan Ma

    Abstract: Terahertz (THz) communications are envisioned to be a promising technology for 6G thanks to its broad bandwidth. However, the large path loss, antenna misalignment, and atmospheric influence of THz communications severely deteriorate its reliability. To address this, hybrid automatic repeat request (HARQ) is recognized as an effective technique to ensure reliable THz communications. This paper del… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: Blockage, hybrid automatic repeat request (HARQ), outage probability, terahertz (THz) communications

  38. arXiv:2303.14095  [pdf, other

    cs.CV cs.RO eess.IV

    PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View

    Authors: Ze Shi, Hao Shi, Kailun Yang, Zhe Yin, Yining Lin, Kaiwei Wang

    Abstract: Visual place recognition has gained significant attention in recent years as a crucial technology in autonomous driving and robotics. Currently, the two main approaches are the perspective view retrieval (P2P) paradigm and the equirectangular image retrieval (E2E) paradigm. However, it is practical and natural to assume that users only have consumer-grade pinhole cameras to obtain query perspectiv… ▽ More

    Submitted 28 July, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted to ITSC 2023. Code and datasets will be made available at https://github.com/zafirshi/PanoVPR

  39. arXiv:2302.12662  [pdf, other

    eess.IV cs.CV

    FedDBL: Communication and Data Efficient Federated Deep-Broad Learning for Histopathological Tissue Classification

    Authors: Tianpeng Deng, Yanqi Huang, Guoqiang Han, Zhenwei Shi, Jiatai Lin, Qi Dou, Zaiyi Liu, Xiao-jing Guo, C. L. Philip Chen, Chu Han

    Abstract: Histopathological tissue classification is a fundamental task in computational pathology. Deep learning-based models have achieved superior performance but centralized training with data centralization suffers from the privacy leakage problem. Federated learning (FL) can safeguard privacy by keeping training samples locally, but existing FL-based frameworks require a large number of well-annotated… ▽ More

    Submitted 17 December, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

  40. arXiv:2302.12004  [pdf

    cs.LG eess.SP

    Knowledge Distillation-based Information Sharing for Online Process Monitoring in Decentralized Manufacturing System

    Authors: Zhangyue Shi, Yuxuan Li, Chenang Liu

    Abstract: In advanced manufacturing, the incorporation of sensing technology provides an opportunity to achieve efficient in-situ process monitoring using machine learning methods. Meanwhile, the advances of information technologies also enable a connected and decentralized environment for manufacturing systems, making different manufacturing units in the system collaborate more closely. In a decentralized… ▽ More

    Submitted 25 July, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  41. arXiv:2302.02608  [pdf, ps, other

    cs.IT eess.SP

    Cooperative Task-Oriented Communication for Multi-Modal Data with Transmission Control

    Authors: Shiqi Wang, Qianqian Yang, Zhiguo Shi, Zhaohui Yang, Zhaoyang Zhang

    Abstract: Real-time intelligence applications in Internet of Things (IoT) environment depend on timely data communication. However, it is challenging to transmit and analyse massive data of various modalities. Recently proposed task-oriented communication methods based on deep learning have showed its superiority in communication efficiency. In this paper, we propose a cooperative task-oriented communicatio… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  42. arXiv:2301.13475  [pdf, ps, other

    eess.SP

    A Knowledge-Driven Meta-Learning Method for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Zhi Zhang, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output (MIMO) systems. Recently, deep learning (DL) has been introduced to enhance CSI feedback in massive MIMO application, where the massive collected training data and lengthy training time are costly and impractical for realistic deployment. In this paper, a knowledge-dri… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  43. arXiv:2211.10287  [pdf, other

    eess.IV

    Generative Model Based Highly Efficient Semantic Communication Approach for Image Transmission

    Authors: Tianxiao Han, Jiancheng Tang, Qianqian Yang, Yiping Duan, Zhaoyang Zhang, Zhiguo Shi

    Abstract: Deep learning (DL) based semantic communication methods have been explored to transmit images efficiently in recent years. In this paper, we propose a generative model based semantic communication to further improve the efficiency of image transmission and protect private information. In particular, the transmitter extracts the interpretable latent representation from the original image by a gener… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: submitted to ICASSP 2023

  44. arXiv:2211.00648  [pdf

    eess.IV physics.optics

    Non-line-of-sight imaging with arbitrary illumination and detection pattern

    Authors: Xintong Liu, Jianyu Wang, Leping Xiao, Zuoqiang Shi, Xing Fu, Lingyun Qiu

    Abstract: Non-line-of-sight (NLOS) imaging aims at reconstructing targets obscured from the direct line of sight. Existing NLOS imaging algorithms require dense measurements at rectangular grid points in a large area of the relay surface, which severely hinders their availability to variable relay scenarios in practical applications such as robotic vision, autonomous driving, rescue operations and remote se… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: main article: 32 pages with 8 figures; supplementary information: 49 pages with 26 figures

  45. arXiv:2210.15903  [pdf, other

    eess.AS cs.SD eess.SP

    Speaker recognition with two-step multi-modal deep cleansing

    Authors: Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li

    Abstract: Neural network-based speaker recognition has achieved significant improvement in recent years. A robust speaker representation learns meaningful knowledge from both hard and easy samples in the training set to achieve good performance. However, noisy samples (i.e., with wrong labels) in the training set induce confusion and cause the network to learn the incorrect representation. In this paper, we… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures

  46. arXiv:2210.06385  [pdf, other

    eess.IV cs.CV physics.med-ph

    The Extreme Cardiac MRI Analysis Challenge under Respiratory Motion (CMRxMotion)

    Authors: Shuo Wang, Chen Qin, Chengyan Wang, Kang Wang, Haoran Wang, Chen Chen, Cheng Ouyang, Xutong Kuang, Chengliang Dai, Yuanhan Mo, Zhang Shi, Chenchen Dai, Xinrong Chen, He Wang, Wenjia Bai

    Abstract: The quality of cardiac magnetic resonance (CMR) imaging is susceptible to respiratory motion artifacts. The model robustness of automated segmentation techniques in face of real-world respiratory motion artifacts is unclear. This manuscript describes the design of extreme cardiac MRI analysis challenge under respiratory motion (CMRxMotion Challenge). The challenge aims to establish a public benchm… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Summary of CMRxMotion Challenge Design

  47. arXiv:2209.13638  [pdf, ps, other

    cs.IT eess.SP

    Outage Probability Analysis of HARQ-Aided Terahertz Communications

    Authors: Ziyang Song, Zheng Shi, Qingping Dou, Guanghua Yang, Yunfei Li, Shaodan Ma

    Abstract: Although terahertz (THz) communications can provide mobile broadband services, it usually has a large path loss and is vulnerable to antenna misalignment. This significantly degrades the reception reliability. To address this issue, the hybrid automatic repeat request (HARQ) is proposed to further enhance the reliability of THz communications. This paper provides an in-depth investigation on the o… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  48. arXiv:2209.11382  [pdf, ps, other

    cs.IT eess.SP

    Zero-Forcing Based Downlink Virtual MIMO-NOMA Communications in IoT Networks

    Authors: Zheng Shi, Hong Wang, Yaru Fu, Guanghua Yang, Shaodan Ma, Fen Hou, Theodoros A. Tsiftsis

    Abstract: To support massive connectivity and boost spectral efficiency for internet of things (IoT), a downlink scheme combining virtual multiple-input multiple-output (MIMO) and nonorthogonal multiple access (NOMA) is proposed. All the single-antenna IoT devices in each cluster cooperate with each other to establish a virtual MIMO entity, and multiple independent data streams are requested by each cluster… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  49. Ziv-Zakai Bound for DOAs Estimation

    Authors: Zongyu Zhang, Zhiguo Shi, Yujie Gu

    Abstract: Lower bounds on the mean square error (MSE) play an important role in evaluating the direction-of-arrival (DOA) estimation performance. Among numerous bounds for DOA estimation, the local Cramer-Rao bound (CRB) is only tight asymptotically. By contrast, the existing global tight Ziv-Zakai bound (ZZB) is appropriate for evaluating the single source estimation only. In this paper, we derive an expli… ▽ More

    Submitted 6 December, 2022; v1 submitted 8 September, 2022; originally announced September 2022.

  50. arXiv:2209.01424  [pdf, ps, other

    eess.SP

    Dynamic Write-Voltage Design and Read-Voltage Optimization for MLC NAND Flash Memory

    Authors: Runbin Cai, Yi Fang, Zhifang Shi, Lin Dai, Guojun Han

    Abstract: To mitigate the impact of noise and interference on multi-level-cell (MLC) flash memory with the use of low-density parity-check (LDPC) codes, we propose a dynamic write-voltage design scheme considering the asymmetric property of raw bit error rate (RBER), which can obtain the optimal write voltage by minimizing a cost function. In order to further improve the decoding performance of flash memory… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 12 pages, 6 figures, submitted to China Communication