Zum Hauptinhalt springen

Showing 1–50 of 87 results for author: Zhou, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.15667  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    Towards reliable respiratory disease diagnosis based on cough sounds and vision transformers

    Authors: Qian Wang, Zhaoyang Bu, Jiaxuan Mao, Wenyu Zhu, Jingya Zhao, Wei Du, Guochao Shi, Min Zhou, Si Chen, Jieming Qu

    Abstract: Recent advancements in deep learning techniques have sparked performance boosts in various real-world applications including disease diagnosis based on multi-modal medical data. Cough sound data-based respiratory disease (e.g., COVID-19 and Chronic Obstructive Pulmonary Disease) diagnosis has also attracted much attention. However, existing works usually utilise traditional machine learning or dee… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2407.16634  [pdf, other

    eess.IV cs.AI cs.CV cs.HC

    Knowledge-driven AI-generated data for accurate and interpretable breast ultrasound diagnoses

    Authors: Haojun Yu, Youcheng Li, Nan Zhang, Zihan Niu, Xuantong Gong, Yanwen Luo, Quanlin Wu, Wangyan Qin, Mengyuan Zhou, Jie Han, Jia Tao, Ziwei Zhao, Di Dai, Di He, Dong Wang, Binghui Tang, Ling Huo, Qingli Zhu, Yong Wang, Liwei Wang

    Abstract: Data-driven deep learning models have shown great capabilities to assist radiologists in breast ultrasound (US) diagnoses. However, their effectiveness is limited by the long-tail distribution of training data, which leads to inaccuracies in rare cases. In this study, we address a long-standing challenge of improving the diagnostic model performance on rare cases using long-tailed data. Specifical… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  3. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  4. arXiv:2404.12804  [pdf, other

    cs.CV eess.IV

    Linearly-evolved Transformer for Pan-sharpening

    Authors: Junming Hou, Zihan Cao, Naishan Zheng, Xuan Li, Xiaoyu Chen, Xinyang Liu, Xiaofeng Cong, Man Zhou, Danfeng Hong

    Abstract: Vision transformer family has dominated the satellite pan-sharpening field driven by the global-wise spatial information modeling mechanism from the core self-attention ingredient. The standard modeling rules within these promising pan-sharpening methods are to roughly stack the transformer variants in a cascaded manner. Despite the remarkable advancement, their success may be at the huge cost of… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 10 pages

  5. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  6. arXiv:2403.15483  [pdf

    eess.SP cs.LG

    Rolling bearing fault diagnosis method based on generative adversarial enhanced multi-scale convolutional neural network model

    Authors: Maoxuan Zhou, Wei Kang, Kun He

    Abstract: In order to solve the problem that current convolutional neural networks can not capture the correlation features between the time domain signals of rolling bearings effectively, and the model accuracy is limited by the number and quality of samples, a rolling bearing fault diagnosis method based on generative adversarial enhanced multi-scale convolutional neural network model is proposed. Firstly… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  7. arXiv:2403.00987  [pdf, other

    cs.MA cs.RO eess.SY

    Composite Distributed Learning and Synchronization of Nonlinear Multi-Agent Systems with Complete Uncertain Dynamics

    Authors: Emadodin Jandaghi, Dalton L. Stein, Adam Hoburg, Paolo Stegagno, Mingxi Zhou, Chengzhi Yuan

    Abstract: This paper addresses the problem of composite synchronization and learning control in a network of multi-agent robotic manipulator systems with heterogeneous nonlinear uncertainties under a leader-follower framework. A novel two-layer distributed adaptive learning control strategy is introduced, comprising a first-layer distributed cooperative estimator and a second-layer decentralized determinist… ▽ More

    Submitted 9 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  8. arXiv:2401.00160  [pdf, other

    eess.SP

    Acceleration Estimation of Signal Propagation Path Length Changes for Wireless Sensing

    Authors: Jiacheng Wang, Hongyang Du, Dusit Niyato, Mu Zhou, Jiawen Kang, H. Vincent Poor

    Abstract: As indoor applications grow in diversity, wireless sensing, vital in areas like localization and activity recognition, is attracting renewed interest. Indoor wireless sensing relies on signal processing, particularly channel state information (CSI) based signal parameter estimation. Nonetheless, regarding reflected signals induced by dynamic human targets, no satisfactory algorithm yet exists for… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  9. arXiv:2312.09576  [pdf, other

    eess.IV cs.CV

    SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

    Authors: Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, Jin Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein , et al. (17 additional authors not shown)

    Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)

  10. arXiv:2312.04767  [pdf, other

    eess.SY

    Finite Horizon Reinforcement Learning in Solving Optimal Control of State-Dependent Switched Systems

    Authors: Mi Zhou

    Abstract: In this article, the deep deterministic policy gradient (DDPG) method is used to learn an optimal control policy of a multi-region state-dependent switched system. We observe good performance of this model-free method and explain it in a rigorous mathematical language. The performance of the learning-based methods is compared with the optimal solution given by vanilla differential dynamic programm… ▽ More

    Submitted 14 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

  11. arXiv:2312.00951  [pdf, other

    cs.RO eess.SY

    AV4EV: Open-Source Modular Autonomous Electric Vehicle Platform for Making Mobility Research Accessible

    Authors: Zhijie Qiao, Mingyan Zhou, Zhijun Zhuang, Tejas Agarwal, Felix Jahncke, Po-Jen Wang, Jason Friedman, Hongyi Lai, Divyanshu Sahu, Tomáš Nagy, Martin Endler, Jason Schlessman, Rahul Mangharam

    Abstract: When academic researchers develop and validate autonomous driving algorithms, there is a challenge in balancing high-performance capabilities with the cost and complexity of the vehicle platform. Much of today's research on autonomous vehicles (AV) is limited to experimentation on expensive commercial vehicles that require large skilled teams to retrofit the vehicles and test them in dedicated fac… ▽ More

    Submitted 12 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 pages, 5 figures

  12. arXiv:2311.03557  [pdf, other

    cs.LG cs.CV eess.IV

    Spatio-Temporal Similarity Measure based Multi-Task Learning for Predicting Alzheimer's Disease Progression using MRI Data

    Authors: Xulong Wang, Yu Zhang, Menghui Zhou, Tong Liu, Jun Qi, Po Yang

    Abstract: Identifying and utilising various biomarkers for tracking Alzheimer's disease (AD) progression have received many recent attentions and enable helping clinicians make the prompt decisions. Traditional progression models focus on extracting morphological biomarkers in regions of interest (ROIs) from MRI/PET images, such as regional average cortical thickness and regional volume. They are effective… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  13. arXiv:2311.02691  [pdf, ps, other

    cs.IT eess.SP

    Age of Information Analysis for CR-NOMA Aided Uplink Systems with Randomly Arrived Packets

    Authors: Yanshi Sun, Yanglin Ye, Zhiguo Ding, Momiao Zhou, Lei Liu

    Abstract: This paper studies the application of cognitive radio inspired non-orthogonal multiple access (CR-NOMA) to reduce age of information (AoI) for uplink transmission. In particular, a time division multiple access (TDMA) based legacy network is considered, where each user is allocated with a dedicated time slot to transmit its status update information. The CR-NOMA is implemented as an add-on to the… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  14. arXiv:2310.04722  [pdf, other

    cs.SD cs.AI eess.AS

    A Holistic Evaluation of Piano Sound Quality

    Authors: Monan Zhou, Shangda Wu, Shaohua Ji, Zijin Li, Wei Li

    Abstract: This paper aims to develop a holistic evaluation method for piano sound quality to assist in purchasing decisions. Unlike previous studies that focused on the effect of piano performance techniques on sound quality, this study evaluates the inherent sound quality of different pianos. To derive quality evaluation systems, the study uses subjective questionnaires based on a piano sound quality datas… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  15. arXiv:2309.17315  [pdf, other

    eess.SY

    Data-Driven Newton Raphson Controller Based on Koopman Operator Theory

    Authors: Mi Zhou

    Abstract: Newton-Raphson controller is a powerful prediction-based variable gain integral controller. Basically, the classical model-based Newton-Raphson controller requires two elements: the prediction of the system output and the derivative of the predicted output with respect to the control input. In real applications, the model may not be known and it is infeasible to predict the system sometime ahead a… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  16. arXiv:2309.16834  [pdf, other

    eess.SY

    Energy Optimal Control of a Harmonic Oscillator with a State Inequality Constraint

    Authors: Mi Zhou, Erik I Verriest, Chaouki Abdallah

    Abstract: In this article, the optimal control problem for a harmonic oscillator with an inequality constraint is considered. The applied energy of the oscillator during a fixed final time period is used as the performance criterion. The analytical solution with both small and large terminal time is found for a special case when the undriven oscillator system is initially at rest. For other initial states o… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  17. arXiv:2309.13259  [pdf, other

    cs.IR cs.AI cs.SD eess.AS

    WikiMT++ Dataset Card

    Authors: Monan Zhou, Shangda Wu, Yuan Wang, Wei Li

    Abstract: WikiMT++ is an expanded and refined version of WikiMusicText (WikiMT), featuring 1010 curated lead sheets in ABC notation. To expand application scenarios of WikiMT, we add both objective (album, lyrics, video) and subjective emotion (12 emotion adjectives) and emo\_4q (Russell 4Q) attributes, enhancing its usability for music information retrieval, conditional music generation, automatic composit… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  18. arXiv:2309.01958  [pdf, other

    cs.CV eess.IV

    Empowering Low-Light Image Enhancer through Customized Learnable Priors

    Authors: Naishan Zheng, Man Zhou, Yanmeng Dong, Xiangyu Rui, Jie Huang, Chongyi Li, Feng Zhao

    Abstract: Deep neural networks have achieved remarkable progress in enhancing low-light images by improving their brightness and eliminating noise. However, most existing methods construct end-to-end mapping networks heuristically, neglecting the intrinsic prior of image enhancement task and lacking transparency and interpretability. Although some unfolding solutions have been proposed to relieve these issu… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023

  19. arXiv:2308.16083  [pdf, other

    cs.CV eess.IV

    Learned Image Reasoning Prior Penetrates Deep Unfolding Network for Panchromatic and Multi-Spectral Image Fusion

    Authors: Man Zhou, Jie Huang, Naishan Zheng, Chongyi Li

    Abstract: The success of deep neural networks for pan-sharpening is commonly in a form of black box, lacking transparency and interpretability. To alleviate this issue, we propose a novel model-driven deep unfolding framework with image reasoning prior tailored for the pan-sharpening task. Different from existing unfolding solutions that deliver the proximal operator networks as the uncertain and vague prio… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 10 pages; Accepted by ICCV 2023

  20. arXiv:2307.00479  [pdf, other

    eess.IV cs.CV

    Domain Transfer Through Image-to-Image Translation for Uncertainty-Aware Prostate Cancer Classification

    Authors: Meng Zhou, Amoon Jamzad, Jason Izard, Alexandre Menard, Robert Siemens, Parvin Mousavi

    Abstract: Prostate Cancer (PCa) is a prevalent disease among men, and multi-parametric MRIs offer a non-invasive method for its detection. While MRI-based deep learning solutions have shown promise in supporting PCa diagnosis, acquiring sufficient training data, particularly in local clinics remains challenging. One potential solution is to take advantage of publicly available datasets to pre-train deep mod… ▽ More

    Submitted 3 June, 2024; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: Preprint. In Submission

  21. arXiv:2306.14274  [pdf, other

    eess.IV cs.CV

    MEPNet: A Model-Driven Equivariant Proximal Network for Joint Sparse-View Reconstruction and Metal Artifact Reduction in CT Images

    Authors: Hong Wang, Minghao Zhou, Dong Wei, Yuexiang Li, Yefeng Zheng

    Abstract: Sparse-view computed tomography (CT) has been adopted as an important technique for speeding up data acquisition and decreasing radiation dose. However, due to the lack of sufficient projection data, the reconstructed CT images often present severe artifacts, which will be further amplified when patients carry metallic implants. For this joint sparse-view reconstruction and metal artifact reductio… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: MICCAI 2023

  22. Visual-Aware Text-to-Speech

    Authors: Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei

    Abstract: Dynamically synthesizing talking speech that actively responds to a listening head is critical during the face-to-face interaction. For example, the speaker could take advantage of the listener's facial expression to adjust the tones, stressed syllables, or pauses. In this work, we present a new visual-aware text-to-speech (VA-TTS) task to synthesize speech conditioned on both textual inputs and s… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: accepted as oral and top 3% paper by ICASSP 2023

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023, 1-5

  23. arXiv:2305.07774  [pdf, other

    cs.CV eess.IV

    PanFlowNet: A Flow-Based Deep Network for Pan-sharpening

    Authors: Gang Yang, Xiangyong Cao, Wenzhe Xiao, Man Zhou, Aiping Liu, Xun chen, Deyu Meng

    Abstract: Pan-sharpening aims to generate a high-resolution multispectral (HRMS) image by integrating the spectral information of a low-resolution multispectral (LRMS) image with the texture details of a high-resolution panchromatic (PAN) image. It essentially inherits the ill-posed nature of the super-resolution (SR) task that diverse HRMS images can degrade into an LRMS image. However, existing deep learn… ▽ More

    Submitted 16 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

  24. arXiv:2304.04484  [pdf, other

    cs.IT eess.SP

    Quasi-Synchronous Random Access for Massive MIMO-Based LEO Satellite Constellations

    Authors: Keke Ying, Zhen Gao, Sheng Chen, Mingyu Zhou, Dezhi Zheng, Symeon Chatzinotas, Björn Ottersten, H. Vincent Poor

    Abstract: Low earth orbit (LEO) satellite constellation-enabled communication networks are expected to be an important part of many Internet of Things (IoT) deployments due to their unique advantage of providing seamless global coverage. In this paper, we investigate the random access problem in massive multiple-input multiple-output-based LEO satellite systems, where the multi-satellite cooperative process… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 38 pages, 16 figures. This paper has been accepted by IEEE JSAC SI on 3GPP Technologies: 5G-Advanced and Beyond. Copyright may be transferred without notice, after which this version may no longer be accessible

  25. arXiv:2303.13046  [pdf, other

    cs.IT eess.SP

    Quantized Phase Alignment by Discrete Phase Shifts for Reconfigurable Intelligent Surface-Assisted Communication Systems

    Authors: Jian Sang, Jifeng Lan, Mingyong Zhou, Boning Gao, Wankai Tang, Xiao Li, Xinping Yi, Shi Jin

    Abstract: Reconfigurable intelligent surface (RIS) has aroused a surge of interest in recent years. In this paper, we investigate the joint phase alignment and phase quantization on discrete phase shift designs for RIS-assisted single-input single-output (SISO) system. Firstly, the phenomena of phase distribution in far field and near field are respectively unveiled, paving the way for discretization of pha… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  26. arXiv:2303.04414  [pdf, other

    cs.IT eess.SP

    Next-Generation URLLC with Massive Devices: A Unified Semi-Blind Detection Framework for Sourced and Unsourced Random Access

    Authors: Malong Ke, Zhen Gao, Mingyu Zhou, Dezhi Zheng, Derrick Wing Kwan Ng, H. Vincent Poor

    Abstract: This paper proposes a unified semi-blind detection framework for sourced and unsourced random access (RA), which enables next-generation ultra-reliable low-latency communications (URLLC) with massive devices. Specifically, the active devices transmit their uplink access signals in a grant-free manner to realize ultra-low access latency. Meanwhile, the base station aims to achieve ultra-reliable da… ▽ More

    Submitted 20 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: This paper has been accepted by IEEE JSAC special issue on next-generation URLLC in 6G

  27. arXiv:2302.05816  [pdf, ps, other

    math.OC cs.LG eess.SY

    A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee

    Authors: Mo Zhou, Jianfeng Lu

    Abstract: We consider policy gradient methods for stochastic optimal control problem in continuous time. In particular, we analyze the gradient flow for the control, viewed as a continuous time limit of the policy gradient method. We prove the global convergence of the gradient flow and establish a convergence rate under some regularity assumptions. The main novelty in the analysis is the notion of local op… ▽ More

    Submitted 22 April, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: 53 pages

    MSC Class: 93E20 (Primary); 49L12 49M05 (secondary)

  28. arXiv:2301.02277  [pdf

    cs.CV cs.AI eess.IV

    LostNet: A smart way for lost and find

    Authors: Meihua Zhou, Ivan Fung, Li Yang, Nan Wan, Keke Di, Tingting Wang

    Abstract: Due to the enormous population growth of cities in recent years, objects are frequently lost and unclaimed on public transportation, in restaurants, or any other public areas. While services like Find My iPhone can easily identify lost electronic devices, more valuable objects cannot be tracked in an intelligent manner, making it impossible for administrators to reclaim a large number of lost and… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  29. arXiv:2210.08181  [pdf, other

    cs.CV eess.IV

    Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network

    Authors: Keyu Yan, Man Zhou, Jie Huang, Feng Zhao, Chengjun Xie, Chongyi Li, Danfeng Hong

    Abstract: Panchromatic (PAN) and multi-spectral (MS) image fusion, named Pan-sharpening, refers to super-resolve the low-resolution (LR) multi-spectral (MS) images in the spatial domain to generate the expected high-resolution (HR) MS images, conditioning on the corresponding high-resolution PAN images. In this paper, we present a simple yet effective \textit{alternating reverse filtering network} for pan-s… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Journal ref: NeurIPS2022

  30. arXiv:2209.12775  [pdf, other

    math.OC eess.SY

    Jump Law of Co-State in Optimal Control for State-Dependent Switched Systems and Applications

    Authors: Mi Zhou, Erik I. Verriest, Yue Guan, Chaouki Abdallah

    Abstract: This paper presents the jump law of co-states in optimal control for state-dependent switched systems. The number of switches and the switching modes are assumed to be known a priori. A proposed jump law is rigorously derived by theoretical analysis and illustrated by simulation results. An algorithm is then proposed to solve optimal control for state-dependent hybrid systems. Through numerical si… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  31. Model-Guided Multi-Contrast Deep Unfolding Network for MRI Super-resolution Reconstruction

    Authors: Gang Yang, Li Zhang, Man Zhou, Aiping Liu, Xun Chen, Zhiwei Xiong, Feng Wu

    Abstract: Magnetic resonance imaging (MRI) with high resolution (HR) provides more detailed information for accurate diagnosis and quantitative image analysis. Despite the significant advances, most existing super-resolution (SR) reconstruction network for medical images has two flaws: 1) All of them are designed in a black-box principle, thus lacking sufficient interpretability and further limiting their p… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted to ACMMM 2022, 9 pages

  32. Precise Repositioning of Robotic Ultrasound: Improving Registration-based Motion Compensation using Ultrasound Confidence Optimization

    Authors: Zhongliang Jiang, Nehil Danis, Yuan Bi, Mingchuan Zhou, Markus Kroenke, Thomas Wendler, Nassir Navab

    Abstract: Robotic ultrasound (US) imaging has been seen as a promising solution to overcome the limitations of free-hand US examinations, i.e., inter-operator variability. However, the fact that robotic US systems cannot react to subject movements during scans limits their clinical acceptance. Regarding human sonographers, they often react to patient movements by repositioning the probe or even restarting t… ▽ More

    Submitted 5 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: The paper has been accepted by IEEE TIM. Video: https://www.youtube.com/watch?v=MUtgSXS7EZI

  33. DeepWSD: Projecting Degradations in Perceptual Space to Wasserstein Distance in Deep Feature Space

    Authors: Xingran Liao, Baoliang Chen, Hanwei Zhu, Shiqi Wang, Mingliang Zhou, Sam Kwong

    Abstract: Existing deep learning-based full-reference IQA (FR-IQA) models usually predict the image quality in a deterministic way by explicitly comparing the features, gauging how severely distorted an image is by how far the corresponding feature lies from the space of the reference images. Herein, we look at this problem from a different viewpoint and propose to model the quality degradation in perceptua… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: ACM Multimedia 2022 accepted thesis

  34. arXiv:2207.01983  [pdf, ps, other

    cs.IT eess.SP

    Massive Access in Extra Large-Scale MIMO with Mixed-ADC over Near Field Channels

    Authors: Yikun Mei, Zhen Gao, De Mi, Mingyu Zhou, Dezhi Zheng, Michail Matthaiou, Pei Xiao, Robert Schober

    Abstract: Massive connectivity for extra large-scale multi-input multi-output (XL-MIMO) systems is a challenging issue due to the near-field access channels and the prohibitive cost. In this paper, we propose an uplink grant-free massive access scheme for XL-MIMO systems, in which a mixed-analog-to-digital converters (ADC) architecture is adopted to strike the right balance between access performance and po… ▽ More

    Submitted 3 April, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE TVT

  35. arXiv:2206.07163  [pdf, other

    cs.CV cs.LG eess.IV

    DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction via A Structure-Specific Generative Method

    Authors: Qi Chang, Zhennan Yan, Mu Zhou, Di Liu, Khalid Sawalha, Meng Ye, Qilong Zhangli, Mikael Kanski, Subhi Al Aref, Leon Axel, Dimitris Metaxas

    Abstract: Joint 2D cardiac segmentation and 3D volume reconstruction are fundamental to building statistical cardiac anatomy models and understanding functional mechanisms from motion patterns. However, due to the low through-plane resolution of cine MR and high inter-subject variance, accurately segmenting cardiac images and reconstructing the 3D volume are challenging. In this study, we propose an end-to-… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: MICCAI2022

  36. arXiv:2205.09452  [pdf, other

    cs.LG eess.SY

    Learning-based AC-OPF Solvers on Realistic Network and Realistic Loads

    Authors: Tsun Ho Aaron Cheung, Min Zhou, Minghua Chen

    Abstract: Deep learning approaches for the Alternating Current-Optimal Power Flow (AC-OPF) problem are under active research in recent years. A common shortcoming in this area of research is the lack of a dataset that includes both a realistic power network topology and the corresponding realistic loads. To address this issue, we construct an AC-OPF formulation-ready dataset called TAS-97 that contains real… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 8 pages, 6 figures

  37. arXiv:2204.03125  [pdf, other

    eess.SY cs.LG

    Deep transfer learning for system identification using long short-term memory neural networks

    Authors: Kaicheng Niu, Mi Zhou, Chaouki T. Abdallah, Mohammad Hayajneh

    Abstract: Recurrent neural networks (RNNs) have many advantages over more traditional system identification techniques. They may be applied to linear and nonlinear systems, and they require fewer modeling assumptions. However, these neural network models may also need larger amounts of data to learn and generalize. Furthermore, neural networks training is a time-consuming process. Hence, building upon long-… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  38. Proximal Policy Optimization-based Transmit Beamforming and Phase-shift Design in an IRS-aided ISAC System for the THz Band

    Authors: Xiangnan Liu, Haijun Zhang, Keping Long, Mingyu Zhou, Yonghui Li, H. Vincent Poor

    Abstract: In this paper, an IRS-aided integrated sensing and communications (ISAC) system operating in the terahertz (THz) band is proposed to maximize the system capacity. Transmit beamforming and phase-shift design are transformed into a universal optimization problem with ergodic constraints. Then the joint optimization of transmit beamforming and phase-shift design is achieved by gradient-based, primal-… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  39. arXiv:2203.10726  [pdf, other

    eess.IV cs.CV

    TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers

    Authors: Di Liu, Yunhe Gao, Qilong Zhangli, Ligong Han, Xiaoxiao He, Zhaoyang Xia, Song Wen, Qi Chang, Zhennan Yan, Mu Zhou, Dimitris Metaxas

    Abstract: Combining information from multi-view images is crucial to improve the performance and robustness of automated methods for disease diagnosis. However, due to the non-alignment characteristics of multi-view images, building correlation and data fusion across views largely remain an open problem. In this study, we present TransFusion, a Transformer-based architecture to merge divergent multi-view im… ▽ More

    Submitted 5 September, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  40. arXiv:2203.04960  [pdf, other

    eess.IV cs.CV

    Memory-augmented Deep Unfolding Network for Guided Image Super-resolution

    Authors: Man Zhou, Keyu Yan, Jinshan Pan, Wenqi Ren, Qi Xie, Xiangyong Cao

    Abstract: Guided image super-resolution (GISR) aims to obtain a high-resolution (HR) target image by enhancing the spatial resolution of a low-resolution (LR) target image under the guidance of a HR image. However, previous model-based methods mainly takes the entire image as a whole, and assume the prior distribution between the HR target image and the HR guidance image, simply ignoring many non-local comm… ▽ More

    Submitted 12 February, 2022; originally announced March 2022.

    Comments: 24 pages, 16 figures

  41. arXiv:2203.00131  [pdf, other

    eess.IV cs.CV

    A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark

    Authors: Yunhe Gao, Mu Zhou, Di Liu, Zhennan Yan, Shaoting Zhang, Dimitris N. Metaxas

    Abstract: Transformers have demonstrated remarkable performance in natural language processing and computer vision. However, existing vision Transformers struggle to learn from limited medical data and are unable to generalize on diverse medical image tasks. To tackle these challenges, we present MedFormer, a data-scalable Transformer designed for generalizable 3D medical image segmentation. Our approach in… ▽ More

    Submitted 4 April, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

  42. arXiv:2202.08916  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Graph Convolutional Networks for Multi-modality Medical Imaging: Methods, Architectures, and Clinical Applications

    Authors: Kexin Ding, Mu Zhou, Zichen Wang, Qiao Liu, Corey W. Arnold, Shaoting Zhang, Dimitri N. Metaxas

    Abstract: Image-based characterization and disease understanding involve integrative analysis of morphological, spatial, and topological information across biological scales. The development of graph convolutional networks (GCNs) has created the opportunity to address this information complexity via graph-driven architectures, since GCNs can perform feature aggregation, interaction, and reasoning with remar… ▽ More

    Submitted 20 April, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  43. arXiv:2112.07087  [pdf, other

    cs.NE cs.CV cs.LG eess.IV

    Heuristic Hyperparameter Optimization for Convolutional Neural Networks using Genetic Algorithm

    Authors: Meng Zhou

    Abstract: In recent years, people from all over the world are suffering from one of the most severe diseases in history, known as Coronavirus disease 2019, COVID-19 for short. When the virus reaches the lungs, it has a higher probability to cause lung pneumonia and sepsis. X-ray image is a powerful tool in identifying the typical features of the infection for COVID-19 patients. The radiologists and patholog… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 8 pages, 3 figures

  44. arXiv:2110.14285  [pdf, ps, other

    eess.SP

    Over-the-Air Aggregation for Federated Learning: Waveform Superposition and Prototype Validation

    Authors: Huayan Guo, Yifan Zhu, Haoyu Ma, Vincent K. N. Lau, Kaibin Huang, Xiaofan Li, Huabin Nong, Mingyu Zhou

    Abstract: In this paper, we develop an orthogonal-frequency-division-multiplexing (OFDM)-based over-the-air (OTA) aggregation solution for wireless federated learning (FL). In particular, the local gradients in massive IoT devices are modulated by an analog waveform and are then transmitted using the same wireless resources. To this end, achieving perfect waveform superposition is the key challenge, which i… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  45. arXiv:2110.07469  [pdf, other

    cs.GT eess.SY

    Shaping Large Population Agent Behaviors Through Entropy-Regularized Mean-Field Games

    Authors: Yue Guan, Mi Zhou, Ali Pakniyat, Panagiotis Tsiotras

    Abstract: Mean-field games (MFG) were introduced to efficiently analyze approximate Nash equilibria in large population settings. In this work, we consider entropy-regularized mean-field games with a finite state-action space in a discrete time setting. We show that entropy regularization provides the necessary regularity conditions, that are lacking in the standard finite mean field games. Such regularity… ▽ More

    Submitted 22 July, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  46. arXiv:2110.05765  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Music Sentiment Transfer

    Authors: Miles Sigel, Michael Zhou, Jiebo Luo

    Abstract: Music sentiment transfer is a completely novel task. Sentiment transfer is a natural evolution of the heavily-studied style transfer task, as sentiment transfer is rooted in applying the sentiment of a source to be the new sentiment for a target piece of media; yet compared to style transfer, sentiment transfer has been only scantily studied on images. Music sentiment transfer attempts to apply th… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: NSF REU: Computational Methods for Understanding Music, Media, and Minds, University of Rochester

  47. Deformation-Aware Robotic 3D Ultrasound

    Authors: Zhongliang Jiang, Yue Zhou, Yuan Bi, Mingchuan Zhou, Thomas Wendler, Nassir Navab

    Abstract: Tissue deformation in ultrasound (US) imaging leads to geometrical errors when measuring tissues due to the pressure exerted by probes. Such deformation has an even larger effect on 3D US volumes as the correct compounding is limited by the inconsistent location and geometry. This work proposes a patient-specified stiffness-based method to correct the tissue deformations in robotic 3D US acquisiti… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters; Video: https://www.youtube.com/watch?v=MlZtugQ2cvQ

    Journal ref: IEEE Robotics and Automation Letters 2021

  48. arXiv:2105.14758  [pdf, other

    eess.IV cs.CV

    Low-Dose CT Denoising Using a Structure-Preserving Kernel Prediction Network

    Authors: Lu Xu, Yuwei Zhang, Ying Liu, Daoye Wang, Mu Zhou, Jimmy Ren, Jingwei Wei, Zhaoxiang Ye

    Abstract: Low-dose CT has been a key diagnostic imaging modality to reduce the potential risk of radiation overdose to patient health. Despite recent advances, CNN-based approaches typically apply filters in a spatially invariant way and adopt similar pixel-level losses, which treat all regions of the CT image equally and can be inefficient when fine-grained structures coexist with non-uniformly distributed… ▽ More

    Submitted 23 July, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: ICIP2021

  49. arXiv:2104.07667  [pdf

    eess.IV cs.CV

    Shoulder Implant X-Ray Manufacturer Classification: Exploring with Vision Transformer

    Authors: Meng Zhou, Shanglin Mo

    Abstract: Shoulder replacement surgery, also called total shoulder replacement, is a common and complex surgery in Orthopedics discipline. It involves replacing a dead shoulder joint with an artificial implant. In the market, there are many artificial implant manufacturers and each of them may produce different implants with different structures compares to other providers. The problem arises in the followi… ▽ More

    Submitted 21 April, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: 11 pages, 12 figures

  50. arXiv:2103.16493  [pdf, other

    cs.CV eess.IV

    Enabling Data Diversity: Efficient Automatic Augmentation via Regularized Adversarial Training

    Authors: Yunhe Gao, Zhiqiang Tang, Mu Zhou, Dimitris Metaxas

    Abstract: Data augmentation has proved extremely useful by increasing training data variance to alleviate overfitting and improve deep neural networks' generalization performance. In medical image analysis, a well-designed augmentation policy usually requires much expert knowledge and is difficult to generalize to multiple tasks due to the vast discrepancies among pixel intensities, image appearances, and o… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted by IPMI 2021