Zum Hauptinhalt springen

Showing 1–50 of 153 results for author: Lin, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.11582  [pdf, other

    cs.RO eess.SY

    Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars

    Authors: Zhihao Lin, Zhen Tian, Qi Zhang, Hanyang Zhuang, Jianglin Lan

    Abstract: The paper presents a vision-based obstacle avoidance strategy for lightweight self-driving cars that can be run on a CPU-only device using a single RGB-D camera. The method consists of two steps: visual perception and path planning. The visual perception part uses ORBSLAM3 enhanced with optical flow to estimate the car's poses and extract rich texture information from the scene. In the path planni… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 16 pages; Submitted to a journal

  2. arXiv:2408.09951  [pdf

    cs.AI eess.SP

    Principle Driven Parameterized Fiber Model based on GPT-PINN Neural Network

    Authors: Yubin Zang, Boyu Hua, Zhenzhou Tang, Zhipeng Lin, Fangzheng Zhang, Simin Li, Zuxing Zhang, Hongwei Chen

    Abstract: In cater the need of Beyond 5G communications, large numbers of data driven artificial intelligence based fiber models has been put forward as to utilize artificial intelligence's regression ability to predict pulse evolution in fiber transmission at a much faster speed compared with the traditional split step Fourier method. In order to increase the physical interpretabiliy, principle driven fibe… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  3. arXiv:2408.09947  [pdf

    cs.AI eess.SP

    Fiber Transmission Model with Parameterized Inputs based on GPT-PINN Neural Network

    Authors: Yubin Zang, Boyu Hua, Zhipeng Lin, Fangzheng Zhang, Simin Li, Zuxing Zhang, Hongwei Chen

    Abstract: In this manuscript, a novelty principle driven fiber transmission model for short-distance transmission with parameterized inputs is put forward. By taking into the account of the previously proposed principle driven fiber model, the reduced basis expansion method and transforming the parameterized inputs into parameterized coefficients of the Nonlinear Schrodinger Equations, universal solutions w… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  4. arXiv:2408.09027  [pdf, other

    cs.SD cs.AI eess.AS

    Efficient Autoregressive Audio Modeling via Next-Scale Prediction

    Authors: Kai Qiu, Xiang Li, Hao Chen, Jie Sun, Jinglu Wang, Zhe Lin, Marios Savvides, Bhiksha Raj

    Abstract: Audio generation has achieved remarkable progress with the advance of sophisticated generative models, such as diffusion models (DMs) and autoregressive (AR) models. However, due to the naturally significant sequence length of audio, the efficiency of audio generation remains an essential issue to be addressed, especially for AR models that are incorporated in large language models (LLMs). In this… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 7 pages, 6 figures, 7 tables

  5. arXiv:2408.08242  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts

    Authors: Zhihao Lin, Zhen Tian, Qi Zhang, Ziyang Ye, Hanyang Zhuang, Jianglin Lan

    Abstract: Safety and efficiency are crucial for autonomous driving in roundabouts, especially in the context of mixed traffic where autonomous vehicles (AVs) and human-driven vehicles coexist. This paper introduces a learning-based algorithm tailored to foster safe and efficient driving behaviors across varying levels of traffic flows in roundabouts. The proposed algorithm employs a deep Q-learning network… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 15 pages, 12 figures, submitted to an IEEE journal

  6. arXiv:2407.13076  [pdf, other

    cs.MA cs.NI eess.SP

    Matching-Driven Deep Reinforcement Learning for Energy-Efficient Transmission Parameter Allocation in Multi-Gateway LoRa Networks

    Authors: Ziqi Lin, Xu Zhang, Shimin Gong, Lanhua Li, Zhou Su, Bo Gu

    Abstract: Long-range (LoRa) communication technology, distinguished by its low power consumption and long communication range, is widely used in the Internet of Things. Nevertheless, the LoRa MAC layer adopts pure ALOHA for medium access control, which may suffer from severe packet collisions as the network scale expands, consequently reducing the system energy efficiency (EE). To address this issue, it is… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  7. arXiv:2407.10632  [pdf, other

    eess.IV cs.AI cs.CV

    Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model

    Authors: Zhening Liu, Xinjie Zhang, Jiawei Shao, Zehong Lin, Jun Zhang

    Abstract: With the rapid advancement of stereo vision technologies, stereo image compression has emerged as a crucial field that continues to draw significant attention. Previous approaches have primarily employed a unidirectional paradigm, where the compression of one view is dependent on the other, resulting in imbalanced compression. To address this issue, we introduce a symmetric bidirectional stereo im… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  8. arXiv:2407.04675  [pdf, other

    eess.AS cs.SD

    Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

    Authors: Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li , et al. (30 additional authors not shown)

    Abstract: Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  9. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, Jingyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 16 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2406.14973  [pdf, other

    cs.CV eess.IV

    LU2Net: A Lightweight Network for Real-time Underwater Image Enhancement

    Authors: Haodong Yang, Jisheng Xu, Zhiliang Lin, Jianping He

    Abstract: Computer vision techniques have empowered underwater robots to effectively undertake a multitude of tasks, including object tracking and path planning. However, underwater optical factors like light refraction and absorption present challenges to underwater vision, which cause degradation of underwater images. A variety of underwater image enhancement methods have been proposed to improve the effe… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  11. arXiv:2406.10856  [pdf, other

    cs.NI eess.SY

    LEO Satellite Networks Assisted Geo-distributed Data Processing

    Authors: Zhiyuan Zhao, Zhe Chen, Zheng Lin, Wenjun Zhu, Kun Qiu, Chaoqun You, Yue Gao

    Abstract: Nowadays, the increasing deployment of edge clouds globally provides users with low-latency services. However, connecting an edge cloud to a core cloud via optic cables in terrestrial networks poses significant barriers due to the prohibitively expensive building cost of optic cables. Fortunately, emerging Low Earth Orbit (LEO) satellite networks (e.g., Starlink) offer a more cost-effective soluti… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 6 pages, 5 figures

  12. arXiv:2406.04589  [pdf, other

    cs.SD cs.LG eess.AS

    MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancement

    Authors: Zizhen Lin, Xiaoting Chen, Junyu Wang

    Abstract: Achieving a balance between lightweight design and high performance remains a challenging task for speech enhancement. In this paper, we introduce Multi-path Enhanced Taylor (MET) Transformer based U-net for Speech Enhancement (MUSE), a lightweight speech enhancement network built upon the Unet architecture. Our approach incorporates a novel Multi-path Enhanced Taylor (MET) Transformer block, whic… ▽ More

    Submitted 19 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: This paper was accepted by Interspeech 2024

  13. arXiv:2405.15705  [pdf, other

    cs.AR eess.SY

    Sums: Sniffing Unknown Multiband Signals under Low Sampling Rates

    Authors: Jinbo Peng, Zhe Chen, Zheng Lin, Haoxuan Yuan, Zihan Fang, Lingzhong Bao, Zihang Song, Ying Li, Jing Ren, Yue Gao

    Abstract: Due to sophisticated deployments of all kinds of wireless networks (e.g., 5G, Wi-Fi, Bluetooth, LEO satellite, etc.), multiband signals distribute in a large bandwidth (e.g., from 70 MHz to 8 GHz). Consequently, for network monitoring and spectrum sharing applications, a sniffer for extracting physical layer information, such as structure of packet, with low sampling rate (especially, sub-Nyquist… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 12 pages, 9 figures

  14. arXiv:2405.15542  [pdf, other

    cs.NI cs.DC cs.LG eess.SP

    SATSense: Multi-Satellite Collaborative Framework for Spectrum Sensing

    Authors: Haoxuan Yuan, Zhe Chen, Zheng Lin, Jinbo Peng, Zihan Fang, Yuhang Zhong, Zihang Song, Yue Gao

    Abstract: Low Earth Orbit satellite Internet has recently been deployed, providing worldwide service with non-terrestrial networks. With the large-scale deployment of both non-terrestrial and terrestrial networks, limited spectrum resources will not be allocated enough. Consequently, dynamic spectrum sharing is crucial for their coexistence in the same spectrum, where accurate spectrum sensing is essential.… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 13 pages, 16 figures

  15. arXiv:2405.12584  [pdf, other

    eess.IV cs.CV cs.LG

    Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model?

    Authors: Ziqin Lin, Heng Li, Zinan Li, Huazhu Fu, Jiang Liu

    Abstract: Recent advancements in pre-trained large foundation models (LFM) have yielded significant breakthroughs across various domains, including natural language processing and computer vision. These models have been particularly impactful in the domain of medical diagnostic tasks. With abundant unlabeled data, an LFM has been developed for fundus images using the Vision Transformer (VIT) and a self-supe… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures

  16. arXiv:2405.05252  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models

    Authors: Hongjie Wang, Difan Liu, Yan Kang, Yijun Li, Zhe Lin, Niraj K. Jha, Yuchen Liu

    Abstract: Diffusion Models (DMs) have exhibited superior performance in generating high-quality and diverse images. However, this exceptional performance comes at the cost of expensive architectural design, particularly due to the attention module heavily used in leading models. Existing works mainly adopt a retraining process to enhance DM efficiency. This is computationally expensive and not very scalable… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted to IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  17. arXiv:2404.15750  [pdf, other

    eess.SP

    A Reconfigurable Subarray Architecture and Hybrid Beamforming for Millimeter-Wave Dual-Function-Radar-Communication Systems

    Authors: Xin Jin, Tiejun Lv, Wei Ni, Zhipeng Lin, Qiuming Zhu, Ekram Hossain, H. Vincent Poor

    Abstract: Dual-function-radar-communication (DFRC) is a promising candidate technology for next-generation networks. By integrating hybrid analog-digital (HAD) beamforming into a multi-user millimeter-wave (mmWave) DFRC system, we design a new reconfigurable subarray (RS) architecture and jointly optimize the HAD beamforming to maximize the communication sum-rate and ensure a prescribed signal-to-clutter-pl… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 14 pages, 9 figures, Accepted by IEEE TWC

  18. arXiv:2403.18811  [pdf, other

    cs.CV cs.GR cs.SD eess.AS

    Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment

    Authors: Li Siyao, Tianpei Gu, Zhitao Yang, Zhengyu Lin, Ziwei Liu, Henghui Ding, Lei Yang, Chen Change Loy

    Abstract: We introduce a novel task within the field of 3D dance generation, termed dance accompaniment, which necessitates the generation of responsive movements from a dance partner, the "follower", synchronized with the lead dancer's movements and the underlying musical rhythm. Unlike existing solo or group dance generation tasks, a duet dance scenario entails a heightened degree of interaction between t… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  19. arXiv:2402.05725  [pdf, other

    cs.RO eess.SP

    Dual-modal Tactile E-skin: Enabling Bidirectional Human-Robot Interaction via Integrated Tactile Perception and Feedback

    Authors: Shilong Mu, Runze Zhao, Zenan Lin, Yan Huang, Shoujie Li, Chenchang Li, Xiao-Ping Zhang, Wenbo Ding

    Abstract: To foster an immersive and natural human-robot interaction, the implementation of tactile perception and feedback becomes imperative, effectively bridging the conventional sensory gap. In this paper, we propose a dual-modal electronic skin (e-skin) that integrates magnetic tactile sensing and vibration feedback for enhanced human-robot interaction. The dual-modal tactile e-skin offers multi-functi… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 7 pages, 8 figures. Submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA), Japan, Yokohama

  20. arXiv:2401.15955  [pdf

    eess.SP eess.SY

    A Novel Geometric Solution for Moving Target Localization through Multistatic Sensing in the ISAC System

    Authors: S. Zhuge, Y. Ma, Z. Lin, Y. Zeng

    Abstract: This paper proposes a novel geometric solution for tracking a moving target through multistatic sensing. In contrast to existing two-step weighted least square (2SWLS) methods which use the bistatic range (BR) and bistatic range rate (BRR) measurements, the proposed method incorporates an additional direction of arrival (DOA) measurement of the target obtained from a communication receiver in an i… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  21. arXiv:2401.10256  [pdf, ps, other

    cs.CV eess.IV

    Active headrest combined with a depth camera-based ear-positioning system

    Authors: Yuteng Liu, Haowen Li, Haishan Zou, Jing Lu, Zhibin Lin

    Abstract: Active headrests can reduce low-frequency noise around ears based on active noise control (ANC) system. Both the control system using fixed control filters and the remote microphone-based adaptive control system provide good noise reduction performance when the head is in the original position. However, their performance degrades significantly when the head is in motion. In this paper, a human ear… ▽ More

    Submitted 25 December, 2023; originally announced January 2024.

  22. arXiv:2401.07532  [pdf, other

    cs.SD cs.AI eess.AS

    Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation

    Authors: Zhiwei Lin, Jun Chen, Boshi Tang, Binzhu Sha, Jing Yang, Yaolong Ju, Fan Fan, Shiyin Kang, Zhiyong Wu, Helen Meng

    Abstract: Variational Autoencoders (VAEs) constitute a crucial component of neural symbolic music generation, among which some works have yielded outstanding results and attracted considerable attention. Nevertheless, previous VAEs still encounter issues with overly long feature sequences and generated results lack contextual coherence, thus the challenge of modeling long multi-track symbolic music still re… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  23. arXiv:2312.15653  [pdf, other

    cs.IT eess.SP

    Index Modulation for Fluid Antenna-Assisted MIMO Communications: System Design and Performance Analysis

    Authors: Jing Zhu, Gaojie Chen, Pengyu Gao, Pei Xiao, Zihuai Lin, Atta Quddus

    Abstract: In this paper, we propose a transmission mechanism for fluid antennas (FAs) enabled multiple-input multiple-output (MIMO) communication systems based on index modulation (IM), named FA-IM, which incorporates the principle of IM into FAs-assisted MIMO system to improve the spectral efficiency (SE) without increasing the hardware complexity. In FA-IM, the information bits are mapped not only to the… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 12 pages,9 figures, publish to TWC

  24. arXiv:2312.05910  [pdf, other

    cs.LG eess.SP stat.ML

    Ensemble Kalman Filtering Meets Gaussian Process SSM for Non-Mean-Field and Online Inference

    Authors: Zhidi Lin, Yiyong Sun, Feng Yin, Alexandre Hoang Thiéry

    Abstract: The Gaussian process state-space models (GPSSMs) represent a versatile class of data-driven nonlinear dynamical system models. However, the presence of numerous latent variables in GPSSM incurs unresolved issues for existing variational inference approaches, particularly under the more realistic non-mean-field (NMF) assumption, including extensive training effort, compromised inference accuracy, a… ▽ More

    Submitted 22 July, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Gaussian process, state-space model, ensemble Kalman filter, online learning, variational inference

  25. arXiv:2312.03986  [pdf, other

    eess.SP

    An Unsupervised Machine Learning Scheme for Index-Based CSI Feedback in Wi-Fi

    Authors: Mrugen Deshmukh, Zinan Lin, Hanqing Lou, Mahmoud Kamel, Rui Yang, Ismail Guvenc

    Abstract: With the ever-increasing demand for high-speed wireless data transmission, beamforming techniques have been proven to be crucial in improving the data rate and the signal-to-noise ratio (SNR) at the receiver. However, they require feedback mechanisms that need an overhead of information and increase the system complexity, potentially challenging the efficiency and capacity of modern wireless netwo… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  26. arXiv:2312.03978  [pdf, other

    eess.SP

    Enhanced Index-Based Feedback Overhead Reduction for WLANs

    Authors: Mrugen Deshmukh, Zinan Lin, Hanqing Lou, Mahmoud Kamel, Rui Yang, Ismail Guvenc

    Abstract: Compressed beamforming algorithm is used in the current Wi-Fi standard to reduce the beamforming feedback overhead (BFO). However, with each new amendment of the standard the number of supported antennas in Wi-Fi devices increases, leading to increased BFO and hampering the throughput despite using compressed beamforming. In this paper, a novel index-based method is presented to reduce the BFO in… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  27. arXiv:2312.01338  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing and Adapting in the Clinic: Source-free Unsupervised Domain Adaptation for Medical Image Enhancement

    Authors: Heng Li, Ziqin Lin, Zhongxi Qiu, Zinan Li, Huazhu Fu, Yan Hu, Jiang Liu

    Abstract: Medical imaging provides many valuable clues involving anatomical structure and pathological characteristics. However, image degradation is a common issue in clinical practice, which can adversely impact the observation and diagnosis by physicians and algorithms. Although extensive enhancement models have been developed, these models require a well pre-training before deployment, while failing to… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 14 pages, 9 figures, in IEEE Transactions on Medical Imaging

  28. Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation

    Authors: Zhaofeng Lin, Tanvina Patel, Odette Scharenborg

    Abstract: Whispering is a distinct form of speech known for its soft, breathy, and hushed characteristics, often used for private communication. The acoustic characteristics of whispered speech differ substantially from normally phonated speech and the scarcity of adequate training data leads to low automatic speech recognition (ASR) performance. To address the data scarcity issue, we use a signal processin… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to ASRU 2023

  29. arXiv:2311.01653  [pdf

    eess.IV cs.CV

    INeAT: Iterative Neural Adaptive Tomography

    Authors: Bo Xiong, Changqing Su, Zihan Lin, You Zhou, Zhaofei Yu

    Abstract: Computed Tomography (CT) with its remarkable capability for three-dimensional imaging from multiple projections, enjoys a broad range of applications in clinical diagnosis, scientific observation, and industrial detection. Neural Adaptive Tomography (NeAT) is a recently proposed 3D rendering method based on neural radiance field for CT, and it demonstrates superior performance compared to traditio… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  30. arXiv:2310.09467  [pdf

    eess.IV

    PC-bzip2: a phase-space continuity enhanced lossless compression algorithm for light field microscopy data

    Authors: Changqing Su, Zihan Lin, You Zhou, Shuai Wang, Yuhan Gao, Chenggang Yan, Bo Xiong

    Abstract: Light-field fluorescence microscopy (LFM) is a powerful elegant compact method for long-term high-speed imaging of complex biological systems, such as neuron activities and rapid movements of organelles. LFM experiments typically generate terabytes image data and require a huge number of storage space. Some lossy compression algorithms have been proposed recently with good compression performance.… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  31. arXiv:2310.03985  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder

    Authors: Zih-Jyun Lin, Yi-Ju Chen, Po-Chih Kuo, Likai Huang, Chaur-Jong Hu, Cheng-Yu Chen

    Abstract: Dementia diagnosis requires a series of different testing methods, which is complex and time-consuming. Early detection of dementia is crucial as it can prevent further deterioration of the condition. This paper utilizes a speech recognition model to construct a dementia assessment system tailored for Mandarin speakers during the picture description task. By training an attention-based speech reco… ▽ More

    Submitted 15 December, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted to IEEE ICASSP 2024

  32. arXiv:2309.12010  [pdf, other

    eess.IV cs.CV

    Convolution and Attention Mixer for Synthetic Aperture Radar Image Change Detection

    Authors: Haopeng Zhang, Zijing Lin, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

    Abstract: Synthetic aperture radar (SAR) image change detection is a critical task and has received increasing attentions in the remote sensing community. However, existing SAR change detection methods are mainly based on convolutional neural networks (CNNs), with limited consideration of global attention mechanism. In this letter, we explore Transformer-like architecture for SAR change detection to incorpo… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE GRSL

  33. arXiv:2309.11411  [pdf, other

    eess.SY

    Distributed Finite-Time Cooperative Localization for Three-Dimensional Sensor Networks

    Authors: Jinze Wu, Lorenzo Zino, Zhiyun Lin, Alessandro Rizzo

    Abstract: This paper addresses the distributed localization problem for a network of sensors placed in a three-dimensional space, in which sensors are able to perform range measurements, i.e., measure the relative distance between them, and exchange information on a network structure. First, we derive a necessary and sufficient condition for node localizability using barycentric coordinates. Then, building… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 39 pages, 7 figures, under review

  34. arXiv:2309.08201  [pdf, other

    cs.LG eess.SP math.OC

    Sparsity-Aware Distributed Learning for Gaussian Processes with Linear Multiple Kernel

    Authors: Richard Cornelius Suwandi, Zhidi Lin, Feng Yin, Zhiguo Wang, Sergios Theodoridis

    Abstract: Gaussian processes (GPs) stand as crucial tools in machine learning and signal processing, with their effectiveness hinging on kernel design and hyper-parameter optimization. This paper presents a novel GP linear multiple kernel (LMK) and a generic sparsity-aware distributed learning framework to optimize the hyper-parameters. The newly proposed grid spectral mixture (GSM) kernel is tailored for m… ▽ More

    Submitted 26 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  35. arXiv:2309.01074  [pdf, other

    cs.LG eess.SP eess.SY

    Towards Efficient Modeling and Inference in Multi-Dimensional Gaussian Process State-Space Models

    Authors: Zhidi Lin, Juan Maroñas, Ying Li, Feng Yin, Sergios Theodoridis

    Abstract: The Gaussian process state-space model (GPSSM) has attracted extensive attention for modeling complex nonlinear dynamical systems. However, the existing GPSSM employs separate Gaussian processes (GPs) for each latent state dimension, leading to escalating computational complexity and parameter proliferation, thus posing challenges for modeling dynamical systems with high-dimensional latent states.… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  36. arXiv:2308.11849  [pdf, other

    eess.SY cs.AI cs.LG

    A Mobile Data-Driven Hierarchical Deep Reinforcement Learning Approach for Real-time Demand-Responsive Railway Rescheduling and Station Overcrowding Mitigation

    Authors: Enze Liu, Zhiyuan Lin, Judith Y. T. Wang, Hong Chen

    Abstract: Real-time railway rescheduling is an important technique to enable operational recovery in response to unexpected and dynamic conditions in a timely and flexible manner. Current research relies mostly on OD based data and model-based methods for estimating train passenger demands. These approaches primarily focus on averaged disruption patterns, often overlooking the immediate uneven distribution… ▽ More

    Submitted 6 November, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 42 pages,20 figures

  37. arXiv:2308.11627  [pdf, other

    eess.SP cs.AI cs.CV eess.IV eess.SY

    Non-Intrusive Electric Load Monitoring Approach Based on Current Feature Visualization for Smart Energy Management

    Authors: Yiwen Xu, Dengfeng Liu, Liangtao Huang, Zhiquan Lin, Tiesong Zhao, Sam Kwong

    Abstract: The state-of-the-art smart city has been calling for an economic but efficient energy management over large-scale network, especially for the electric power system. It is a critical issue to monitor, analyze and control electric loads of all users in system. In this paper, we employ the popular computer vision techniques of AI to design a non-invasive load monitoring method for smart electric ener… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  38. arXiv:2308.01425  [pdf, ps, other

    eess.SP

    Exploiting Structured Sparsity with Low Complexity Sparse Bayesian Learning for RIS-assisted MIMO Channel Estimation

    Authors: W. Li, Z. Lin, Q. Guo, B. Vucetic

    Abstract: As an emerging communication auxiliary technology, reconfigurable intelligent surface (RIS) is expected to play a significant role in the upcoming 6G networks. Due to its total reflection characteristics, it is challenging to implement conventional channel estimation algorithms. This work focuses on RIS-assisted MIMO communications. Although many algorithms have been proposed to address this issue… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  39. arXiv:2307.02779  [pdf, other

    cs.IT cs.LG cs.NI eess.SP

    Large Language Models Empowered Autonomous Edge AI for Connected Intelligence

    Authors: Yifei Shen, Jiawei Shao, Xinjie Zhang, Zehong Lin, Hao Pan, Dongsheng Li, Jun Zhang, Khaled B. Letaief

    Abstract: The evolution of wireless networks gravitates towards connected intelligence, a concept that envisions seamless interconnectivity among humans, objects, and intelligence in a hyper-connected cyber-physical world. Edge artificial intelligence (Edge AI) is a promising solution to achieve connected intelligence by delivering high-quality, low-latency, and privacy-preserving AI services at the network… ▽ More

    Submitted 25 December, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: IEEE Communication Magazine

  40. arXiv:2307.02011  [pdf, other

    eess.SP

    Precise WiFi Indoor Positioning using Deep Learning Algorithms

    Authors: Minxue Cai, Zihuai Lin

    Abstract: This study demonstrates a WiFi indoor positioning system using Deep Learning algorithms. A new method using fitting function in MATLAB will be utilized to compute the path loss coefficient and log-normal fading variance. To reduce the error, a new hybrid localization approach utilizing Received Signal Strength Indicator (RSSI) and Angle of Arrival (AoA) has been created. Three Deep Learning algori… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  41. arXiv:2307.00307  [pdf, other

    eess.IV

    Spatio-Temporal Classification of Lung Ventilation Patterns using 3D EIT Images: A General Approach for Individualized Lung Function Evaluation

    Authors: Shuzhe Chen, Li Li, Zhichao Lin, Ke Zhang, Ying Gong, Lu Wang, Xu Wu, Maokun Li, Yuanlin Song, Fan Yang, Shenheng Xu

    Abstract: The Pulmonary Function Test (PFT) is an widely utilized and rigorous classification test for lung function evaluation, serving as a comprehensive tool for lung diagnosis. Meanwhile, Electrical Impedance Tomography (EIT) is a rapidly advancing clinical technique that visualizes conductivity distribution induced by ventilation. EIT provides additional spatial and temporal information on lung ventila… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  42. arXiv:2306.15230  [pdf, other

    cs.IT eess.SP

    Probability of Error for Optimal Codes in a Reconfigurable Intelligent Surface Aided URLLC System

    Authors: Likun Sui, Zihuai Lin

    Abstract: The lower bound on the decoding error probability for the optimal code given a signal-to-noise ratio and a code rate are investigated in this letter for the reconfigurable intelligent surface (RIS) communication system over a Rician fading channel at the short blocklength regime, which is the key characteristic of ultra-reliable low-latency communications (URLLC) to meet the need for strict adhere… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  43. You Only Train Once: Learning a General Anomaly Enhancement Network with Random Masks for Hyperspectral Anomaly Detection

    Authors: Zhaoxu Li, Yingqian Wang, Chao Xiao, Qiang Ling, Zaiping Lin, Wei An

    Abstract: In this paper, we introduce a new approach to address the challenge of generalization in hyperspectral anomaly detection (AD). Our method eliminates the need for adjusting parameters or retraining on new test scenes as required by most existing methods. Employing an image-level training paradigm, we achieve a general anomaly enhancement network for hyperspectral AD that only needs to be trained on… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Journal ref: TGRS 2023

  44. arXiv:2303.09456  [pdf, other

    eess.SY physics.data-an

    Modeling and Analysis on Efficiency Degradation of Lithium-ion Batteries

    Authors: Zihui Lin, Dagang Li

    Abstract: Efficiency of Battery Energy Storage Systems (BESSs) is increasingly critical as renewable energy generation becomes more prevalent on the grid. Therefore, it is necessary to study the energy efficiency of lithium-ion batteries, which are typically used in BESSs. The purpose of this study is to propose the State of Efficiency (SOE) as a measure of how efficiently batteries transfer energy, and to… ▽ More

    Submitted 19 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  45. arXiv:2302.13018  [pdf, other

    eess.SP

    Sparse Bayesian Learning-Based 3D Spectrum Environment Map Construction-Sampling Optimization, Scenario-Dependent Dictionary Construction and Sparse Recovery

    Authors: Jie Wang, Qiuming Zhu, Zhipeng Lin, Qihui Wu, Yang Huang, Xuezhao Cai, Weizhi Zhong, Yi Zhao

    Abstract: The spectrum environment map (SEM), which can visualize the information of invisible electromagnetic spectrum, is vital for monitoring, management, and security of spectrum resources in cognitive radio (CR) networks. In view of a limited number of spectrum sensors and constrained sampling time, this paper presents a new three-dimensional (3D) SEM construction scheme based on sparse Bayesian learni… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: 13 pages, 13 figures

  46. arXiv:2302.01712  [pdf, other

    physics.optics eess.IV

    Transcending shift-invariance in the paraxial regime via end-to-end inverse design of freeform nanophotonics

    Authors: William F. Li, Gaurav Arya, Charles Roques-Carmes, Zin Lin, Steven G. Johnson, Marin Soljačić

    Abstract: Traditional optical elements and conventional metasurfaces obey shift-invariance in the paraxial regime. For imaging systems obeying paraxial shift-invariance, a small shift in input angle causes a corresponding shift in the sensor image. Shift-invariance has deep implications for the design and functionality of optical devices, such as the necessity of free space between components (as in compoun… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Journal ref: Optics Express 31 (2023) 24260-24272

  47. arXiv:2301.08843  [pdf, other

    cs.LG eess.SP

    Towards Flexibility and Interpretability of Gaussian Process State-Space Model

    Authors: Zhid Lin, Feng Yin, Juan Maroñas

    Abstract: The Gaussian process state-space model (GPSSM) has garnered considerable attention over the past decade. However, the standard GP with a preliminary kernel, such as the squared exponential kernel or Matérn kernel, that is commonly used in GPSSM studies, limits the model's representation power and substantially restricts its applicability to complex scenarios. To address this issue, we propose a ne… ▽ More

    Submitted 6 April, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

    Comments: preprint

  48. arXiv:2301.06267  [pdf, other

    cs.CV cs.AI cs.LG cs.SD eess.AS

    Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models

    Authors: Zhiqiu Lin, Samuel Yu, Zhiyi Kuang, Deepak Pathak, Deva Ramanan

    Abstract: The ability to quickly learn a new task with minimal instruction - known as few-shot learning - is a central aspect of intelligent agents. Classical few-shot benchmarks make use of few-shot samples from a single modality, but such samples may not be sufficient to characterize an entire concept class. In contrast, humans use cross-modal information to learn new concepts efficiently. In this work, w… ▽ More

    Submitted 27 August, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: Published at CVPR 2023. Project site: https://linzhiqiu.github.io/papers/cross_modal/

  49. arXiv:2301.01887  [pdf, other

    eess.SP cs.HC

    A Novel Exploitative and Explorative GWO-SVM Algorithm for Smart Emotion Recognition

    Authors: Xucun Yan, Zihuai Lin, Zhiyun Lin, Branka Vucetic

    Abstract: Emotion recognition or detection is broadly utilized in patient-doctor interactions for diseases such as schizophrenia and autism and the most typical techniques are speech detection and facial recognition. However, features extracted from these behavior-based emotion recognitions are not reliable since humans can disguise their emotions. Recording voices or tracking facial expressions for a long… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  50. arXiv:2212.07731  [pdf, other

    cs.IT eess.SP

    Quantum Sensing Based Joint 3D Beam Training for UAV-mounted STAR-RIS Aided TeraHertz Multi-user Massive MIMO Systems

    Authors: Xufang Wang, Zihuai Lin, Feng Lin, Pei Xiao

    Abstract: Terahertz (THz) systems are capable of supporting ultra-high data rates thanks to large bandwidth, and the potential to harness high-gain beamforming to combat high pathloss. In this paper, a novel quantum sensing (Ghost Imaging (GI)) based beam training is proposed for Simultaneously Transmitting and Reflecting Reconfigurable Intelligent Surface (STAR RIS) aided THz multi-user massive MIMO system… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: text overlap with arXiv:2201.10757