Skip to main content

Showing 1–50 of 135 results for author: Li, N

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.10408  [pdf, other

    cs.IT eess.SP

    Latency Minimization for IRS-enhanced Wideband MEC Networks with Practical Reflection Model

    Authors: N. Li, W. Hao, X. Li, Z. Zhu, Z. Tang, S. Yang

    Abstract: Intelligent reflecting surface (IRS) has been considered as an efficient way to boost the computation capability of mobile edge computing (MEC) system, especially when the communication links is blocked or the communication signal is weak. However, most existing works are restricted to narrow-band channel and ideal IRS reflection model, which is not practical and may lead to significant performanc… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 13 pages, 9 figures

  2. arXiv:2406.18069  [pdf, other

    eess.SP cs.AI cs.CL

    Large Language Models for Cuffless Blood Pressure Measurement From Wearable Biosignals

    Authors: Zengding Liu, Chen Chen, Jiannong Cao, Minglei Pan, Jikui Liu, Nan Li, Fen Miao, Ye Li

    Abstract: Large language models (LLMs) have captured significant interest from both academia and industry due to their impressive performance across various textual tasks. However, the potential of LLMs to analyze physiological time-series data remains an emerging research field. Particularly, there is a notable gap in the utilization of LLMs for analyzing wearable biosignals to achieve cuffless blood press… ▽ More

    Submitted 4 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.05325  [pdf, other

    eess.AS cs.SD

    LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance

    Authors: Shihao Chen, Yu Gu, Jie Zhang, Na Li, Rilin Chen, Liping Chen, Lirong Dai

    Abstract: Any-to-any singing voice conversion (SVC) is an interesting audio editing technique, aiming to convert the singing voice of one singer into that of another, given only a few seconds of singing data. However, during the conversion process, the issue of timbre leakage is inevitable: the converted singing voice still sounds like the original singer's voice. To tackle this, we propose a latent diffusi… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  4. arXiv:2406.04243  [pdf, other

    math.OC eess.SY math.DG

    Policy Optimization in Control: Geometry and Algorithmic Implications

    Authors: Shahriar Talebi, Yang Zheng, Spencer Kraisler, Na Li, Mehran Mesbahi

    Abstract: This survey explores the geometric perspective on policy optimization within the realm of feedback control systems, emphasizing the intrinsic relationship between control design and optimization. By adopting a geometric viewpoint, we aim to provide a nuanced understanding of how various ``complete parameterization'' -- referring to the policy parameters together with its Riemannian geometry -- of… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2405.12031  [pdf, other

    cs.SD eess.AS

    Neighborhood Attention Transformer with Progressive Channel Fusion for Speaker Verification

    Authors: Nian Li, Jianguo Wei

    Abstract: Transformer-based architectures for speaker verification typically require more training data than ECAPA-TDNN. Therefore, recent work has generally been trained on VoxCeleb1&2. We propose a backbone network based on self-attention, which can achieve competitive results when trained on VoxCeleb2 alone. The network alternates between neighborhood attention and global attention to capture local and g… ▽ More

    Submitted 29 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures, 3 tables; added github link

  6. arXiv:2405.07281  [pdf, ps, other

    eess.SP

    Movable Antennas Aided Multicast MISO Communication Systems

    Authors: Zhenqiao Cheng, Nanxi Li, Ruizhe Long, Jianchi Zhu, Chongjun Ouyang, Peng Chen

    Abstract: A novel multicast communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the transmission rate. Specifically, an MA-assisted two-user multicast multiple-input single-input system is considered. The joint optimization of the transmit beamforming vector and transmit MA positions is studied by modeling the motion of the MA element… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 5 pages

  7. arXiv:2405.06089  [pdf, other

    eess.SY cs.IT cs.LG

    Learning Low-dimensional Latent Dynamics from High-dimensional Observations: Non-asymptotics and Lower Bounds

    Authors: Yuyang Zhang, Shahriar Talebi, Na Li

    Abstract: In this paper, we focus on learning a linear time-invariant (LTI) model with low-dimensional latent variables but high-dimensional observations. We provide an algorithm that recovers the high-dimensional features, i.e. column space of the observer, embeds the data into low dimensions and learns the low-dimensional model parameters. Our algorithm enjoys a sample complexity guarantee of order… ▽ More

    Submitted 25 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  8. arXiv:2405.05787  [pdf, other

    cs.RO cs.CV eess.SY

    Autonomous Robotic Ultrasound System for Liver Follow-up Diagnosis: Pilot Phantom Study

    Authors: Tianpeng Zhang, Sekeun Kim, Jerome Charton, Haitong Ma, Kyungsang Kim, Na Li, Quanzheng Li

    Abstract: The paper introduces a novel autonomous robot ultrasound (US) system targeting liver follow-up scans for outpatients in local communities. Given a computed tomography (CT) image with specific target regions of interest, the proposed system carries out the autonomous follow-up scan in three steps: (i) initial robot contact to surface, (ii) coordinate mapping between CT image and robot, and (iii) ta… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  9. arXiv:2405.05353  [pdf, other

    eess.SY

    Eco-driving Accounting for Interactive Cut-in Vehicles

    Authors: Chaozhe R. He, Nan Li

    Abstract: Automated vehicles can gather information about surrounding traffic and plan safe and energy-efficient driving behavior, which is known as eco-driving. Conventional eco-driving designs only consider preceding vehicles in the same lane as the ego vehicle. In heavy traffic, however, vehicles in adjacent lanes may cut into the ego vehicle's lane, influencing the ego vehicle's eco-driving behavior and… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted at 2024 IEEE International Conference on Mobility: Operations, Services, and Technologies (MOST)

  10. arXiv:2404.15278  [pdf, other

    eess.SP cs.CR cs.NI

    Security-Sensitive Task Offloading in Integrated Satellite-Terrestrial Networks

    Authors: Wenjun Lan, Kongyang Chen, Jiannong Cao, Yikai Li, Ning Li, Qi Chen, Yuvraj Sahni

    Abstract: With the rapid development of sixth-generation (6G) communication technology, global communication networks are moving towards the goal of comprehensive and seamless coverage. In particular, low earth orbit (LEO) satellites have become a critical component of satellite communication networks. The emergence of LEO satellites has brought about new computational resources known as the \textit{LEO sat… ▽ More

    Submitted 20 January, 2024; originally announced April 2024.

  11. arXiv:2404.13605  [pdf, other

    cs.CV eess.IV

    Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence

    Authors: Ripon Kumar Saha, Dehao Qin, Nianyi Li, Jinwei Ye, Suren Jayasuriya

    Abstract: Tackling image degradation due to atmospheric turbulence, particularly in dynamic environment, remains a challenge for long-range imaging systems. Existing techniques have been primarily designed for static scenes or scenes with small motion. This paper presents the first segment-then-restore pipeline for restoring the videos of dynamic scenes in turbulent environment. We leverage mean optical flo… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Paper

  12. arXiv:2404.12163  [pdf, other

    eess.IV cs.CV

    Unsupervised Microscopy Video Denoising

    Authors: Mary Aiyetigbo, Alexander Korte, Ethan Anderson, Reda Chalhoub, Peter Kalivas, Feng Luo, Nianyi Li

    Abstract: In this paper, we introduce a novel unsupervised network to denoise microscopy videos featured by image sequences captured by a fixed location microscopy camera. Specifically, we propose a DeepTemporal Interpolation method, leveraging a temporal signal filter integrated into the bottom CNN layers, to restore microscopy videos corrupted by unknown noise types. Our unsupervised denoising architectur… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPRW 2024

  13. arXiv:2403.14935  [pdf, ps, other

    math.OC eess.SY

    Data-Driven Predictive Control with Adaptive Disturbance Attenuation for Constrained Systems

    Authors: Nan Li, Ilya Kolmanovsky, Hong Chen

    Abstract: In this paper, we propose a novel data-driven predictive control approach for systems subject to time-domain constraints. The approach combines the strengths of H-infinity control for rejecting disturbances and MPC for handling constraints. In particular, the approach can dynamically adapt H-infinity disturbance attenuation performance depending on measured system state and forecasted disturbance… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 figures

  14. arXiv:2403.12215  [pdf, other

    eess.SY

    Aggregate Peak EV Charging Demand: The Impact of Segmented Network Tariffs

    Authors: Nanda Kishor Panda, Na Li, Simon H. Tindemans

    Abstract: Aggregate peak Electric Vehicle (EV) charging demand is a matter of growing concern for network operators as it severely limits the network's capacity, preventing its reliable operation. Various tariff schemes have been proposed to limit peak demand by incentivizing flexible asset users to shift their demand from peak periods. However, fewer studies quantify the effect of these tariff schemes on t… ▽ More

    Submitted 2 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures, 2 columns

  15. Exploring the Capabilities and Limitations of Large Language Models in the Electric Energy Sector

    Authors: Subir Majumder, Lin Dong, Fatemeh Doudi, Yuting Cai, Chao Tian, Dileep Kalathi, Kevin Ding, Anupam A. Thatte, Na Li, Le Xie

    Abstract: Large Language Models (LLMs) as chatbots have drawn remarkable attention thanks to their versatile capability in natural language processing as well as in a wide range of tasks. While there has been great enthusiasm towards adopting such foundational model-based artificial intelligence tools in all sectors possible, the capabilities and limitations of such LLMs in improving the operation of the el… ▽ More

    Submitted 20 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  16. arXiv:2403.09030  [pdf

    cs.SD cs.LG eess.AS

    An AI-Driven Approach to Wind Turbine Bearing Fault Diagnosis from Acoustic Signals

    Authors: Zhao Wang, Xiaomeng Li, Na Li, Longlong Shu

    Abstract: This study aimed to develop a deep learning model for the classification of bearing faults in wind turbine generators from acoustic signals. A convolutional LSTM model was successfully constructed and trained by using audio data from five predefined fault types for both training and validation. To create the dataset, raw audio signal data was collected and processed in frames to capture time and f… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  17. arXiv:2403.05772  [pdf, other

    cs.SD cs.NE eess.AS

    sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks

    Authors: Qu Yang, Qianhui Liu, Nan Li, Meng Ge, Zeyang Song, Haizhou Li

    Abstract: Speech applications are expected to be low-power and robust under noisy conditions. An effective Voice Activity Detection (VAD) front-end lowers the computational need. Spiking Neural Networks (SNNs) are known to be biologically plausible and power-efficient. However, SNN-based VADs have yet to achieve noise robustness and often require large models for high performance. This paper introduces a no… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted by ICASSP 2024

  18. arXiv:2402.01808  [pdf, other

    cs.SD eess.AS

    KS-Net: Multi-band joint speech restoration and enhancement network for 2024 ICASSP SSI Challenge

    Authors: Guochen Yu, Runqiang Han, Chenglin Xu, Haoran Zhao, Nan Li, Chen Zhang, Xiguang Zheng, Chao Zhou, Qi Huang, Bing Yu

    Abstract: This paper presents the speech restoration and enhancement system created by the 1024K team for the ICASSP 2024 Speech Signal Improvement (SSI) Challenge. Our system consists of a generative adversarial network (GAN) in complex-domain for speech restoration and a fine-grained multi-band fusion module for speech enhancement. In the blind test set of SSI, the proposed system achieves an overall mean… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP 2024; Rank 1st in ICASSP 2024 Speech Signal Improvement (SSI) Challenge

  19. arXiv:2401.17575  [pdf, other

    eess.SP

    Can We Improve Channel Reciprocity via Loop-back Compensation for RIS-assisted Physical Layer Key Generation

    Authors: Ningya Xu, Guoshun Nan, Xiaofeng Tao, Na Li, Pengxuan Mao, Tianyuan Yang

    Abstract: Reconfigurable intelligent surface (RIS) facilitates the extraction of unpredictable channel features for physical layer key generation (PKG), securing communications among legitimate users with symmetric keys. Previous works have demonstrated that channel reciprocity plays a crucial role in generating symmetric keys in PKG systems, whereas, in reality, reciprocity is greatly affected by hardware… ▽ More

    Submitted 30 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by ICC 2024

  20. arXiv:2401.16183  [pdf, ps, other

    eess.SY

    Scalable Reinforcement Learning for Linear-Quadratic Control of Networks

    Authors: Johan Olsson, Runyu Zhang, Emma Tegling, Na Li

    Abstract: Distributed optimal control is known to be challenging and can become intractable even for linear-quadratic regulator problems. In this work, we study a special class of such problems where distributed state feedback controllers can give near-optimal performance. More specifically, we consider networked linear-quadratic controllers with decoupled costs and spatially exponentially decaying dynamics… ▽ More

    Submitted 13 March, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages, 4 figures

  21. arXiv:2401.12167  [pdf, other

    eess.IV cs.AI cs.LG

    Dynamic Semantic Compression for CNN Inference in Multi-access Edge Computing: A Graph Reinforcement Learning-based Autoencoder

    Authors: Nan Li, Alexandros Iosifidis, Qi Zhang

    Abstract: This paper studies the computational offloading of CNN inference in dynamic multi-access edge computing (MEC) networks. To address the uncertainties in communication time and computation resource availability, we propose a novel semantic compression method, autoencoder-based CNN architecture (AECNN), for effective semantic extraction and compression in partial offloading. In the semantic encoder,… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2211.13745

  22. arXiv:2312.14018  [pdf, ps, other

    eess.SP

    Enabling Secure Wireless Communications via Movable Antennas

    Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Xiaoming She, Chongjun Ouyang, Peng Chen

    Abstract: A pioneering secure transmission scheme is proposed, which harnesses movable antennas (MAs) to optimize antenna positions for augmenting the physical layer security. Particularly, an MA-enabled secure wireless system is considered, where a multi-antenna transmitter communicates with a single-antenna receiver in the presence of an eavesdropper. The beamformer and antenna positions at the transmitte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by IEEE ICASSP 2024

  23. arXiv:2312.13722  [pdf, other

    cs.SD eess.AS

    BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution

    Authors: Guochen Yu, Xiguang Zheng, Nan Li, Runqiang Han, Chengshi Zheng, Chen Zhang, Chao Zhou, Qi Huang, Bing Yu

    Abstract: Speech bandwidth extension (BWE) has demonstrated promising performance in enhancing the perceptual speech quality in real communication systems. Most existing BWE researches primarily focus on fixed upsampling ratios, disregarding the fact that the effective bandwidth of captured audio may fluctuate frequently due to various capturing devices and transmission conditions. In this paper, we propose… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  24. arXiv:2312.05724  [pdf, other

    eess.SY math.OC

    Minimum-Time Trajectory Optimization With Data-Based Models: A Linear Programming Approach

    Authors: Nan Li, Ehsan Taheri, Ilya Kolmanovsky, Dimitar Filev

    Abstract: In this paper, we develop a computationally-efficient approach to minimum-time trajectory optimization using input-output data-based models, to produce an end-to-end data-to-control solution to time-optimal planning/control of dynamic systems and hence facilitate their autonomous operation. The approach integrates a non-parametric data-based model for trajectory prediction and a continuous optimiz… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 11 pages, 4 figures

  25. arXiv:2312.05332  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    MPC-Inspired Reinforcement Learning for Verifiable Model-Free Control

    Authors: Yiwen Lu, Zishuo Li, Yihan Zhou, Na Li, Yilin Mo

    Abstract: In this paper, we introduce a new class of parameterized controllers, drawing inspiration from Model Predictive Control (MPC). The controller resembles a Quadratic Programming (QP) solver of a linear MPC problem, with the parameters of the controller being trained via Deep Reinforcement Learning (DRL) rather than derived from system models. This approach addresses the limitations of common control… ▽ More

    Submitted 9 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  26. Channel Estimation and Training Design for Active RIS Aided Wireless Communications

    Authors: Hao Chen, Nanxi Li, Ruizhe Long, Ying-Chang Liang

    Abstract: Active reconfigurable intelligent surface (ARIS) is a newly emerging RIS technique that leverages radio frequency (RF) reflection amplifiers to empower phase-configurable reflection elements (REs) in amplifying the incident signal. Thereby, ARIS can enhance wireless communications with the strengthened ARIS-aided links. In this letter, we propose exploiting the signal amplification capability of A… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted for publication in IEEE Wireless Communications Letters

    Journal ref: IEEE Wireless Communications Letters, early access, 2023

  27. arXiv:2310.12507  [pdf, other

    eess.IV

    Multi-granularity Backprojection Transformer for Remote Sensing Image Super-Resolution

    Authors: Jinglei Hao, Wukai Li, Binglu Wang, Shunzhou Wang, Yuting Lu, Ning Li, Yongqiang Zhao

    Abstract: Backprojection networks have achieved promising super-resolution performance for nature images but not well be explored in the remote sensing image super-resolution (RSISR) field due to the high computation costs. In this paper, we propose a Multi-granularity Backprojection Transformer termed MBT for RSISR. MBT incorporates the backprojection learning strategy into a Transformer framework. It cons… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  28. arXiv:2310.05508  [pdf, other

    eess.SY math.OC

    A Comparison between Markov Chain and Koopman Operator Based Data-Driven Modeling of Dynamical Systems

    Authors: Saeid Tafazzol, Nan Li, Ilya Kolmanovsky, Dimitar Filev

    Abstract: Markov chain-based modeling and Koopman operator-based modeling are two popular frameworks for data-driven modeling of dynamical systems. They share notable similarities from a computational and practitioner's perspective, especially for modeling autonomous systems. The first part of this paper aims to elucidate these similarities. For modeling systems with control inputs, the models produced by t… ▽ More

    Submitted 1 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  29. arXiv:2309.12596  [pdf, ps, other

    eess.SP

    Movable Antenna-Empowered AirComp

    Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Xiaoming She, Chongjun Ouyang, Peng Chen

    Abstract: A novel over-the-air computation (AirComp) framework, empowered by the incorporation of movable antennas (MAs), is proposed to significantly enhance computation accuracy. Within this framework, the joint optimization of transmit power control, antenna positioning, and receive combining is investigated. An efficient method is proposed to tackle the problem of computation mean-squared error (MSE) mi… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  30. arXiv:2309.11135  [pdf, ps, other

    eess.SP

    Sum-Rate Maximization for Movable Antenna Enabled Multiuser Communications

    Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Chongjun Ouyang

    Abstract: A novel multiuser communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the downlink sum-rate. The joint optimization of the transmit beamforming vector and transmit MA positions is studied for a multiuser multiple-input single-input system. An efficient algorithm is proposed to tackle the formulated non-convex problem via cap… ▽ More

    Submitted 22 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 11 pages

  31. arXiv:2309.05929  [pdf

    eess.IV cs.CV

    Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation

    Authors: Zhiqing Zhang, Guojia Fan, Tianyong Liu, Nan Li, Yuyang Liu, Ziyu Liu, Canwei Dong, Shoujun Zhou

    Abstract: Medical image segmentation is critical for diagnosing and treating spinal disorders. However, the presence of high noise, ambiguity, and uncertainty makes this task highly challenging. Factors such as unclear anatomical boundaries, inter-class similarities, and irrational annotations contribute to this challenge. Achieving both accurate and diverse segmentation templates is essential to support ra… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  32. arXiv:2309.03471  [pdf, other

    cs.IT eess.SP

    Resource Management for IRS-assisted WP-MEC Networks with Practical Phase Shift Model

    Authors: Nana Li, Wanming Hao, Fuhui Zhou, Zheng Chu, Shouyi Yang, Pei Xiao

    Abstract: Wireless powered mobile edge computing (WP-MEC) has been recognized as a promising solution to enhance the computational capability and sustainable energy supply for low-power wireless devices (WDs). However, when the communication links between the hybrid access point (HAP) and WDs are hostile, the energy transfer efficiency and task offloading rate are compromised. To tackle this problem, we pro… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 15 pages, 14 figures

  33. arXiv:2308.03268  [pdf, other

    eess.SY

    Towards Carbon-Free Electricity: A Flow-Based Framework for Power Grid Carbon Accounting and Decarbonization

    Authors: Xin Chen, Hungpo Chao, Wenbo Shi, Na Li

    Abstract: This paper introduces a comprehensive framework aimed at advancing research and policy development in the realm of decarbonization within electric power systems. The framework focuses on three key aspects: carbon accounting, carbon-aware decision-making, and carbon-electricity market design. It addresses existing problems, methods, and proposes solutions. In contrast to traditional pool-based emis… ▽ More

    Submitted 28 November, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

  34. arXiv:2308.03240  [pdf, other

    math.OC eess.SY

    Carbon-Aware Optimal Power Flow

    Authors: Xin Chen, Andy Sun, Wenbo Shi, Na Li

    Abstract: To facilitate effective decarbonization of the electric power sector, this paper introduces the generic Carbon-aware Optimal Power Flow (C-OPF) method for power system decision-making that considers demand-side carbon accounting and emission management. Built upon the classic optimal power flow (OPF) model, the C-OPF method incorporates carbon emission flow equations and constraints, as well as ca… ▽ More

    Submitted 17 July, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

  35. arXiv:2307.00179  [pdf, other

    cs.CV eess.IV

    Unsupervised Coordinate-Based Video Denoising

    Authors: Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Nianyi Li

    Abstract: In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-int… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  36. arXiv:2306.10369  [pdf, other

    math.OC eess.SY stat.ML

    Non-asymptotic System Identification for Linear Systems with Nonlinear Policies

    Authors: Yingying Li, Tianpeng Zhang, Subhro Das, Jeff Shamma, Na Li

    Abstract: This paper considers a single-trajectory system identification problem for linear systems under general nonlinear and/or time-varying policies with i.i.d. random excitation noises. The problem is motivated by safe learning-based control for constrained linear systems, where the safe policies during the learning process are usually nonlinear and time-varying for satisfying the state and input const… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

  37. arXiv:2305.17860  [pdf, other

    cs.SD eess.AS

    speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition

    Authors: Haoyu Lu, Nan Li, Tongtong Song, Longbiao Wang, Jianwu Dang, Xiaobao Wang, Shiliang Zhang

    Abstract: In recent years, the joint training of speech enhancement front-end and automatic speech recognition (ASR) back-end has been widely used to improve the robustness of ASR systems. Traditional joint training methods only use enhanced speech as input for the backend. However, it is difficult for speech enhancement systems to directly separate speech from input due to the diverse types of noise with d… ▽ More

    Submitted 30 May, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

  38. arXiv:2305.10821  [pdf, other

    eess.AS

    Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation

    Authors: Yanjie Fu, Meng Ge, Honglong Wang, Nan Li, Haoran Yin, Longbiao Wang, Gaoyan Zhang, Jianwu Dang, Chengyun Deng, Fei Wang

    Abstract: Recently, stunning improvements on multi-channel speech separation have been achieved by neural beamformers when direction information is available. However, most of them neglect to utilize speaker's 2-dimensional (2D) location cues contained in mixture signal, which limits the performance when two sources come from close directions. In this paper, we propose an end-to-end beamforming network for… ▽ More

    Submitted 2 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech 2023. arXiv admin note: substantial text overlap with arXiv:2212.03401

  39. arXiv:2305.09991  [pdf, ps, other

    eess.SY

    Angle-based formation stabilization and maneuvers in port-Hamiltonian form with bearing and velocity measurements

    Authors: Ningbo Li, Pablo Borja, Arjan van der Schaft, Jacquelien M. A. Scherpen

    Abstract: This paper proposes a port-Hamiltonian framework for angle-based formation stabilization and maneuvers using bearing and velocity measurements with an underlying triangulated Laman graph. The corresponding port-Hamiltonian controller is designed using virtual couplings on the errors of angle constraints in angle space and then the angle constraints and agent actuators are mapped by the constraint… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  40. arXiv:2305.09964  [pdf, ps, other

    eess.SY

    A port-Hamiltonian framework for displacement-based and rigid formation tracking

    Authors: Ningbo Li, Zhiyong Sun, Arjan van der Schaft, Jacquelien M. A. Scherpen

    Abstract: This paper proposes a passivity-based port-Hamiltonian (pH) framework for multi-agent displacement-based and rigid formation control and velocity tracking. The control law consists of two parts, where the internal feedback is to track the velocity and the external feedback is to achieve formation stabilization by steering variables of neighboring agents that prescribe the desired geometric shape.… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  41. arXiv:2304.07984  [pdf, other

    eess.SY math.OC

    A Unified Safety Protection and Extension Governor

    Authors: Nan Li, Yutong Li, Ilya Kolmanovsky

    Abstract: In this paper, we propose a supervisory control scheme that unifies the abilities of safety protection and safety extension. It produces a control that is able to keep the system safe indefinitely when such a control exists. When such a control does not exist due to abnormal system states, it optimizes the control to maximize the time before any safety violation, which translates into more time to… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 8 pages, 4 figures

  42. arXiv:2304.03677  [pdf, other

    eess.SY

    Scheduling Dosage of Proton Pump Inhibitors Using Constrained Optimization With Gastric Acid Secretion Model

    Authors: Yutong Li, Nan Li, Anouck Girard, Ilya Kolmanovsky

    Abstract: Dosage schedule of the Proton Pump Inhibitors (PPIs) is critical for gastric acid disorder treatment. In this paper, we develop a constrained optimization based approach for scheduling the PPIs dosage. In particular, we exploit a mathematical prediction model describing the gastric acid secretion, and use it within the optimization algorithm to predict the acid level. The dosage of the PPIs which… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  43. arXiv:2303.06858  [pdf, other

    math.OC eess.SY

    Continuous-Time Zeroth-Order Dynamics with Projection Maps: Model-Free Feedback Optimization with Safety Guarantees

    Authors: Xin Chen, Jorge I. Poveda, Na Li

    Abstract: This paper introduces a class of model-free feedback methods for solving generic constrained optimization problems where the specific mathematical forms of the objective and constraint functions are not available. The proposed methods, termed Projected Zeroth-Order (P-ZO) dynamics, incorporate projection maps into a class of continuous-time model-free dynamics that make use of periodic dithering f… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 17 pages

  44. arXiv:2301.08445  [pdf, other

    math.OC eess.SY

    Online switching control with stability and regret guarantees

    Authors: Yingying Li, James A. Preiss, Na Li, Yiheng Lin, Adam Wierman, Jeff Shamma

    Abstract: This paper considers online switching control with a finite candidate controller pool, an unknown dynamical system, and unknown cost functions. The candidate controllers can be unstabilizing policies. We only require at least one candidate controller to satisfy certain stability properties, but we do not know which one is stabilizing. We design an online algorithm that guarantees finite-gain stabi… ▽ More

    Submitted 23 January, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  45. arXiv:2301.05599  [pdf, other

    q-bio.NC cs.AI cs.LG eess.SP

    Short-length SSVEP data extension by a novel generative adversarial networks based framework

    Authors: Yudong Pan, Ning Li, Yangsong Zhang, Peng Xu, Dezhong Yao

    Abstract: Steady-state visual evoked potentials (SSVEPs) based brain-computer interface (BCI) has received considerable attention due to its high information transfer rate (ITR) and available quantity of targets. However, the performance of frequency identification methods heavily hinges on the amount of user calibration data and data length, which hinders the deployment in real-world applications. Recently… ▽ More

    Submitted 2 October, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: 16 pages, 9 figures, 4 tables

  46. arXiv:2211.16291  [pdf, other

    math.OC eess.SY

    On Controller Reduction in Linear Quadratic Gaussian Control with Performance Bounds

    Authors: Zhaolin Ren, Yang Zheng, Maryam Fazel, Na Li

    Abstract: The problem of controller reduction has a rich history in control theory. Yet, many questions remain open. In particular, there exist very few results on the order reduction of general non-observer based controllers and the subsequent quantification of the closed-loop performance. Recent developments in model-free policy optimization for Linear Quadratic Gaussian (LQG) control have highlighted the… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  47. arXiv:2211.12628  [pdf, other

    eess.SY cs.AI math.OC

    Safe Control and Learning Using Generalized Action Governor

    Authors: Nan Li, Yutong Li, Ilya Kolmanovsky, Anouck Girard, H. Eric Tseng, Dimitar Filev

    Abstract: This paper introduces the Generalized Action Governor, which is a supervisory scheme for augmenting a nominal closed-loop system with the capability of strictly handling constraints. After presenting its theory for general systems and introducing tailored design approaches for linear and discrete systems, we discuss its application to safe online learning, which aims to safely evolve control param… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 10 pages, 4 figures

  48. arXiv:2211.04767  [pdf, other

    eess.IV

    Multimodal Remote Sensing Image Registration Based on Adaptive Multi-scale PIIFD

    Authors: Ning Li, Yuxuan Li, Jichao jiao

    Abstract: In recent years, due to the wide application of multi-sensor vision systems, multimodal image acquisition technology has continued to develop, and the registration problem based on multimodal images has gradually emerged. Most of the existing multimodal image registration methods are only suitable for two modalities, and cannot uniformly register multiple modal image data. Therefore, this paper pr… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  49. arXiv:2210.05254  [pdf, other

    cs.SD cs.AI eess.AS

    Deep Spectro-temporal Artifacts for Detecting Synthesized Speech

    Authors: Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang

    Abstract: The Audio Deep Synthesis Detection (ADD) Challenge has been held to detect generated human-like speech. With our submitted system, this paper provides an overall assessment of track 1 (Low-quality Fake Audio Detection) and track 2 (Partially Fake Audio Detection). In this paper, spectro-temporal artifacts were detected using raw temporal signals, spectral features, as well as deep embedding featur… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: 7 pages, 1 figures, Accecpted by Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia

  50. arXiv:2210.05092  [pdf, other

    cs.SD eess.AS

    The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022

    Authors: Xiaoyi Qin, Na Li, Yuke Lin, Yiwei Ding, Chao Weng, Dan Su, Ming Li

    Abstract: This paper is the system description of the DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC22). In this challenge, we focus on track1 and track3. For track1, multiple backbone networks are adopted to extract frame-level features. Since track1 focus on the cross-age scenarios, we adopt the cross-age trials and perform QMF to calibrate score. The magnitude-based qualit… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.