Zum Hauptinhalt springen

Showing 1–50 of 255 results for author: Sun, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.10110  [pdf

    physics.optics eess.SP physics.app-ph

    Electrically Reconfigurable Non-Volatile On-Chip Bragg Filter with Multilevel Operation

    Authors: Amged Alquliah, Jay Ke-Chieh Sun, Christopher Mekhiel, Chengkuan Gao, Guli Gulinihali, Yeshaiahu Fainman, Abdoulaye Ndao

    Abstract: Photonic integrated circuits (PICs) demand tailored spectral responses for various applications. On-chip Bragg filters offer a promising solution, yet their static nature hampers scalability. Current tunable filters rely on volatile switching mechanisms plagued by high static power consumption and thermal crosstalk. Here, we introduce, for the first time, a non-volatile, electrically programmable… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 20 pages, 4 figures,

  2. arXiv:2408.09027  [pdf, other

    cs.SD cs.AI eess.AS

    Efficient Autoregressive Audio Modeling via Next-Scale Prediction

    Authors: Kai Qiu, Xiang Li, Hao Chen, Jie Sun, Jinglu Wang, Zhe Lin, Marios Savvides, Bhiksha Raj

    Abstract: Audio generation has achieved remarkable progress with the advance of sophisticated generative models, such as diffusion models (DMs) and autoregressive (AR) models. However, due to the naturally significant sequence length of audio, the efficiency of audio generation remains an essential issue to be addressed, especially for AR models that are incorporated in large language models (LLMs). In this… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 7 pages, 6 figures, 7 tables

  3. arXiv:2408.04192  [pdf, ps, other

    eess.SP

    Pilot-Aided Joint Time Synchronization and Channel Estimation for OTFS

    Authors: Jiazheng Sun, Peng Yang, Xianbin Cao, Zehui Xiong, Haijun Zhang, Tony Q. S. Quek

    Abstract: This letter proposes a pilot-aided joint time synchronization and channel estimation (JTSCE) algorithm for orthogonal time frequency space (OTFS) systems. Unlike existing algorithms, JTSCE employs a maximum length sequence (MLS) rather than an isolated signal as the pilot. Distinctively, JTSCE explores MLS's autocorrelation properties to estimate timing offset and channel delay taps. After obtaini… ▽ More

    Submitted 13 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

  4. arXiv:2407.18596  [pdf, ps, other

    eess.SY

    Piecewise constant tuning gain based singularity-free MRAC with application to aircraft control systems

    Authors: Zhipeng Zhang, Yanjun Zhang, Jian Sun

    Abstract: This paper introduces an innovative singularity-free output feedback model reference adaptive control (MRAC) method applicable to a wide range of continuous-time linear time-invariant (LTI) systems with general relative degrees. Unlike existing solutions such as Nussbaum and multiple-model-based methods, which manage unknown high-frequency gains through persistent switching and repeated parameter… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 9 pages, 6 figures

    MSC Class: 93A10; 93B52; 93C40; 93D20

  5. arXiv:2407.05259  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Multi-scale Conditional Generative Modeling for Microscopic Image Restoration

    Authors: Luzhe Huang, Xiongye Xiao, Shixuan Li, Jiawen Sun, Yi Huang, Aydogan Ozcan, Paul Bogdan

    Abstract: The advance of diffusion-based generative models in recent years has revolutionized state-of-the-art (SOTA) techniques in a wide variety of image analysis and synthesis tasks, whereas their adaptation on image restoration, particularly within computational microscopy remains theoretically and empirically underexplored. In this research, we introduce a multi-scale generative model that enhances con… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  6. arXiv:2407.04936  [pdf, other

    cs.SD eess.AS

    A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

    Authors: Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Xubo Liu, Wenbo Wang, Shuhan Qi, Kejia Zhang, Jianyuan Sun, Wenwu Wang

    Abstract: Language-queried audio source separation (LASS) aims to separate an audio source guided by a text query, with the signal-to-distortion ratio (SDR)-based metrics being commonly used to objectively measure the quality of the separated audio. However, the SDR-based metrics require a reference signal, which is often difficult to obtain in real-world scenarios. In addition, with the SDR-based metrics,… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Submitted to DCASE 2024 Workshop

  7. arXiv:2407.03245  [pdf, other

    cs.RO cs.AI eess.SY

    TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach

    Authors: Weikun Peng, Jun Lv, Yuwei Zeng, Haonan Chen, Siheng Zhao, Jichen Sun, Cewu Lu, Lin Shao

    Abstract: The tie-knotting task is highly challenging due to the tie's high deformation and long-horizon manipulation actions. This work presents TieBot, a Real-to-Sim-to-Real learning from visual demonstration system for the robots to learn to knot a tie. We introduce the Hierarchical Feature Matching approach to estimate a sequence of tie's meshes from the demonstration video. With these estimated meshes… ▽ More

    Submitted 3 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: fix few typos

  8. Analog or Digital In-memory Computing? Benchmarking through Quantitative Modeling

    Authors: Jiacong Sun, Pouya Houshmand, Marian Verhelst

    Abstract: In-Memory Computing (IMC) has emerged as a promising paradigm for energy-efficient, throughput-efficient and area-efficient machine learning at the edge. However, the differences in hardware architectures, array dimensions, and fabrication technologies among published IMC realizations have made it difficult to grasp their relative strengths. Moreover, previous studies have primarily focused on exp… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  9. arXiv:2404.15341  [pdf, other

    eess.SP cs.LG

    Classifier-guided neural blind deconvolution: a physics-informed denoising module for bearing fault diagnosis under heavy noise

    Authors: Jing-Xiao Liao, Chao He, Jipu Li, Jinwei Sun, Shiping Zhang, Xiaoge Zhang

    Abstract: Blind deconvolution (BD) has been demonstrated as an efficacious approach for extracting bearing fault-specific features from vibration signals under strong background noise. Despite BD's desirable feature in adaptability and mathematical interpretability, a significant challenge persists: How to effectively integrate BD with fault-diagnosing classifiers? This issue arises because the traditional… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  10. arXiv:2404.11313  [pdf, other

    eess.IV cs.AI

    NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

    Authors: Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo , et al. (43 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i.e., Kuaishou/Kwai Platform. The KVQ database is divided into three parts, including 2926 videos for training, 420 videos for validation, and 854 videos for testing. The… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024 Workshop. The challenge report for CVPR NTIRE2024 Short-form UGC Video Quality Assessment Challenge

  11. arXiv:2403.16361  [pdf, other

    eess.IV cs.CV

    RSTAR: Rotational Streak Artifact Reduction in 4D CBCT using Separable and Circular Convolutions

    Authors: Ziheng Deng, Hua Chen, Haibo Hu, Zhiyong Xu, Jiayuan Sun, Tianling Lyu, Yan Xi, Yang Chen, Jun Zhao

    Abstract: Four-dimensional cone-beam computed tomography (4D CBCT) provides respiration-resolved images and can be used for image-guided radiation therapy. However, the ability to reveal respiratory motion comes at the cost of image artifacts. As raw projection data are sorted into multiple respiratory phases, the cone-beam projections become much sparser and the reconstructed 4D CBCT images will be covered… ▽ More

    Submitted 22 August, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  12. arXiv:2403.15448  [pdf, other

    eess.SP cs.LG

    What is Wrong with End-to-End Learning for Phase Retrieval?

    Authors: Wenjie Zhang, Yuxiang Wan, Zhong Zhuang, Ju Sun

    Abstract: For nonlinear inverse problems that are prevalent in imaging science, symmetries in the forward model are common. When data-driven deep learning approaches are used to solve such problems, these intrinsic symmetries can cause substantial learning difficulties. In this paper, we explain how such difficulties arise and, more importantly, how to overcome them by preprocessing the training set before… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  13. arXiv:2403.06423  [pdf, other

    eess.SP cs.RO

    LiDAR Point Cloud-based Multiple Vehicle Tracking with Probabilistic Measurement-Region Association

    Authors: Guanhua Ding, Jianan Liu, Yuxuan Xia, Tao Huang, Bing Zhu, Jinping Sun

    Abstract: Multiple extended target tracking (ETT) has gained increasing attention due to the development of high-precision LiDAR and radar sensors in automotive applications. For LiDAR point cloud-based vehicle tracking, this paper presents a probabilistic measurement-region association (PMRA) ETT model, which can describe the complex measurement distribution by partitioning the target extent into different… ▽ More

    Submitted 18 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 8 pages, 5 figures, accepted by the 27th International Conference on Information Fusion (FUSION 2024)

  14. arXiv:2401.09705  [pdf, other

    cs.RO eess.SY

    Learning Hybrid Policies for MPC with Application to Drone Flight in Unknown Dynamic Environments

    Authors: Zhaohan Feng, Jie Chen, Wei Xiao, Jian Sun, Bin Xin, Gang Wang

    Abstract: In recent years, drones have found increased applications in a wide array of real-world tasks. Model predictive control (MPC) has emerged as a practical method for drone flight control, owing to its robustness against modeling errors/uncertainties and external disturbances. However, MPC's sensitivity to manually tuned parameters can lead to rapid performance degradation when faced with unknown env… ▽ More

    Submitted 25 January, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: To be published in Unmanned Systems

  15. arXiv:2401.04660  [pdf, other

    eess.SY

    Distributed Data-driven Unknown-input Observers

    Authors: Yuzhou Wei, Giorgia Disarò, Wenjie Liu, Jian Sun, Maria Elena Valcher, Gang Wang

    Abstract: Unknown inputs related to, e.g., sensor aging, modeling errors, or device bias, represent a major concern in wireless sensor networks, as they degrade the state estimation performance. To improve the performance, unknown-input observers (UIOs) have been proposed. Most of the results available to design UIOs are based on explicit system models, which can be difficult or impossible to obtain in real… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  16. arXiv:2401.03850  [pdf, other

    eess.AS cs.SD

    Inverse Nonlinearity Compensation of Hyperelastic Deformation in Dielectric Elastomer for Acoustic Actuation

    Authors: Jin Woo Lee, Gwang Seok An, Jeong-Yun Sun, Kyogu Lee

    Abstract: This paper delves into the analysis of nonlinear deformation induced by dielectric actuation in pre-stressed ideal dielectric elastomers. It formulates a nonlinear ordinary differential equation governing this deformation based on the hyperelastic model under dielectric stress. Through numerical integration and neural network approximations, the relationship between voltage and stretch is establis… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  17. arXiv:2401.03697  [pdf, other

    cs.SD eess.AS

    An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

    Authors: Runduo Han, Xiaopeng Yan, Weiming Xu, Pengcheng Guo, Jiayao Sun, He Wang, Quan Lu, Ning Jiang, Lei Xie

    Abstract: This paper describes our audio-quality-based multi-strategy approach for the audio-visual target speaker extraction (AVTSE) task in the Multi-modal Information based Speech Processing (MISP) 2023 Challenge. Specifically, our approach adopts different extraction strategies based on the audio quality, striking a balance between interference removal and speech preservation, which benifits the back-en… ▽ More

    Submitted 6 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  18. arXiv:2401.03687  [pdf, other

    eess.AS cs.SD

    BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators

    Authors: Zihan Zhang, Jiayao Sun, Xianjun Xia, Chuanzeng Huang, Yijian Xiao, Lei Xie

    Abstract: Packet loss is a common and unavoidable problem in voice over internet phone (VoIP) systems. To deal with the problem, we propose a band-split packet loss concealment network (BS-PLCNet). Specifically, we split the full-band signal into wide-band (0-8kHz) and high-band (8-24kHz). The wide-band signals are processed by a gated convolutional recurrent network (GCRN), while the high-band counterpart… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: submitted to ICASSP 2024

  19. arXiv:2401.03473  [pdf, ps, other

    cs.SD cs.AI eess.AS

    ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

    Authors: He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li

    Abstract: To promote speech processing and recognition research in driving scenarios, we build on the success of the Intelligent Cockpit Speech Recognition Challenge (ICSRC) held at ISCSLP 2022 and launch the ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) Challenge. This challenge collects over 100 hours of multi-channel speech data recorded inside a new energy vehicle and 40 hours… ▽ More

    Submitted 20 February, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted at ICASSP 2024

  20. arXiv:2312.15195  [pdf, other

    cs.AI cs.LG eess.SY

    Mutual Information as Intrinsic Reward of Reinforcement Learning Agents for On-demand Ride Pooling

    Authors: Xianjie Zhang, Jiahao Sun, Chen Gong, Kai Wang, Yifei Cao, Hao Chen, Hao Chen, Yu Liu

    Abstract: The emergence of on-demand ride pooling services allows each vehicle to serve multiple passengers at a time, thus increasing drivers' income and enabling passengers to travel at lower prices than taxi/car on-demand services (only one passenger can be assigned to a car at a time like UberX and Lyft). Although on-demand ride pooling services can bring so many benefits, ride pooling services need a w… ▽ More

    Submitted 7 January, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: Accepted by AAMAS 2024

  21. arXiv:2312.13045  [pdf, ps, other

    eess.SY

    Feasibility Conditions for Mobile LiFi

    Authors: Shuai Ma, Haihong Sheng, Junchang Sun, Hang Li, Xiaodong Liu, Chen Qiu, Majid Safari, Naofal Al-Dhahir, Shiyin Li

    Abstract: Light fidelity (LiFi) is a potential key technology for future 6G networks. However, its feasibility of supporting mobile communications has not been fundamentally discussed. In this paper, we investigate the time-varying channel characteristics of mobile LiFi based on measured mobile phone rotation and movement data. Specifically, we define LiFi channel coherence time to evaluate the correlation… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  22. arXiv:2312.07631  [pdf, other

    physics.med-ph cs.AI eess.IV physics.bio-ph physics.optics

    AI-driven projection tomography with multicore fibre-optic cell rotation

    Authors: Jiawei Sun, Bin Yang, Nektarios Koukourakis, Jochen Guck, Juergen W. Czarske

    Abstract: Optical tomography has emerged as a non-invasive imaging method, providing three-dimensional insights into subcellular structures and thereby enabling a deeper understanding of cellular functions, interactions, and processes. Conventional optical tomography methods are constrained by a limited illumination scanning range, leading to anisotropic resolution and incomplete imaging of cellular structu… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 15 pages, 6 figures

  23. arXiv:2312.03097  [pdf, other

    eess.SY

    State of Health Estimation for Battery Modules with Parallel-Connected Cells Under Cell-to-Cell Variations

    Authors: Qinan Zhou, Dyche Anderson, Jing Sun

    Abstract: State of health (SOH) estimation for lithium-ion battery modules with cells connected in parallel is a challenging problem, especially with cell-to-cell variations. Incremental capacity analysis (ICA) and differential voltage analysis (DVA) are effective at the cell level, but a generalizable method to extend them to module-level SOH estimation remains missing, when only module-level measurements… ▽ More

    Submitted 19 May, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Addressed reviewer comments: Combined two sections, revised dataset and module-level result sections, corrected a typo in Algorithm 2; Previous Edit Comments: Condensed abstract; Added details in Introduction, Dataset, Module-Level Result Sections; Revised Section I, III & VII, IX; Added the initialization of Phi in Algorithm 2

  24. arXiv:2312.00568  [pdf, ps, other

    eess.SP

    A WINNER+ Based 3-D Non-Stationary Wideband MIMO Channel Model

    Authors: Ji Bian, Jian Sun, Cheng-Xiang Wang, Rui Feng, Jie Huang, Yang Yang, Minggao Zhang

    Abstract: In this paper, a three-dimensional (3-D) non-stationary wideband multiple-input multiple-output (MIMO) channel model based on the WINNER+ channel model is proposed. The angular distributions of clusters in both the horizontal and vertical planes are jointly considered. The receiver and clusters can be moving, which makes the model more general. Parameters including number of clusters, powers, dela… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  25. arXiv:2311.18508  [pdf, other

    eess.IV cs.CV

    DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution

    Authors: Axi Niu, Kang Zhang, Joshua Tian Jin Tee, Trung X. Pham, Jinqiu Sun, Chang D. Yoo, In So Kweon, Yanning Zhang

    Abstract: It is well known the adversarial optimization of GAN-based image super-resolution (SR) methods makes the preceding SR model generate unpleasant and undesirable artifacts, leading to large distortion. We attribute the cause of such distortions to the poor calibration of the discriminator, which hampers its ability to provide meaningful feedback to the generator for learning high-quality images. To… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  26. arXiv:2311.11300  [pdf, other

    eess.SY

    Robust Control of Unknown Switched Linear Systems from Noisy Data

    Authors: Wenjie Liu, Yifei Li, Jian Sun, Gang Wang, Jie Chen

    Abstract: This paper investigates the problem of data-driven stabilization for linear discrete-time switched systems with unknown switching dynamics. In the absence of noise, a data-based state feedback stabilizing controller can be obtained by solving a semi-definite program (SDP) on-the-fly, which automatically adapts to the changes of switching dynamics. However, when noise is present, the persistency of… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  27. arXiv:2311.10416  [pdf, other

    eess.SP

    Meta-DSP: A Meta-Learning Approach for Data-Driven Nonlinear Compensation in High-Speed Optical Fiber Systems

    Authors: Xinyu Xiao, Zhennan Zhou, Bin Dong, Dingjiong Ma, Li Zhou, Jie Sun

    Abstract: Non-linear effects in long-haul, high-speed optical fiber systems significantly hinder channel capacity. While the Digital Backward Propagation algorithm (DBP) with adaptive filter (ADF) can mitigate these effects, it suffers from an overwhelming computational complexity. Recent solutions have incorporated deep neural networks in a data-driven strategy to alleviate this complexity in the DBP model… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  28. arXiv:2311.08207  [pdf, other

    eess.SY

    Data-driven Control Against False Data Injection Attacks

    Authors: Wenjie Liu, Lidong Li, Jian Sun, Fang Deng, Gang Wang, Jie Chen

    Abstract: The rise of cyber-security concerns has brought significant attention to the analysis and design of cyber-physical systems (CPSs). Among the various types of cyberattacks, denial-of-service (DoS) attacks and false data injection (FDI) attacks can be easily launched and have become prominent threats. While resilient control against DoS attacks has received substantial research efforts, countermeasu… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  29. arXiv:2311.07872  [pdf, ps, other

    cs.NI eess.SP

    Cost-Efficient Computation Offloading and Service Chain Caching in LEO Satellite Networks

    Authors: Yantong Wang, Chuanfen Feng, Jiande Sun

    Abstract: The ever-increasing demand for ubiquitous, continuous, and high-quality services poses a great challenge to the traditional terrestrial network. To mitigate this problem, the mobile-edge-computing-enhanced low earth orbit (LEO) satellite network, which provides both communication connectivity and on-board processing services, has emerged as an effective method. The main issue in LEO satellites inc… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures

  30. arXiv:2311.05415  [pdf, other

    eess.SP

    EEG-DG: A Multi-Source Domain Generalization Framework for Motor Imagery EEG Classification

    Authors: Xiao-Cong Zhong, Qisong Wang, Dan Liu, Zhihuang Chen, Jing-Xiao Liao, Jinwei Sun, Yudong Zhang, Feng-Lei Fan

    Abstract: Motor imagery EEG classification plays a crucial role in non-invasive Brain-Computer Interface (BCI) research. However, the classification is affected by the non-stationarity and individual variations of EEG signals. Simply pooling EEG data with different statistical distributions to train a classification model can severely degrade the generalization performance. To address this issue, the existi… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  31. arXiv:2311.02443  [pdf, ps, other

    eess.SP

    PIPO-Net: A Penalty-based Independent Parameters Optimization Deep Unfolding Network

    Authors: Xiumei Li, Zhijie Zhang, Huang Bai, Ljubiša Stanković, Junpeng Hao, Junmei Sun

    Abstract: Compressive sensing (CS) has been widely applied in signal and image processing fields. Traditional CS reconstruction algorithms have a complete theoretical foundation but suffer from the high computational complexity, while fashionable deep network-based methods can achieve high-accuracy reconstruction of CS but are short of interpretability. These facts motivate us to develop a deep unfolding ne… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  32. arXiv:2310.14778  [pdf, other

    cs.MM cs.SD eess.AS

    Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

    Authors: Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang

    Abstract: Audio-visual speaker tracking has drawn increasing attention over the past few years due to its academic values and wide application. Audio and visual modalities can provide complementary information for localization and tracking. With audio and visual information, the Bayesian-based filter can solve the problem of data association, audio-visual fusion and track management. In this paper, we condu… ▽ More

    Submitted 17 December, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

  33. arXiv:2310.13883  [pdf, other

    eess.SY math.OC

    Robust Model Predictive Control for Enhanced Fast Charging on Electric Vehicles through Integrated Power and Thermal Management

    Authors: Qiuhao Hu, Mohammad Reza Amini, Ashley Wiese, Ilya Kolmanovsky, Jing Sun

    Abstract: This paper explores the synergies between integrated power and thermal management (iPTM) and battery charging in an electric vehicle (EV). A multi-objective model predictive control (MPC) framework is developed to optimize the fast charging performance while enforcing the constraints in the power and thermal loops. The approach takes into account the coupling of the battery and cabin thermal manag… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: The 62nd Conference on Decision and Control (CDC), December 13-15, 2023, Singapore

  34. arXiv:2310.12795  [pdf, other

    eess.SY

    Self-triggered Consensus Control of Multi-agent Systems from Data

    Authors: Yifei Li, Xin Wang, Jian Sun, Gang Wang, Jie Chen

    Abstract: This paper considers self-triggered consensus control of unknown linear multi-agent systems (MASs). Self-triggering mechanisms (STMs) are widely used in MASs, thanks to their advantages in avoiding continuous monitoring and saving computing and communication resources. However, existing results require the knowledge of system matrices, which are difficult to obtain in real-world settings. To addre… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  35. arXiv:2310.08364  [pdf, other

    cs.NI eess.SP

    Map2Schedule: An End-to-End Link Scheduling Method for Urban V2V Communications

    Authors: Lihao Zhang, Haijian Sun, Jin Sun, Ramviyas Parasuraman, Yinghui Ye, Rose Qingyang Hu

    Abstract: Urban vehicle-to-vehicle (V2V) link scheduling with shared spectrum is a challenging problem. Its main goal is to find the scheduling policy that can maximize system performance (usually the sum capacity of each link or their energy efficiency). Given that each link can experience interference from all other active links, the scheduling becomes a combinatorial integer programming problem and gener… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: submitted to IEEE conference for future publication

  36. arXiv:2310.04817  [pdf, other

    cs.IT eess.SP

    A Grouping-based Scheduler for Efficient Channel Utilization under Age of Information Constraints

    Authors: Lehan Wang, Jingzhou Sun, Yuxuan Sun, Sheng Zhou, Zhisheng Niu

    Abstract: We consider a status information updating system where a fusion center collects the status information from a large number of sources and each of them has its own age of information (AoI) constraints. A novel grouping-based scheduler is proposed to solve this complex large-scale problem by dividing the sources into different scheduling groups. The problem is then transformed into deriving the opti… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 10 pages, 3 figures, presented at the 34th international teletraffic congress (ITC34)

  37. arXiv:2310.04813  [pdf, other

    cs.IT eess.SP

    Age of Information Guaranteed Scheduling for Asynchronous Status Updates in Collaborative Perception

    Authors: Lehan Wang, Jingzhou Sun, Yuxuan Sun, Sheng Zhou, Zhisheng Niu

    Abstract: We consider collaborative perception (CP) systems where a fusion center monitors various regions by multiple sources. The center has different age of information (AoI) constraints for different regions. Multi-view sensing data for a region generated by sources can be fused by the center for a reliable representation of the region. To ensure accurate perception, differences between generation time… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 9 pages, 5 figures, presented at 2023 Workshop on Modeling and Optimization in Semantic Communications (MOSC)

  38. arXiv:2310.04715  [pdf, other

    eess.AS cs.SD

    An Exploration of Task-decoupling on Two-stage Neural Post Filter for Real-time Personalized Acoustic Echo Cancellation

    Authors: Zihan Zhang, Jiayao Sun, Xianjun Xia, Ziqian Wang, Xiaopeng Yan, Yijian Xiao, Lei Xie

    Abstract: Deep learning based techniques have been popularly adopted in acoustic echo cancellation (AEC). Utilization of speaker representation has extended the frontier of AEC, thus attracting many researchers' interest in personalized acoustic echo cancellation (PAEC). Meanwhile, task-decoupling strategies are widely adopted in speech enhancement. To further explore the task-decoupling approach, we propos… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: accepted to ASRU 2023

  39. arXiv:2309.15867  [pdf

    cs.LG eess.IV q-bio.QM

    Identifying factors associated with fast visual field progression in patients with ocular hypertension based on unsupervised machine learning

    Authors: Xiaoqin Huang, Asma Poursoroush, Jian Sun, Michael V. Boland, Chris Johnson, Siamak Yousefi

    Abstract: Purpose: To identify ocular hypertension (OHT) subtypes with different trends of visual field (VF) progression based on unsupervised machine learning and to discover factors associated with fast VF progression. Participants: A total of 3133 eyes of 1568 ocular hypertension treatment study (OHTS) participants with at least five follow-up VF tests were included in the study. Methods: We used a laten… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  40. arXiv:2309.11745  [pdf, other

    eess.IV cs.CV cs.LG

    PIE: Simulating Disease Progression via Progressive Image Editing

    Authors: Kaizhao Liang, Xu Cao, Kuei-Da Liao, Tianren Gao, Wenqian Ye, Zhengyu Chen, Jianguo Cao, Tejas Nama, Jimeng Sun

    Abstract: Disease progression simulation is a crucial area of research that has significant implications for clinical diagnosis, prognosis, and treatment. One major challenge in this field is the lack of continuous medical imaging monitoring of individual patients over time. To address this issue, we develop a novel framework termed Progressive Image Editing (PIE) that enables controlled manipulation of dis… ▽ More

    Submitted 5 October, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: Code and checkpoints for replicating our results can be found at https://github.com/IrohXu/PIE and https://huggingface.co/IrohXu/stable-diffusion-mimic-cxr-v0.1

  41. arXiv:2309.11717  [pdf, other

    eess.SP

    A class-weighted supervised contrastive learning long-tailed bearing fault diagnosis approach using quadratic neural network

    Authors: Wei-En Yu, Jinwei Sun, Shiping Zhang, Xiaoge Zhang, Jing-Xiao Liao

    Abstract: Deep learning has achieved remarkable success in bearing fault diagnosis. However, its performance oftentimes deteriorates when dealing with highly imbalanced or long-tailed data, while such cases are prevalent in industrial settings because fault is a rare event that occurs with an extremely low probability. Conventional data augmentation methods face fundamental limitations due to the scarcity o… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  42. arXiv:2309.07413  [pdf, other

    cs.CL cs.SD eess.AS

    CPPF: A contextual and post-processing-free model for automatic speech recognition

    Authors: Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

    Abstract: ASR systems have become increasingly widespread in recent years. However, their textual outputs often require post-processing tasks before they can be practically utilized. To address this issue, we draw inspiration from the multifaceted capabilities of LLMs and Whisper, and focus on integrating multiple ASR text processing tasks related to speech recognition into the ASR model. This integration n… ▽ More

    Submitted 20 September, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Submitted to ICASSP2024

  43. arXiv:2309.06036  [pdf, other

    eess.SP

    Which Framework is Suitable for Online 3D Multi-Object Tracking for Autonomous Driving with Automotive 4D Imaging Radar?

    Authors: Jianan Liu, Guanhua Ding, Yuxuan Xia, Jinping Sun, Tao Huang, Lihua Xie, Bing Zhu

    Abstract: Online 3D multi-object tracking (MOT) has recently received significant research interests due to the expanding demand of 3D perception in advanced driver assistance systems (ADAS) and autonomous driving (AD). Among the existing 3D MOT frameworks for ADAS and AD, conventional point object tracking (POT) framework using the tracking-by-detection (TBD) strategy has been well studied and accepted for… ▽ More

    Submitted 25 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures, accepted by IEEE 35th Intelligent Vehicles Symposium (IV 2024), oral presentation (top 5%), code is available at https://github.com/dinggh0817/4D_Radar_MOT

  44. arXiv:2308.01487  [pdf, ps, other

    eess.SY eess.SP

    Data-Driven Nonlinear TDOA for Accurate Source Localization in Complex Signal Dynamics

    Authors: Chinmay Sahu, Mahesh Banavar, Jie Sun

    Abstract: The complex and dynamic propagation of oscillations and waves is often triggered by sources at unknown locations. Accurate source localization enables the elimination of the rotor core in atrial fibrillation (AFib) as an effective treatment for such severe cardiac disorder; it also finds potential use in locating the spreading source in natural disasters such as forest fires and tsunamis. However,… ▽ More

    Submitted 12 August, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 9 pages, 5 figures, Accepted to IEEE Sensors Journal

  45. arXiv:2307.12032  [pdf, other

    cs.CV cs.LG eess.IV

    Flight Contrail Segmentation via Augmented Transfer Learning with Novel SR Loss Function in Hough Space

    Authors: Junzi Sun, Esther Roosenbrand

    Abstract: Air transport poses significant environmental challenges, particularly regarding the role of flight contrails in climate change due to their potential global warming impact. Traditional computer vision techniques struggle under varying remote sensing image conditions, and conventional machine learning approaches using convolutional neural networks are limited by the scarcity of hand-labeled contra… ▽ More

    Submitted 25 September, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: Source code available at: https://github.com/junzis/contrail-net

  46. arXiv:2307.11950  [pdf, other

    eess.SP

    Accurate RSS-Based Localization Using an Opposition-Based Learning Simulated Annealing Algorithm

    Authors: Weizhong Ding, Shengming Chang, Shudi Bao, Meng Chen, Jie Sun

    Abstract: Wireless sensor networks require accurate target localization, often achieved through received signal strength (RSS) localization estimation based on maximum likelihood (ML). However, ML-based algorithms can suffer from issues such as low diversity, slow convergence, and local optima, which can significantly affect localization performance. In this paper, we propose a novel localization algorithm… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  47. arXiv:2307.07128  [pdf, other

    eess.SY

    Data-driven Polytopic Output Synchronization of Heterogeneous Multi-agent Systems from Noisy Data

    Authors: Yifei Li, Wenjie Liu, Jian Sun, Gang Wang, Lihua Xie, Jie Chen

    Abstract: This paper proposes a novel approach to addressing the output synchronization problem in unknown heterogeneous multi-agent systems (MASs) using noisy data. Unlike existing studies that focus on noiseless data, we introduce a distributed data-driven controller that enables all heterogeneous followers to synchronize with a leader's trajectory. To handle the noise in the state-input-output data, we d… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  48. arXiv:2307.00781  [pdf, other

    cs.CV eess.IV

    ACDMSR: Accelerated Conditional Diffusion Models for Single Image Super-Resolution

    Authors: Axi Niu, Pham Xuan Trung, Kang Zhang, Jinqiu Sun, Yu Zhu, In So Kweon, Yanning Zhang

    Abstract: Diffusion models have gained significant popularity in the field of image-to-image translation. Previous efforts applying diffusion models to image super-resolution (SR) have demonstrated that iteratively refining pure Gaussian noise using a U-Net architecture trained on denoising at various noise levels can yield satisfactory high-resolution images from low-resolution inputs. However, this iterat… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.12831

  49. arXiv:2306.17434  [pdf

    eess.IV

    A Motion Assessment Method for Reference Stack Selection in Fetal Brain MRI Reconstruction Based on Tensor Rank Approximation

    Authors: Haoan Xu, Wen Shi, Jiwei Sun, Tianshu Zheng, Cong Sun, Sun Yi, Guangbin Wang, Dan Wu

    Abstract: Purpose: Slice-to-volume registration and super-resolution reconstruction (SVR-SRR) is commonly used to generate 3D volumes of the fetal brain from 2D stacks of slices acquired in multiple orientations. A critical initial step in this pipeline is to select one stack with the minimum motion as a reference for registration. An accurate and unbiased motion assessment (MA) is thus crucial for successf… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 6 figures. Correspondence to: Dan Wu, Ph.D. E-mail: [email protected]

  50. arXiv:2306.16050  [pdf, other

    cs.CV cs.LG eess.IV

    Evaluating Similitude and Robustness of Deep Image Denoising Models via Adversarial Attack

    Authors: Jie Ning, Jiebao Sun, Yao Li, Zhichang Guo, Wangmeng Zuo

    Abstract: Deep neural networks (DNNs) have shown superior performance comparing to traditional image denoising algorithms. However, DNNs are inevitably vulnerable while facing adversarial attacks. In this paper, we propose an adversarial attack method named denoising-PGD which can successfully attack all the current deep denoising models while keep the noise distribution almost unchanged. We surprisingly fi… ▽ More

    Submitted 6 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.