Zum Hauptinhalt springen

Showing 1–50 of 310 results for author: Chen, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.03957  [pdf, other

    cs.NI cs.IT cs.LG eess.SP

    GNN-Based Joint Channel and Power Allocation in Heterogeneous Wireless Networks

    Authors: Lili Chen, Jingge Zhu, Jamie Evans

    Abstract: The optimal allocation of channels and power resources plays a crucial role in ensuring minimal interference, maximal data rates, and efficient energy utilisation. As a successful approach for tackling resource management problems in wireless networks, Graph Neural Networks (GNNs) have attracted a lot of attention. This article proposes a GNN-based algorithm to address the joint resource allocatio… ▽ More

    Submitted 28 July, 2024; originally announced August 2024.

  2. arXiv:2408.02293  [pdf, other

    cs.RO eess.SY

    OPENGRASP-LITE Version 1.0: A Tactile Artificial Hand with a Compliant Linkage Mechanism

    Authors: Sonja Groß, Michael Ratzel, Edgar Welte, Diego Hidalgo-Carvajal, Lingyun Chen, Edmundo Pozo Fortunić, Amartya Ganguly, Abdalla Swikir, Sami Haddadin

    Abstract: Recent research has seen notable progress in the development of linkage-based artificial hands. While previous designs have focused on adaptive grasping, dexterity and biomimetic artificial skin, only a few systems have proposed a lightweight, accessible solution integrating tactile sensing with a compliant linkage-based mechanism. This paper introduces OPENGRASP LITE, an open-source, highly integ… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems, 14-18 October 2024

  3. arXiv:2407.11620  [pdf

    eess.SP

    A Deep Learning-Based Target Radial Length Estimation Method through HRRP Sequence

    Authors: Lingfeng Chen, Panhe Hu, Zhiliang Pan, Xiao Sun, Zehao Wang

    Abstract: This paper introduces an innovative deep learning-based method for end-to-end target radial length estimation from HRRP (High Resolution Range Profile) sequences. Firstly, the HRRP sequences are normalized and transformed into GAF (Gram Angular Field) images to effectively capture and utilize the temporal information. Subsequently, these GAF images serve as the input for a pretrained ResNet-101 mo… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 2 pages, 2 figures. Accepted by APCAP 2024

  4. arXiv:2407.08236  [pdf, other

    eess.SP

    HRRPGraphNet: A Graph Neural Network Based Approach for HRRP Radar Target Recognition

    Authors: Lingfeng Chen, Panhe Hu, Zhiliang Pan, Xiao Sun, Zehao Wang

    Abstract: High Resolution Range Profiles (HRRP) have become a key area of focus in the domain of Radar Automatic Target Recognition (RATR). Despite the success of data-driven neural network-based HRRP recognition, challenges such as insufficient training samples persist in its real-world application. This letter introduces HRRPGraphNet, a novel Graph Neural Network (GNN) model designed specifically for HRRP… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures

  5. arXiv:2407.07744  [pdf, other

    cs.IT cs.AI eess.SP

    Belief Information based Deep Channel Estimation for Massive MIMO Systems

    Authors: Jialong Xu, Liu Liu, Xin Wang, Lan Chen

    Abstract: In the next generation wireless communication system, transmission rates should continue to rise to support emerging scenarios, e.g., the immersive communications. From the perspective of communication system evolution, multiple-input multiple-output (MIMO) technology remains pivotal for enhancing transmission rates. However, current MIMO systems rely on inserting pilot signals to achieve accurate… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures

  6. arXiv:2406.08266  [pdf, other

    eess.AS cs.SD

    Refining Self-Supervised Learnt Speech Representation using Brain Activations

    Authors: Hengyu Li, Kangdi Mei, Zhaoci Liu, Yang Ai, Liping Chen, Jie Zhang, Zhenhua Ling

    Abstract: It was shown in literature that speech representations extracted by self-supervised pre-trained models exhibit similarities with brain activations of human for speech perception and fine-tuning speech representation models on downstream tasks can further improve the similarity. However, it still remains unclear if this similarity can be used to optimize the pre-trained speech models. In this work,… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: accpeted by Interspeech2024

  7. arXiv:2406.08200  [pdf, other

    cs.SD cs.AI eess.AS

    Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding

    Authors: Rui Wang, Liping Chen, Kong AiK Lee, Zhen-Hua Ling

    Abstract: Voice anonymization has been developed as a technique for preserving privacy by replacing the speaker's voice in a speech signal with that of a pseudo-speaker, thereby obscuring the original voice attributes from machine recognition and human perception. In this paper, we focus on altering the voice attributes against machine recognition while retaining human perception. We referred to this as the… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: accpeted by Interspeech2024

  8. arXiv:2406.05325  [pdf, other

    eess.AS cs.SD

    LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance

    Authors: Shihao Chen, Yu Gu, Jie Zhang, Na Li, Rilin Chen, Liping Chen, Lirong Dai

    Abstract: Any-to-any singing voice conversion (SVC) is an interesting audio editing technique, aiming to convert the singing voice of one singer into that of another, given only a few seconds of singing data. However, during the conversion process, the issue of timbre leakage is inevitable: the converted singing voice still sounds like the original singer's voice. To tackle this, we propose a latent diffusi… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  9. arXiv:2405.20357  [pdf

    eess.IV physics.app-ph physics.optics

    Encryption in ghost imaging with Kronecker products of random matrices

    Authors: Yi-Ning Zhao, Lin-Shan Chen, Lingxin Kong, Chong Wang, Cheng Ren, De-Zhong Cao

    Abstract: By forming measurement matrices with the Kronecker product of two random matrices, image encryption in computational ghost imaging is investigated. The two-dimensional images are conveniently reconstructed with the pseudo-inverse matrices of the two random matrices. To suppress the noise, the method of truncated singular value decomposition can be applied to either or both of the two pseudo-invers… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures

  10. arXiv:2405.16356  [pdf, other

    eess.SY econ.TH

    Prudent Price-Responsive Demands

    Authors: Liudong Chen, Bolun Xu

    Abstract: We investigate a flexible demand with a risk-neutral cost-saving objective in response to volatile electricity prices. We introduce the concept of prudent demand, which states that future price uncertainties will affect immediate consumption patterns, despite the price expectations remaining unchanged. We develop a theoretical framework and prove that demand exhibits prudence when the third-order… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  11. arXiv:2405.09716  [pdf, other

    eess.IV cs.CV

    Illumination Histogram Consistency Metric for Quantitative Assessment of Video Sequences

    Authors: Long Chen, Mobarakol Islam, Matt Clarkson, Thomas Dowrick

    Abstract: The advances in deep generative models have greatly accelerate the process of video procession such as video enhancement and synthesis. Learning spatio-temporal video models requires to capture the temporal dynamics of a scene, in addition to the visual appearance of individual frames. Illumination consistency, which reflects the variations of illumination in the dynamic video sequences, play a vi… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  12. arXiv:2405.03729  [pdf

    eess.IV physics.optics quant-ph

    Computational ghost imaging with hybrid transforms by integrating Hadamard, discrete cosine, and Haar matrices

    Authors: Yi-Ning Zhao, Lin-Shan Chen, Liu-Ya Chen, Lingxin Kong, Chong Wang, Cheng Ren, Su-Heng Zhang, De-Zhong Cao

    Abstract: A scenario of ghost imaging with hybrid transform approach is proposed by integrating Hadamard, discrete cosine, and Haar matrices. The measurement matrix is formed by the Kronecker product of the two different transform matrices. The image information can be conveniently reconstructed by the corresponding inverse matrices. In experiment, six hybridization sets are performed in computational ghost… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures

  13. arXiv:2404.17400  [pdf, other

    cs.CV cs.AI eess.IV

    Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement

    Authors: Zishu Yao, Guodong Fan, Jinfu Fan, Min Gan, C. L. Philip Chen

    Abstract: Low-light remote sensing images generally feature high resolution and high spatial complexity, with continuously distributed surface features in space. This continuity in scenes leads to extensive long-range correlations in spatial domains within remote sensing images. Convolutional Neural Networks, which rely on local correlations for long-distance modeling, struggle to establish long-range corre… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 14 page

  14. arXiv:2404.15334  [pdf

    eess.SP

    Performance Enhancement via Real-time Image-based Beam Tracking for WA-OWC with Dynamic Waves and Mobile Receivers

    Authors: Yujie Di, Anzi Xu, Lian-Kuan Chen

    Abstract: Intensified underwater activities have driven the escalating demand for reliable, flexible, and high data-rate underwater communication links. Optical wireless communication (OWC) emerges as the most promising technology for short- to medium-range communication, facilitating the real-time high-speed transmission of information from undersea to an aerial vehicle which can subsequently relay the inf… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  15. In-situ process monitoring and adaptive quality enhancement in laser additive manufacturing: a critical review

    Authors: Lequn Chen, Guijun Bi, Xiling Yao, Jinlong Su, Chaolin Tan, Wenhe Feng, Michalis Benakis, Youxiang Chew, Seung Ki Moon

    Abstract: Laser Additive Manufacturing (LAM) presents unparalleled opportunities for fabricating complex, high-performance structures and components with unique material properties. Despite these advancements, achieving consistent part quality and process repeatability remains challenging. This paper provides a comprehensive review of various state-of-the-art in-situ process monitoring techniques, including… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 107 Pages, 29 Figures. Paper Accepted At Journal of Manufacturing Systems

  16. arXiv:2404.03984  [pdf, other

    cs.MA cs.LG eess.SY

    ROMA-iQSS: An Objective Alignment Approach via State-Based Value Learning and ROund-Robin Multi-Agent Scheduling

    Authors: Chi-Hui Lin, Joewie J. Koh, Alessandro Roncone, Lijun Chen

    Abstract: Effective multi-agent collaboration is imperative for solving complex, distributed problems. In this context, two key challenges must be addressed: first, autonomously identifying optimal objectives for collective outcomes; second, aligning these objectives among agents. Traditional frameworks, often reliant on centralized learning, struggle with scalability and efficiency in large multi-agent sys… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 10 pages, 3 figures, extended version of our 2024 American Control Conference publication

    Journal ref: Proceedings of the 2024 American Control Conference (ACC), 2024

  17. arXiv:2404.01553  [pdf, other

    eess.IV

    A CT Image Denoising Method with Residual Encoder-Decoder Network

    Authors: Helena Shawn, Thompson Chyrikov, Jacob Lanet, Lam-chi Chen, Jim Zhao, Christina Chajo

    Abstract: Utilizing a low-dose CT approach significantly reduces the radiation exposure for patients, yet it introduces challenges, such as increased noise and artifacts in the resultant images, which can hinder accurate medical diagnostics. Traditional methods for noise reduction struggle with preserving image textures due to the complexity of modeling statistical properties directly within the image domai… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 7 pages, 2 figures, 1 table, work under preparation

  18. arXiv:2403.14135  [pdf, other

    eess.IV cs.CV

    Powerful Lossy Compression for Noisy Images

    Authors: Shilv Cai, Xiaoguo Liang, Shuning Cao, Luxin Yan, Sheng Zhong, Liqun Chen, Xu Zou

    Abstract: Image compression and denoising represent fundamental challenges in image processing with many real-world applications. To address practical demands, current solutions can be categorized into two main strategies: 1) sequential method; and 2) joint method. However, sequential methods have the disadvantage of error accumulation as there is information loss between multiple individual models. Recentl… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by ICME 2024

  19. arXiv:2403.13941  [pdf, ps, other

    cs.RO eess.SY

    Sensory Glove-Based Surgical Robot User Interface

    Authors: Leonardo Borgioli, Ki-Hwan Oh, Alberto Mangano, Alvaro Ducas, Luciano Ambrosini, Federico Pinto, Paula A Lopez, Jessica Cassiani, Milos Zefran, Liaohai Chen, Pier Cristoforo Giulianotti

    Abstract: Robotic surgery has reached a high level of maturity and has become an integral part of standard surgical care. However, existing surgeon consoles are bulky and take up valuable space in the operating room, present challenges for surgical team coordination, and their proprietary nature makes it difficult to take advantage of recent technological advances, especially in virtual and augmented realit… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 6 pages, 5 figures, 7 tables, submitted to International Conference on Intelligent Robots and Systems (IROS)2024

  20. arXiv:2403.09223  [pdf, other

    cs.LG eess.SP

    MCformer: Multivariate Time Series Forecasting with Mixed-Channels Transformer

    Authors: Wenyong Han, Tao Zhu Member, Liming Chen, Huansheng Ning, Yang Luo, Yaping Wan

    Abstract: The massive generation of time-series data by largescale Internet of Things (IoT) devices necessitates the exploration of more effective models for multivariate time-series forecasting. In previous models, there was a predominant use of the Channel Dependence (CD) strategy (where each channel represents a univariate sequence). Current state-of-the-art (SOTA) models primarily rely on the Channel In… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  21. Learning Barrier-Certified Polynomial Dynamical Systems for Obstacle Avoidance with Robots

    Authors: Martin Schonger, Hugo T. M. Kussaba, Lingyun Chen, Luis Figueredo, Abdalla Swikir, Aude Billard, Sami Haddadin

    Abstract: Established techniques that enable robots to learn from demonstrations are based on learning a stable dynamical system (DS). To increase the robots' resilience to perturbations during tasks that involve static obstacle avoidance, we propose incorporating barrier certificates into an optimization problem to learn a stable and barrier-certified DS. Such optimization problem can be very complex or ex… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 7 pages, 7 figures, accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

    MSC Class: 68T40 ACM Class: I.2.9

  22. arXiv:2403.05834  [pdf, other

    cs.MM cs.SD eess.AS

    Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information

    Authors: Qiaochu Huang, Xu He, Boshi Tang, Haolin Zhuang, Liyang Chen, Shuochen Gao, Zhiyong Wu, Haozhi Huang, Helen Meng

    Abstract: Dance generation, as a branch of human motion generation, has attracted increasing attention. Recently, a few works attempt to enhance dance expressiveness, which includes genre matching, beat alignment, and dance dynamics, from certain aspects. However, the enhancement is quite limited as they lack comprehensive consideration of the aforementioned three factors. In this paper, we propose Expressi… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  23. arXiv:2403.02419  [pdf, other

    cs.LG cs.AI cs.CL eess.SY

    Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

    Authors: Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou

    Abstract: Many recent state-of-the-art results in language tasks were achieved using compound systems that perform multiple Language Model (LM) calls and aggregate their responses. However, there is little understanding of how the number of LM calls - e.g., when asking the LM to answer each question multiple times and taking a majority vote - affects such a compound system's performance. In this paper, we i… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  24. arXiv:2402.15756  [pdf, other

    cs.CV eess.SP

    Detection Is Tracking: Point Cloud Multi-Sweep Deep Learning Models Revisited

    Authors: Lingji Chen

    Abstract: Conventional tracking paradigm takes in instantaneous measurements such as range and bearing, and produces object tracks across time. In applications such as autonomous driving, lidar measurements in the form of point clouds are usually passed through a "virtual sensor" realized by a deep learning model, to produce "measurements" such as bounding boxes, which are in turn ingested by a tracking mod… ▽ More

    Submitted 6 April, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  25. arXiv:2402.03468  [pdf, other

    cs.LG eess.SP

    Exact Tensor Completion Powered by Slim Transforms

    Authors: Li Ge, Lin Chen, Yudong Chen, Xue Jiang

    Abstract: In this work, a tensor completion problem is studied, which aims to perfectly recover the tensor from partial observations. The existing theoretical guarantee requires the involved transform to be orthogonal, which hinders its applications. In this paper, jumping out of the constraints of isotropy and self-adjointness, the theoretical guarantee of exact tensor completion with arbitrary linear tran… ▽ More

    Submitted 15 August, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  26. arXiv:2402.01271  [pdf, other

    eess.AS cs.SD

    An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec

    Authors: Linping Xu, Jiawei Jiang, Dejun Zhang, Xianjun Xia, Li Chen, Yijian Xiao, Piao Ding, Shenyi Song, Sixing Yin, Ferdous Sohel

    Abstract: Recently, neural networks have proven to be effective in performing speech coding task at low bitrates. However, under-utilization of intra-frame correlations and the error of quantizer specifically degrade the reconstructed audio quality. To improve the coding quality, we present an end-to-end neural speech codec, namely CBRC (Convolutional and Bidirectional Recurrent neural Codec). An interleave… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: INTERSPEECH 2023

  27. arXiv:2401.14717  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

    Authors: Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

    Abstract: We propose an approach for continuous prediction of turn-taking and backchanneling locations in spoken dialogue by fusing a neural acoustic model with a large language model (LLM). Experiments on the Switchboard human-human conversation dataset demonstrate that our approach consistently outperforms the baseline models with single modality. We also develop a novel multi-task instruction fine-tuning… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: To appear in IEEE ICASSP 2024

  28. arXiv:2401.13442  [pdf, other

    cs.IT eess.SP

    Finite-Precision Arithmetic Transceiver for Massive MIMO Systems

    Authors: Yiming Fang, Li Chen, Yunfei Chen, Huarui Yin

    Abstract: Efficient implementation of massive multiple-input-multiple-output (MIMO) transceivers is essential for the next-generation wireless networks. To reduce the high computational complexity of the massive MIMO transceiver, in this paper, we propose a new massive MIMO architecture using finite-precision arithmetic. First, we conduct the rounding error analysis and derive the lower bound of the achieva… ▽ More

    Submitted 25 March, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 16 pages, 8 figures. Submitted to IEEE JSAC for possible publication

  29. Adversarial speech for voice privacy protection from Personalized Speech generation

    Authors: Shihao Chen, Liping Chen, Jie Zhang, KongAik Lee, Zhenhua Ling, Lirong Dai

    Abstract: The rapid progress in personalized speech generation technology, including personalized text-to-speech (TTS) and voice conversion (VC), poses a challenge in distinguishing between generated and real speech for human listeners, resulting in an urgent demand in protecting speakers' voices from malicious misuse. In this regard, we propose a speaker protection method based on adversarial attacks. The… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted by icassp 2024

  30. Two-pass Endpoint Detection for Speech Recognition

    Authors: Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

    Abstract: Endpoint (EP) detection is a key component of far-field speech recognition systems that assist the user through voice commands. The endpoint detector has to trade-off between accuracy and latency, since waiting longer reduces the cases of users being cut-off early. We propose a novel two-pass solution for endpointing, where the utterance endpoint detected from a first pass endpointer is verified b… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: ASRU 2023

  31. arXiv:2401.07422  [pdf, other

    eess.SP

    Multiperson Detection and Vital-Sign Sensing Empowered by Space-Time-Coding RISs

    Authors: Xinyu Li, Jian Wei You, Ze Gu, Qian Ma, Jingyuan Zhang, Long Chen, Tie Jun Cui

    Abstract: Passive human sensing using wireless signals has attracted increasing attention due to its superiorities of non-contact and robustness in various lighting conditions. However, when multiple human individuals are present, their reflected signals could be intertwined in the time, frequency and spatial domains, making it challenging to separate them. To address this issue, this paper proposes a novel… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  32. arXiv:2401.05690  [pdf, other

    cs.IT eess.SP

    Sparse Array Enabled Near-Field Communications: Beam Pattern Analysis and Hybrid Beamforming Design

    Authors: Cong Zhou, Changsheng You, Haodong Zhang, Li Chen, Shuo Shi

    Abstract: Extremely large-scale array (XL-array) has emerged as a promising technology to enable near-field communications for achieving enhanced spectrum efficiency and spatial resolution, by drastically increasing the number of antennas. However, this also inevitably incurs higher hardware and energy cost, which may not be affordable in future wireless systems. To address this issue, we propose in this pa… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: In this paper, we propose to exploit sparse arrays for enabling near-field communications and characterize its unique beam pattern for facilitating its hybrid beamforming design

  33. arXiv:2401.03150  [pdf, other

    eess.IV

    O-PRESS: Boosting OCT axial resolution with Prior guidance, Recurrence, and Equivariant Self-Supervision

    Authors: Kaiyan Li, Jingyuan Yang, Wenxuan Liang, Xingde Li, Chenxi Zhang, Lulu Chen, Chan Wu, Xiao Zhang, Zhiyan Xu, Yuelin Wang, Lihui Meng, Yue Zhang, Youxin Chen, S. Kevin Zhou

    Abstract: Optical coherence tomography (OCT) is a noninvasive technology that enables real-time imaging of tissue microanatomies. The axial resolution of OCT is intrinsically constrained by the spectral bandwidth of the employed light source while maintaining a fixed center wavelength for a specific application. Physically extending this bandwidth faces strong limitations and requires a substantial cost. We… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  34. arXiv:2401.01794  [pdf, other

    eess.SP

    Joint Channel Estimation and Data Recovery for Millimeter Massive MIMO: Using Pilot to Capture Principal Components

    Authors: Shusen Cai, Li Chen, Yunfei Chen, Huarui Yin, Weidong Wang

    Abstract: Channel state information (CSI) is important to reap the full benefits of millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) systems. The traditional channel estimation methods using pilot frames (PF) lead to excessive overhead. To reduce the demand for PF, data frames (DF) can be adopted for joint channel estimation and data recovery. However, the computational complexity of t… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 16 pages,11 figures,submitted to IEEE transactions on communications

  35. arXiv:2401.00283  [pdf, other

    cs.IT eess.SP

    Near-Space Communications: the Last Piece of 6G Space-Air-Ground-Sea Integrated Network Puzzle

    Authors: Hongshan Liu, Tong Qin, Zhen Gao, Tianqi Mao, Keke Ying, Ziwei Wan, Li Qiao, Rui Na, Zhongxiang Li, Chun Hu, Yikun Mei, Tuan Li, Guanghui Wen, Lei Chen, Zhonghuai Wu, Ruiqi Liu, Gaojie Chen, Shuo Wang, Dezhi Zheng

    Abstract: This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 28 pages, 8 figures, 2 tables

  36. arXiv:2312.16149  [pdf, other

    cs.SD eess.AS

    SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network

    Authors: Yuhang He, Zhuangzhuang Dai, Long Chen, Niki Trigoni, Andrew Markham

    Abstract: In this paper, we study an underexplored, yet important and challenging problem: counting the number of distinct sounds in raw audio characterized by a high degree of polyphonicity. We do so by systematically proposing a novel end-to-end trainable neural network (which we call DyDecNet, consisting of a dyadic decomposition front-end and backbone network), and quantifying the difficulty level of co… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: AAAI2024 Paper

  37. arXiv:2312.14392  [pdf, other

    eess.SP

    Wideband Sample Rate Converter Using Cascaded Parallel-serial Structure for Synthetic Instrumentation

    Authors: Ruiyuan Ming, Peng Ye, Kuojun Yang, Zhixiang Pan, Li chen, Xuetao Liu

    Abstract: A sample rate converter(SRC) is designed to adjust the sampling rate of digital signals flexibly for different application requirements in the broadband signal processing system. In this paper, a novel parallel-serial structure is proposed to improve the bandwidth and flexibility of SRC. The core of this structure is a parallel decimation filter followed by a serial counterpart, the parallel part… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 12 pages, 15 figures

  38. arXiv:2312.10992  [pdf

    eess.SY

    A Hybrid Intelligent Framework for Maximising SAG Mill Throughput: An Integration of Expert Knowledge, Machine Learning and Evolutionary Algorithms for Parameter Optimisation

    Authors: Zahra Ghasemi, Mehdi Neshat, Chris Aldrich, John Karageorgos, Max Zanin, Frank Neumann, Lei Chen

    Abstract: In mineral processing plants, grinding is a crucial step, accounting for approximately 50 percent of the total mineral processing costs. Semi-autogenous grinding mills are extensively employed in the grinding circuit of mineral processing plants. Maximizing SAG mill throughput is of significant importance considering its profound financial outcomes. However, the optimum process parameter setting a… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  39. arXiv:2312.09022  [pdf, other

    eess.IV cs.CV q-bio.NC

    BDHT: Generative AI Enables Causality Analysis for Mild Cognitive Impairment

    Authors: Qiankun Zuo, Ling Chen, Yanyan Shen, Michael Kwok-Po Ng, Baiying Lei, Shuqiang Wang

    Abstract: Effective connectivity estimation plays a crucial role in understanding the interactions and information flow between different brain regions. However, the functional time series used for estimating effective connectivity is derived from certain software, which may lead to large computing errors because of different parameter settings and degrade the ability to model complex causal relationships b… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 13pages, 14 figures

  40. arXiv:2312.04549  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play

    Authors: Lili Chen, Shikhar Bahl, Deepak Pathak

    Abstract: Learning from unstructured and uncurated data has become the dominant paradigm for generative approaches in language and vision. Such unstructured and unguided behavior data, commonly known as play, is also easier to collect in robotics but much more difficult to learn from due to its inherently multimodal, noisy, and suboptimal nature. In this paper, we study this problem of learning goal-directe… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: In CoRL 2023. Website at https://play-fusion.github.io

  41. arXiv:2311.18245  [pdf, other

    eess.IV cs.CV

    Automatic Detection of Alzheimer's Disease with Multi-Modal Fusion of Clinical MRI Scans

    Authors: Long Chen, Liben Chen, Binfeng Xu, Wenxin Zhang, Narges Razavian

    Abstract: The aging population of the U.S. drives the prevalence of Alzheimer's disease. Brookmeyer et al. forecasts approximately 15 million Americans will have either clinical AD or mild cognitive impairment by 2060. In response to this urgent call, methods for early detection of Alzheimer's disease have been developed for prevention and pre-treatment. Notably, literature on the application of deep learni… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  42. arXiv:2311.15582  [pdf, other

    cs.SD cs.LG eess.AS

    Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice

    Authors: Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao

    Abstract: The Consensus Auditory-Perceptual Evaluation of Voice is a widely employed tool in clinical voice quality assessment that is significant for streaming communication among clinical professionals and benchmarking for the determination of further treatment. Currently, because the assessment relies on experienced clinicians, it tends to be inconsistent, and thus, difficult to standardize. To address t… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Published in IEEE 42th International Conference on Consumer Electronics (ICCE 2024)

  43. arXiv:2311.12071  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing Low-dose CT Image Reconstruction by Integrating Supervised and Unsupervised Learning

    Authors: Ling Chen, Zhishen Huang, Yong Long, Saiprasad Ravishankar

    Abstract: Traditional model-based image reconstruction (MBIR) methods combine forward and noise models with simple object priors. Recent application of deep learning methods for image reconstruction provides a successful data-driven approach to addressing the challenges when reconstructing images with undersampled measurements or various types of noise. In this work, we propose a hybrid supervised-unsupervi… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: submitted to IEEE Transactions on Medical Imaging

  44. arXiv:2311.07873  [pdf, other

    eess.SP

    Passive Human Sensing Enhanced by Reconfigurable Intelligent Surface: Opportunities and Challenges

    Authors: Xinyu Li, Jian Wei You, Ze Gu, Qian Ma, Long Chen, Jingyuan Zhang, Shi Jin, Tie Jun Cui

    Abstract: Reconfigurable intelligent surfaces (RISs) have flexible and exceptional performance in manipulating electromagnetic waves and customizing wireless channels. These capabilities enable them to provide a plethora of valuable activity-related information for promoting wireless human sensing. In this article, we present a comprehensive review of passive human sensing using radio frequency signals with… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  45. arXiv:2310.18498  [pdf, ps, other

    eess.IV cs.CV cs.LG

    GPT-4 Vision on Medical Image Classification -- A Case Study on COVID-19 Dataset

    Authors: Ruibo Chen, Tianyi Xiong, Yihan Wu, Guodong Liu, Zhengmian Hu, Lichang Chen, Yanshuo Chen, Chenxi Liu, Heng Huang

    Abstract: This technical report delves into the application of GPT-4 Vision (GPT-4V) in the nuanced realm of COVID-19 image classification, leveraging the transformative potential of in-context learning to enhance diagnostic processes.

    Submitted 27 October, 2023; originally announced October 2023.

  46. arXiv:2310.17327  [pdf, ps, other

    cs.IT eess.SP

    Near-Field Positioning and Attitude Sensing Based on Electromagnetic Propagation Modeling

    Authors: Ang Chen, Li Chen, Yunfei Chen, Nan Zhao, Changsheng You

    Abstract: Positioning and sensing over wireless networks are imperative for many emerging applications. However, since traditional wireless channel models over-simplify the user equipment (UE) as a point target, they cannot be used for sensing the attitude of the UE, which is typically described by the spatial orientation. In this paper, a comprehensive electromagnetic propagation modeling (EPM) based on el… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 19 pages, 13 figures. Accepted by IEEE Journal on Selected Areas in Communications

  47. Contrastive Self-Supervised Learning for Spatio-Temporal Analysis of Lung Ultrasound Videos

    Authors: Li Chen, Jonathan Rubin, Jiahong Ouyang, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Sourabh Kulhare, Rachel Millin, Kenton W Gregory, Cynthia R Gregory, Meihua Zhu, David O Kessler, Laurie Malia, Almaz Dessie, Joni Rabiner, Di Coneybeare, Bo Shopsin, Andrew Hersh, Cristian Madar, Jeffrey Shupp, Laura S Johnson, Jacob Avila, Kristin Dwyer, Peter Weimersheimer, Balasundar Raju , et al. (2 additional authors not shown)

    Abstract: Self-supervised learning (SSL) methods have shown promise for medical imaging applications by learning meaningful visual representations, even when the amount of labeled data is limited. Here, we extend state-of-the-art contrastive learning SSL methods to 2D+time medical ultrasound video data by introducing a modified encoder and augmentation method capable of learning meaningful spatio-temporal r… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: ISBI 2023, 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI)

  48. arXiv:2310.09625  [pdf, other

    eess.IV cs.CV

    JSMoCo: Joint Coil Sensitivity and Motion Correction in Parallel MRI with a Self-Calibrating Score-Based Diffusion Model

    Authors: Lixuan Chen, Xuanyu Tian, Jiangjie Wu, Ruimin Feng, Guoyan Lao, Yuyao Zhang, Hongjiang Wei

    Abstract: Magnetic Resonance Imaging (MRI) stands as a powerful modality in clinical diagnosis. However, it is known that MRI faces challenges such as long acquisition time and vulnerability to motion-induced artifacts. Despite the success of many existing motion correction algorithms, there has been limited research focused on correcting motion artifacts on the estimated coil sensitivity maps for fast MRI… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: 10 pages,8 figures, journal

  49. arXiv:2310.03749  [pdf

    eess.SP cs.AI cs.LG

    SCVCNet: Sliding cross-vector convolution network for cross-task and inter-individual-set EEG-based cognitive workload recognition

    Authors: Qi Wang, Li Chen, Zhiyuan Zhan, Jianhua Zhang, Zhong Yin

    Abstract: This paper presents a generic approach for applying the cognitive workload recognizer by exploiting common electroencephalogram (EEG) patterns across different human-machine tasks and individual sets. We propose a neural network called SCVCNet, which eliminates task- and individual-set-related interferences in EEGs by analyzing finer-grained frequency structures in the power spectral densities. Th… ▽ More

    Submitted 21 September, 2023; originally announced October 2023.

    Comments: 12 pages

  50. arXiv:2310.03443  [pdf, ps, other

    cs.CL cs.SD eess.AS

    The North System for Formosa Speech Recognition Challenge 2023

    Authors: Li-Wei Chen, Kai-Chen Cheng, Hung-Shin Lee

    Abstract: This report provides a concise overview of the proposed North system, which aims to achieve automatic word/syllable recognition for Taiwanese Hakka (Sixian). The report outlines three key components of the system: the acquisition, composition, and utilization of the training data; the architecture of the model; and the hardware specifications and operational statistics. The demonstration of the sy… ▽ More

    Submitted 5 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.