Zum Hauptinhalt springen

Showing 1–50 of 121 results for author: Chen, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.16277  [pdf

    eess.IV cs.CV

    Fine-grained Classification of Port Wine Stains Using Optical Coherence Tomography Angiography

    Authors: Xiaofeng Deng, Defu Chen, Bowen Liu, Xiwan Zhang, Haixia Qiu, Wu Yuan, Hongliang Ren

    Abstract: Accurate classification of port wine stains (PWS, vascular malformations present at birth), is critical for subsequent treatment planning. However, the current method of classifying PWS based on the external skin appearance rarely reflects the underlying angiopathological heterogeneity of PWS lesions, resulting in inconsistent outcomes with the common vascular-targeted photodynamic therapy (V-PDT)… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. arXiv:2407.19763  [pdf, other

    eess.IV cs.CV

    TeleOR: Real-time Telemedicine System for Full-Scene Operating Room

    Authors: Yixuan Wu, Kaiyuan Hu, Qian Shao, Jintai Chen, Danny Z. Chen, Jian Wu

    Abstract: The advent of telemedicine represents a transformative development in leveraging technology to extend the reach of specialized medical expertise to remote surgeries, a field where the immediacy of expert guidance is paramount. However, the intricate dynamics of Operating Room (OR) scene pose unique challenges for telemedicine, particularly in achieving high-fidelity, real-time scene reconstruction… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  3. arXiv:2407.10310  [pdf, other

    cs.CY eess.SY

    Impact of Different Infrastructures and Traffic Scenarios on Behavioral and Physiological Responses of E-scooter Users

    Authors: Dong Chen, Arman Hosseini, Arik Smith, David Xiang, Arsalan Heydarian, Omid Shoghli, Bradford Campbell

    Abstract: As micromobility devices such as e-scooters gain global popularity, emergency departments around the world have observed a rising trend in related injuries. However, the majority of current research on e-scooter safety relies heavily on surveys, news reports, and data from vendors, with a noticeable scarcity of naturalistic studies examining the effects of riders' behaviors and physiological respo… ▽ More

    Submitted 5 May, 2024; originally announced July 2024.

    Comments: 6 pages, 8 figures

  4. arXiv:2406.19485  [pdf, other

    eess.IV cs.CV

    GAPNet: Granularity Attention Network with Anatomy-Prior-Constraint for Carotid Artery Segmentation

    Authors: Lin Zhang, Chenggang Lu, Xin-yang Shi, Caifeng Shan, Jiong Zhang, Da Chen, Laurent D. Cohen

    Abstract: Atherosclerosis is a chronic, progressive disease that primarily affects the arterial walls. It is one of the major causes of cardiovascular disease. Magnetic Resonance (MR) black-blood vessel wall imaging (BB-VWI) offers crucial insights into vascular disease diagnosis by clearly visualizing vascular structures. However, the complex anatomy of the neck poses challenges in distinguishing the carot… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  5. arXiv:2406.13340  [pdf, other

    cs.CL cs.SD eess.AS

    SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

    Authors: Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu

    Abstract: Speech encompasses a wealth of information, including but not limited to content, paralinguistic, and environmental information. This comprehensive nature of speech significantly impacts communication and is crucial for human-computer interaction. Chat-Oriented Large Language Models (LLMs), known for their general-purpose assistance capabilities, have evolved to handle multi-modal inputs, includin… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  6. arXiv:2406.11653  [pdf, other

    eess.SY

    Communication-Efficient MARL for Platoon Stability and Energy-efficiency Co-optimization in Cooperative Adaptive Cruise Control of CAVs

    Authors: Min Hua, Dong Chen, Kun Jiang, Fanggang Zhang, Jinhai Wang, Bo Wang, Quan Zhou, Hongming Xu

    Abstract: Cooperative adaptive cruise control (CACC) has been recognized as a fundamental function of autonomous driving, in which platoon stability and energy efficiency are outstanding challenges that are difficult to accommodate in real-world operations. This paper studied the CACC of connected and autonomous vehicles (CAVs) based on the multi-agent reinforcement learning algorithm (MARL) to optimize pla… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2406.10358  [pdf, other

    cs.CR eess.SY

    I Still See You: Why Existing IoT Traffic Reshaping Fails

    Authors: Su Wang, Keyang Yu, Qi Li, Dong Chen

    Abstract: The Internet traffic data produced by the Internet of Things (IoT) devices are collected by Internet Service Providers (ISPs) and device manufacturers, and often shared with their third parties to maintain and enhance user services. Unfortunately, on-path adversaries could infer and fingerprint users' sensitive privacy information such as occupancy and user activities by analyzing these network tr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: EWSN'24 paper accepted, to appear

  8. arXiv:2406.05924  [pdf, other

    eess.SP

    Imageless Contraband Detection Using a Millimeter-Wave Dynamic Antenna Array via Spatial Fourier Domain Sampling

    Authors: Daniel Chen, Anton Schlegel, Jeffrey A. Nanzer

    Abstract: We demonstrate an imageless method of concealed contraband detection using a real-time 75 GHz rotationally dynamic antenna array. The array measures information in the two-dimensional Fourier domain and captures a set of samples that is sufficient for detecting concealed objects yet insufficient for generating full image, thereby preserving the privacy of screened subjects. The small set of Fourie… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2406.00341  [pdf, other

    eess.IV cs.CV

    DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation

    Authors: Qihang Xie, Mengguo Guo, Lei Mou, Dan Zhang, Da Chen, Caifeng Shan, Yitian Zhao, Ruisheng Su, Jiong Zhang

    Abstract: Cerebrovascular diseases (CVDs) remain a leading cause of global disability and mortality. Digital Subtraction Angiography (DSA) sequences, recognized as the golden standard for diagnosing CVDs, can clearly visualize the dynamic flow and reveal pathological conditions within the cerebrovasculature. Therefore, precise segmentation of cerebral arteries (CAs) and classification between their main tru… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  10. arXiv:2405.03039  [pdf

    cs.CV eess.SY

    Performance Evaluation of Real-Time Object Detection for Electric Scooters

    Authors: Dong Chen, Arman Hosseini, Arik Smith, Amir Farzin Nikkhah, Arsalan Heydarian, Omid Shoghli, Bradford Campbell

    Abstract: Electric scooters (e-scooters) have rapidly emerged as a popular mode of transportation in urban areas, yet they pose significant safety challenges. In the United States, the rise of e-scooters has been marked by a concerning increase in related injuries and fatalities. Recently, while deep-learning object detection holds paramount significance in autonomous vehicles to avoid potential collisions,… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 10 pages, 3 figures

  11. arXiv:2404.19087  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Deep Reinforcement Learning for Advanced Longitudinal Control and Collision Avoidance in High-Risk Driving Scenarios

    Authors: Dianwei Chen, Yaobang Gong, Xianfeng Yang

    Abstract: Existing Advanced Driver Assistance Systems primarily focus on the vehicle directly ahead, often overlooking potential risks from following vehicles. This oversight can lead to ineffective handling of high risk situations, such as high speed, closely spaced, multi vehicle scenarios where emergency braking by one vehicle might trigger a pile up collision. To overcome these limitations, this study i… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  12. arXiv:2403.12852  [pdf, other

    eess.IV cs.CV

    Generative Enhancement for 3D Medical Images

    Authors: Lingting Zhu, Noel Codella, Dongdong Chen, Zhenchao Jin, Lu Yuan, Lequan Yu

    Abstract: The limited availability of 3D medical image datasets, due to privacy concerns and high collection or annotation costs, poses significant challenges in the field of medical imaging. While a promising alternative is the use of synthesized medical data, there are few solutions for realistic 3D medical image synthesis due to difficulties in backbone design and fewer 3D training samples compared to 2D… ▽ More

    Submitted 24 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 20 pages, 8 figures

  13. arXiv:2403.08931  [pdf, ps, other

    eess.SY

    Unleashing the True Power of Age-of-Information: Service Aggregation in Connected and Autonomous Vehicles

    Authors: Anik Mallik, Dawei Chen, Kyungtae Han, Jiang Xie, Zhu Han

    Abstract: Connected and autonomous vehicles (CAVs) rely heavily upon time-sensitive information update services to ensure the safety of people and assets, and satisfactory entertainment applications. Therefore, the freshness of information is a crucial performance metric for CAV services. However, information from roadside sensors and nearby vehicles can get delayed in transmission due to the high mobility… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 6 pages, 8 figures, to appear in the Proceedings of IEEE International Conference on Communications (IEEE ICC, 9-13 June 2024, Denver, CO, USA)

  14. arXiv:2403.03390  [pdf, other

    cs.CV cs.LG eess.IV

    Performance Evaluation of Semi-supervised Learning Frameworks for Multi-Class Weed Detection

    Authors: Jiajia Li, Dong Chen, Xunyuan Yin, Zhaojian Li

    Abstract: Effective weed control plays a crucial role in optimizing crop yield and enhancing agricultural product quality. However, the reliance on herbicide application not only poses a critical threat to the environment but also promotes the emergence of resistant weeds. Fortunately, recent advances in precision weed management enabled by ML and DL provide a sustainable alternative. Despite great progress… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 11 pages, 7 figures

  15. Human Activity Recognition with Low-Resolution Infrared Array Sensor Using Semi-supervised Cross-domain Neural Networks for Indoor Environment

    Authors: Cunyi Yin, Xiren Miao, Jing Chen, Hao Jiang, Deying Chen, Yixuan Tong, Shaocong Zheng

    Abstract: Low-resolution infrared-based human activity recognition (HAR) attracted enormous interests due to its low-cost and private. In this paper, a novel semi-supervised crossdomain neural network (SCDNN) based on 8 $\times$ 8 low-resolution infrared sensor is proposed for accurately identifying human activity despite changes in the environment at a low-cost. The SCDNN consists of feature extractor, dom… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  16. arXiv:2402.16907  [pdf, other

    eess.IV cs.CV cs.LG

    Diffusion Posterior Proximal Sampling for Image Restoration

    Authors: Hongjie Wu, Linchao He, Mingqin Zhang, Dongdong Chen, Kunming Luo, Mengting Luo, Ji-Zhe Zhou, Hu Chen, Jiancheng Lv

    Abstract: Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements inherited from the unconditional generation paradigm. These strategies initiate the denoising process with pure white noise and incorporate random noise at each… ▽ More

    Submitted 6 August, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: ACM Multimedia 2024 Oral

  17. arXiv:2402.03695  [pdf, other

    eess.IV cs.CV

    ConUNETR: A Conditional Transformer Network for 3D Micro-CT Embryonic Cartilage Segmentation

    Authors: Nishchal Sapkota, Yejia Zhang, Susan M. Motch Perrine, Yuhan Hsi, Sirui Li, Meng Wu, Greg Holmes, Abdul R. Abdulai, Ethylin W. Jabs, Joan T. Richtsmeier, Danny Z Chen

    Abstract: Studying the morphological development of cartilaginous and osseous structures is critical to the early detection of life-threatening skeletal dysmorphology. Embryonic cartilage undergoes rapid structural changes within hours, introducing biological variations and morphological shifts that limit the generalization of deep learning-based segmentation models that infer across multiple embryonic age… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Published in ISBI 2024

  18. arXiv:2312.09899  [pdf, other

    eess.IV cs.CV cs.LG

    SQA-SAM: Segmentation Quality Assessment for Medical Images Utilizing the Segment Anything Model

    Authors: Yizhe Zhang, Shuo Wang, Tao Zhou, Qi Dou, Danny Z. Chen

    Abstract: Segmentation quality assessment (SQA) plays a critical role in the deployment of a medical image based AI system. Users need to be informed/alerted whenever an AI system generates unreliable/incorrect predictions. With the introduction of the Segment Anything Model (SAM), a general foundation segmentation model, new research opportunities emerged in how one can utilize SAM for medical image segmen… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Work in progress;

  19. arXiv:2312.07212  [pdf, other

    cs.MM cs.AI cs.SD eess.AS

    More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory

    Authors: Peiwen Sun, Yifan Zhang, Zishan Liu, Donghao Chen, Honggang Zhang

    Abstract: The vanilla fusion methods still dominate a large percentage of mainstream audio-visual tasks. However, the effectiveness of vanilla fusion from a theoretical perspective is still worth discussing. Thus, this paper reconsiders the signal fused in the multimodal case from a bionics perspective and proposes a simple, plug-and-play, attention module for vanilla fusion based on fundamental signal theo… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  20. arXiv:2312.05930  [pdf, other

    eess.IV cs.CV cs.LG

    A Comprehensive Dataset and Automated Pipeline for Nailfold Capillary Analysis

    Authors: Linxi Zhao, Jiankai Tang, Dongyu Chen, Xiaohong Liu, Yong Zhou, Yuanchun Shi, Guangyu Wang, Yuntao Wang

    Abstract: Nailfold capillaroscopy is widely used in assessing health conditions, highlighting the pressing need for an automated nailfold capillary analysis system. In this study, we present a pioneering effort in constructing a comprehensive nailfold capillary dataset-321 images, 219 videos from 68 subjects, with clinic reports and expert annotations-that serves as a crucial resource for training deep-lear… ▽ More

    Submitted 14 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Dataset, code, pretrained models: https://github.com/THU-CS-PI-LAB/ANFC-Automated-Nailfold-Capillary

  21. arXiv:2311.17791  [pdf, other

    eess.IV cs.CV

    U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation

    Authors: Yaopeng Peng, Milan Sonka, Danny Z. Chen

    Abstract: In this paper, we introduce U-Net v2, a new robust and efficient U-Net variant for medical image segmentation. It aims to augment the infusion of semantic information into low-level features while simultaneously refining high-level features with finer details. For an input image, we begin by extracting multi-level features with a deep neural network encoder. Next, we enhance the feature map of eac… ▽ More

    Submitted 30 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  22. arXiv:2311.17243  [pdf, other

    cs.CV eess.IV

    PHG-Net: Persistent Homology Guided Medical Image Classification

    Authors: Yaopeng Peng, Hongxiao Wang, Milan Sonka, Danny Z. Chen

    Abstract: Modern deep neural networks have achieved great successes in medical image analysis. However, the features captured by convolutional neural networks (CNNs) or Transformers tend to be optimized for pixel intensities and neglect key anatomical structures such as connected components and loops. In this paper, we propose a persistent homology guided approach (PHG-Net) that explores topological feature… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted by WACV 2024

  23. arXiv:2311.08225  [pdf, other

    eess.IV cs.CV

    Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images

    Authors: Zhiyun Song, Zengxin Qi, Xin Wang, Xiangyu Zhao, Zhenrong Shen, Sheng Wang, Manman Fei, Zhe Wang, Di Zang, Dongdong Chen, Linlin Yao, Qian Wang, Xuehai Wu, Lichi Zhang

    Abstract: Cross-modality synthesis (CMS), super-resolution (SR), and their combination (CMSR) have been extensively studied for magnetic resonance imaging (MRI). Their primary goals are to enhance the imaging quality by synthesizing the desired modality and reducing the slice thickness. Despite the promising synthetic results, these techniques are often tailored to specific tasks, thereby limiting their ada… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  24. arXiv:2309.11000  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model

    Authors: Xinyu Zhou, Delong Chen, Yudong Chen

    Abstract: This paper explores the potential of constructing an AI spoken dialogue system that "thinks how to respond" and "thinks how to speak" simultaneously, which more closely aligns with the human speech production process compared to the current cascade pipeline of independent chatbot and Text-to-Speech (TTS) modules. We hypothesize that Large Language Models (LLMs) with billions of parameters possess… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  25. arXiv:2308.02345  [pdf, other

    eess.SY

    Communication-Efficient Decentralized Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control

    Authors: Dong Chen, Kaixiang Zhang, Yongqiang Wang, Xunyuan Yin, Zhaojian Li, Dimitar Filev

    Abstract: Connected and autonomous vehicles (CAVs) promise next-gen transportation systems with enhanced safety, energy efficiency, and sustainability. One typical control strategy for CAVs is the so-called cooperative adaptive cruise control (CACC) where vehicles drive in platoons and cooperate to achieve safe and efficient transportation. In this study, we formulate CACC as a multi-agent reinforcement lea… ▽ More

    Submitted 18 February, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 14 pages, 11 figures

  26. arXiv:2306.11021  [pdf, other

    eess.SP

    CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis

    Authors: Xiaodie Chen, Jiayu Li, Dicheng Chen, Yirong Zhou, Zhangren Tu, Meijin Lin, Taishan Kang, Jianzhong Lin, Tao Gong, Liuhong Zhu, Jianjun Zhou, Lin Ou-yang, Jiefeng Guo, Jiyang Dong, Di Guo, Xiaobo Qu

    Abstract: Magnetic resonance spectroscopy (MRS) is an important clinical imaging method for diagnosis of diseases. MRS spectrum is used to observe the signal intensity of metabolites or further infer their concentrations. Although the magnetic resonance vendors commonly provide basic functions of spectra plots and metabolite quantification, the widespread clinical research of MRS is still limited due to the… ▽ More

    Submitted 6 September, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 11 pages, 12 figures

  27. arXiv:2306.10065  [pdf, other

    eess.AS cs.AI cs.MM cs.SD

    Taming Diffusion Models for Music-driven Conducting Motion Generation

    Authors: Zhuoran Zhao, Jinbin Bai, Delong Chen, Debang Wang, Yubo Pan

    Abstract: Generating the motion of orchestral conductors from a given piece of symphony music is a challenging task since it requires a model to learn semantic music features and capture the underlying distribution of real conducting motion. Prior works have applied Generative Adversarial Networks (GAN) to this task, but the promising diffusion model, which recently showed its advantages in terms of both tr… ▽ More

    Submitted 13 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted by AAAI 2023 Summer Symposium with Best Paper Award

  28. arXiv:2306.09274  [pdf, other

    cs.CV eess.IV

    Conditional Human Sketch Synthesis with Explicit Abstraction Control

    Authors: Dar-Yen Chen

    Abstract: This paper presents a novel free-hand sketch synthesis approach addressing explicit abstraction control in class-conditional and photo-to-sketch synthesis. Abstraction is a vital aspect of sketches, as it defines the fundamental distinction between a sketch and an image. Previous works relied on implicit control to achieve different levels of abstraction, leading to inaccurate control and synthesi… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Code is available at https://github.com/ChenDarYen/Conditional-Human-Sketch-Synthesis-with-Explicit-Abstraction-Control

  29. arXiv:2306.05297  [pdf

    eess.IV cs.CV

    Connectional-Style-Guided Contextual Representation Learning for Brain Disease Diagnosis

    Authors: Gongshu Wang, Ning Jiang, Yunxiao Ma, Tiantian Liu, Duanduan Chen, Jinglong Wu, Guoqi Li, Dong Liang, Tianyi Yan

    Abstract: Structural magnetic resonance imaging (sMRI) has shown great clinical value and has been widely used in deep learning (DL) based computer-aided brain disease diagnosis. Previous approaches focused on local shapes and textures in sMRI that may be significant only within a particular domain. The learned representations are likely to contain spurious information and have a poor generalization ability… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  30. arXiv:2305.12311  [pdf, other

    cs.CL cs.AI cs.CV cs.LG eess.AS

    i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data

    Authors: Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang

    Abstract: The convergence of text, visual, and audio data is a key step towards human-like artificial intelligence, however the current Vision-Language-Speech landscape is dominated by encoder-only models which lack generative abilities. We propose closing this gap with i-Code V2, the first model capable of generating natural language from any combination of Vision, Language, and Speech data. i-Code V2 is a… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

  31. arXiv:2305.00107  [pdf, other

    cs.CR eess.SY

    Unraveling Latch Locking Using Machine Learning, Boolean Analysis, and ILP

    Authors: Dake Chen, Xuan Zhou, Yinghua Hu, Yuke Zhang, Kaixin Yang, Andrew Rittenbach, Pierluigi Nuzzo, Peter A. Beerel

    Abstract: Logic locking has become a promising approach to provide hardware security in the face of a possibly insecure fabrication supply chain. While many techniques have focused on locking combinational logic (CL), an alternative latch-locking approach in which the sequential elements are locked has also gained significant attention. Latch (LAT) locking duplicates a subset of the flip-flops (FF) of a des… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: 8 pages, 7 figures, accepted by ISQED 2023

    ACM Class: B.m; C.m

  32. arXiv:2303.01193  [pdf, other

    cs.LG eess.SY math.NA

    Interpretable System Identification and Long-term Prediction on Time-Series Data

    Authors: Xiaoyi Liu, Duxin Chen, Wenjia Wei, Xia Zhu, Wenwu Yu

    Abstract: Time-series prediction has drawn considerable attention during the past decades fueled by the emerging advances of deep learning methods. However, most neural network based methods lack interpretability and fail in extracting the hidden mechanism of the targeted physical system. To overcome these shortcomings, an interpretable sparse system identification method without any prior knowledge is prop… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  33. arXiv:2302.03204  [pdf, other

    cs.NI eess.SP

    CoMap: Proactive Provision for Crowdsourcing Map in Automotive Edge Computing

    Authors: Yongjie Xue, Yuru Zhang, Qiang Liu, Dawei Chen, Kyungtae Han

    Abstract: Crowdsourcing data from connected and automated vehicles (CAVs) is a cost-efficient way to achieve high-definition maps with up-to-date transient road information. Achieving the map with deterministic latency performance is, however, challenging due to the unpredictable resource competition and distributional resource demands. In this paper, we propose CoMap, a new crowdsourcing high definition (H… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: accepted by ICC 2023

  34. arXiv:2301.00843  [pdf, other

    eess.SP cs.IT q-bio.QM

    Explicitly Solvable Continuous-time Inference for Partially Observed Markov Processes

    Authors: Daniel Chen, Alexander G. Strang, Andrew W. Eckford, Peter J. Thomas

    Abstract: Many natural and engineered systems can be modeled as discrete state Markov processes. Often, only a subset of states are directly observable. Inferring the conditional probability that a system occupies a particular hidden state, given the partial observation, is a problem with broad application. In this paper, we introduce a continuous-time formulation of the sum-product algorithm, which is a we… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: Accepted for publication in IEEE Transactions on Signal Processing

  35. arXiv:2212.05794  [pdf, other

    eess.IV cs.CV

    CTT-Net: A Multi-view Cross-token Transformer for Cataract Postoperative Visual Acuity Prediction

    Authors: Jinhong Wang, Jingwen Wang, Tingting Chen, Wenhao Zheng, Zhe Xu, Xingdi Wu, Wen Xu, Haochao Ying, Danny Chen, Jian Wu

    Abstract: Surgery is the only viable treatment for cataract patients with visual acuity (VA) impairment. Clinically, to assess the necessity of cataract surgery, accurately predicting postoperative VA before surgery by analyzing multi-view optical coherence tomography (OCT) images is crucially needed. Unfortunately, due to complicated fundus conditions, determining postoperative VA remains difficult for med… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, accepted for publication in BIBM

  36. arXiv:2210.06696  [pdf, other

    cs.AR eess.SY

    CPSAA: Accelerating Sparse Attention using Crossbar-based Processing-In-Memory Architecture

    Authors: Huize Li, Hai Jin, Long Zheng, Yu Huang, Xiaofei Liao, Dan Chen, Zhuohui Duan, Cong Liu, Jiahong Xu, Chuanyi Gui

    Abstract: The attention mechanism requires huge computational efforts to process unnecessary calculations, significantly limiting the system's performance. Researchers propose sparse attention to convert some DDMM operations to SDDMM and SpMM operations. However, current sparse attention solutions introduce massive off-chip random memory access. We propose CPSAA, a novel crossbar-based PIM-featured sparse a… ▽ More

    Submitted 7 October, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: 14 pages, 19 figures

  37. arXiv:2209.01725  [pdf, other

    eess.SP cs.CV

    Imaging with Equivariant Deep Learning

    Authors: Dongdong Chen, Mike Davies, Matthias J. Ehrhardt, Carola-Bibiane Schönlieb, Ferdia Sherry, Julián Tachella

    Abstract: From early image processing to modern computational imaging, successful models and algorithms have relied on a fundamental property of natural signals: symmetry. Here symmetry refers to the invariance property of signal sets to transformations such as translation, rotation or scaling. Symmetry can also be incorporated into deep neural networks in the form of equivariance, allowing for more data-ef… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: To appear in IEEE Signal Processing Magazine

  38. arXiv:2209.00196  [pdf, other

    eess.IV physics.optics

    Group frame neural network of moving object ghost imaging combined with frame merging algorithm

    Authors: Da Chen, Shan-Guo Feng, Hua-Hua Wang, Jia-Ning Cao, Zhi-Wei Zhang, Zhi-Xin Yang, Ao Yan, Lu Gao, Ze Zhang

    Abstract: The nature of multiple samples to extract correlation information limits the applications of ghost imaging of moving objects. A novel multi-to-one neural network is proposed and the concept of "batch frame" is introduced to improve the serial imaging method. The neural network extracts more correlation information from a small number of samples, thus reducing the sampling ratio of the ghost imagin… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: 12 pages, 7 figures

  39. arXiv:2208.12599  [pdf

    physics.optics eess.IV

    SOFFLFM: Super-resolution optical fluctuation Fourier light-field microscopy

    Authors: Haixin Huang, Haoyuan Qiu, Hanzhe Wu, Yihong Ji, Heng Li, Bin Yu, Danni Chen, Junle Qu

    Abstract: Fourier light-field microscopy (FLFM) uses a micro-lens array (MLA) to segment the Fourier Plane of the microscopic objective lens to generate multiple two-dimensional perspective views, thereby reconstructing the three-dimensional(3D) structure of the sample using 3D deconvolution calculation without scanning. However, the resolution of FLFM is still limited by diffraction, and furthermore, depen… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  40. arXiv:2208.10302  [pdf, other

    cs.RO eess.SY

    Event-Triggered Model Predictive Control with Deep Reinforcement Learning for Autonomous Driving

    Authors: Fengying Dang, Dong Chen, Jun Chen, Zhaojian Li

    Abstract: Event-triggered model predictive control (eMPC) is a popular optimal control method with an aim to alleviate the computation and/or communication burden of MPC. However, it generally requires priori knowledge of the closed-loop system behavior along with the communication characteristics for designing the event-trigger policy. This paper attempts to solve this challenge by proposing an efficient e… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  41. arXiv:2207.10670  [pdf, other

    cs.LG cs.AI eess.SP

    ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases

    Authors: Jintai Chen, Kuanlun Liao, Kun Wei, Haochao Ying, Danny Z. Chen, Jian Wu

    Abstract: Electrocardiogram (ECG) is a widely used non-invasive diagnostic tool for heart diseases. Many studies have devised ECG analysis models (e.g., classifiers) to assist diagnosis. As an upstream task, researches have built generative models to synthesize ECG data, which are beneficial to providing training samples, privacy protection, and annotation reduction. However, previous generative methods for… ▽ More

    Submitted 29 May, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

    Journal ref: In International Conference on Machine Learning, 3360--3370, (2022), PMLR

  42. arXiv:2207.08405  [pdf, other

    eess.IV

    ORB-based SLAM accelerator on SoC FPGA

    Authors: Vibhakar Vemulapati, Deming Chen

    Abstract: Simultaneous Localization and Mapping (SLAM) is one of the main components of autonomous navigation systems. With the increase in popularity of drones, autonomous navigation on low-power systems is seeing widespread application. Most SLAM algorithms are computationally intensive and struggle to run in real-time on embedded devices with reasonable accuracy. ORB-SLAM is an open-sourced feature-based… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  43. arXiv:2207.00156  [pdf, other

    eess.IV cs.CV cs.LG

    Usable Region Estimate for Assessing Practical Usability of Medical Image Segmentation Models

    Authors: Yizhe Zhang, Suraj Mishra, Peixian Liang, Hao Zheng, Danny Z. Chen

    Abstract: We aim to quantitatively measure the practical usability of medical image segmentation models: to what extent, how often, and on which samples a model's predictions can be used/trusted. We first propose a measure, Correctness-Confidence Rank Correlation (CCRC), to capture how predictions' confidence estimates correlate with their correctness scores in rank. A model with a high value of CCRC means… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI2022

  44. arXiv:2206.10592  [pdf, other

    cs.AI cs.LG eess.SP

    Identifying Electrocardiogram Abnormalities Using a Handcrafted-Rule-Enhanced Neural Network

    Authors: Yuexin Bian, Jintai Chen, Xiaojun Chen, Xiaoxian Yang, Danny Z. Chen, JIan Wu

    Abstract: A large number of people suffer from life-threatening cardiac abnormalities, and electrocardiogram (ECG) analysis is beneficial to determining whether an individual is at risk of such abnormalities. Automatic ECG classification methods, especially the deep learning based ones, have been proposed to detect cardiac abnormalities using ECG records, showing good potential to improve clinical diagnosis… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Journal ref: IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022

  45. arXiv:2205.13160  [pdf, other

    cs.CR eess.SP

    Integration of Blockchain and Edge Computing in Internet of Things: A Survey

    Authors: He Xue, Dajiang Chen, Ning Zhang, Hong-Ning Dai, Keping Yu

    Abstract: As an important technology to ensure data security, consistency, traceability, etc., blockchain has been increasingly used in Internet of Things (IoT) applications. The integration of blockchain and edge computing can further improve the resource utilization in terms of network, computing, storage, and security. This paper aims to present a survey on the integration of blockchain and edge computin… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  46. arXiv:2205.12429  [pdf, other

    eess.IV cs.CV

    Interaction of a priori Anatomic Knowledge with Self-Supervised Contrastive Learning in Cardiac Magnetic Resonance Imaging

    Authors: Makiya Nakashima, Inyeop Jang, Ramesh Basnet, Mitchel Benovoy, W. H. Wilson Tang, Christopher Nguyen, Deborah Kwon, Tae Hyun Hwang, David Chen

    Abstract: Training deep learning models on cardiac magnetic resonance imaging (CMR) can be a challenge due to the small amount of expert generated labels and inherent complexity of data source. Self-supervised contrastive learning (SSCL) has recently been shown to boost performance in several medical imaging tasks. However, it is unclear how much the pre-trained representation reflects the primary organ of… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Under review at Machine Learning in Healthcare

  47. arXiv:2205.01818  [pdf, other

    cs.LG cs.AI cs.CL cs.CV eess.AS

    i-Code: An Integrative and Composable Multimodal Learning Framework

    Authors: Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang

    Abstract: Human intelligence is multimodal; we integrate visual, linguistic, and acoustic signals to maintain a holistic worldview. Most current pretraining methods, however, are limited to one or two modalities. We present i-Code, a self-supervised pretraining framework where users may flexibly combine the modalities of vision, speech, and language into unified and general-purpose vector representations. I… ▽ More

    Submitted 5 May, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

  48. arXiv:2204.08171  [pdf, other

    cs.IT eess.SP

    Distributed Neural Precoding for Hybrid mmWave MIMO Communications with Limited Feedback

    Authors: Kai Wei, Jindan Xu, Wei Xu, Ning Wang, Dong Chen

    Abstract: Hybrid precoding is a cost-efficient technique for millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) communications. This paper proposes a deep learning approach by using a distributed neural network for hybrid analog-and-digital precoding design with limited feedback. The proposed distributed neural precoding network, called DNet, is committed to achieving two objectives. Fir… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 13 pages, 4 figures

  49. arXiv:2204.04707  [pdf, other

    cs.CV eess.IV

    Generative Adversarial Networks for Image Augmentation in Agriculture: A Systematic Review

    Authors: Ebenezer Olaniyi, Dong Chen, Yuzhen Lu, Yanbo Huang

    Abstract: In agricultural image analysis, optimal model performance is keenly pursued for better fulfilling visual recognition tasks (e.g., image classification, segmentation, object detection and localization), in the presence of challenges with biological variability and unstructured environments. Large-scale, balanced and ground-truthed image datasets, however, are often difficult to obtain to fuel the d… ▽ More

    Submitted 12 April, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: 32 pages, 15 figures

  50. arXiv:2203.12513  [pdf, other

    stat.ML cs.LG eess.IV

    Sensing Theorems for Unsupervised Learning in Linear Inverse Problems

    Authors: Julián Tachella, Dongdong Chen, Mike Davies

    Abstract: Solving an ill-posed linear inverse problem requires knowledge about the underlying signal model. In many applications, this model is a priori unknown and has to be learned from data. However, it is impossible to learn the model using observations obtained via a single incomplete measurement operator, as there is no information about the signal model in the nullspace of the operator, resulting in… ▽ More

    Submitted 11 October, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.12151

    MSC Class: 68U10 ACM Class: I.4.5; I.2.10; G.3