Zum Hauptinhalt springen

Showing 1–50 of 57 results for author: Fu, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.07932  [pdf, other

    eess.IV cs.CV cs.LG

    MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion

    Authors: Lucas Nedel Kirsten, Zhicheng Fu, Nikhil Ambha Madhusudhana

    Abstract: Recent advances in camera design and imaging technology have enabled the capture of high-quality images using smartphones. However, due to the limited dynamic range of digital cameras, the quality of photographs captured in environments with highly imbalanced lighting often results in poor-quality images. To address this issue, most devices capture multi-exposure frames and then use some multi-exp… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2406.13292  [pdf, other

    q-bio.QM cs.AI eess.IV

    An interpretable generative multimodal neuroimaging-genomics framework for decoding Alzheimer's disease

    Authors: Giorgio Dolci, Federica Cruciani, Md Abdur Rahaman, Anees Abrol, Jiayu Chen, Zening Fu, Ilaria Boscolo Galazzo, Gloria Menegaz, Vince D. Calhoun

    Abstract: Alzheimer's disease (AD) is the most prevalent form of dementia with a progressive decline in cognitive abilities. The AD continuum encompasses a prodormal stage known as Mild Cognitive Impairment (MCI), where patients may either progress to AD or remain stable. In this study, we leveraged structural and functional MRI to investigate the disease-induced grey matter and functional network connectiv… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 27 pages, 7 figures, submitted to a journal

  3. arXiv:2406.10454  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    HumanPlus: Humanoid Shadowing and Imitation from Humans

    Authors: Zipeng Fu, Qingqing Zhao, Qi Wu, Gordon Wetzstein, Chelsea Finn

    Abstract: One of the key arguments for building robots that have similar form factors to human beings is that we can leverage the massive human data for training. Yet, doing so has remained challenging in practice due to the complexities in humanoid perception and control, lingering physical gaps between humanoids and humans in morphologies and actuation, and lack of a data pipeline for humanoids to learn a… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: project website: https://humanoid-ai.github.io/

  4. arXiv:2405.01119  [pdf

    cs.SI eess.SY

    Towards Understanding Worldwide Cross-cultural Differences in Implicit Driving Cues: Review, Comparative Analysis, and Research Roadmap

    Authors: Yongqi Dong, Chang Liu, Yiyun Wang, Zhe Fu

    Abstract: Recognizing and understanding implicit driving cues across diverse cultures is imperative for fostering safe and efficient global transportation systems, particularly when training new immigrants holding driving licenses from culturally disparate countries. Additionally, it is essential to consider cross-cultural differences in the development of Automated Driving features tailored to different co… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 7 pages, 1 figure, under review by the 27th IEEE International Conference on Intelligent Transportation Systems (IEEE ITSC 2024)

  5. arXiv:2405.00472  [pdf, other

    eess.IV cs.CV

    DmADs-Net: Dense multiscale attention and depth-supervised network for medical image segmentation

    Authors: Zhaojin Fu, Zheng Chen, Jinjiang Li, Lu Ren

    Abstract: Deep learning has made important contributions to the development of medical image segmentation. Convolutional neural networks, as a crucial branch, have attracted strong attention from researchers. Through the tireless efforts of numerous researchers, convolutional neural networks have yielded numerous outstanding algorithms for processing medical images. The ideas and architectures of these algo… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  6. arXiv:2404.00144  [pdf, other

    eess.IV cs.CV

    An Interpretable Cross-Attentive Multi-modal MRI Fusion Framework for Schizophrenia Diagnosis

    Authors: Ziyu Zhou, Anton Orlichenko, Gang Qu, Zening Fu, Vince D Calhoun, Zhengming Ding, Yu-Ping Wang

    Abstract: Both functional and structural magnetic resonance imaging (fMRI and sMRI) are widely used for the diagnosis of mental disorder. However, combining complementary information from these two modalities is challenging due to their heterogeneity. Many existing methods fall short of capturing the interaction between these modalities, frequently defaulting to a simple combination of latent features. In t… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  7. arXiv:2402.17043  [pdf, other

    eess.SY

    Traffic Control via Connected and Automated Vehicles: An Open-Road Field Experiment with 100 CAVs

    Authors: Jonathan W. Lee, Han Wang, Kathy Jang, Amaury Hayat, Matthew Bunting, Arwa Alanqary, William Barbour, Zhe Fu, Xiaoqian Gong, George Gunter, Sharon Hornstein, Abdul Rahman Kreidieh, Nathan Lichtlé, Matthew W. Nice, William A. Richardson, Adit Shah, Eugene Vinitsky, Fangyu Wu, Shengquan Xiang, Sulaiman Almatrudi, Fahd Althukair, Rahul Bhadani, Joy Carpio, Raphael Chekroun, Eric Cheng , et al. (39 additional authors not shown)

    Abstract: The CIRCLES project aims to reduce instabilities in traffic flow, which are naturally occurring phenomena due to human driving behavior. These "phantom jams" or "stop-and-go waves,"are a significant source of wasted energy. Toward this goal, the CIRCLES project designed a control system referred to as the MegaController by the CIRCLES team, that could be deployed in real traffic. Our field experim… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  8. arXiv:2402.16993  [pdf, other

    eess.SY

    Hierarchical Speed Planner for Automated Vehicles: A Framework for Lagrangian Variable Speed Limit in Mixed Autonomy Traffic

    Authors: Han Wang, Zhe Fu, Jonathan Lee, Hossein Nick Zinat Matin, Arwa Alanqary, Daniel Urieli, Sharon Hornstein, Abdul Rahman Kreidieh, Raphael Chekroun, William Barbour, William A. Richardson, Dan Work, Benedetto Piccoli, Benjamin Seibold, Jonathan Sprinkle, Alexandre M. Bayen, Maria Laura Delle Monache

    Abstract: This paper introduces a novel control framework for Lagrangian variable speed limits in hybrid traffic flow environments utilizing automated vehicles (AVs). The framework was validated using a fleet of 100 connected automated vehicles as part of the largest coordinated open-road test designed to smooth traffic flow. The framework includes two main components: a high-level controller deployed on th… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  9. arXiv:2401.02117  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

    Authors: Zipeng Fu, Tony Z. Zhao, Chelsea Finn

    Abstract: Imitation learning from human demonstrations has shown impressive performance in robotics. However, most results focus on table-top manipulation, lacking the mobility and dexterity necessary for generally useful tasks. In this work, we develop a system for imitating mobile manipulation tasks that are bimanual and require whole-body control. We first present Mobile ALOHA, a low-cost and whole-body… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Project website: https://mobile-aloha.github.io (Zipeng Fu and Tony Z. Zhao are project co-leads, Chelsea Finn is the advisor)

  10. arXiv:2309.01112  [pdf

    cs.RO eess.SY

    Swing Leg Motion Strategy for Heavy-load Legged Robot Based on Force Sensing

    Authors: Ze Fu, Yinghui Li, Weizhong Guo

    Abstract: The heavy-load legged robot has strong load carrying capacity and can adapt to various unstructured terrains. But the large weight results in higher requirements for motion stability and environmental perception ability. In order to utilize force sensing information to improve its motion performance, in this paper, we propose a finite state machine model for the swing leg in the static gait by imi… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  11. arXiv:2308.12797  [pdf, other

    cs.RO cs.MA eess.SY

    TrafficMCTS: A Closed-Loop Traffic Flow Generation Framework with Group-Based Monte Carlo Tree Search

    Authors: Licheng Wen, Ze Fu, Pinlong Cai, Daocheng Fu, Song Mao, Botian Shi

    Abstract: Digital twins for intelligent transportation systems are currently attracting great interests, in which generating realistic, diverse, and human-like traffic flow in simulations is a formidable challenge. Current approaches often hinge on predefined driver models, objective optimization, or reliance on pre-recorded driving datasets, imposing limitations on their scalability, versatility, and adapt… ▽ More

    Submitted 31 August, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  12. arXiv:2302.07453  [pdf, other

    eess.SY

    Cooperative Driving for Speed Harmonization in Mixed-Traffic Environments

    Authors: Zhe Fu, Abdul Rahman Kreidieh, Han Wang, Jonathan W. Lee, Maria Laura Delle Monache, Alexandre M. Bayen

    Abstract: Autonomous driving systems present promising methods for congestion mitigation in mixed autonomy traffic control settings. In particular, when coupled with even modest traffic state estimates, such systems can plan and coordinate the behaviors of automated vehicles (AVs) in response to observed downstream events, thereby inhibiting the continued propagation of congestion. In this paper, we present… ▽ More

    Submitted 3 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE IV 2023

  13. arXiv:2211.16398  [pdf, other

    cs.LG eess.IV

    Self-Supervised Mental Disorder Classifiers via Time Reversal

    Authors: Zafar Iqbal, Usman Mahmood, Zening Fu, Sergey Plis

    Abstract: Data scarcity is a notable problem, especially in the medical domain, due to patient data laws. Therefore, efficient Pre-Training techniques could help in combating this problem. In this paper, we demonstrate that a model trained on the time direction of functional neuro-imaging data could help in any downstream task, for example, classifying diseases from healthy controls in fMRI data. We train a… ▽ More

    Submitted 30 November, 2022; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 10 pages, 7 figures

  14. arXiv:2210.10044  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion

    Authors: Zipeng Fu, Xuxin Cheng, Deepak Pathak

    Abstract: An attached arm can significantly increase the applicability of legged robots to several mobile manipulation tasks that are not possible for the wheeled or tracked counterparts. The standard hierarchical control pipeline for such legged manipulators is to decouple the controller into that of manipulation and locomotion. However, this is ineffective. It requires immense engineering to support coord… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: CoRL 2022 (Oral). Project website at https://maniploco.github.io

  15. arXiv:2209.07654  [pdf, ps, other

    cs.RO eess.SY

    Cerberus: Low-Drift Visual-Inertial-Leg Odometry For Agile Locomotion

    Authors: Shuo Yang, Zixin Zhang, Zhengyu Fu, Zachary Manchester

    Abstract: We present an open-source Visual-Inertial-Leg Odometry (VILO) state estimation solution, Cerberus, for legged robots that estimates position precisely on various terrains in real time using a set of standard sensors, including stereo cameras, IMU, joint encoders, and contact sensors. In addition to estimating robot states, we also perform online kinematic parameter calibration and contact outlier… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 7 pages, 6 figures, submitted to IEEE ICRA 2023

  16. arXiv:2209.07590   

    eess.IV cs.CV cs.LG q-bio.NC

    Prediction of Gender from Longitudinal MRI data via Deep Learning on Adolescent Data Reveals Unique Patterns Associated with Brain Structure and Change over a Two-year Period

    Authors: Yuda Bi, Anees Abrol, Zening Fu, Jiayu Chen, Jingyu Liu, Vince Calhoun

    Abstract: Deep learning algorithms for predicting neuroimaging data have shown considerable promise in various applications. Prior work has demonstrated that deep learning models that take advantage of the data's 3D structure can outperform standard machine learning on several learning tasks. However, most prior research in this area has focused on neuroimaging data from adults. Within the Adolescent Brain… ▽ More

    Submitted 5 March, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: I submitted the wrong paper

  17. arXiv:2208.12534  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning energy-efficient driving behaviors by imitating experts

    Authors: Abdul Rahman Kreidieh, Zhe Fu, Alexandre M. Bayen

    Abstract: The rise of vehicle automation has generated significant interest in the potential role of future automated vehicles (AVs). In particular, in highly dense traffic settings, AVs are expected to serve as congestion-dampeners, mitigating the presence of instabilities that arise from various sources. However, in many applications, such maneuvers rely heavily on non-local sensing or coordination by int… ▽ More

    Submitted 28 June, 2022; originally announced August 2022.

  18. arXiv:2208.10642  [pdf, other

    cs.CV eess.IV

    Anatomy-Aware Contrastive Representation Learning for Fetal Ultrasound

    Authors: Zeyu Fu, Jianbo Jiao, Robail Yasrab, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble

    Abstract: Self-supervised contrastive representation learning offers the advantage of learning meaningful visual representations from unlabeled medical datasets for transfer learning. However, applying current contrastive learning approaches to medical data without considering its domain-specific anatomical characteristics may lead to visual representations that are inconsistent in appearance and semantics.… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: ECCV-MCV 2022

  19. A Codebook Design for FD-MIMO Systems with Multi-Panel Array

    Authors: Zhilin Fu, Sangwon Hwang, Jihwan Moon, Haibao Ren, Inkyu Lee

    Abstract: In this work, we study codebook designs for full-dimension multiple-input multiple-output (FD-MIMO) systems with a multi-panel array (MPA). We propose novel codebooks which allow precise beam structures for MPA FD-MIMO systems by investigating the physical properties and alignments of the panels. We specifically exploit the characteristic that a group of antennas in a vertical direction exhibit mo… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  20. arXiv:2205.13109  [pdf

    cs.CV cs.AI cs.LG eess.IV

    Learning to segment with limited annotations: Self-supervised pretraining with regression and contrastive loss in MRI

    Authors: Lavanya Umapathy, Zhiyang Fu, Rohit Philip, Diego Martin, Maria Altbach, Ali Bilgin

    Abstract: Obtaining manual annotations for large datasets for supervised training of deep learning (DL) models is challenging. The availability of large unlabeled datasets compared to labeled ones motivate the use of self-supervised pretraining to initialize DL models for subsequent segmentation tasks. In this work, we consider two pre-training approaches for driving a DL model to learn different representa… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Presented at the Annual Conference of International Society for Magnetic Resonance in Medicine, London, UK. May 2022

  21. arXiv:2204.08114  [pdf, ps, other

    eess.SY

    A Distributed control framework for the optimal operation of DC microgrids

    Authors: Zao Fu, Michele Cucuzzella, Carlo Cenedese, Wenwu Yu, Jacquelien M. A. Scherpen

    Abstract: In this paper we propose an original distributed control framework for DC mcirogrids. We first formulate the (optimal) control objectives as an aggregative game suitable for the energy trading market. Then, based on the dual theory, we analyze the equivalent distributed optimal condition for the proposed aggregative game and design a distributed control scheme to solve it. By interconnecting the D… ▽ More

    Submitted 11 January, 2023; v1 submitted 17 April, 2022; originally announced April 2022.

  22. arXiv:2203.09487  [pdf, other

    eess.SP cs.CR cs.LG

    Defending Against Adversarial Attack in ECG Classification with Adversarial Distillation Training

    Authors: Jiahao Shao, Shijia Geng, Zhaoji Fu, Weilun Xu, Tong Liu, Shenda Hong

    Abstract: In clinics, doctors rely on electrocardiograms (ECGs) to assess severe cardiac disorders. Owing to the development of technology and the increase in health awareness, ECG signals are currently obtained by using medical and commercial devices. Deep neural networks (DNNs) can be used to analyze these signals because of their high accuracy rate. However, researchers have found that adversarial attack… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  23. arXiv:2203.00512  [pdf, other

    eess.SP cs.AI cs.LG

    A Deep Bayesian Neural Network for Cardiac Arrhythmia Classification with Rejection from ECG Recordings

    Authors: Wenrui Zhang, Xinxin Di, Guodong Wei, Shijia Geng, Zhaoji Fu, Shenda Hong

    Abstract: With the development of deep learning-based methods, automated classification of electrocardiograms (ECGs) has recently gained much attention. Although the effectiveness of deep neural networks has been encouraging, the lack of information given by the outputs restricts clinicians' reexamination. If the uncertainty estimation comes along with the classification results, cardiologists can pay more… ▽ More

    Submitted 25 February, 2022; originally announced March 2022.

  24. arXiv:2112.01697  [pdf, other

    cs.CV cs.CL cs.LG cs.SD eess.AS

    LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences

    Authors: Ziwang Fu, Feng Liu, Hanyang Wang, Siyuan Shen, Jiahao Zhang, Jiayin Qi, Xiangling Fu, Aimin Zhou

    Abstract: Learning modality-fused representations and processing unaligned multimodal sequences are meaningful and challenging in multimodal emotion recognition. Existing approaches use directional pairwise attention or a message hub to fuse language, visual, and audio modalities. However, those approaches introduce information redundancy when fusing features and are inefficient without considering the comp… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: 9 pages ,Figure 2, Table 5

  25. arXiv:2111.09103  [pdf, other

    eess.IV cs.CV

    Fast and Light-Weight Network for Single Frame Structured Illumination Microscopy Super-Resolution

    Authors: Xi Cheng, Jun Li, Qiang Dai, Zhenyong Fu, Jian Yang

    Abstract: Structured illumination microscopy (SIM) is an important super-resolution based microscopy technique that breaks the diffraction limit and enhances optical microscopy systems. With the development of biology and medical engineering, there is a high demand for real-time and robust SIM imaging under extreme low light and short exposure environments. Existing SIM techniques typically require multiple… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 9 pages

  26. arXiv:2109.15262  [pdf

    physics.optics eess.SY physics.app-ph physics.class-ph quant-ph

    Non-Hermitian physics and engineering in silicon photonics

    Authors: Changqing Wang, Zhoutian Fu, Lan Yang

    Abstract: Silicon photonics has been studied as an integratable optical platform where numerous applicable devices and systems are created based on modern physics and state-of-the-art nanotechnologies. The implementation of quantum mechanics has been the driving force of the most intriguing design of photonic structures, since the optical systems are found of great capability and potential in realizing the… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

    Comments: 30 pages, 12 figures, 225 references. Link to the published version: https://link.springer.com/chapter/10.1007%2F978-3-030-68222-4_7

    Journal ref: Wang C., Fu Z., Yang L. (2021) Non-Hermitian Physics and Engineering in Silicon Photonics. In: Lockwood D.J., Pavesi L. (eds) Silicon Photonics IV. Topics in Applied Physics, vol 139. Springer, Cham

  27. arXiv:2109.14671  [pdf, other

    cs.CV cs.LG eess.IV

    Segmentation of Roads in Satellite Images using specially modified U-Net CNNs

    Authors: Jonas Bokstaller, Yihang She, Zhehan Fu, Tommaso Macrì

    Abstract: The image classification problem has been deeply investigated by the research community, with computer vision algorithms and with the help of Neural Networks. The aim of this paper is to build an image classifier for satellite images of urban scenes that identifies the portions of the images in which a road is located, separating these portions from the rest. Unlike conventional computer vision al… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: 4 pages, 4 figures

  28. arXiv:2109.05485  [pdf, other

    cs.CV cs.LG eess.IV

    Facial Anatomical Landmark Detection using Regularized Transfer Learning with Application to Fetal Alcohol Syndrome Recognition

    Authors: Zeyu Fu, Jianbo Jiao, Michael Suttie, J. Alison Noble

    Abstract: Fetal alcohol syndrome (FAS) caused by prenatal alcohol exposure can result in a series of cranio-facial anomalies, and behavioral and neurocognitive problems. Current diagnosis of FAS is typically done by identifying a set of facial characteristics, which are often obtained by manual examination. Anatomical landmark detection, which provides rich geometric information, is important to detect the… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: To appear in IEEE journal of Biomedical and Health Informatics 2021

  29. arXiv:2108.06652  [pdf, other

    cs.RO eess.SY

    Force-feedback based Whole-body Stabilizer for Position-Controlled Humanoid Robots

    Authors: Shunpeng Yang, Hua Chen, Zhen Fu, Wei Zhang

    Abstract: This paper studies stabilizer design for position-controlled humanoid robots. Stabilizers are an essential part for position-controlled humanoids, whose primary objective is to adjust the control input sent to the robot to assist the tracking controller to better follow the planned reference trajectory. To achieve this goal, this paper develops a novel force-feedback based whole-body stabilizer th… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: IROS 2021, 8 pages

  30. arXiv:2105.08629  [pdf, other

    eess.IV cs.CV cs.LG

    Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang , et al. (7 additional authors not shown)

    Abstract: Image denoising is one of the most critical problems in mobile photo processing. While many solutions have been proposed for this task, they are usually working with synthetic data and are too computationally expensive to run on mobile devices. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based image denoising solut… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.07809, arXiv:2105.07825

  31. arXiv:2105.01128  [pdf, other

    cs.LG eess.SP

    Fusing multimodal neuroimaging data with a variational autoencoder

    Authors: Eloy Geenjaar, Noah Lewis, Zening Fu, Rohan Venkatdas, Sergey Plis, Vince Calhoun

    Abstract: Neuroimaging studies often involve the collection of multiple data modalities. These modalities contain both shared and mutually exclusive information about the brain. This work aims at finding a scalable and interpretable method to fuse the information of multiple neuroimaging modalities using a variational autoencoder (VAE). To provide an initial assessment, this work evaluates the representatio… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  32. arXiv:2104.10781  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

    Authors: Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng li, Thomas Tanay , et al. (47 additional authors not shown)

    Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at… ▽ More

    Submitted 31 August, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Corrected the MOS values in Table 2, and corrected some minor typos

  33. Efficient Multi-Stage Video Denoising with Recurrent Spatio-Temporal Fusion

    Authors: Matteo Maggioni, Yibin Huang, Cheng Li, Shuai Xiao, Zhongqian Fu, Fenglong Song

    Abstract: In recent years, denoising methods based on deep learning have achieved unparalleled performance at the cost of large computational complexity. In this work, we propose an Efficient Multi-stage Video Denoising algorithm, called EMVD, to drastically reduce the complexity while maintaining or even improving the performance. First, a fusion stage reduces the noise through a recursive combination of a… ▽ More

    Submitted 30 March, 2023; v1 submitted 9 March, 2021; originally announced March 2021.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 3465-3474

  34. arXiv:2103.02186  [pdf

    eess.SP cs.CV cs.HC

    Eye-gaze Estimation with HEOG and Neck EMG using Deep Neural Networks

    Authors: Zhen Fu, Bo Wang, Fei Chen, Xihong Wu, Jing Chen

    Abstract: Hearing-impaired listeners usually have troubles attending target talker in multi-talker scenes, even with hearing aids (HAs). The problem can be solved with eye-gaze steering HAs, which requires listeners eye-gazing on the target. In a situation where head rotates, eye-gaze is subject to both behaviors of saccade and head rotation. However, existing methods of eye-gaze estimation did not work rel… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 5 pages, 5 figures, submitted to EUSIPCO 2021

  35. arXiv:2103.02183  [pdf

    eess.SP cs.AI cs.SD eess.AS

    Auditory Attention Decoding from EEG using Convolutional Recurrent Neural Network

    Authors: Zhen Fu, Bo Wang, Xihong Wu, Jing Chen

    Abstract: The auditory attention decoding (AAD) approach was proposed to determine the identity of the attended talker in a multi-talker scenario by analyzing electroencephalography (EEG) data. Although the linear model-based method has been widely used in AAD, the linear assumption was considered oversimplified and the decoding accuracy remained lower for shorter decoding windows. Recently, nonlinear model… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 5 pages, 4 figures, submitted to EUSIPCO 2021

  36. arXiv:2102.00676  [pdf, other

    cs.CV eess.IV

    Underwater Image Enhancement via Learning Water Type Desensitized Representations

    Authors: Zhenqi Fu, Xiaopeng Lin, Wu Wang, Yue Huang, Xinghao Ding

    Abstract: We present a novel underwater image enhancement method termed SCNet to improve the image quality meanwhile cope with the degradation diversity caused by the water. SCNet is based on normalization schemes across both spatial and channel dimensions with the key idea of learning water type desensitized features. Specifically, we apply whitening to de-correlate activations across spatial dimensions fo… ▽ More

    Submitted 14 March, 2022; v1 submitted 1 February, 2021; originally announced February 2021.

  37. arXiv:2012.03673  [pdf, other

    eess.IV cs.CV

    Efficient Medical Image Segmentation with Intermediate Supervision Mechanism

    Authors: Di Yuan, Junyang Chen, Zhenghua Xu, Thomas Lukasiewicz, Zhigang Fu, Guizhi Xu

    Abstract: Because the expansion path of U-Net may ignore the characteristics of small targets, intermediate supervision mechanism is proposed. The original mask is also entered into the network as a label for intermediate output. However, U-Net is mainly engaged in segmentation, and the extracted features are also targeted at segmentation location information, and the input and output are different. The lab… ▽ More

    Submitted 15 November, 2020; originally announced December 2020.

  38. arXiv:2011.00940  [pdf, other

    eess.IV cs.CV

    Deep Learning in Computer-Aided Diagnosis and Treatment of Tumors: A Survey

    Authors: Dan Zhao, Guizhi Xu, Zhenghua XU, Thomas Lukasiewicz, Minmin Xue, Zhigang Fu

    Abstract: Computer-Aided Diagnosis and Treatment of Tumors is a hot topic of deep learning in recent years, which constitutes a series of medical tasks, such as detection of tumor markers, the outline of tumor leisures, subtypes and stages of tumors, prediction of therapeutic effect, and drug development. Meanwhile, there are some deep learning models with precise positioning and excellent performance produ… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  39. arXiv:2010.09776  [pdf, other

    cs.MA cs.AI cs.GT cs.LG eess.SY

    SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

    Authors: Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat , et al. (12 additional authors not shown)

    Abstract: Multi-agent interaction is a fundamental aspect of autonomous driving in the real world. Despite more than a decade of research and development, the problem of how to competently interact with diverse road users in diverse scenarios remains largely unsolved. Learning methods have much to offer towards solving this problem. But they require a realistic multi-agent simulator that generates diverse a… ▽ More

    Submitted 31 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 20 pages, 11 figures. Paper accepted to CoRL 2020

  40. arXiv:2010.08942  [pdf, other

    cs.CV cs.LG eess.IV

    Distortion-aware Monocular Depth Estimation for Omnidirectional Images

    Authors: Hong-Xiang Chen, Kunhong Li, Zhiheng Fu, Mengyi Liu, Zonghao Chen, Yulan Guo

    Abstract: A main challenge for tasks on panorama lies in the distortion of objects among images. In this work, we propose a Distortion-Aware Monocular Omnidirectional (DAMO) dense depth estimation network to address this challenge on indoor panoramas with two steps. First, we introduce a distortion-aware module to extract calibrated semantic features from omnidirectional images. Specifically, we exploit def… ▽ More

    Submitted 29 November, 2020; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: Preprint

  41. arXiv:2009.13635  [pdf, other

    cs.CV cs.LG eess.IV

    Cross-Task Representation Learning for Anatomical Landmark Detection

    Authors: Zeyu Fu, Jianbo Jiao, Michael Suttie, J. Alison Noble

    Abstract: Recently, there is an increasing demand for automatically detecting anatomical landmarks which provide rich structural information to facilitate subsequent medical image analysis. Current methods related to this task often leverage the power of deep neural networks, while a major challenge in fine tuning such models in medical applications arises from insufficient number of labeled samples. To add… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: MICCAI-MLMI 2020

  42. arXiv:2009.13634  [pdf, other

    eess.IV cs.CV cs.LG

    MPG-Net: Multi-Prediction Guided Network for Segmentation of Retinal Layers in OCT Images

    Authors: Zeyu Fu, Yang Sun, Xiangyu Zhang, Scott Stainton, Shaun Barney, Jeffry Hogg, William Innes, Satnam Dlay

    Abstract: Optical coherence tomography (OCT) is a commonly-used method of extracting high resolution retinal information. Moreover there is an increasing demand for the automated retinal layer segmentation which facilitates the retinal disease diagnosis. In this paper, we propose a novel multiprediction guided attention network (MPG-Net) for automated retinal layer segmentation in OCT images. The proposed m… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: EUSIPCO2020

  43. arXiv:2007.13135  [pdf, other

    cs.CV eess.IV

    Contrastive Visual-Linguistic Pretraining

    Authors: Lei Shi, Kai Shuang, Shijie Geng, Peng Su, Zhengkai Jiang, Peng Gao, Zuohui Fu, Gerard de Melo, Sen Su

    Abstract: Several multi-modality representation learning approaches such as LXMERT and ViLBERT have been proposed recently. Such approaches can achieve superior performance due to the high-level semantic information captured during large-scale multimodal pretraining. However, as ViLBERT and LXMERT adopt visual region regression and classification loss, they often suffer from domain gap and noisy label probl… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

  44. arXiv:2007.02165  [pdf, other

    eess.SP cs.LG

    CardioLearn: A Cloud Deep Learning Service for Cardiac Disease Detection from Electrocardiogram

    Authors: Shenda Hong, Zhaoji Fu, Rongbo Zhou, Jie Yu, Yongkui Li, Kai Wang, Guanlin Cheng

    Abstract: Electrocardiogram (ECG) is one of the most convenient and non-invasive tools for monitoring peoples' heart condition, which can use for diagnosing a wide range of heart diseases, including Cardiac Arrhythmia, Acute Coronary Syndrome, et al. However, traditional ECG disease detection models show substantial rates of misdiagnosis due to the limitations of the abilities of extracted features. Recent… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: WWW 2020 Demo

  45. arXiv:2006.08939  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Learning the Redundancy-free Features for Generalized Zero-Shot Object Recognition

    Authors: Zongyan Han, Zhenyong Fu, Jian Yang

    Abstract: Zero-shot object recognition or zero-shot learning aims to transfer the object recognition ability among the semantically related categories, such as fine-grained animal or bird species. However, the images of different fine-grained objects tend to merely exhibit subtle differences in appearance, which will severely deteriorate zero-shot object recognition. To reduce the superfluous information in… ▽ More

    Submitted 23 May, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Some researchers and we have found KNN results in 1st version are incorrect, due to a careless mistake in the code. Concretely, the parameters for accuracy function of KNN were organized in the wrong order by mistake. The softmax results are correct. We have removed all KNN results and remove the SOTA claims. According to the Program Chairs' suggestion, we have made errata request to CVF and IEEE

  46. arXiv:2005.08646  [pdf, other

    cs.CV eess.IV

    Character Matters: Video Story Understanding with Character-Aware Relations

    Authors: Shijie Geng, Ji Zhang, Zuohui Fu, Peng Gao, Hang Zhang, Gerard de Melo

    Abstract: Different from short videos and GIFs, video stories contain clear plots and lists of principal characters. Without identifying the connection between appearing people and character names, a model is not able to obtain a genuine understanding of the plots. Video Story Question Answering (VSQA) offers an effective way to benchmark higher-level comprehension abilities of a model. However, current VSQ… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

  47. arXiv:1911.06813  [pdf, ps, other

    eess.IV cs.LG stat.ML

    Transfer Learning of fMRI Dynamics

    Authors: Usman Mahmood, Md Mahfuzur Rahman, Alex Fedorov, Zening Fu, Sergey Plis

    Abstract: As a mental disorder progresses, it may affect brain structure, but brain function expressed in brain dynamics is affected much earlier. Capturing the moment when brain dynamics express the disorder is crucial for early diagnosis. The traditional approach to this problem via training classifiers either proceeds from handcrafted features or requires large datasets to combat the $m>>n$ problem when… ▽ More

    Submitted 16 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  48. arXiv:1911.03461  [pdf, other

    eess.IV cs.CV

    AIM 2019 Challenge on Image Demoireing: Methods and Results

    Authors: Shanxin Yuan, Radu Timofte, Gregory Slabaugh, Ales Leonardis, Bolun Zheng, Xin Ye, Xiang Tian, Yaowu Chen, Xi Cheng, Zhenyong Fu, Jian Yang, Ming Hong, Wenying Lin, Wenjin Yang, Yanyun Qu, Hong-Kyu Shin, Joon-Yeon Kim, Sung-Jea Ko, Hang Dong, Yu Guo, Jie Wang, Xuan Ding, Zongyan Han, Sourya Dipta Das, Kuldeep Purohit , et al. (3 additional authors not shown)

    Abstract: This paper reviews the first-ever image demoireing challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ICCV 2019. This paper describes the challenge, and focuses on the proposed solutions and their results. Demoireing is a difficult task of removing moire patterns from an image to reveal an underlying clean image. A new dataset, called LCDMoire wa… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: text overlap with arXiv:1911.02498

  49. arXiv:1910.06761  [pdf, other

    cs.LG eess.SY

    Causal Mechanism Transfer Network for Time Series Domain Adaptation in Mechanical Systems

    Authors: Zijian Li, Ruichu Cai, Kok Soon Chai, Hong Wei Ng, Hoang Dung Vu, Marianne Winslett, Tom Z. J. Fu, Boyan Xu, Xiaoyan Yang, Zhenjie Zhang

    Abstract: Data-driven models are becoming essential parts in modern mechanical systems, commonly used to capture the behavior of various equipment and varying environmental characteristics. Despite the advantages of these data-driven models on excellent adaptivity to high dynamics and aging equipment, they are usually hungry to massive labels over historical data, mostly contributed by human engineers at an… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  50. arXiv:1811.08064  [pdf, other

    cs.SE cs.FL cs.LO eess.SY

    Model and Integrate Medical Resource Availability into Verifiably Correct Executable Medical Guidelines - Technical Report

    Authors: Chunhui Guo, Zhicheng Fu, Zhenyu Zhang, Shangping Ren, Lui Sha

    Abstract: Improving effectiveness and safety of patient care is an ultimate objective for medical cyber-physical systems. A recent study shows that the patients' death rate can be reduced by computerizing medical guidelines. Most existing medical guideline models are validated and/or verified based on the assumption that all necessary medical resources needed for a patient care are always available. However… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: full version, 8 pages. arXiv admin note: substantial text overlap with arXiv:1811.08061

    Journal ref: IEEE/ACM 36th International Conference on Computer-Aided Design (ICCAD), 2017