Zum Hauptinhalt springen

Showing 1–39 of 39 results for author: Zou, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16020  [pdf, other

    cs.SD cs.CL eess.AS

    AudioBench: A Universal Benchmark for Audio Large Language Models

    Authors: Bin Wang, Xunlong Zou, Geyu Lin, Shuo Sun, Zhuohan Liu, Wenyu Zhang, Zhengyuan Liu, AiTi Aw, Nancy F. Chen

    Abstract: We introduce AudioBench, a new benchmark designed to evaluate audio large language models (AudioLLMs). AudioBench encompasses 8 distinct tasks and 26 carefully selected or newly curated datasets, focusing on speech understanding, voice interpretation, and audio scene understanding. Despite the rapid advancement of large language models, including multimodal versions, a significant gap exists in co… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: 20 pages; v2 - typo update; Code: https://github.com/AudioLLMs/AudioBench

  2. arXiv:2406.14069  [pdf, other

    eess.IV cs.CV

    Towards Multi-modality Fusion and Prototype-based Feature Refinement for Clinically Significant Prostate Cancer Classification in Transrectal Ultrasound

    Authors: Hong Wu, Juan Fu, Hongsheng Ye, Yuming Zhong, Xuebin Zou, Jianhua Zhou, Yi Wang

    Abstract: Prostate cancer is a highly prevalent cancer and ranks as the second leading cause of cancer-related deaths in men globally. Recently, the utilization of multi-modality transrectal ultrasound (TRUS) has gained significant traction as a valuable technique for guiding prostate biopsies. In this study, we propose a novel learning framework for clinically significant prostate cancer (csPCa) classifica… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.12943  [pdf

    eess.IV

    A square cross-section FOV rotational CL (SC-CL) and its analytical reconstruction method

    Authors: Xiang Zou, Wuliang Shi, Muge Du, Yuxiang Xing

    Abstract: Rotational computed laminography (CL) has broad application potential in three-dimensional imaging of plate-like objects, as it only needs x-ray to pass through the tested object in the thickness direction during the imaging process. In this study, a square cross-section FOV rotational CL (SC-CL) was proposed. Then, the FDK-type analytical reconstruction algorithm applicable to the SC-CL was deriv… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2404.09433  [pdf, other

    eess.IV

    MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image

    Authors: Chengfeng Liu, Mai Xu, Qunliang Xing, Xin Zou

    Abstract: Lossy image compression is essential for Mars exploration missions, due to the limited bandwidth between Earth and Mars. However, the compression may introduce visual artifacts that complicate the geological analysis of the Martian surface. Existing quality enhancement approaches, primarily designed for Earth images, fall short for Martian images due to a lack of consideration for the unique Marti… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  5. arXiv:2403.14135  [pdf, other

    eess.IV cs.CV

    Powerful Lossy Compression for Noisy Images

    Authors: Shilv Cai, Xiaoguo Liang, Shuning Cao, Luxin Yan, Sheng Zhong, Liqun Chen, Xu Zou

    Abstract: Image compression and denoising represent fundamental challenges in image processing with many real-world applications. To address practical demands, current solutions can be categorized into two main strategies: 1) sequential method; and 2) joint method. However, sequential methods have the disadvantage of error accumulation as there is information loss between multiple individual models. Recentl… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by ICME 2024

  6. arXiv:2403.02601  [pdf, other

    eess.IV cs.CV

    Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

    Authors: Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Haoze Sun, Xueyi Zou, Zhensong Zhang, Youliang Yan, Lei Zhu

    Abstract: For image super-resolution (SR), bridging the gap between the performance on synthetic datasets and real-world degradation scenarios remains a challenge. This work introduces a novel "Low-Res Leads the Way" (LWay) training framework, merging Supervised Pre-training with Self-supervised Learning to enhance the adaptability of SR models to real-world images. Our approach utilizes a low-resolution (L… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  7. arXiv:2311.18327  [pdf

    eess.SY

    Deep Reinforcement Learning Based Optimal Energy Management of Multi-energy Microgrids with Uncertainties

    Authors: Yang Cui, Yang Xu, Yang Li, Yijian Wang, Xinpeng Zou

    Abstract: Multi-energy microgrid (MEMG) offers an effective approach to deal with energy demand diversification and new energy consumption on the consumer side. In MEMG, it is critical to deploy an energy management system (EMS) for efficient utilization of energy and reliable operation of the system. To help EMS formulate optimal dispatching schemes, a deep reinforcement learning (DRL)-based MEMG energy ma… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted by CSEE Journal of Power and Energy Systems

  8. arXiv:2311.12083  [pdf, other

    cs.CV eess.IV

    PanBench: Towards High-Resolution and High-Performance Pansharpening

    Authors: Shiying Wang, Xuechao Zou, Kai Li, Junliang Xing, Pin Tao

    Abstract: Pansharpening, a pivotal task in remote sensing, involves integrating low-resolution multispectral images with high-resolution panchromatic images to synthesize an image that is both high-resolution and retains multispectral information. These pansharpened images enhance precision in land cover classification, change detection, and environmental monitoring within remote sensing data analysis. Whil… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  9. arXiv:2309.15367  [pdf

    cs.RO eess.SY

    Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range

    Authors: Xinran Li, Shuaikang Zheng, Pengcheng Zheng, Haifeng Zhang, Zhitian Li, Xudong Zou

    Abstract: Relative pose estimation is the foundational requirement for multi-robot system, while it is a challenging research topic in infrastructure-free scenes. In this study, we analyze the relative 6-DOF pose estimation error of multi-robot system in GNSS-denied and anchor-free environment. An analytical lower bound of position and orientation estimation error is given under the assumption that distance… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 7 pages, 9 figures

  10. arXiv:2308.04417  [pdf, other

    cs.CV cs.LG eess.IV

    DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images

    Authors: Xuechao Zou, Kai Li, Junliang Xing, Yu Zhang, Shiying Wang, Lei Jin, Pin Tao

    Abstract: Optical satellite images are a critical data source; however, cloud cover often compromises their quality, hindering image applications and analysis. Consequently, effectively removing clouds from optical satellite images has emerged as a prominent research direction. While recent advancements in cloud removal primarily rely on generative adversarial networks, which may yield suboptimal image qual… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 13 pages, 7 figures

  11. arXiv:2307.13953  [pdf, other

    cs.CV cs.SD eess.AS

    The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features

    Authors: Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj

    Abstract: This work unveils the enigmatic link between phonemes and facial features. Traditional studies on voice-face correlations typically involve using a long period of voice input, including generating face images from voices and reconstructing 3D face meshes from voices. However, in situations like voice-based crimes, the available voice evidence may be short and limited. Additionally, from a physiolo… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Interspeech 2023

  12. arXiv:2307.09740  [pdf

    eess.SY

    A Physics-Informed Data-Driven Fault Location Method for Transmission Lines Using Single-Ended Measurements with Field Data Validation

    Authors: Yiqi Xing, Yu Liu, Dayou Lu, Xinchen Zou, Xuming He

    Abstract: Data driven transmission line fault location methods have the potential to more accurately locate faults by extracting fault information from available data. However, most of the data driven fault location methods in the literature are not validated by field data for the following reasons. On one hand, the available field data during faults are very limited for one specific transmission line, and… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 10 pages, 27 figures

  13. arXiv:2305.15030  [pdf, other

    cs.CV eess.IV

    Make Lossy Compression Meaningful for Low-Light Images

    Authors: Shilv Cai, Liqun Chen, Sheng Zhong, Luxin Yan, Jiahuan Zhou, Xu Zou

    Abstract: Low-light images frequently occur due to unavoidable environmental influences or technical limitations, such as insufficient lighting or limited exposure time. To achieve better visibility for visual perception, low-light image enhancement is usually adopted. Besides, lossy image compression is vital for meeting the requirements of storage and transmission in computer vision applications. To touch… ▽ More

    Submitted 24 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by AAAI 2024

    ACM Class: I.4.2; I.4.3

  14. arXiv:2305.03387  [pdf, other

    eess.IV cs.CV

    AsConvSR: Fast and Lightweight Super-Resolution Network with Assembled Convolutions

    Authors: Jiaming Guo, Xueyi Zou, Yuyi Chen, Yi Liu, Jia Hao, Jianzhuang Liu, Youliang Yan

    Abstract: In recent years, videos and images in 720p (HD), 1080p (FHD) and 4K (UHD) resolution have become more popular for display devices such as TVs, mobile phones and VR. However, these high resolution images cannot achieve the expected visual effect due to the limitation of the internet bandwidth, and bring a great challenge for super-resolution networks to achieve real-time performance. Following this… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  15. arXiv:2303.16565  [pdf, other

    cs.CV cs.LG eess.IV

    PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-performance Cloud Removal from Multi-temporal Satellite Imagery

    Authors: Xuechao Zou, Kai Li, Junliang Xing, Pin Tao, Yachao Cui

    Abstract: Satellite imagery analysis plays a pivotal role in remote sensing; however, information loss due to cloud cover significantly impedes its application. Although existing deep cloud removal models have achieved notable outcomes, they scarcely consider contextual information. This study introduces a high-performance cloud removal architecture, termed Progressive Multi-scale Attention Autoencoder (PMA… ▽ More

    Submitted 8 August, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted by ECAI 2023

  16. arXiv:2302.06294  [pdf, other

    eess.IV cs.CV cs.LG

    CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

    Authors: Chinedu Innocent Nwoye, Tong Yu, Saurav Sharma, Aditya Murali, Deepak Alapatt, Armine Vardazaryan, Kun Yuan, Jonas Hajek, Wolfgang Reiter, Amine Yamlahi, Finn-Henri Smidt, Xiaoyang Zou, Guoyan Zheng, Bruno Oliveira, Helena R. Torres, Satoshi Kondo, Satoshi Kasai, Felix Holm, Ege Özsoy, Shuangchun Gui, Han Li, Sista Raviteja, Rachana Sathish, Pranav Poudel, Binod Bhattarai , et al. (24 additional authors not shown)

    Abstract: Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier effor… ▽ More

    Submitted 14 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: MICCAI EndoVis CholecTriplet2022 challenge report. Published at Elsevier journal of Medical Image Analysis. 25 pages, 15 figures, 8 tables

    Journal ref: Medical Image Analysis, Volume 89, 2023, 102888, ISSN 1361-8415

  17. arXiv:2212.01560  [pdf

    eess.SP

    High-resolution and reliable automatic target recognition based on photonic ISAR imaging system with explainable deep learning

    Authors: Xiuting Zou, Anyi Deng, Yiheng Hu, Shiyu Hua, Linbo Zhang, Shaofu Xu, Weiwen Zou

    Abstract: Automatic target recognition (ATR) based on inverse synthetic aperture radar (ISAR) images, which is extensively utilized to surveil environment in military and civil fields, must be high-precision and reliable. Photonic technologies' advantage of broad bandwidth enables ISAR systems to realize high-resolution imaging, which is in favor of achieving high-performance ATR. Deep learning (DL) algorit… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  18. arXiv:2210.17530  [pdf, other

    eess.SP

    Joint Localization and Beamforming for Reconfigurable Intelligent Surface Aided 5G mmWave Communication Systems

    Authors: Yunis Xanthos, Wanting Lyu, Songjie Yang, Chadi Assi, Xianbing Zou, Ning Wei

    Abstract: Reconfigurable intelligent surface (RIS) is an attractive technology to improve the transmission rate of millimetre-wave (mmWave) communication systems. The previous {research} on RIS technology mainly focused on improving the transmission rate and security rate of the mmWave communication systems. Since the emergence of RIS technology creates the conditions for generating an intelligent radio env… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  19. arXiv:2210.11410  [pdf

    eess.SP physics.optics

    Millimeter-level Resolution Photonic Multiband Radar Using a Single MZM and Sub-GHz-Bandwidth Electronics

    Authors: Peixuan Li, Wenlin Bai, Xihua Zou, Ningyuan Zhong, Wei Pan, Lianshan Yan

    Abstract: We here propose a novel cost-effective millimeter-level resolution photonic multiband radar system using a single MZM driven by a 1-GHz-bandwidth LFM signal. It experimentally shows an ~8.5-mm range resolution through coherence-processing-free multiband data fusion.

    Submitted 18 October, 2022; originally announced October 2022.

  20. Cost-effective photonic super-resolution millimeter-wave joint radar-communication system using self-coherent detection

    Authors: Wenlin Bai, Peixuan Li, Xihua Zou, Ningyuan Zhong, Wei Pan, Lianshan Yan, Bin Luo

    Abstract: A cost-effective millimeter-wave (MMW) joint radar-communication (JRC) system with super resolution is proposed and experimentally demonstrated, using optical heterodyne up-conversion and self-coherent detection down-conversion techniques. The point lies in the designed coherent dual-band constant envelope linear frequency modulation-orthogonal frequency division multiplexing (LFM-OFDM) signal wit… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  21. High-Fidelity Variable-Rate Image Compression via Invertible Activation Transformation

    Authors: Shilv Cai, Zhijun Zhang, Liqun Chen, Luxin Yan, Sheng Zhong, Xu Zou

    Abstract: Learning-based methods have effectively promoted the community of image compression. Meanwhile, variational autoencoder (VAE) based variable-rate approaches have recently gained much attention to avoid the usage of a set of different networks for various compression rates. Despite the remarkable performance that has been achieved, these approaches would be readily corrupted once multiple compressi… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: Accept to ACMMM2022

    MSC Class: 68P30 ACM Class: I.4.2

  22. arXiv:2208.11184  [pdf, other

    eess.IV cs.CV

    AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results

    Authors: Ren Yang, Radu Timofte, Xin Li, Qi Zhang, Lin Zhang, Fanglong Liu, Dongliang He, Fu li, He Zheng, Weihang Yuan, Pavel Ostyakov, Dmitry Vyal, Magauiya Zhussip, Xueyi Zou, Youliang Yan, Lei Li, Jingzhu Tang, Ming Chen, Shijie Zhao, Yu Zhu, Xiaoran Qin, Chenghua Li, Cong Leng, Jian Cheng, Claudio Rota , et al. (28 additional authors not shown)

    Abstract: This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 3… ▽ More

    Submitted 25 August, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Camera-ready version

  23. arXiv:2208.05772  [pdf, other

    eess.IV cs.CV cs.LG

    KiPA22 Report: U-Net with Contour Regularization for Renal Structures Segmentation

    Authors: Kangqing Ye, Peng Liu, Xiaoyang Zou, Qin Zhou, Guoyan Zheng

    Abstract: Three-dimensional (3D) integrated renal structures (IRS) segmentation is important in clinical practice. With the advancement of deep learning techniques, many powerful frameworks focusing on medical image segmentation are proposed. In this challenge, we utilized the nnU-Net framework, which is the state-of-the-art method for medical image segmentation. To reduce the outlier prediction for the tum… ▽ More

    Submitted 6 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

  24. arXiv:2208.04318  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Adaptive Local Implicit Image Function for Arbitrary-scale Super-resolution

    Authors: Hongwei Li, Tao Dai, Yiming Li, Xueyi Zou, Shu-Tao Xia

    Abstract: Image representation is critical for many visual tasks. Instead of representing images discretely with 2D arrays of pixels, a recent study, namely local implicit image function (LIIF), denotes images as a continuous function where pixel values are expansion by using the corresponding coordinates as inputs. Due to its continuous nature, LIIF can be adopted for arbitrary-scale image super-resolution… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: This paper is accepted by ICIP 2022. 5 pages

  25. arXiv:2203.14823  [pdf

    physics.optics eess.SP

    Reciprocal phase transition-enabled electro-optic modulation

    Authors: Fang Zou, Lei Zou, Ye Tian, Yiming Zhang, Erwin Bente, Weigang Hou, Yu Liu, Siming Chen, Victoria Cao, Lei Guo, Songsui Li, Lianshan Yan, Wei Pan, Dusan Milosevic, Zizheng Cao, A. M. J. Koonen, Huiyun Liu, Xihua Zou

    Abstract: Electro-optic (EO) modulation is a well-known and essential topic in the field of communications and sensing. Its ultrahigh efficiency is unprecedentedly desired in the current green and data era. However, dramatically increasing the modulation efficiency is difficult due to the monotonic mapping relationship between the electrical signal and modulated optical signal. Here, a new mechanism termed… ▽ More

    Submitted 22 November, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: 27 pages, 14 figures

  26. arXiv:2201.01893  [pdf, other

    eess.IV cs.CV

    Flow-Guided Sparse Transformer for Video Deblurring

    Authors: Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc Van Gool

    Abstract: Exploiting similar and sharper scene patches in spatio-temporal neighborhoods is critical for video deblurring. However, CNN-based methods show limitations in capturing long-range dependencies and modeling non-local self-similarity. In this paper, we propose a novel framework, Flow-Guided Sparse Transformer (FGST), for video deblurring. In FGST, we customize a self-attention module, Flow-Guided Sp… ▽ More

    Submitted 29 May, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: ICML 2022; The First Transformer-based method for Video Deblurring

  27. Locally Adaptive Structure and Texture Similarity for Image Quality Assessment

    Authors: Keyan Ding, Yi Liu, Xueyi Zou, Shiqi Wang, Kede Ma

    Abstract: The latest advances in full-reference image quality assessment (IQA) involve unifying structure and texture similarity based on deep representations. The resulting Deep Image Structure and Texture Similarity (DISTS) metric, however, makes rather global quality measurements, ignoring the fact that natural photographic images are locally structured and textured across space and scale. In this paper,… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 29th ACM International Conference on Multimedia, 2021

  28. arXiv:2104.10781  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

    Authors: Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng li, Thomas Tanay , et al. (47 additional authors not shown)

    Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at… ▽ More

    Submitted 31 August, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Corrected the MOS values in Table 2, and corrected some minor typos

  29. arXiv:2102.02640  [pdf, ps, other

    cs.SD cs.LG cs.MM eess.AS

    Low Bit-Rate Wideband Speech Coding: A Deep Generative Model based Approach

    Authors: Gang Min, Xiongwei Zhang, Xia Zou, Xiangyang Liu

    Abstract: Traditional low bit-rate speech coding approach only handles narrowband speech at 8kHz, which limits further improvements in speech quality. Motivated by recent successful exploration of deep learning methods for image and speech compression, this paper presents a new approach through vector quantization (VQ) of mel-frequency cepstral coefficients (MFCCs) and using a deep generative model called W… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 6 pages

  30. arXiv:2009.09910  [pdf

    eess.IV physics.optics

    Detail reconstruction in binary ghost imaging by using point-by-point method

    Authors: Ning Zhang, Yanfeng Bai, Xuanpengfan Zou, Xiquan Fu

    Abstract: We propose a new local-binary ghost imaging by using point-by-point method. This method can compensate the degradation of imaging quality due to the loss of information during binarization process. The numerical and experimental results show that the target details can be reconstructed well by this method when compared with traditional ghost imaging. By comparing the differences of the speckle pat… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  31. arXiv:2006.13443  [pdf

    eess.SP

    Hardware-irrelevant parallel processing system

    Authors: Xiuting Zou, Shaofu Xu, Anyi Deng, Rui Wang, Weiwen Zou

    Abstract: Parallel processing technology has been a primary tool for achieving high-speed, high-accuracy, and broadband processing for many years across modern information systems and data processing such as optical and radar, synthetic aperture radar imaging, digital beam forming, and digital filtering systems. However, hardware deviations in a parallel processing system (PPS) severely degrade system perfo… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

  32. arXiv:2005.01996  [pdf, other

    eess.IV cs.CV

    NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results

    Authors: Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng , et al. (21 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real world super-resolution. It focuses on the participating methods and final results. The challenge addresses the real world setting, where paired true high and low-resolution images are unavailable. For training, only one set of source input images is therefore provided along with a set of unpaired high-quality target images. In Track 1: Image Proc… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  33. arXiv:2003.03460  [pdf

    quant-ph cs.ET eess.SY

    Enhancing a Near-Term Quantum Accelerator's Instruction Set Architecture for Materials Science Applications

    Authors: Xiang Zou, Shavindra P. Premaratne, M. Adriaan Rol, Sonika Johri, Viacheslav Ostroukh, David J. Michalak, Roman Caudillo, James S. Clarke, Leonardo Dicarlo, A. Y. Matsuura

    Abstract: Quantum computers with tens to hundreds of noisy qubits are being developed today. To be useful for real-world applications, we believe that these near-term systems cannot simply be scaled-down non-error-corrected versions of future fault-tolerant large-scale quantum computers. These near-term systems require specific architecture and design attributes to realize their full potential. To efficient… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: Received August 15, 2019; revised December 9, 2019; accepted December 13, 2019; date of publication January 28, 2020; date of current version February 14, 2020

    Journal ref: in IEEE Transactions on Quantum Engineering, vol. 1, pp. 1-7, 2020, Art no. 4500307

  34. arXiv:1912.10074  [pdf, ps, other

    cs.IT eess.SP

    Trellis-Coded Non-Orthogonal Multiple Access

    Authors: Xun Zou, Mehdi Ganji, Hamid Jafarkhani

    Abstract: In this letter, we propose a trellis-coded nonorthogonal multiple access (NOMA) scheme. The signals for different users are produced by trellis coded modulation (TCM) and then superimposed on different power levels. By interpreting the encoding process via the tensor product of trellises, we introduce a joint detection method based on the Viterbi algorithm. Then, we determine the optimal power all… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

  35. arXiv:1911.01249  [pdf, other

    eess.IV cs.CV

    AIM 2019 Challenge on Constrained Super-Resolution: Methods and Results

    Authors: Kai Zhang, Shuhang Gu, Radu Timofte, Zheng Hui, Xiumei Wang, Xinbo Gao, Dongliang Xiong, Shuai Liu, Ruipeng Gang, Nan Nan, Chenghua Li, Xueyi Zou, Ning Kang, Zhan Wang, Hang Xu, Chaofeng Wang, Zheng Li, Linlin Wang, Jun Shi, Wenyu Sun, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Yazhe Niu , et al. (4 additional authors not shown)

    Abstract: This paper reviews the AIM 2019 challenge on constrained example-based single image super-resolution with focus on proposed solutions and results. The challenge had 3 tracks. Taking the three main aspects (i.e., number of parameters, inference/running time, fidelity (PSNR)) of MSRResNet as the baseline, Track 1 aims to reduce the amount of parameters while being constrained to maintain or improve… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

  36. arXiv:1810.08906  [pdf

    eess.SP physics.app-ph

    Analog-to-digital conversion revolutionized by deep learning

    Authors: Shaofu Xu, Xiuting Zou, Bowen Ma, Jianping Chen, Lei Yu, Weiwen Zou

    Abstract: As the bridge between the analog world and digital computers, analog-to-digital converters are generally used in modern information systems such as radar, surveillance, and communications. For the configuration of analog-to-digital converters in future high-frequency broadband systems, we introduce a revolutionary architecture that adopts deep learning technology to overcome tradeoffs between band… ▽ More

    Submitted 21 October, 2018; originally announced October 2018.

  37. arXiv:1808.05410  [pdf, ps, other

    cs.IT eess.SP

    Interleaving Channel Estimation and Limited Feedback for Point-to-Point Systems with a Large Number of Transmit Antennas

    Authors: Erdem Koyuncu, Xun Zou, Hamid Jafarkhani

    Abstract: We introduce and investigate the opportunities of multi-antenna communication schemes whose training and feedback stages are interleaved and mutually interacting. Specifically, unlike the traditional schemes where the transmitter first trains all of its antennas at once and then receives a single feedback message, we consider a scenario where the transmitter instead trains its antennas one by one… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    Comments: To appear in IEEE Transactions on Wireless Communications

  38. arXiv:1711.03197  [pdf, other

    eess.SP

    Asynchronous Channel Training in Multi-Cell Massive MIMO

    Authors: Xun Zou, Hamid Jafarkhani

    Abstract: Pilot contamination has been regarded as the main bottleneck in time division duplexing (TDD) multi-cell massive multiple-input multiple-output (MIMO) systems. The pilot contamination problem cannot be addressed with large-scale antenna arrays. We provide a novel asynchronous channel training scheme to obtain precise channel matrices without the cooperation of base stations. The scheme takes advan… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

  39. arXiv:1506.06419  [pdf, other

    cs.LO eess.SY

    Verification and Control of Partially Observable Probabilistic Real-Time Systems

    Authors: Gethin Norman, David Parker, Xueyi Zou

    Abstract: We propose automated techniques for the verification and control of probabilistic real-time systems that are only partially observable. To formally model such systems, we define an extension of probabilistic timed automata in which local states are partially visible to an observer or controller. We give a probabilistic temporal logic that can express a range of quantitative properties of these mod… ▽ More

    Submitted 22 June, 2015; v1 submitted 21 June, 2015; originally announced June 2015.