Zum Hauptinhalt springen

Showing 1–39 of 39 results for author: Zeng, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.00629  [pdf, other

    cs.CV eess.IV

    Empowering Snapshot Compressive Imaging: Spatial-Spectral State Space Model with Across-Scanning and Local Enhancement

    Authors: Wenzhe Tian, Haijin Zeng, Yin-Ping Zhao, Yongyong Chen, Zhen Wang, Xuelong Li

    Abstract: Snapshot Compressive Imaging (SCI) relies on decoding algorithms such as CNN or Transformer to reconstruct the hyperspectral image (HSI) from its compressed measurement. Although existing CNN and Transformer-based methods have proven effective, CNNs are limited by their inadequate modeling of long-range dependencies, while Transformer ones face high computational costs due to quadratic complexity.… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 12 pages,6 figures

  2. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2405.16102  [pdf, other

    eess.IV cs.CV

    Reliable Source Approximation: Source-Free Unsupervised Domain Adaptation for Vestibular Schwannoma MRI Segmentation

    Authors: Hongye Zeng, Ke Zou, Zhihao Chen, Rui Zheng, Huazhu Fu

    Abstract: Source-Free Unsupervised Domain Adaptation (SFUDA) has recently become a focus in the medical image domain adaptation, as it only utilizes the source model and does not require annotated target data. However, current SFUDA approaches cannot tackle the complex segmentation task across different MRI sequences, such as the vestibular schwannoma segmentation. To address this problem, we proposed Relia… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Early accepted by MICCAI 2024

  4. arXiv:2405.09923  [pdf, other

    cs.CV eess.IV

    NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge

    Authors: Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang

    Abstract: In this paper, we review the NTIRE 2024 challenge on Restore Any Image Model (RAIM) in the Wild. The RAIM challenge constructed a benchmark for image restoration in the wild, including real-world images with/without reference ground truth in various scenarios from real applications. The participants were required to restore the real-captured images from complex and unknown degradation, where gener… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  5. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Haijin Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  6. arXiv:2404.16920  [pdf, other

    cs.NI cs.IT cs.LG eess.SP

    Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

    Authors: Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li, Shivendra Panwar

    Abstract: We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i.e., users sending data packet requests to access points (APs) via uplinks and APs transmitting requested data packets to users via downlinks. Our objective is to minimize the average delay in the system due to APs' limited service capacity and unreliable wireless channels between APs and u… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: IEEE Transactions on Wireless Communications

  7. arXiv:2402.11211  [pdf, other

    eess.IV cs.CV

    Training-free image style alignment for self-adapting domain shift on handheld ultrasound devices

    Authors: Hongye Zeng, Ke Zou, Zhihao Chen, Yuchong Gao, Hongbo Chen, Haibin Zhang, Kang Zhou, Meng Wang, Rick Siow Mong Goh, Yong Liu, Chang Jiang, Rui Zheng, Huazhu Fu

    Abstract: Handheld ultrasound devices face usage limitations due to user inexperience and cannot benefit from supervised deep learning without extensive expert annotations. Moreover, the models trained on standard ultrasound device data are constrained by training data distribution and perform poorly when directly applied to handheld device data. In this study, we propose the Training-free Image Style Align… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  8. arXiv:2401.11620  [pdf, other

    eess.SY

    Real-Time Systems Optimization with Black-box Constraints and Hybrid Variables

    Authors: Sen Wang, Dong Li, Shao-Yu Huang, Xuanliang Deng, Ashrarul H. Sifat, Changhee Jung, Ryan Williams, Haibo Zeng

    Abstract: When optimizing real-time systems, designers often face a challenging problem where the schedulability constraints are non-convex, non-continuous, or lack an analytical form to understand their properties. Although the optimization framework NORTH proposed in previous work is general (it works with arbitrary schedulability analysis) and scalable, it can only handle problems with continuous variabl… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Workshop on OPtimization for Embedded and ReAl-time systems (OPERA 2023) co-located with the 44th IEEE Real-Time Systems Symposium (RTSS)

  9. arXiv:2401.03284  [pdf, other

    eess.SY

    A General and Scalable Method for Optimizing Real-Time Systems

    Authors: Sen Wang, Dong Li, Shao-Yu Huang, Xuanliang Deng, Ashrarul H. Sifat, Changhee Jung, Ryan Williams, Haibo Zeng

    Abstract: In real-time systems optimization, designers often face a challenging problem posed by the non-convex and non-continuous schedulability conditions, which may even lack an analytical form to understand their properties. To tackle this challenging problem, we treat the schedulability analysis as a black box that only returns true/false results. We propose a general and scalable framework to optimize… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: Extension of a conference paper

  10. arXiv:2310.19699  [pdf, other

    eess.SY cs.OS cs.SC

    Optimizing Logical Execution Time Model for Both Determinism and Low Latency

    Authors: Sen Wang, Dong Li, Ashrarul H. Sifat, Shao-Yu Huang, Xuanliang Deng, Changhee Jung, Ryan Williams, Haibo Zeng

    Abstract: The Logical Execution Time (LET) programming model has recently received considerable attention, particularly because of its timing and dataflow determinism. In LET, task computation appears always to take the same amount of time (called the task's LET interval), and the task reads (resp. writes) at the beginning (resp. end) of the interval. Compared to other communication mechanisms, such as impl… ▽ More

    Submitted 7 March, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: accepted in RTAS'24

  11. arXiv:2307.01990  [pdf

    eess.IV cs.CV

    Unsupervised Spectral Demosaicing with Lightweight Spectral Attention Networks

    Authors: Kai Feng, Yongqiang Zhao, Seong G. Kong, Haijin Zeng

    Abstract: This paper presents a deep learning-based spectral demosaicing technique trained in an unsupervised manner. Many existing deep learning-based techniques relying on supervised learning with synthetic images, often underperform on real-world images especially when the number of spectral bands increases. According to the characteristics of the spectral mosaic image, this paper proposes a mosaic loss… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  12. arXiv:2305.04047  [pdf, other

    eess.IV cs.CV

    Degradation-Noise-Aware Deep Unfolding Transformer for Hyperspectral Image Denoising

    Authors: Haijin Zeng, Jiezhang Cao, Kai Feng, Shaoguang Huang, Hongyan Zhang, Hiep Luong, Wilfried Philips

    Abstract: Hyperspectral imaging (HI) has emerged as a powerful tool in diverse fields such as medical diagnosis, industrial inspection, and agriculture, owing to its ability to detect subtle differences in physical properties through high spectral resolution. However, hyperspectral images (HSIs) are often quite noisy because of narrow band spectral filtering. To reduce the noise in HSI data cubes, both mode… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

  13. arXiv:2303.13571  [pdf, other

    cs.CV eess.IV

    Inheriting Bayer's Legacy-Joint Remosaicing and Denoising for Quad Bayer Image Sensor

    Authors: Haijin Zeng, Kai Feng, Jiezhang Cao, Shaoguang Huang, Yongqiang Zhao, Hiep Luong, Jan Aelterman, Wilfried Philips

    Abstract: Pixel binning based Quad sensors have emerged as a promising solution to overcome the hardware limitations of compact cameras in low-light imaging. However, binning results in lower spatial resolution and non-Bayer CFA artifacts. To address these challenges, we propose a dual-head joint remosaicing and denoising network (DJRD), which enables the conversion of noisy Quad Bayer and standard noise-fr… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  14. arXiv:2303.13404  [pdf, other

    eess.IV cs.CV

    MSFA-Frequency-Aware Transformer for Hyperspectral Images Demosaicing

    Authors: Haijin Zeng, Kai Feng, Shaoguang Huang, Jiezhang Cao, Yongyong Chen, Hongyan Zhang, Hiep Luong, Wilfried Philips

    Abstract: Hyperspectral imaging systems that use multispectral filter arrays (MSFA) capture only one spectral component in each pixel. Hyperspectral demosaicing is used to recover the non-measured components. While deep learning methods have shown promise in this area, they still suffer from several challenges, including limited modeling of non-local dependencies, lack of consideration of the periodic MSFA… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  15. arXiv:2302.03839  [pdf, other

    eess.IV cs.CV cs.LG

    Futuristic Variations and Analysis in Fundus Images Corresponding to Biological Traits

    Authors: Muhammad Hassan, Hao Zhang, Ahmed Fateh Ameen, Home Wu Zeng, Shuye Ma, Wen Liang, Dingqi Shang, Jiaming Ding, Ziheng Zhan, Tsz Kwan Lam, Ming Xu, Qiming Huang, Dongmei Wu, Can Yang Zhang, Zhou You, Awiwu Ain, Pei Wu Qin

    Abstract: Fundus image captures rear of an eye, and which has been studied for the diseases identification, classification, segmentation, generation, and biological traits association using handcrafted, conventional, and deep learning methods. In biological traits estimation, most of the studies have been carried out for the age prediction and gender classification with convincing results. However, the curr… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 10 pages, 4 figures, 3 tables

  16. arXiv:2301.06132  [pdf, other

    cs.CV eess.IV

    Deep Diversity-Enhanced Feature Representation of Hyperspectral Images

    Authors: Jinhui Hou, Zhiyu Zhu, Junhui Hou, Hui Liu, Huanqiang Zeng, Deyu Meng

    Abstract: In this paper, we study the problem of efficiently and effectively embedding the high-dimensional spatio-spectral information of hyperspectral (HS) images, guided by feature diversity. Specifically, based on the theoretical formulation that feature diversity is correlated with the rank of the unfolded kernel matrix, we rectify 3D convolution by modifying its topology to enhance the rank upper-boun… ▽ More

    Submitted 9 May, 2024; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 17 pages, 12 figures. Accepted in TPAMI 2024. arXiv admin note: substantial text overlap with arXiv:2207.04266

  17. arXiv:2301.01420  [pdf

    cs.MM eess.IV

    Improved CNN Prediction Based Reversible Data Hiding

    Authors: Yingqiang Qiu, Wanli Peng, Xiaodan Lin, Huanqiang Zeng, Zhenxing Qian

    Abstract: This letter proposes an improved CNN predictor (ICNNP) for reversible data hiding (RDH) in images, which consists of a feature extraction module, a pixel prediction module, and a complexity prediction module. Due to predicting the complexity of each pixel with the ICNNP during the embedding process, the proposed method can achieve superior performance than the CNN predictor-based method. Specifica… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  18. arXiv:2212.14747  [pdf, other

    eess.IV cs.CV

    VertMatch: A Semi-supervised Framework for Vertebral Structure Detection in 3D Ultrasound Volume

    Authors: Hongye Zeng, kang Zhou, Songhan Ge, Yuchong Gao, Jianhao Zhao, Shenghua Gao, Rui Zheng

    Abstract: Three-dimensional (3D) ultrasound imaging technique has been applied for scoliosis assessment, but current assessment method only uses coronal projection image and cannot illustrate the 3D deformity and vertebra rotation. The vertebra detection is essential to reveal 3D spine information, but the detection task is challenging due to complex data and limited annotations. We propose VertMatch, a two… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: 15 pages, 8 figures

  19. Deep Posterior Distribution-based Embedding for Hyperspectral Image Super-resolution

    Authors: Jinhui Hou, Zhiyu Zhu, Junhui Hou, Huanqiang Zeng, Jinjian Wu, Jiantao Zhou

    Abstract: In this paper, we investigate the problem of hyperspectral (HS) image spatial super-resolution via deep learning. Particularly, we focus on how to embed the high-dimensional spatial-spectral information of HS images efficiently and effectively. Specifically, in contrast to existing methods adopting empirically-designed network modules, we formulate HS embedding as an approximation of the posterior… ▽ More

    Submitted 23 August, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted by IEEE Transactions on Image Processing

  20. arXiv:2204.12879  [pdf, other

    cs.CV eess.IV

    Low-rank Meets Sparseness: An Integrated Spatial-Spectral Total Variation Approach to Hyperspectral Denoising

    Authors: Haijin Zeng, Shaoguang Huang, Yongyong Chen, Hiep Luong, Wilfried Philips

    Abstract: Spatial-Spectral Total Variation (SSTV) can quantify local smoothness of image structures, so it is widely used in hyperspectral image (HSI) processing tasks. Essentially, SSTV assumes a sparse structure of gradient maps calculated along the spatial and spectral directions. In fact, these gradient tensors are not only sparse, but also (approximately) low-rank under FFT, which we have verified by n… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  21. arXiv:2204.07228  [pdf

    cs.CL cs.SD eess.AS

    Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech

    Authors: Cong Zhang, Huinan Zeng, Huang Liu, Jiewen Zheng

    Abstract: This study investigates whether the phonological features derived from the Featurally Underspecified Lexicon model can be applied in text-to-speech systems to generate native and non-native speech in English and Mandarin. We present a mapping of ARPABET/pinyin to SAMPA/SAMPA-SC and then to phonological features. This mapping was tested for whether it could lead to the successful generation of nati… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: submitted to Interspeech 2022. arXiv admin note: substantial text overlap with arXiv:2110.03609

  22. arXiv:2203.16537  [pdf, other

    cs.LG cs.AI eess.SP

    Efficient Localness Transformer for Smart Sensor-Based Energy Disaggregation

    Authors: Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang

    Abstract: Modern smart sensor-based energy management systems leverage non-intrusive load monitoring (NILM) to predict and optimize appliance load distribution in real-time. NILM, or energy disaggregation, refers to the decomposition of electricity usage conditioned on the aggregated power signals (i.e., smart sensor on the main channel). Based on real-time appliance power prediction using sensory technolog… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to DCOSS 2022

  23. arXiv:2203.14216  [pdf, other

    cs.CV eess.IV

    Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution

    Authors: Jie Liang, Hui Zeng, Lei Zhang

    Abstract: Efficient and effective real-world image super-resolution (Real-ISR) is a challenging task due to the unknown complex degradation of real-world images and the limited computation resources in practical applications. Recent research on Real-ISR has achieved significant progress by modeling the image degradation space; however, these methods largely rely on heavy backbone networks and they are infle… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  24. arXiv:2203.09195  [pdf, other

    eess.IV cs.CV

    Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution

    Authors: Jie Liang, Hui Zeng, Lei Zhang

    Abstract: Single image super-resolution (SISR) with generative adversarial networks (GAN) has recently attracted increasing attention due to its potentials to generate rich details. However, the training of GAN is unstable, and it often introduces many perceptually unpleasant artifacts along with the generated details. In this paper, we demonstrate that it is possible to train a GAN-based SISR model which c… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: To appear at CVPR 2022

  25. arXiv:2203.07659  [pdf

    eess.IV cs.CV

    Breast Cancer Molecular Subtypes Prediction on Pathological Images with Discriminative Patch Selecting and Multi-Instance Learning

    Authors: Hong Liu, Wen-Dong Xu, Zi-Hao Shang, Xiang-Dong Wang, Hai-Yan Zhou, Ke-Wen Ma, Huan Zhou, Jia-Lin Qi, Jia-Rui Jiang, Li-Lan Tan, Hui-Min Zeng, Hui-Juan Cai, Kuan-Song Wang, Yue-Liang Qian

    Abstract: Molecular subtypes of breast cancer are important references to personalized clinical treatment. For cost and labor savings, only one of the patient's paraffin blocks is usually selected for subsequent immunohistochemistry (IHC) to obtain molecular subtypes. Inevitable sampling error is risky due to tumor heterogeneity and could result in a delay in treatment. Molecular subtype prediction from con… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  26. arXiv:2112.02858  [pdf

    eess.IV cs.CV cs.MM

    A comparison study of CNN denoisers on PRNU extraction

    Authors: Hui Zeng, Morteza Darvish Morshedi Hosseini, Kang Deng, Anjie Peng, Miroslav Goljan

    Abstract: Performance of the sensor-based camera identification (SCI) method heavily relies on the denoising filter in estimating Photo-Response Non-Uniformity (PRNU). Given various attempts on enhancing the quality of the extracted PRNU, it still suffers from unsatisfactory performance in low-resolution images and high computational demand. Leveraging the similarity of PRNU estimation and image denoising,… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 12 pages, 6 figures, 4 tables

  27. arXiv:2111.14474  [pdf, other

    eess.IV cs.CV

    Learning-Based Video Coding with Joint Deep Compression and Enhancement

    Authors: Tiesong Zhao, Weize Feng, Hongji Zeng, Yuzhen Niu, Jiaying Liu

    Abstract: The end-to-end learning-based video compression has attracted substantial attentions by paving another way to compress video signals as stacked visual features. This paper proposes an efficient end-to-end deep video codec with jointly optimized compression and enhancement modules (JCEVC). First, we propose a dual-path generative adversarial network (DPEG) to reconstruct video details after compres… ▽ More

    Submitted 30 April, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: 10 pages, 9 figures

  28. arXiv:2111.08233  [pdf, ps, other

    eess.SP

    Toward UL-DL Rate Balancing: Joint Resource Allocation and Hybrid-Mode Multiple Access for UAV-BS Assisted Communication Systems

    Authors: Haiyong Zeng, Xu Zhu, Yufei Jiang, Zhongxiang Wei, Sumei Sun

    Abstract: In this paper, we investigate unmanned aerial vehicle (UAV) assisted communication systems that require quasi-balanced data rates in uplink (UL) and downlink (DL), as well as users' heterogeneous traffic. To the best of our knowledge, this is the first work to explicitly investigate joint UL-DL optimization for UAV assisted systems under heterogeneous requirements. A hybrid-mode multiple access (H… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 32 pages, 9 figures

  29. arXiv:2110.03609  [pdf

    cs.CL cs.LG cs.SD eess.AS

    Applying Phonological Features in Multilingual Text-To-Speech

    Authors: Cong Zhang, Huinan Zeng, Huang Liu, Jiewen Zheng

    Abstract: This study investigates whether phonological features can be applied in text-to-speech systems to generate native and non-native speech in English and Mandarin. We present a mapping of ARPABET/pinyin to SAMPA/SAMPA-SC and then to phonological features. We tested whether this mapping could lead to the successful generation of native, non-native, and code-switched speech in the two languages. We ran… ▽ More

    Submitted 10 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: demo webpage: https://congzhang365.github.io/feature_tts/

  30. arXiv:2105.07825  [pdf, other

    eess.IV cs.CV cs.LG

    Real-Time Quantized Image Super-Resolution on Mobile NPUs, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Andrew Lek, Mustafa Ayazoglu, Jie Liu, Zongcai Du, Jiaming Guo, Xueyi Zhou, Hao Jia, Youliang Yan, Zexin Zhang, Yixin Chen, Yunbo Peng, Yue Lin, Xindong Zhang, Hui Zeng, Kun Zeng, Peirong Li, Zhihuang Liu, Shiqi Xue, Shengpeng Wang

    Abstract: Image super-resolution is one of the most popular computer vision problems with many important applications to mobile devices. While many solutions have been proposed for this task, they are usually not optimized even for common smartphone AI hardware, not to mention more constrained smart TV platforms that are often supporting INT8 inference only. To address this problem, we introduce the first M… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/

  31. arXiv:2105.03847  [pdf

    eess.IV cs.CV

    Automatic segmentation of vertebral features on ultrasound spine images using Stacked Hourglass Network

    Authors: Hong-Ye Zeng, Song-Han Ge, Yu-Chong Gao, De-Sen Zhou, Kang Zhou, Xu-Ming He, Edmond Lou, Rui Zheng

    Abstract: Objective: The spinous process angle (SPA) is one of the essential parameters to denote three-dimensional (3-D) deformity of spine. We propose an automatic segmentation method based on Stacked Hourglass Network (SHN) to detect the spinous processes (SP) on ultrasound (US) spine images and to measure the SPAs of clinical scoliotic subjects. Methods: The network was trained to detect vertebral SP an… ▽ More

    Submitted 23 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    Comments: 9 pages,5 figures

  32. arXiv:2104.14655  [pdf

    eess.IV cs.CV

    Lung Cancer Diagnosis Using Deep Attention Based on Multiple Instance Learning and Radiomics

    Authors: Junhua Chen, Haiyan Zeng, Chong Zhang, Zhenwei Shi, Andre Dekker, Leonard Wee, Inigo Bermejo

    Abstract: Early diagnosis of lung cancer is a key intervention for the treatment of lung cancer computer aided diagnosis (CAD) can play a crucial role. However, most published CAD methods treat lung cancer diagnosis as a lung nodule classification problem, which does not reflect clinical practice, where clinicians diagnose a patient based on a set of images of nodules, instead of one specific nodule. Beside… ▽ More

    Submitted 12 February, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

  33. Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time

    Authors: Hui Zeng, Jianrui Cai, Lida Li, Zisheng Cao, Lei Zhang

    Abstract: Recent years have witnessed the increasing popularity of learning based methods to enhance the color and tone of photos. However, many existing photo enhancement methods either deliver unsatisfactory results or consume too much computational and memory resources, hindering their application to high-resolution images (usually with more than 12 megapixels) in practice. In this paper, we learn image-… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

    Comments: High quality adaptive photo enhancement in real-time (<2ms for 4K resolution images)! Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

  34. Hyperspectral Image Super-resolution via Deep Progressive Zero-centric Residual Learning

    Authors: Zhiyu Zhu, Junhui Hou, Jie Chen, Huanqiang Zeng, Jiantao Zhou

    Abstract: This paper explores the problem of hyperspectral image (HSI) super-resolution that merges a low resolution HSI (LR-HSI) and a high resolution multispectral image (HR-MSI). The cross-modality distribution of the spatial and spectral information makes the problem challenging. Inspired by the classic wavelet decomposition-based image fusion, we propose a novel \textit{lightweight} deep neural network… ▽ More

    Submitted 5 December, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  35. Hyperspectral Image Denoising via Global Spatial-Spectral Total Variation Regularized Nonconvex Local Low-Rank Tensor Approximation

    Authors: Haijin Zeng, Xiaozhen Xie, Jifeng Ning

    Abstract: Hyperspectral image (HSI) denoising aims to restore clean HSI from the noise-contaminated one. Noise contamination can often be caused during data acquisition and conversion. In this paper, we propose a novel spatial-spectral total variation (SSTV) regularized nonconvex local low-rank (LR) tensor approximation method to remove mixed noise in HSIs. From one aspect, the clean HSI data have its under… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    MSC Class: 94A12

    Journal ref: Signal Processing Volume 178, January 2021, 107805

  36. arXiv:1910.11103  [pdf, ps, other

    cs.CV eess.SP

    SPEC2: SPECtral SParsE CNN Accelerator on FPGAs

    Authors: Yue Niu, Hanqing Zeng, Ajitesh Srivastava, Kartik Lakhotia, Rajgopal Kannan, Yanzhi Wang, Viktor Prasanna

    Abstract: To accelerate inference of Convolutional Neural Networks (CNNs), various techniques have been proposed to reduce computation redundancy. Converting convolutional layers into frequency domain significantly reduces the computation complexity of the sliding window operations in space domain. On the other hand, weight pruning techniques address the redundancy in model parameters by converting dense co… ▽ More

    Submitted 10 October, 2023; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: This is a 10-page conference paper in 26TH IEEE International Conference On High Performance Computing, Data, and Analytics (HiPC)

  37. arXiv:1909.01341  [pdf, other

    eess.IV cs.CV

    Deep Coarse-to-fine Dense Light Field Reconstruction with Flexible Sampling and Geometry-aware Fusion

    Authors: Jing Jin, Junhui Hou, Jie Chen, Huanqiang Zeng, Sam Kwong, Jingyi Yu

    Abstract: A densely-sampled light field (LF) is highly desirable in various applications, such as 3-D reconstruction, post-capture refocusing and virtual reality. However, it is costly to acquire such data. Although many computational methods have been proposed to reconstruct a densely-sampled LF from a sparsely-sampled one, they still suffer from either low reconstruction quality, low computational efficie… ▽ More

    Submitted 26 September, 2020; v1 submitted 31 August, 2019; originally announced September 2019.

    Comments: 17 pages, 11 figures, 10 tables

  38. arXiv:1907.06141  [pdf, ps, other

    eess.SP cs.NI

    A Real-Time mmWave Communication Testbed with Phase Noise Cancellation

    Authors: Adnan Quadri, Huacheng Zeng, Y. Thomas Hou

    Abstract: As the spectrum under 6 GHz is being depleted, pushing wireless communications onto millimeter wave (mmWave) frequencies is a trend that promises multi-Gbps data rate. mmWave is therefore considered as a key technology for 5G wireless systems and has attracted tremendous research efforts. The booming research on mmWave necessitates a reconfigurable mmWave testbed that can be used to prototype and… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

  39. arXiv:1905.10940  [pdf, ps, other

    cs.NI eess.SP

    A Practical Spectrum Sharing Scheme for Cognitive Radio Networks: Design and Experiments

    Authors: Pedram Kheirkhah Sangdeh, Hossein Pirayesh, Adnan Quadri, Huacheng Zeng

    Abstract: Spectrum shortage is a fundamental problem in wireless networks and this problem becomes increasingly acute with the rapid proliferation of wireless devices. To address this problem, spectrum sharing in the context of cognitive radio networks (CRNs) has been considered a promising solution. In this paper, we propose a practical spectrum sharing scheme for a small CRN that comprises a pair of prima… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.