Zum Hauptinhalt springen

Showing 1–50 of 75 results for author: Guo, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.00516  [pdf, other

    eess.SY

    Deep Learning based Performance Testing for Analog Integrated Circuits

    Authors: Jiawei Cao, Chongtao Guo, Hao Li, Zhigang Wang, Houjun Wang, Geoffrey Ye Li

    Abstract: In this paper, we propose a deep learning based performance testing framework to minimize the number of required test modules while guaranteeing the accuracy requirement, where a test module corresponds to a combination of one circuit and one stimulus. First, we apply a deep neural network (DNN) to establish the mapping from the response of the circuit under test (CUT) in each module to all specif… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2405.00739  [pdf, other

    cs.LG cs.CV eess.IV

    Why does Knowledge Distillation Work? Rethink its Attention and Fidelity Mechanism

    Authors: Chenqi Guo, Shiwei Zhong, Xiaofeng Liu, Qianli Feng, Yinglong Ma

    Abstract: Does Knowledge Distillation (KD) really work? Conventional wisdom viewed it as a knowledge transfer procedure where a perfect mimicry of the student to its teacher is desired. However, paradoxical studies indicate that closely replicating the teacher's behavior does not consistently improve student generalization, posing questions on its possible causes. Confronted with this gap, we hypothesize th… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

  3. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  4. arXiv:2402.10686  [pdf, other

    cs.IT cs.CR cs.LG eess.SP

    On the Impact of Uncertainty and Calibration on Likelihood-Ratio Membership Inference Attacks

    Authors: Meiyi Zhu, Caili Guo, Chunyan Feng, Osvaldo Simeone

    Abstract: In a membership inference attack (MIA), an attacker exploits the overconfidence exhibited by typical machine learning models to determine whether a specific data point was used to train a target model. In this paper, we analyze the performance of the state-of-the-art likelihood ratio attack (LiRA) within an information-theoretical framework that allows the investigation of the impact of the aleato… ▽ More

    Submitted 15 August, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 13 pages, 20 figures

  5. arXiv:2401.13893  [pdf, ps, other

    eess.SP

    A Survey on Indoor Visible Light Positioning Systems: Fundamentals, Applications, and Challenges

    Authors: Zhiyu Zhu, Yang Yang, Mingzhe Chen, Caili Guo, Julian Cheng, Shuguang Cui

    Abstract: The growing demand for location-based services in areas like virtual reality, robot control, and navigation has intensified the focus on indoor localization. Visible light positioning (VLP), leveraging visible light communications (VLC), becomes a promising indoor positioning technology due to its high accuracy and low cost. This paper provides a comprehensive survey of VLP systems. In particular,… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  6. arXiv:2401.05365  [pdf, other

    eess.SP cs.LG

    Online Action Recognition for Human Risk Prediction with Anticipated Haptic Alert via Wearables

    Authors: Cheng Guo, Lorenzo Rapetti, Kourosh Darvish, Riccardo Grieco, Francesco Draicchio, Daniele Pucci

    Abstract: This paper proposes a framework that combines online human state estimation, action recognition and motion prediction to enable early assessment and prevention of worker biomechanical risk during lifting tasks. The framework leverages the NIOSH index to perform online risk assessment, thus fitting real-time applications. In particular, the human state is retrieved via inverse kinematics/dynamics a… ▽ More

    Submitted 14 December, 2023; originally announced January 2024.

    Comments: 8 pages, 7 figures, accepted at 2023 IEEE-RAS International Conference on Humanoid Robots (Humanoids)

  7. arXiv:2401.02178  [pdf, other

    eess.SP

    OFDM-Based Digital Semantic Communication with Importance Awareness

    Authors: Chuanhong Liu, Caili Guo, Yang Yang, Wanli Ni, Tony Q. S. Quek

    Abstract: Semantic communication (SemCom) has received considerable attention for its ability to reduce data transmission size while maintaining task performance. However, existing works mainly focus on analog SemCom with simple channel models, which may limit its practical application. To reduce this gap, we propose an orthogonal frequency division multiplexing (OFDM)-based SemCom system that is compatible… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  8. arXiv:2312.12789  [pdf, other

    eess.IV cs.CV cs.LG

    SLP-Net:An efficient lightweight network for segmentation of skin lesions

    Authors: Bo Yang, Hong Peng, Chenggang Guo, Xiaohui Luo, Jun Wang, Xianzhong Long

    Abstract: Prompt treatment for melanoma is crucial. To assist physicians in identifying lesion areas precisely in a quick manner, we propose a novel skin lesion segmentation technique namely SLP-Net, an ultra-lightweight segmentation network based on the spiking neural P(SNP) systems type mechanism. Most existing convolutional neural networks achieve high segmentation accuracy while neglecting the high hard… ▽ More

    Submitted 4 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  9. arXiv:2310.07130  [pdf, other

    cs.NI eess.SP

    Edge Cloud Collaborative Stream Computing for Real-Time Structural Health Monitoring

    Authors: Wenzhao Zhang, Cheng Guo, Yi Gao, Wei Dong

    Abstract: Structural Health Monitoring (SHM) is crucial for the safety and maintenance of various infrastructures. Due to the large amount of data generated by numerous sensors and the high real-time requirements of many applications, SHM poses significant challenges. Although the cloud-centric stream computing paradigm opens new opportunities for real-time data processing, it consumes too much network band… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  10. arXiv:2310.04644  [pdf, other

    cs.SD eess.AS q-bio.NC

    Neural2Speech: A Transfer Learning Framework for Neural-Driven Speech Reconstruction

    Authors: Jiawei Li, Chunxu Guo, Li Fu, Lu Fan, Edward F. Chang, Yuanning Li

    Abstract: Reconstructing natural speech from neural activity is vital for enabling direct communication via brain-computer interfaces. Previous efforts have explored the conversion of neural recordings into speech using complex deep neural network (DNN) models trained on extensive neural recording data, which is resource-intensive under regular clinical constraints. However, achieving satisfactory performan… ▽ More

    Submitted 31 January, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: To appear in 2024 IEEE International Conference on Acoustics, Speech and Signal Processing

  11. arXiv:2309.10263  [pdf, other

    cs.CR cs.IT eess.IV eess.SP

    Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission

    Authors: Lunan Sun, Yang Yang, Mingzhe Chen, Caili Guo

    Abstract: Joint source and channel coding (JSCC) has attracted increasing attention due to its robustness and high efficiency. However, JSCC is vulnerable to privacy leakage due to the high relevance between the source image and channel input. In this paper, we propose a disentangled information bottleneck guided privacy-protective JSCC (DIB-PPJSCC) for image transmission, which aims at protecting private i… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  12. arXiv:2309.08402  [pdf, other

    eess.IV cs.CV

    3D SA-UNet: 3D Spatial Attention UNet with 3D ASPP for White Matter Hyperintensities Segmentation

    Authors: Changlu Guo

    Abstract: White Matter Hyperintensity (WMH) is an imaging feature related to various diseases such as dementia and stroke. Accurately segmenting WMH using computer technology is crucial for early disease diagnosis. However, this task remains challenging due to the small lesions with low contrast and high discontinuity in the images, which contain limited contextual and spatial information. To address this c… ▽ More

    Submitted 20 November, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  13. arXiv:2309.08188  [pdf, other

    cs.CR eess.SP

    Privacy-Aware Joint Source-Channel Coding for image transmission based on Disentangled Information Bottleneck

    Authors: Lunan Sun, Caili Guo, Mingzhe Chen, Yang Yang

    Abstract: Current privacy-aware joint source-channel coding (JSCC) works aim at avoiding private information transmission by adversarially training the JSCC encoder and decoder under specific signal-to-noise ratios (SNRs) of eavesdroppers. However, these approaches incur additional computational and storage requirements as multiple neural networks must be trained for various eavesdroppers' SNRs to determine… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  14. arXiv:2309.01072  [pdf, other

    eess.IV cs.CV

    Channel Attention Separable Convolution Network for Skin Lesion Segmentation

    Authors: Changlu Guo, Jiangyan Dai, Marton Szemenyei, Yugen Yi

    Abstract: Skin cancer is a frequently occurring cancer in the human population, and it is very important to be able to diagnose malignant tumors in the body early. Lesion segmentation is crucial for monitoring the morphological changes of skin lesions, extracting features to localize and identify diseases to assist doctors in early diagnosis. Manual de-segmentation of dermoscopic images is error-prone and t… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: Accepted by ICONIP 2023

  15. arXiv:2308.03448  [pdf, other

    cs.CV eess.IV

    Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model

    Authors: Xin Jin, Jia-Wen Xiao, Ling-Hao Han, Chunle Guo, Xialei Liu, Chongyi Li, Ming-Ming Cheng

    Abstract: Explicit calibration-based methods have dominated RAW image denoising under extremely low-light environments. However, these methods are impeded by several critical limitations: a) the explicit calibration process is both labor- and time-intensive, b) challenge exists in transferring denoisers across different camera models, and c) the disparity between synthetic and real noise is exacerbated by d… ▽ More

    Submitted 25 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  16. arXiv:2306.08918  [pdf, other

    eess.IV cs.CV

    PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators

    Authors: Runmin Cong, Wenyu Yang, Wei Zhang, Chongyi Li, Chun-Le Guo, Qingming Huang, Sam Kwong

    Abstract: Due to the light absorption and scattering induced by the water medium, underwater images usually suffer from some degradation problems, such as low contrast, color distortion, and blurring details, which aggravate the difficulty of downstream underwater understanding tasks. Therefore, how to obtain clear and visually pleasant images has become a common concern of people, and the task of underwate… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 8 pages, 4 figures, Accepted by IEEE Transactions on Image Processing 2023

  17. arXiv:2305.16055  [pdf, ps, other

    eess.SP

    Machine Learning-Based Automatic Cardiovascular Disease Diagnosis Using Two ECG Leads

    Authors: Cheng Guo, Sajid Ahmed, Mohamed-Slim Alouini

    Abstract: The state-of-the-art cardiovascular disease diagnosis techniques use machine-learning algorithms based on feature extraction and classification. In this work, in contrast to a conventional single Electrocardiogram (ECG) lead, two leads are used, and autoregressive (AR) coefficients and statistical parameters are extracted to be used as features. Four machine-learning classifiers support-vector-mac… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 15 pages, 11 figures

    MSC Class: 53A45

  18. arXiv:2305.00505  [pdf, ps, other

    eess.SY math-ph

    Fixed-time safe tracking control of uncertain high-order nonlinear pure-feedback systems via unified transformation functions

    Authors: Chaoqun Guo, Jiangping Hu, Jiasheng Hao, Sergej Celikovsky, Xiaoming Hu

    Abstract: In this paper, a fixed-time safe control problem is investigated for an uncertain high-order nonlinear pure-feedback system with state constraints. A new nonlinear transformation function is firstly proposed to handle both the constrained and unconstrained cases in a unified way. Further, a radial basis function neural network is constructed to approximate the unknown dynamics in the system and a… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  19. arXiv:2303.12286  [pdf, other

    eess.SP

    Explainable Semantic Communication for Text Tasks

    Authors: Chuanhong Liu, Caili Guo, Yang Yang, Wanli Ni, Yanquan Zhou, Lei Li, Tony Q. S. Quek

    Abstract: Task-oriented semantic communication has gained increasing attention due to its ability to reduce the amount of transmitted data without sacrificing task performance. Although some prior efforts have been dedicated to developing semantic communications, the semantics in these works remains to be unexplainable. Challenges related to explainable semantic representation and knowledge-based semantic c… ▽ More

    Submitted 17 May, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

  20. arXiv:2303.04854  [pdf, other

    eess.IV

    Structural Similarity: When to Use Deep Generative Models on Imbalanced Image Dataset Augmentation

    Authors: Chenqi Guo, Fabian Benitez-Quiroz, Qianli Feng, Aleix Martinez

    Abstract: Improving the performance on an imbalanced training set is one of the main challenges in nowadays Machine Learning. One way to augment and thus re-balance the image dataset is through existing deep generative models, like class-conditional Generative Adversarial Networks (cGAN) or Diffusion Models by synthesizing images on each of the tail-class. Our experiments on imbalanced image dataset classif… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  21. arXiv:2302.02287  [pdf, ps, other

    eess.IV

    Deep Joint Source-Channel Coding for Wireless Image Transmission with Semantic Importance

    Authors: Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Rui Tang, Chuanhong Liu

    Abstract: The sixth-generation mobile communication system proposes the vision of smart interconnection of everything, which requires accomplishing communication tasks while ensuring the performance of intelligent tasks. A joint source-channel coding method based on semantic importance is proposed, which aims at preserving semantic information during wireless image transmission and thereby boosting the perf… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2208.11375

  22. arXiv:2212.12097  [pdf, other

    math.OC eess.SY

    Tightening Quadratic Convex Relaxations for the AC Optimal Transmission Switching Problem

    Authors: Cheng Guo, Harsha Nagarajan, Merve Bodur

    Abstract: The Alternating Current Optimal Transmission Switching (ACOTS) problem incorporates line switching decisions into the fundamental AC optimal power flow (ACOPF) problem. The advantages of the ACOTS problem are well-known in terms of reducing the operational cost and improving system reliability. ACOTS optimization models contain discrete variables and nonlinear, non-convex structures, which make it… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Report number: LA-UR-22-33111

  23. Information Bottleneck-Inspired Type Based Multiple Access for Remote Estimation in IoT Systems

    Authors: Meiyi Zhu, Chunyan Feng, Caili Guo, Nan Jiang, Osvaldo Simeone

    Abstract: Type-based multiple access (TBMA) is a semantics-aware multiple access protocol for remote inference. In TBMA, codewords are reused across transmitting sensors, with each codeword being assigned to a different observation value. Existing TBMA protocols are based on fixed shared codebooks and on conventional maximum-likelihood or Bayesian decoders, which require knowledge of the distributions of ob… ▽ More

    Submitted 5 April, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, accepted by IEEE Signal Processing Letters (SPL)

  24. U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion

    Authors: Siran Peng, Chenhao Guo, Xiao Wu, Liang-Jian Deng

    Abstract: In image fusion tasks, images obtained from different sources exhibit distinct properties. Consequently, treating them uniformly with a single-branch network can lead to inadequate feature extraction. Additionally, numerous works have demonstrated that multi-scaled networks capture information more sufficiently than single-scaled models in pixel-level computer vision problems. Considering these fa… ▽ More

    Submitted 2 October, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted by the 31st ACM International Conference on Multimedia (ACM MM '23)

  25. arXiv:2210.13004  [pdf, other

    cs.CV cs.LG eess.IV q-bio.NC

    Efficient Representation of Natural Image Patches

    Authors: Cheng Guo

    Abstract: Utilizing an abstract information processing model based on minimal yet realistic assumptions inspired by biological systems, we study how to achieve the early visual system's two ultimate objectives: efficient information transmission and accurate sensor probability distribution modeling. We prove that optimizing for information transmission does not guarantee optimal probability distribution mod… ▽ More

    Submitted 11 April, 2024; v1 submitted 24 October, 2022; originally announced October 2022.

  26. arXiv:2209.03918  [pdf, other

    eess.IV cs.CV cs.LG

    A multi view multi stage and multi window framework for pulmonary artery segmentation from CT scans

    Authors: ZeYu Liu, Yi Wang, Jing Wen, Yong Zhang, Hao Yin, Chao Guo, ZhongYu Wang

    Abstract: This is the technical report of the 9th place in the final result of PARSE2022 Challenge. We solve the segmentation problem of the pulmonary artery by using a two-stage method based on a 3D CNN network. The coarse model is used to locate the ROI, and the fine model is used to refine the segmentation result. In addition, in order to improve the segmentation performance, we adopt multi-view and mult… ▽ More

    Submitted 14 September, 2022; v1 submitted 8 September, 2022; originally announced September 2022.

  27. arXiv:2208.11375  [pdf, other

    eess.IV

    Deep Joint Source-Channel Coding Based on Semantics of Pixels

    Authors: Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Rui Tang, Chuanhong Liu

    Abstract: The semantic information of the image for intelligent tasks is hidden behind the pixels, and slight changes in the pixels will affect the performance of intelligent tasks. In order to preserve semantic information behind pixels for intelligent tasks during wireless image transmission, we propose a joint source-channel coding method based on semantics of pixels, which can improve the performance of… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  28. arXiv:2204.08910  [pdf, other

    eess.SP

    Adaptable Semantic Compression and Resource Allocation for Task-Oriented Communications

    Authors: Chuanhong Liu, Caili Guo, Yang Yang, Nan Jiang

    Abstract: Task-oriented communication is a new paradigm that aims at providing efficient connectivity for accomplishing intelligent tasks rather than the reception of every transmitted bit. In this paper, a deep learning-based task-oriented communication architecture is proposed where the user extracts, compresses and transmits semantics in an end-to-end (E2E) manner. Furthermore, an approach is proposed to… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  29. arXiv:2204.08131  [pdf, ps, other

    eess.SP

    Positioning Using Visible Light Communications: A Perspective Arcs Approach

    Authors: Zhiyu Zhu, Caili Guo, Rongzhen Bao, Mingzhe Chen, Walid Saad, Yang Yang

    Abstract: Visible light positioning (VLP) is an accurate indoor positioning technology that uses luminaires as transmitters. In particular, circular luminaires are a common source type for VLP, that are typically treated only as point sources for positioning, while ignoring their geometry characteristics. In this paper, the arc feature of the circular luminaire and the coordinate information obtained via vi… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

  30. arXiv:2204.02663  [pdf, other

    eess.IV cs.CV

    Towards An End-to-End Framework for Flow-Guided Video Inpainting

    Authors: Zhen Li, Cheng-Ze Lu, Jianhua Qin, Chun-Le Guo, Ming-Ming Cheng

    Abstract: Optical flow, which captures motion information across frames, is exploited in recent video inpainting methods through propagating pixels along its trajectories. However, the hand-crafted flow-based processes in these methods are applied separately to form the whole inpainting pipeline. Thus, these methods are less efficient and rely heavily on the intermediate results from earlier stages. In this… ▽ More

    Submitted 7 April, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022

  31. arXiv:2202.06369  [pdf, ps, other

    cs.LG cs.CL eess.AS eess.SP

    Incremental user embedding modeling for personalized text classification

    Authors: Ruixue Lian, Che-Wei Huang, Yuqing Tang, Qilong Gu, Chengyuan Ma, Chenlei Guo

    Abstract: Individual user profiles and interaction histories play a significant role in providing customized experiences in real-world applications such as chatbots, social media, retail, and education. Adaptive user representation learning by utilizing user personalized information has become increasingly challenging due to ever-growing history data. In this work, we propose an incremental user embedding m… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: Accepted to International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022

  32. arXiv:2201.12599  [pdf, other

    cs.CV eess.IV

    Semantic-assisted image compression

    Authors: Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Xijun Xue

    Abstract: Conventional image compression methods typically aim at pixel-level consistency while ignoring the performance of downstream AI tasks.To solve this problem, this paper proposes a Semantic-Assisted Image Compression method (SAIC), which can maintain semantic-level consistency to enable high performance of downstream AI tasks.To this end, we train the compression network using semantic-level loss fu… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

  33. arXiv:2201.10929  [pdf, other

    cs.IT eess.SP

    Task-Oriented Image Semantic Communication Based on Rate-Distortion Theory

    Authors: Fangfang Liu, Wanjie Tong, Yang Yang, Zhengfen Sun, Caili Guo

    Abstract: Task-oriented image semantic communication is a new communication paradigm, which aims to transmit semantics for artificial intelligent (AI) tasks while ignoring the reconstruction quality of the images. However, in some applications, such as autonomous driving, both image reconstruction quality and the performance of the followed AI tasks must be simultaneously considered. To tackle this challeng… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: 17 pages, 8 figures

  34. arXiv:2201.10795  [pdf, other

    eess.SP

    Bandwidth and Power Allocation for Task-Oriented SemanticCommunication

    Authors: Chuanhong Liu, Caili Guo, Yang Yang, Jiujiu Chen

    Abstract: Deep learning enabled semantic communication has been studied to improve communication efficiency while guaranteeing intelligent task performance. Different from conventional communications systems, the resource allocation in semantic communications no longer just pursues the bit transmission rate, but focuses on how to better compress and transmit semantic to complete subsequent intelligent tasks… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  35. arXiv:2112.08133  [pdf

    physics.ins-det eess.IV physics.optics

    Ptychographic sensor for large-scale lensless microbial monitoring with high spatiotemporal resolution

    Authors: Shaowei Jiang, Chengfei Guo, Zichao Bian, Ruihai Wang, Jiakai Zhu, Pengming Song, Patrick Hu, Derek Hu, Zibang Zhang, Kazunori Hoshino, Bin Feng, Guoan Zheng

    Abstract: Traditional microbial detection methods often rely on the overall property of microbial cultures and cannot resolve individual growth event at high spatiotemporal resolution. As a result, they require bacteria to grow to confluence and then interpret the results. Here, we demonstrate the application of an integrated ptychographic sensor for lensless cytometric analysis of microbial cultures over a… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 18 pages, 6 figures

  36. High-throughput lensless whole slide imaging via continuous height-varying modulation of tilted sensor

    Authors: Shaowei Jiang, Chengfei Guo, Patrick Hu, Derek Hu, Pengming Song, Tianbo Wang, Zichao Bian, Zibang Zhang, Guoan Zheng

    Abstract: We report a new lensless microscopy configuration by integrating the concepts of transverse translational ptychography and defocus multi-height phase retrieval. In this approach, we place a tilted image sensor under the specimen for linearly-increasing phase modulation along one lateral direction. Similar to the operation of ptychography, we laterally translate the specimen and acquire the diffrac… ▽ More

    Submitted 28 September, 2021; originally announced October 2021.

  37. arXiv:2108.13249  [pdf, ps, other

    cs.SD eess.AS

    RSKNet-MTSP: Effective and Portable Deep Architecture for Speaker Verification

    Authors: Yanfeng Wu, Chenkai Guo, Junan Zhao, Xiao Jin, Jing Xu

    Abstract: The convolutional neural network (CNN) based approaches have shown great success for speaker verification (SV) tasks, where modeling long temporal context and reducing information loss of speaker characteristics are two important challenges significantly affecting the verification performance. Previous works have introduced dilated convolution and multi-scale aggregation methods to address above c… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: submitted to Neurocomputing

  38. arXiv:2106.00610  [pdf, other

    eess.SP cs.SD eess.AS

    Deep Learning for Depression Recognition with Audiovisual Cues: A Review

    Authors: Lang He, Mingyue Niu, Prayag Tiwari, Pekka Marttinen, Rui Su, Jiewei Jiang, Chenguang Guo, Hongyu Wang, Songtao Ding, Zhongmin Wang, Wei Dang, Xiaoying Pan

    Abstract: With the acceleration of the pace of work and life, people have to face more and more pressure, which increases the possibility of suffering from depression. However, many patients may fail to get a timely diagnosis due to the serious imbalance in the doctor-patient ratio in the world. Promisingly, physiological and psychological studies have indicated some differences in speech and facial express… ▽ More

    Submitted 27 May, 2021; originally announced June 2021.

  39. arXiv:2105.09865  [pdf, ps, other

    eess.SP

    Power-Efficient Wireless Streaming of Multi-Quality Tiled 360 VR Video in MIMO-OFDMA Systems

    Authors: Chengjun Guo, Lingzhi Zhao, Ying Cui, Zhi Liu, Derrick Wing Kwan Ng

    Abstract: In this paper, we study the optimal wireless streaming of a multi-quality tiled 360 virtual reality (VR) video from a multi-antenna server to multiple single-antenna users in a multiple-input multiple-output (MIMO)-orthogonal frequency division multiple access (OFDMA) system. In the scenario without user transcoding, we jointly optimize beamforming and subcarrier, transmission power, and rate allo… ▽ More

    Submitted 13 April, 2021; originally announced May 2021.

    Comments: 15 pages, 4 figures, to appear in IEEE Trans. Wireless Commun. arXiv admin note: text overlap with arXiv:2104.06183

  40. arXiv:2103.03444  [pdf, ps, other

    eess.SP cs.LG

    Optimization of User Selection and Bandwidth Allocation for Federated Learning in VLC/RF Systems

    Authors: Chuanhong Liu, Caili Guo, Yang Yang, Mingzhe Chen, H. Vincent Poor, Shuguang Cui

    Abstract: Limited radio frequency (RF) resources restrict the number of users that can participate in federated learning (FL) thus affecting FL convergence speed and performance. In this paper, we first introduce visible light communication (VLC) as a supplement to RF in FL and build a hybrid VLC/RF communication system, in which each indoor user can use both VLC and RF to transmit its FL model parameters.… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: WCNC2021

  41. arXiv:2102.09199  [pdf, other

    cs.CV cs.LG eess.IV

    Minimizing false negative rate in melanoma detection and providing insight into the causes of classification

    Authors: Ellák Somfai, Benjámin Baffy, Kristian Fenech, Changlu Guo, Rita Hosszú, Dorina Korózs, Fabrizio Nunnari, Marcell Pólik, Daniel Sonntag, Attila Ulbert, András Lőrincz

    Abstract: Our goal is to bridge human and machine intelligence in melanoma detection. We develop a classification system exploiting a combination of visual pre-processing, deep learning, and ensembling for providing explanations to experts and to minimize false negative rate while maintaining high accuracy in melanoma detection. Source images are first automatically segmented using a U-net CNN. The result o… ▽ More

    Submitted 9 March, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: supplementary materials included

    ACM Class: I.4.9; J.3

  42. arXiv:2102.03853  [pdf

    physics.optics eess.IV

    Bypassing the resolution limit of diffractive zone plate optics via rotational Fourier ptychography

    Authors: Chengfei Guo, Shaowei Jiang, Pengming Song, Zichao Bian, Tianbo Wang, Pouria Hoveida, Xiaopeng Shao

    Abstract: Diffractive zone plate optics uses a thin micro-structure pattern to alter the propagation direction of the incoming light wave. It has found important applications in extreme-wavelength imaging where conventional refractive lenses do not exist. The resolution limit of zone plate optics is determined by the smallest width of the outermost zone. In order to improve the achievable resolution, signif… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  43. arXiv:2009.13379  [pdf, other

    cs.NI eess.IV

    A Content Driven Resource Allocation Scheme for Video Transmission in Vehicular Networks

    Authors: Jiujiu Chen, Chunyan Feng, Caili Guo, Xu Zhu

    Abstract: With the growing computer vision applications, lots of videos are transmitted for content analysis, the way to allocate resources can affect the performance of video content analysis. For this purpose, the traditional resource allocation schemes for video transmission in vehicular networks, such as qualityof-service (QoS) based or quality-of-experience (QoE) based schemes, are no longer optimal an… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  44. arXiv:2009.08829  [pdf, other

    eess.IV cs.CV

    Residual Spatial Attention Network for Retinal Vessel Segmentation

    Authors: Changlu Guo, Márton Szemenyei, Yugen Yi, Wei Zhou, Haodong Bian

    Abstract: Reliable segmentation of retinal vessels can be employed as a way of monitoring and diagnosing certain diseases, such as diabetes and hypertension, as they affect the retinal vascular structure. In this work, we propose the Residual Spatial Attention Network (RSAN) for retinal vessel segmentation. RSAN employs a modified residual block structure that integrates DropBlock, which can not only be uti… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: ICONIP 2020

  45. arXiv:2008.06916  [pdf

    eess.IV physics.med-ph

    Virtual brightfield and fluorescence staining for Fourier ptychography via unsupervised deep learning

    Authors: Ruihai Wang, Pengming Song, Shaowei Jiang, Chenggang Yan, Jiakai Zhu, Chengfei Guo, Zichao Bian, Tianbo Wang, Guoan Zheng

    Abstract: Fourier ptychographic microscopy (FPM) is a computational approach geared towards creating high-resolution and large field-of-view images without mechanical scanning. To acquire color images of histology slides, it often requires sequential acquisitions with red, green, and blue illuminations. The color reconstructions often suffer from coherent artifacts that are not presented in regular incohere… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

  46. Power Efficient LED Placement Algorithm for Indoor Visible Light Communication

    Authors: Yang Yang, Zhiyu Zhu, Caili Guo, Chunyan Feng

    Abstract: This paper proposes a novel power-efficient light-emitting diode (LED) placement algorithm for indoor visible light communication (VLC). In the considered model, the LEDs can be designedly placed for high power efficiency while satisfying the indoor communication and illumination requirements. This design problem is formulated as a power minimization problem under both communication and illuminati… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  47. arXiv:2006.08610  [pdf

    physics.med-ph eess.IV physics.optics

    Autofocusing technologies for whole slide imaging and automated microscopy

    Authors: Zichao Bian, Chengfei Guo, Shaowei Jiang, Jiakai Zhu, Ruihai Wang, Pengming Song, Zibang Zhang, Kazunori Hoshino, Guoan Zheng

    Abstract: Whole slide imaging (WSI) has moved digital pathology closer to diagnostic practice in recent years. Due to the inherent tissue topography variability, accurate autofocusing remains a critical challenge for WSI and automated microscopy systems. The traditional focus map surveying method is limited in its ability to acquire a high degree of focus points while still maintaining high throughput. Real… ▽ More

    Submitted 15 August, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  48. arXiv:2006.08114  [pdf

    physics.optics eess.IV

    Super-resolved multispectral lensless microscopy via angle-tilted, wavelength-multiplexed ptychographic modulation

    Authors: Pengming Song, Ruihai Wang, Jiakai Zhu, Tianbo Wang, Zichao Bian, Zibang Zhang, Kazunori Hoshino, Michael Murphy, Shaowei Jiang, Chengfei Guo, Guoan Zheng

    Abstract: We report an angle-tilted, wavelength-multiplexed ptychographic modulation approach for multispectral lensless on-chip microscopy. In this approach, we illuminate the specimen with lights at 5 wavelengths simultaneously. A prism is added at the illumination path for spectral dispersion. Lightwaves at different wavelengths, thus, hit the specimen at slightly different incident angles, breaking the… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  49. A Novel Received Signal Strength Assisted Perspective-three-Point Algorithm for Indoor Visible Light Positioning

    Authors: Lin Bai, Yang Yang, Chunyan Feng, Caili Guo

    Abstract: In this paper, a received signal strength assisted Perspective-three-Point positioning algorithm (R-P3P) is proposed for visible light positioning (VLP) systems. The basic idea of R-P3P is to joint visual and strength information to estimate the receiver position using 3 LEDs regardless of the LEDs' orientations. R-P3P first utilizes visual information captured by the camera to estimate the incide… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2004.06294

  50. Attenuation of Several Common Building Materials in Millimeter-Wave Frequency Bands: 28, 73 and 91 GHz

    Authors: Nozhan Hosseini, Mahfuza Khatun, Changyu Guo, Kairui Du, Ozgur Ozdemir, David W. Matolak, Ismail Guvenc, Hani Mehrpouyan

    Abstract: Future cellular systems will make use of millimeter wave (mmWave) frequency bands. Many users in these bands are located indoors, i.e., inside buildings, homes, and offices. Typical building material attenuations in these high frequency ranges are of interest for link budget calculations. In this paper, we report on a collaborative measurement campaign to find the attenuation of several typical bu… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: keywords: mm-wave; attenuation