Zum Hauptinhalt springen

Showing 1–50 of 61 results for author: Lin, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.03654  [pdf, other

    eess.IV cs.CV

    Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models

    Authors: Markus Ditlev Sjøgren Olsen, Jakob Ambsdorf, Manxi Lin, Caroline Taksøe-Vester, Morten Bo Søndergaard Svendsen, Anders Nymark Christensen, Mads Nielsen, Martin Grønnebæk Tolsgaard, Aasa Feragen, Paraskevas Pegios

    Abstract: Congenital malformations of the brain are among the most common fetal abnormalities that impact fetal development. Previous anomaly detection methods on ultrasound images are based on supervised learning, rely on manual annotations, and risk missing underrepresented categories. In this work, we frame fetal brain anomaly detection as an unsupervised task using diffusion models. To this end, we empl… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted at ASMUS@MICCAI 2024

  2. arXiv:2407.09404  [pdf, other

    math.OC eess.SY physics.soc-ph

    CAACS: A Carbon Aware Ant Colony System

    Authors: Marina Lin, Laura P. Schaposnik

    Abstract: In an era where sustainability is becoming increasingly crucial, we introduce a new Carbon-Aware Ant Colony System (CAACS) Algorithm that addresses the Generalized Traveling Salesman Problem (GTSP) while minimizing carbon emissions. This novel approach leverages the natural efficiency of ant colony pheromone trails to find optimal routes, balancing both environmental and economic objectives. By in… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 31 figures, 23 pages

  3. arXiv:2406.02859   

    eess.AS cs.SD

    ConPCO: Preserving Phoneme Characteristics for Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization

    Authors: Bi-Cheng Yan, Wei-Cheng Chao, Jiun-Ting Li, Yi-Cheng Wang, Hsin-Wei Wang, Meng-Shin Lin, Berlin Chen

    Abstract: Automatic pronunciation assessment (APA) manages to evaluate the pronunciation proficiency of a second language (L2) learner in a target language. Existing efforts typically draw on regression models for proficiency score prediction, where the models are trained to estimate target values without explicitly accounting for phoneme-awareness in the feature space. In this paper, we propose a contrasti… ▽ More

    Submitted 8 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: This paper has been withdrawn because the authors aim to achieve better organization in writing and more detailed experimental analysis

  4. arXiv:2404.07519  [pdf, other

    eess.IV cs.AI

    LATTE: Low-Precision Approximate Attention with Head-wise Trainable Threshold for Efficient Transformer

    Authors: Jiing-Ping Wang, Ming-Guang Lin, An-Yeu, Wu

    Abstract: With the rise of Transformer models in NLP and CV domain, Multi-Head Attention has been proven to be a game-changer. However, its expensive computation poses challenges to the model throughput and efficiency, especially for the long sequence tasks. Exploiting the sparsity in attention has been proven to be an effective way to reduce computation. Nevertheless, prior works do not consider the variou… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  5. arXiv:2404.00032  [pdf, other

    cs.HC cs.CV eess.IV

    Deployment of Deep Learning Model in Real World Clinical Setting: A Case Study in Obstetric Ultrasound

    Authors: Chun Kit Wong, Mary Ngo, Manxi Lin, Zahra Bashir, Amihai Heen, Morten Bo Søndergaard Svendsen, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen, Aasa Feragen

    Abstract: Despite the rapid development of AI models in medical image analysis, their validation in real-world clinical settings remains limited. To address this, we introduce a generic framework designed for deploying image-based AI models in such settings. Using this framework, we deployed a trained model for fetal ultrasound standard plane detection, and evaluated it in real-time sessions with both novic… ▽ More

    Submitted 22 March, 2024; originally announced April 2024.

    Comments: 10 pages

  6. arXiv:2403.08700  [pdf, other

    eess.IV cs.CV cs.HC cs.LG

    Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment

    Authors: Paraskevas Pegios, Manxi Lin, Nina Weng, Morten Bo Søndergaard Svendsen, Zahra Bashir, Siavash Bigdeli, Anders Nymark Christensen, Martin Tolsgaard, Aasa Feragen

    Abstract: Obstetric ultrasound image quality is crucial for accurate diagnosis and monitoring of fetal health. However, producing high-quality standard planes is difficult, influenced by the sonographer's expertise and factors like the maternal BMI or the fetus dynamics. In this work, we propose using diffusion-based counterfactual explainable AI to generate realistic high-quality standard planes from low-q… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  7. arXiv:2403.06748  [pdf, other

    eess.IV cs.CV cs.LG

    Shortcut Learning in Medical Image Segmentation

    Authors: Manxi Lin, Nina Weng, Kamil Mikolaj, Zahra Bashir, Morten Bo Søndergaard Svendsen, Martin Tolsgaard, Anders Nymark Christensen, Aasa Feragen

    Abstract: Shortcut learning is a phenomenon where machine learning models prioritize learning simple, potentially misleading cues from data that do not generalize well beyond the training set. While existing research primarily investigates this in the realm of image classification, this study extends the exploration of shortcut learning into medical image segmentation. We demonstrate that clinical annotatio… ▽ More

    Submitted 27 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 11 pages, 6 figures, accepted at MICCAI 2024

  8. arXiv:2401.15111  [pdf, other

    eess.IV cs.CV cs.LG

    Improving Fairness of Automated Chest X-ray Diagnosis by Contrastive Learning

    Authors: Mingquan Lin, Tianhao Li, Zhaoyi Sun, Gregory Holste, Ying Ding, Fei Wang, George Shih, Yifan Peng

    Abstract: Purpose: Limited studies exploring concrete methods or approaches to tackle and enhance model fairness in the radiology domain. Our proposed AI model utilizes supervised contrastive learning to minimize bias in CXR diagnosis. Materials and Methods: In this retrospective study, we evaluated our proposed method on two datasets: the Medical Imaging and Data Resource Center (MIDRC) dataset with 77,8… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 23 pages, 5 figures

    MSC Class: arms.org

  9. arXiv:2311.03390  [pdf

    cs.CV eess.IV

    FPGA-QHAR: Throughput-Optimized for Quantized Human Action Recognition on The Edge

    Authors: Azzam Alhussain, Mingjie Lin

    Abstract: Accelerating Human Action Recognition (HAR) efficiently for real-time surveillance and robotic systems on edge chips remains a challenging research field, given its high computational and memory requirements. This paper proposed an integrated end-to-end HAR scalable HW/SW accelerator co-design based on an enhanced 8-bit quantized Two-Stream SimpleNet-PyTorch CNN architecture. Our network accelerat… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 5 pages, 7 Figures, 2 tables, 20th IEEE HONET 2023

  10. arXiv:2311.02927  [pdf

    eess.IV physics.bio-ph

    Auto-ICell: An Accessible and Cost-Effective Integrative Droplet Microfluidic System for Real-Time Single-Cell Morphological and Apoptotic Analysis

    Authors: Yuanyuan Wei, Meiai Lin, Shanhang Luo, Syed Muhammad Tariq Abbasi, Liwei Tan, Guangyao Cheng, Bijie Bai, Yi-Ping Ho, Scott Wu Yuan, Ho-Pui Ho

    Abstract: The Auto-ICell system, a novel, and cost-effective integrated droplet microfluidic system, is introduced for real-time analysis of single-cell morphology and apoptosis. This system integrates a 3D-printed microfluidic chip with image analysis algorithms, enabling the generation of uniform droplet reactors and immediate image analysis. The system employs a color-based image analysis algorithm in th… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 22 pages, 5 figures

  11. arXiv:2310.13882  [pdf

    eess.SP

    NMR Spectra Denoising with Vandermonde Constraints

    Authors: Di Guo, Runmin Xu, Jinyu Wu, Meijin Lin, Xiaofeng Du, Xiaobo Qu

    Abstract: Nuclear magnetic resonance (NMR) spectroscopy serves as an important tool to analyze chemicals and proteins in bioengineering. However, NMR signals are easily contaminated by noise during the data acquisition, which can affect subsequent quantitative analysis. Therefore, denoising NMR signals has been a long-time concern. In this work, we propose an optimization model-based iterative denoising met… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 10 pages, 9 figures

  12. arXiv:2310.08210  [pdf, other

    eess.SY

    CLExtract: Recovering Highly Corrupted DVB/GSE Satellite Stream with Contrastive Learning

    Authors: Minghao Lin, Minghao Cheng, Dongsheng Luo, Yueqi Chen

    Abstract: Since satellite systems are playing an increasingly important role in our civilization, their security and privacy weaknesses are more and more concerned. For example, prior work demonstrates that the communication channel between maritime VSAT and ground segment can be eavesdropped on using consumer-grade equipment. The stream decoder GSExtract developed in this prior work performs well for most… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: SpaceSec'23, 11 pages, 14 figures

  13. arXiv:2309.07178  [pdf

    q-bio.QM cs.AI cs.LG eess.SP

    CloudBrain-NMR: An Intelligent Cloud Computing Platform for NMR Spectroscopy Processing, Reconstruction and Analysis

    Authors: Di Guo, Sijin Li, Jun Liu, Zhangren Tu, Tianyu Qiu, Jingjing Xu, Liubin Feng, Donghai Lin, Qing Hong, Meijin Lin, Yanqin Lin, Xiaobo Qu

    Abstract: Nuclear Magnetic Resonance (NMR) spectroscopy has served as a powerful analytical tool for studying molecular structure and dynamics in chemistry and biology. However, the processing of raw data acquired from NMR spectrometers and subsequent quantitative analysis involves various specialized tools, which necessitates comprehensive knowledge in programming and NMR. Particularly, the emerging deep l… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 11 pages, 13 figures

  14. arXiv:2309.03779  [pdf, other

    cs.LG cs.AI cs.AR cs.OS eess.SY

    CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learning

    Authors: Ti Zhou, Man Lin

    Abstract: Small devices are frequently used in IoT and smart-city applications to perform periodic dedicated tasks with soft deadlines. This work focuses on developing methods to derive efficient power-management methods for periodic tasks on small devices. We first study the limitations of the existing Linux built-in methods used in small devices. We illustrate three typical workload/system patterns that a… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted to Journal of Systems Architecture

    Journal ref: Journal of Systems Architecture, 2023

  15. arXiv:2307.13220  [pdf

    eess.IV cs.AI physics.med-ph

    One for Multiple: Physics-informed Synthetic Data Boosts Generalizable Deep Learning for Fast MRI Reconstruction

    Authors: Zi Wang, Xiaotong Yu, Chengyan Wang, Weibo Chen, Jiazheng Wang, Ying-Hua Chu, Hongwei Sun, Rushuai Li, Peiyong Li, Fan Yang, Haiwei Han, Taishan Kang, Jianzhong Lin, Chen Yang, Shufu Chang, Zhang Shi, Sha Hua, Yan Li, Juan Hu, Liuhong Zhu, Jianjun Zhou, Meijing Lin, Jiefeng Guo, Congbo Cai, Zhong Chen , et al. (3 additional authors not shown)

    Abstract: Magnetic resonance imaging (MRI) is a widely used radiological modality renowned for its radiation-free, comprehensive insights into the human body, facilitating medical diagnoses. However, the drawback of prolonged scan times hinders its accessibility. The k-space undersampling offers a solution, yet the resultant artifacts necessitate meticulous removal during image reconstruction. Although Deep… ▽ More

    Submitted 28 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 38 pages, 19 figures, 5 tables

  16. Multi-Scale U-Shape MLP for Hyperspectral Image Classification

    Authors: Moule Lin, Weipeng Jing, Donglin Di, Guangsheng Chen, Houbing Song

    Abstract: Hyperspectral images have significant applications in various domains, since they register numerous semantic and spatial information in the spectral band with spatial variability of spectral signatures. Two critical challenges in identifying pixels of the hyperspectral image are respectively representing the correlated information among the local and global, as well as the abundant parameters of t… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 5 pages

    Journal ref: IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2022, Art no. 6006105

  17. arXiv:2307.07513  [pdf, other

    cs.AI cs.CL cs.CV cs.LG eess.IV

    An empirical study of using radiology reports and images to improve ICU mortality prediction

    Authors: Mingquan Lin, Song Wang, Ying Ding, Lihui Zhao, Fei Wang, Yifan Peng

    Abstract: Background: The predictive Intensive Care Unit (ICU) scoring system plays an important role in ICU management because it predicts important outcomes, especially mortality. Many scoring systems have been developed and used in the ICU. These scoring systems are primarily based on the structured clinical data in the electronic health record (EHR), which may suffer the loss of important clinical infor… ▽ More

    Submitted 20 June, 2023; originally announced July 2023.

    Comments: 21 pages, 5 figures, 7 tables

  18. arXiv:2306.11021  [pdf, other

    eess.SP

    CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis

    Authors: Xiaodie Chen, Jiayu Li, Dicheng Chen, Yirong Zhou, Zhangren Tu, Meijin Lin, Taishan Kang, Jianzhong Lin, Tao Gong, Liuhong Zhu, Jianjun Zhou, Lin Ou-yang, Jiefeng Guo, Jiyang Dong, Di Guo, Xiaobo Qu

    Abstract: Magnetic resonance spectroscopy (MRS) is an important clinical imaging method for diagnosis of diseases. MRS spectrum is used to observe the signal intensity of metabolites or further infer their concentrations. Although the magnetic resonance vendors commonly provide basic functions of spectra plots and metabolite quantification, the widespread clinical research of MRS is still limited due to the… ▽ More

    Submitted 6 September, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 11 pages, 12 figures

  19. arXiv:2306.00838  [pdf, other

    q-bio.OT eess.IV

    The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI

    Authors: Ahmed W. Moawad, Anastasia Janas, Ujjwal Baid, Divya Ramakrishnan, Rachit Saluja, Nader Ashraf, Leon Jekel, Raisa Amiruddin, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Sanjay Aneja, Syed Muhammad Anwar, Timothy Bergquist, Evan Calabrese, Veronica Chiang, Verena Chung, Gian Marco Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Juan Eugenio Iglesias, Zhifan Jiang , et al. (206 additional authors not shown)

    Abstract: The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara… ▽ More

    Submitted 17 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  20. arXiv:2305.12901  [pdf

    eess.IV

    TSPTQ-ViT: Two-scaled post-training quantization for vision transformer

    Authors: Yu-Shan Tai, Ming-Guang Lin, An-Yeu, Wu

    Abstract: Vision transformers (ViTs) have achieved remarkable performance in various computer vision tasks. However, intensive memory and computation requirements impede ViTs from running on resource-constrained edge devices. Due to the non-normally distributed values after Softmax and GeLU, post-training quantization on ViTs results in severe accuracy degradation. Moreover, conventional methods fail to add… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  21. An Automatic Guidance and Quality Assessment System for Doppler Imaging of Umbilical Artery

    Authors: Chun Kit Wong, Manxi Lin, Alberto Raheli, Zahra Bashir, Morten Bo Søndergaard Svendsen, Martin Grønnebæk Tolsgaard, Aasa Feragen, Anders Nymark Christensen

    Abstract: Examination of the umbilical artery with Doppler ultrasonography is performed to investigate blood supply to the fetus through the umbilical cord, which is vital for the monitoring of fetal health. Such examination involves several steps that must be performed correctly: identifying suitable sites on the umbilical artery for the measurement, acquiring the blood flow curve in the form of a Doppler… ▽ More

    Submitted 6 July, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Fetal Ultrasound, Umbilical Artery, Doppler Ultrasound

    Journal ref: ASMUS 2023. Simplifying Medical Ultrasound pp 13-22. Lecture Notes in Computer Science, vol 14337

  22. arXiv:2303.07486  [pdf, other

    eess.AS cs.LG cs.SD

    Guided Speech Enhancement Network

    Authors: Yang Yang, Shao-Fu Shih, Hakan Erdogan, Jamie Menjay Lin, Chehung Lee, Yunpeng Li, George Sung, Matthias Grundmann

    Abstract: High quality speech capture has been widely studied for both voice communication and human computer interface reasons. To improve the capture performance, we can often find multi-microphone speech enhancement techniques deployed on various devices. Multi-microphone speech enhancement problem is often decomposed into two decoupled steps: a beamformer that provides spatial filtering and a single-cha… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023

  23. arXiv:2302.01493  [pdf

    eess.IV cs.CV physics.med-ph

    Deep Learning (DL)-based Automatic Segmentation of the Internal Pudendal Artery (IPA) for Reduction of Erectile Dysfunction in Definitive Radiotherapy of Localized Prostate Cancer

    Authors: Anjali Balagopal, Michael Dohopolski, Young Suk Kwon, Steven Montalvo, Howard Morgan, Ti Bai, Dan Nguyen, Xiao Liang, Xinran Zhong, Mu-Han Lin, Neil Desai, Steve Jiang

    Abstract: Background and purpose: Radiation-induced erectile dysfunction (RiED) is commonly seen in prostate cancer patients. Clinical trials have been developed in multiple institutions to investigate whether dose-sparing to the internal-pudendal-arteries (IPA) will improve retention of sexual potency. The IPA is usually not considered a conventional organ-at-risk (OAR) due to segmentation difficulty. In t… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  24. arXiv:2211.13479  [pdf

    eess.SP

    Alternating Deep Low-Rank Approach for Exponential Function Reconstruction and Its Biomedical Magnetic Resonance Applications

    Authors: Yihui Huang, Zi Wang, Xinlin Zhang, Jian Cao, Zhangren Tu, Meijin Lin, Di Guo, Xiaobo Qu

    Abstract: Undersampling can accelerate the signal acquisition but at the cost of bringing in artifacts. Removing these artifacts is a fundamental problem in signal processing and this task is also called signal reconstruction. Through modeling signals as the superimposed exponential functions, deep learning has achieved fast and high-fidelity signal reconstruction by training a mapping from the undersampled… ▽ More

    Submitted 13 August, 2024; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: 12 pages

  25. arXiv:2210.11388  [pdf

    eess.IV cs.CV

    Physics-informed Deep Diffusion MRI Reconstruction with Synthetic Data: Break Training Data Bottleneck in Artificial Intelligence

    Authors: Chen Qian, Yuncheng Gao, Mingyang Han, Zi Wang, Dan Ruan, Yu Shen, Yaping Wu, Yirong Zhou, Chengyan Wang, Boyu Jiang, Ran Tao, Zhigang Wu, Jiazheng Wang, Liuhong Zhu, Yi Guo, Taishan Kang, Jianzhong Lin, Tao Gong, Chen Yang, Guoqiang Fei, Meijin Lin, Di Guo, Jianjun Zhou, Meiyun Wang, Xiaobo Qu

    Abstract: Diffusion magnetic resonance imaging (MRI) is the only imaging modality for non-invasive movement detection of in vivo water molecules, with significant clinical and research applications. Diffusion MRI (DWI) acquired by multi-shot techniques can achieve higher resolution, better signal-to-noise ratio, and lower geometric distortion than single-shot, but suffers from inter-shot motion-induced arti… ▽ More

    Submitted 5 February, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 23 pages, 16 figures

  26. arXiv:2210.05673  [pdf

    eess.IV cs.CV stat.AP

    Performance Deterioration of Deep Learning Models after Clinical Deployment: A Case Study with Auto-segmentation for Definitive Prostate Cancer Radiotherapy

    Authors: Biling Wang, Michael Dohopolski, Ti Bai, Junjie Wu, Raquibul Hannan, Neil Desai, Aurelie Garant, Daniel Yang, Dan Nguyen, Mu-Han Lin, Robert Timmerman, Xinlei Wang, Steve Jiang

    Abstract: We evaluated the temporal performance of a deep learning (DL) based artificial intelligence (AI) model for auto segmentation in prostate radiotherapy, seeking to correlate its efficacy with changes in clinical landscapes. Our study involved 1328 prostate cancer patients who underwent definitive radiotherapy from January 2006 to August 2022 at the University of Texas Southwestern Medical Center. We… ▽ More

    Submitted 16 November, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  27. Novel Maximum Likelihood Estimation of Clock Skew in One-Way Broadcast Time Synchronization

    Authors: Fanrong Shi, Huailiang Li, Simon X. Yang, Xianguo Tuo, Maosong Lin

    Abstract: Clock skew compensation is essential for accurate time synchronization in wireless networks. However, contemporary clock skew estimation is based on inaccurate transmission time measurement, which makes credible estimation challenging. Based on one-way broadcast synchronization, this study presents a novel maximum likelihood estimation (MLE) with an innovative implementation to minimize the clock… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: 10 pages, Journal

    Journal ref: IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 67, NO. 11, NOVEMBER 2020

  28. arXiv:2207.10723  [pdf

    cs.AR eess.IV

    Hardware-Efficient Template-Based Deep CNNs Accelerator Design

    Authors: Azzam Alhussain, Mingjie Lin

    Abstract: Acceleration of Convolutional Neural Network (CNN) on edge devices has recently achieved a remarkable performance in image classification and object detection applications. This paper proposes an efficient and scalable CNN-based SoC-FPGA accelerator design that takes pre-trained weights with a 16-bit fixed-point quantization and target hardware specification to generate an optimized template capab… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: 4 pages, 4 figures, 16th IEEE International Conference on Networking, Architecture, and Storage (IEEE NAS 2022), 3-4 October 2022

  29. arXiv:2205.11115  [pdf, other

    eess.IV cs.CV

    DTU-Net: Learning Topological Similarity for Curvilinear Structure Segmentation

    Authors: Manxi Lin, Zahra Bashir, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen, Aasa Feragen

    Abstract: Curvilinear structure segmentation is important in medical imaging, quantifying structures such as vessels, airways, neurons, or organ boundaries in 2D slices. Segmentation via pixel-wise classification often fails to capture the small and low-contrast curvilinear structures. Prior topological information is typically used to address this problem, often at an expensive computational cost, and some… ▽ More

    Submitted 4 March, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: 12 pages, 4 figures

  30. arXiv:2204.11669  [pdf

    eess.IV cs.AI physics.med-ph

    Deep-learning-enabled Brain Hemodynamic Mapping Using Resting-state fMRI

    Authors: Xirui Hou, Pengfei Guo, Puyang Wang, Peiying Liu, Doris D. M. Lin, Hongli Fan, Yang Li, Zhiliang Wei, Zixuan Lin, Dengrong Jiang, Jin Jin, Catherine Kelly, Jay J. Pillai, Judy Huang, Marco C. Pinho, Binu P. Thomas, Babu G. Welch, Denise C. Park, Vishal M. Patel, Argye E. Hillis, Hanzhang Lu

    Abstract: Cerebrovascular disease is a leading cause of death globally. Prevention and early intervention are known to be the most effective forms of its management. Non-invasive imaging methods hold great promises for early stratification, but at present lack the sensitivity for personalized prognosis. Resting-state functional magnetic resonance imaging (rs-fMRI), a powerful tool previously used for mappin… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Journal ref: npj Digital Medicine (2023) 116

  31. arXiv:2204.09599  [pdf, other

    cs.CL cs.AI eess.SP eess.SY

    Radiology Text Analysis System (RadText): Architecture and Evaluation

    Authors: Song Wang, Mingquan Lin, Ying Ding, George Shih, Zhiyong Lu, Yifan Peng

    Abstract: Analyzing radiology reports is a time-consuming and error-prone task, which raises the need for an efficient automated radiology report analysis system to alleviate the workloads of radiologists and encourage precise diagnosis. In this work, we present RadText, an open-source radiology text analysis system developed by Python. RadText offers an easy-to-use text analysis pipeline, including de-iden… ▽ More

    Submitted 19 March, 2022; originally announced April 2022.

    Comments: 9 pages, 2 figures, Accepted by 2022 IEEE 10th International Conference on Healthcare Informatics (ICHI)

  32. arXiv:2203.04295  [pdf, other

    eess.IV cs.CV

    Region Specific Optimization (RSO)-based Deep Interactive Registration

    Authors: Ti Bai, Muhan Lin, Xiao Liang, Biling Wang, Michael Dohopolski, Bin Cai, Dan Nguyen, Steve Jiang

    Abstract: Medical image registration is a fundamental and vital task which will affect the efficacy of many downstream clinical tasks. Deep learning (DL)-based deformable image registration (DIR) methods have been investigated, showing state-of-the-art performance. A test time optimization (TTO) technique was proposed to further improve the DL models' performance. Despite the substantial accuracy improvemen… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  33. arXiv:2203.03844  [pdf, other

    eess.IV cs.CV

    Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks

    Authors: Yunshan Zhong, Mingbao Lin, Xunchao Li, Ke Li, Yunhang Shen, Fei Chao, Yongjian Wu, Rongrong Ji

    Abstract: Light-weight super-resolution (SR) models have received considerable attention for their serviceability in mobile devices. Many efforts employ network quantization to compress SR models. However, these methods suffer from severe performance degradation when quantizing the SR models to ultra-low precision (e.g., 2-bit and 3-bit) with the low-cost layer-wise quantizer. In this paper, we identify tha… ▽ More

    Submitted 3 July, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: ECCV2022

  34. arXiv:2203.02557  [pdf, other

    cs.CV eess.IV

    UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation

    Authors: Dmitrii Torbunov, Yi Huang, Haiwang Yu, Jin Huang, Shinjae Yoo, Meifeng Lin, Brett Viren, Yihui Ren

    Abstract: Unpaired image-to-image translation has broad applications in art, design, and scientific simulations. One early breakthrough was CycleGAN that emphasizes one-to-one mappings between two unpaired image domains via generative-adversarial networks (GAN) coupled with the cycle-consistency constraint, while more recent works promote one-to-many mapping to boost diversity of the translated images. Moti… ▽ More

    Submitted 18 October, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: Accepted by WACV2023, contains 5 pages, 2 figures, 2 tables

  35. arXiv:2202.05492  [pdf, other

    eess.IV cs.CV

    Entroformer: A Transformer-based Entropy Model for Learned Image Compression

    Authors: Yichen Qian, Ming Lin, Xiuyu Sun, Zhiyu Tan, Rong Jin

    Abstract: One critical component in lossy deep image compression is the entropy model, which predicts the probability distribution of the quantized latent representation in the encoding and decoding modules. Previous works build entropy models upon convolutional neural networks which are inefficient in capturing global dependencies. In this work, we propose a novel transformer-based entropy model, termed En… ▽ More

    Submitted 14 March, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: Accepted at ICLR 2022 for poster. Camera ready version

    Journal ref: International Conference on Learning Representations (2022)

  36. arXiv:2201.06878  [pdf

    cs.LG eess.IV

    Hardware-Efficient Deconvolution-Based GAN for Edge Computing

    Authors: Azzam Alhussain, Mingjie Lin

    Abstract: Generative Adversarial Networks (GAN) are cutting-edge algorithms for generating new data samples based on the learned data distribution. However, its performance comes at a significant cost in terms of computation and memory requirements. In this paper, we proposed an HW/SW co-design approach for training quantized deconvolution GAN (QDCGAN) implemented on FPGA using a scalable streaming dataflow… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: To be presented in the 56th Annual Conference on Information Sciences and System (CISS), 9-11 March 2022, and published in the IEEE Xplore Digital library

    Journal ref: 2022 IEEE 56th Annual Conference on Information Sciences and Systems (CISS), March 9-11, 2022. pp.1-5

  37. arXiv:2110.15790  [pdf, other

    cs.IR cs.AI cs.MM cs.SD cs.SI eess.AS

    LSTM-RPA: A Simple but Effective Long Sequence Prediction Algorithm for Music Popularity Prediction

    Authors: Kun Li, Meng Li, Yanling Li, Min Lin

    Abstract: The big data about music history contains information about time and users' behavior. Researchers could predict the trend of popular songs accurately by analyzing this data. The traditional trend prediction models can better predict the short trend than the long trend. In this paper, we proposed the improved LSTM Rolling Prediction Algorithm (LSTM-RPA), which combines LSTM historical input with cu… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  38. arXiv:2110.02411  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Voice Aging with Audio-Visual Style Transfer

    Authors: Justin Wilson, Sunyeong Park, Seunghye J. Wilson, Ming C. Lin

    Abstract: Face aging techniques have used generative adversarial networks (GANs) and style transfer learning to transform one's appearance to look younger/older. Identity is maintained by conditioning these generative networks on a learned vector representation of the source content. In this work, we apply a similar approach to age a speaker's voice, referred to as voice aging. We first analyze the classifi… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  39. arXiv:2110.02405  [pdf, other

    cs.CV cs.SD eess.AS

    Echo-Reconstruction: Audio-Augmented 3D Scene Reconstruction

    Authors: Justin Wilson, Nicholas Rewkowski, Ming C. Lin, Henry Fuchs

    Abstract: Reflective and textureless surfaces such as windows, mirrors, and walls can be a challenge for object and scene reconstruction. These surfaces are often poorly reconstructed and filled with depth discontinuities and holes, making it difficult to cohesively reconstruct scenes that contain these planar discontinuities. We propose Echoreconstruction, an audio-visual method that uses the reflections o… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  40. arXiv:2110.02404  [pdf, other

    cs.CV cs.SD eess.AS

    3D-MOV: Audio-Visual LSTM Autoencoder for 3D Reconstruction of Multiple Objects from Video

    Authors: Justin Wilson, Ming C. Lin

    Abstract: 3D object reconstructions of transparent and concave structured objects, with inferred material properties, remains an open research problem for robot navigation in unstructured environments. In this paper, we propose a multimodal single- and multi-frame neural network for 3D reconstructions using audio-visual inputs. Our trained reconstruction LSTM autoencoder 3D-MOV accepts multiple inputs to ac… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  41. arXiv:2108.08731  [pdf

    physics.med-ph eess.IV

    Registration-Guided Deep Learning Image Segmentation for Cone Beam CT-based Online Adaptive Radiotherapy

    Authors: Lin Ma, Weicheng Chi, Howard E. Morgan, Mu-Han Lin, Mingli Chen, David Sher, Dominic Moon, Dat T. Vo, Vladimir Avkshtol, Weiguo Lu, Xuejun Gu

    Abstract: Adaptive radiotherapy (ART), especially online ART, effectively accounts for positioning errors and anatomical changes. One key component of online ART is accurately and efficiently delineating organs at risk (OARs) and targets on online images, such as CBCT, to meet the online demands of plan evaluation and adaptation. Deep learning (DL)-based automatic segmentation has gained great success in se… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: 16 pages, 6 figures

  42. arXiv:2107.13465  [pdf

    cs.CV eess.IV

    A Proof-of-Concept Study of Artificial Intelligence Assisted Contour Revision

    Authors: Ti Bai, Anjali Balagopal, Michael Dohopolski, Howard E. Morgan, Rafe McBeth, Jun Tan, Mu-Han Lin, David J. Sher, Dan Nguyen, Steve Jiang

    Abstract: Automatic segmentation of anatomical structures is critical for many medical applications. However, the results are not always clinically acceptable and require tedious manual revision. Here, we present a novel concept called artificial intelligence assisted contour revision (AIACR) and demonstrate its feasibility. The proposed clinical workflow of AIACR is as follows given an initial contour that… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  43. arXiv:2107.09086  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG eess.IV

    DPNNet-2.0 Part I: Finding hidden planets from simulated images of protoplanetary disk gaps

    Authors: Sayantan Auddy, Ramit Dey, Min-Kai Lin, Cassandra Hall

    Abstract: The observed sub-structures, like annular gaps, in dust emissions from protoplanetary disk, are often interpreted as signatures of embedded planets. Fitting a model of planetary gaps to these observed features using customized simulations or empirical relations can reveal the characteristics of the hidden planets. However, customized fitting is often impractical owing to the increasing sample size… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: 15 pages, 10 figures, to appear in ApJ

  44. arXiv:2105.06272  [pdf, other

    eess.SP

    Outage Constrained Robust Secure Beamforming in Cognitive Satellite-Aerial Networks

    Authors: Bai Zhao, Min Lin, Ming Cheng, Wei-Ping Zhu, Naofal Al-Dhahir

    Abstract: This paper proposes a robust beamforming scheme to enhance the physical layer security (PLS) of multicast transmission in a cognitive satellite and aerial network (CSAN) operating in the millimeter wave frequency band. Based on imperfect channel state information (CSI) of both eavesdroppers (Eves) and primary users (PUs), we maximize the minimum achievable secrecy rate (ASR) of the secondary users… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  45. arXiv:2105.06023  [pdf, other

    cs.IT eess.SP

    Robust Beamforming for Enhancing Security in Multibeam Satellite Systems

    Authors: Jian Zhang, Min Lin, Jian Ouyang, Wei-Ping Zhu, Tomaso de Cola

    Abstract: This paper proposes a robust beamforming (BF) scheme to enhance physical layer security (PLS) of the downlink of a multibeam satellite system in the presence of either uncoordinated or coordinated eavesdroppers (Eves). Specifically, with knowing only the approximate locations of the Eves, we aim at maximizing the worst-case achievable secrecy rate (ASR) of the legitimate user (LU), subject to the… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  46. arXiv:2105.02471  [pdf, other

    eess.SP cs.SD eess.AS math.PR math.ST

    Signal Analysis via the Stochastic Geometry of Spectrogram Level Sets

    Authors: Subhroshekhar Ghosh, Meixia Lin, Dongfang Sun

    Abstract: Spectrograms are fundamental tools in time-frequency analysis, being the squared magnitude of the so-called short time Fourier transform (STFT). Signal analysis via spectrograms has traditionally explored their peaks, i.e. their maxima. This is complemented by a recent interest in their zeros or minima, following seminal work by Flandrin and others, which exploits connections with Gaussian analyti… ▽ More

    Submitted 21 March, 2022; v1 submitted 6 May, 2021; originally announced May 2021.

    Journal ref: IEEE Transactions on Signal Processing, Vol. 70, 2022

  47. arXiv:2103.13588  [pdf, other

    eess.IV cs.CV physics.med-ph

    Artificial Intelligence in Tumor Subregion Analysis Based on Medical Imaging: A Review

    Authors: Mingquan Lin, Jacob Wynne, Yang Lei, Tonghe Wang, Walter J. Curran, Tian Liu, Xiaofeng Yang

    Abstract: Medical imaging is widely used in cancer diagnosis and treatment, and artificial intelligence (AI) has achieved tremendous success in various tasks of medical image analysis. This paper reviews AI-based tumor subregion analysis in medical imaging. We summarize the latest AI-based methods for tumor subregion analysis and their applications. Specifically, we categorize the AI-based methods by traini… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  48. arXiv:2102.01289  [pdf, other

    eess.IV cs.CV

    Mobile-end Tone Mapping based on Integral Image and Integral Histogram

    Authors: Jie Yang, Mengchen Lin, Ziyi Liu, Ulian Shahnovich, Orly Yadid-Pecht

    Abstract: Wide dynamic range (WDR) image tone mapping is in high demand in many applications like film production, security monitoring, and photography. It is especially crucial for mobile devices because most of the images taken today are from mobile phones, hence such technology is highly demanded in the consumer market of mobile devices and is essential for a good customer experience. However, high-quali… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

  49. arXiv:2102.00348  [pdf, other

    eess.IV cs.CV

    Deep Reformulated Laplacian Tone Mapping

    Authors: Jie Yang, Ziyi Liu, Mengchen Lin, Svetlana Yanushkevich, Orly Yadid-Pecht

    Abstract: Wide dynamic range (WDR) images contain more scene details and contrast when compared to common images. However, it requires tone mapping to process the pixel values in order to display properly. The details of WDR images can diminish during the tone mapping process. In this work, we address the problem by combining a novel reformulated Laplacian pyramid and deep learning. The reformulated Laplaci… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

  50. arXiv:2012.02033  [pdf, ps, other

    cs.CV eess.IV

    SuperOCR: A Conversion from Optical Character Recognition to Image Captioning

    Authors: Baohua Sun, Michael Lin, Hao Sha, Lin Yang

    Abstract: Optical Character Recognition (OCR) has many real world applications. The existing methods normally detect where the characters are, and then recognize the character for each detected location. Thus the accuracy of characters recognition is impacted by the performance of characters detection. In this paper, we propose a method for recognizing characters without detecting the location of each chara… ▽ More

    Submitted 21 November, 2020; originally announced December 2020.

    Comments: 8 pages, 2 figures, 2 tables