Zum Hauptinhalt springen

Showing 1–45 of 45 results for author: Su, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.16126  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation

    Authors: Ke Chen, Jiaqi Su, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Zeyu Jin

    Abstract: Achieving robust speech separation for overlapping speakers in various acoustic environments with noise and reverberation remains an open challenge. Although existing datasets are available to train separators for specific scenarios, they do not effectively generalize across diverse real-world scenarios. In this paper, we present a novel data simulation pipeline that produces diverse training data… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: In Proceedings of the 25th Annual Conference of the International Speech Communication Association, Interspeech 2024

  2. arXiv:2408.08101  [pdf

    eess.SY

    Stochastic Real-Time Economic Dispatch for Integrated Electric and Gas Systems Considering Uncertainty Propagation and Pipeline Leakage

    Authors: eiyao Zhao, Zhengshuo Li, Jiahui Zhang, Xiang Bai, Jia Su

    Abstract: Gas-fired units (GFUs) with rapid regulation capabilities are considered an effective tool to mitigate fluctuations in the generation of renewable energy sources and have coupled electricity power systems (EPSs) and natural gas systems (NGSs) more tightly. However, this tight coupling leads to uncertainty propagation, a challenge for the real-time dispatch of such integrated electric and gas syste… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  3. arXiv:2405.09470  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer

    Authors: Weifei Jin, Yuxin Cao, Junjie Su, Qi Shen, Kai Ye, Derui Wang, Jie Hao, Ziyao Liu

    Abstract: In light of the widespread application of Automatic Speech Recognition (ASR) systems, their security concerns have received much more attention than ever before, primarily due to the susceptibility of Deep Neural Networks. Previous studies have illustrated that surreptitiously crafting adversarial perturbations enables the manipulation of speech recognition systems, resulting in the production of… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted to SecTL (AsiaCCS Workshop) 2024

  4. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhijing Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Haijin Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  5. In-situ process monitoring and adaptive quality enhancement in laser additive manufacturing: a critical review

    Authors: Lequn Chen, Guijun Bi, Xiling Yao, Jinlong Su, Chaolin Tan, Wenhe Feng, Michalis Benakis, Youxiang Chew, Seung Ki Moon

    Abstract: Laser Additive Manufacturing (LAM) presents unparalleled opportunities for fabricating complex, high-performance structures and components with unique material properties. Despite these advancements, achieving consistent part quality and process repeatability remains challenging. This paper provides a comprehensive review of various state-of-the-art in-situ process monitoring techniques, including… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 107 Pages, 29 Figures. Paper Accepted At Journal of Manufacturing Systems

  6. arXiv:2401.07041  [pdf, other

    eess.IV cs.CV

    An automated framework for brain vessel centerline extraction from CTA images

    Authors: Sijie Liu, Ruisheng Su, Jianghang Su, Jingmin Xin, Jiayi Wu, Wim van Zwam, Pieter Jan van Doormaal, Aad van der Lugt, Wiro J. Niessen, Nanning Zheng, Theo van Walsum

    Abstract: Accurate automated extraction of brain vessel centerlines from CTA images plays an important role in diagnosis and therapy of cerebrovascular diseases, such as stroke. However, this task remains challenging due to the complex cerebrovascular structure, the varying imaging quality, and vessel pathology effects. In this paper, we consider automatic lumen segmentation generation without additional an… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  7. arXiv:2312.13304  [pdf, other

    eess.IV cs.CV

    End-to-end Rain Streak Removal with RAW Images

    Authors: GuoDong Du, HaoJian Deng, JiaHao Su, Yuan Huang

    Abstract: In this work we address the problem of rain streak removal with RAW images. The general approach is firstly processing RAW data into RGB images and removing rain streak with RGB images. Actually the original information of rain in RAW images is affected by image signal processing (ISP) pipelines including none-linear algorithms, unexpected noise, artifacts and so on. It gains more benefit to direc… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures,4 tables, conference

  8. arXiv:2310.17471  [pdf, other

    cs.IT cs.DC cs.LG cs.NI eess.SP

    Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration

    Authors: Xiang Chen, Zhiheng Guo, Xijun Wang, Howard H. Yang, Chenyuan Feng, Junshen Su, Sihui Zheng, Tony Q. S. Quek

    Abstract: Future wireless communication networks are in a position to move beyond data-centric, device-oriented connectivity and offer intelligent, immersive experiences based on task-oriented connections, especially in the context of the thriving development of pre-trained foundation models (PFM) and the evolving vision of 6G native artificial intelligence (AI). Therefore, redefining modes of collaboration… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 8 pages, 4 figures, 1 table

  9. arXiv:2308.14763  [pdf, other

    eess.AS cs.CL cs.SD

    VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired

    Authors: Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang

    Abstract: Services of personalized TTS systems for the Mandarin-speaking speech impaired are rarely mentioned. Taiwan started the VoiceBanking project in 2020, aiming to build a complete set of services to deliver personalized Mandarin TTS systems to amyotrophic lateral sclerosis patients. This paper reports the corpus design, corpus recording, data purging and correction for the corpus, and evaluations of… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: submitted to 26th International Conference of the ORIENTAL-COCOSDA

  10. arXiv:2306.10689  [pdf, other

    eess.IV cs.CV

    Realistic Restorer: artifact-free flow restorer(AF2R) for MRI motion artifact removal

    Authors: Jiandong Su, Kun Shang, Dong Liang

    Abstract: Motion artifact is a major challenge in magnetic resonance imaging (MRI) that severely degrades image quality, reduces examination efficiency, and makes accurate diagnosis difficult. However, previous methods often relied on implicit models for artifact correction, resulting in biases in modeling the artifact formation mechanism and characterizing the relationship between artifact information and… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  11. arXiv:2306.10520  [pdf, other

    eess.IV cs.CV

    RetinexFlow for CT metal artifact reduction

    Authors: Jiandong Su, Ce Wang, Yinsheng Li, Kun Shang, Dong Liang

    Abstract: Metal artifacts is a major challenge in computed tomography (CT) imaging, significantly degrading image quality and making accurate diagnosis difficult. However, previous methods either require prior knowledge of the location of metal implants, or have modeling deviations with the mechanism of artifact formation, which limits the ability to obtain high-quality CT images. In this work, we formulate… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  12. arXiv:2306.03835  [pdf, other

    eess.IV cs.CV cs.LG

    Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning

    Authors: Yiman Liu, Qiming Huang, Xiaoxiang Han, Tongtong Liang, Zhifang Zhang, Lijun Chen, Jinfeng Wang, Angelos Stefanidis, Jionglong Su, Jiangang Chen, Qingli Li, Yuqi Zhang

    Abstract: Purpose: Congenital heart defect (CHD) is the most common birth defect. Thoracic echocardiography (TTE) can provide sufficient cardiac structure information, evaluate hemodynamics and cardiac function, and is an effective method for atrial septal defect (ASD) examination. This paper aims to study a deep learning method based on cardiac ultrasound video to assist in ASD diagnosis. Materials and met… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  13. arXiv:2304.11341  [pdf, ps, other

    cs.IT eess.SP

    Performance Analysis and Optimal Design of HARQ-IR-Aided Terahertz Communications

    Authors: Ziyang Song, Zheng Shi, Jiaji Su, Qingping Dou, Guanghua Yang, Haichuan Ding, Shaodan Ma

    Abstract: Terahertz (THz) communications are envisioned to be a promising technology for 6G thanks to its broad bandwidth. However, the large path loss, antenna misalignment, and atmospheric influence of THz communications severely deteriorate its reliability. To address this, hybrid automatic repeat request (HARQ) is recognized as an effective technique to ensure reliable THz communications. This paper del… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: Blockage, hybrid automatic repeat request (HARQ), outage probability, terahertz (THz) communications

  14. arXiv:2304.09620  [pdf

    eess.IV cs.CV

    DCELANM-Net:Medical Image Segmentation based on Dual Channel Efficient Layer Aggregation Network with Learner

    Authors: Chengzhun Lu, Zhangrun Xia, Krzysztof Przystupa, Orest Kochan, Jun Su

    Abstract: The DCELANM-Net structure, which this article offers, is a model that ingeniously combines a Dual Channel Efficient Layer Aggregation Network (DCELAN) and a Micro Masked Autoencoder (Micro-MAE). On the one hand, for the DCELAN, the features are more effectively fitted by deepening the network structure; the deeper network can successfully learn and fuse the features, which can more accurately loca… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  15. arXiv:2304.04598  [pdf

    cs.SD eess.AS eess.SP

    In-situ crack and keyhole pore detection in laser directed energy deposition through acoustic signal and deep learning

    Authors: Lequn Chen, Xiling Yao, Chaolin Tan, Weiyang He, Jinlong Su, Fei Weng, Youxiang Chew, Nicholas Poh Huat Ng, Seung Ki Moon

    Abstract: Cracks and keyhole pores are detrimental defects in alloys produced by laser directed energy deposition (LDED). Laser-material interaction sound may hold information about underlying complex physical events such as crack propagation and pores formation. However, due to the noisy environment and intricate signal content, acoustic-based monitoring in LDED has received little attention. This paper pr… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 36 Pages, 16 Figures, accepted at journal Additive Manufacturing

  16. arXiv:2211.02419  [pdf, other

    eess.IV cs.CV cs.LG

    High-Resolution Boundary Detection for Medical Image Segmentation with Piece-Wise Two-Sample T-Test Augmented Loss

    Authors: Yucong Lin, Jinhua Su, Yuhang Li, Yuhao Wei, Hanchao Yan, Saining Zhang, Jiaan Luo, Danni Ai, Hong Song, Jingfan Fan, Tianyu Fu, Deqiang Xiao, Feifei Wang, Jue Hou, Jian Yang

    Abstract: Deep learning methods have contributed substantially to the rapid advancement of medical image segmentation, the quality of which relies on the suitable design of loss functions. Popular loss functions, including the cross-entropy and dice losses, often fall short of boundary detection, thereby limiting high-resolution downstream applications such as automated diagnoses and procedures. We develope… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  17. arXiv:2210.06293  [pdf, other

    eess.SP cs.AI cs.CV cs.LG

    Two-stream Network for ECG Signal Classification

    Authors: Xinyao Hou, Shengmei Qin, Jianbo Su

    Abstract: Electrocardiogram (ECG), a technique for medical monitoring of cardiac activity, is an important method for identifying cardiovascular disease. However, analyzing the increasing quantity of ECG data consumes a lot of medical resources. This paper explores an effective algorithm for automatic classifications of multi-classes of heartbeat types based on ECG. Most neural network based methods target… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  18. arXiv:2209.06353  [pdf, other

    eess.IV cs.CV

    Label Refinement Network from Synthetic Error Augmentation for Medical Image Segmentation

    Authors: Shuai Chen, Antonio Garcia-Uceda, Jiahang Su, Gijs van Tulder, Lennard Wolff, Theo van Walsum, Marleen de Bruijne

    Abstract: Deep convolutional neural networks for image segmentation do not learn the label structure explicitly and may produce segmentations with an incorrect structure, e.g., with disconnected cylindrical structures in the segmentation of tree-like structures such as airways or blood vessels. In this paper, we propose a novel label refinement method to correct such errors from an initial segmentation, imp… ▽ More

    Submitted 9 October, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

  19. arXiv:2208.04360  [pdf, other

    cs.LG eess.SP

    SDWPF: A Dataset for Spatial Dynamic Wind Power Forecasting Challenge at KDD Cup 2022

    Authors: Jingbo Zhou, Xinjiang Lu, Yixiong Xiao, Jiantao Su, Junfu Lyu, Yanjun Ma, Dejing Dou

    Abstract: The variability of wind power supply can present substantial challenges to incorporating wind power into a grid system. Thus, Wind Power Forecasting (WPF) has been widely recognized as one of the most critical issues in wind power integration and operation. There has been an explosion of studies on wind power forecasting problems in the past decades. Nevertheless, how to well handle the WPF proble… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  20. arXiv:2206.06701  [pdf, other

    eess.IV cs.CV cs.LG

    CNN-based Classification Framework for Lung Tissues with Auxiliary Information

    Authors: Huafeng Hu, Ruijie Ye, Jeyarajan Thiyagalingam, Frans Coenen, Jionglong Su

    Abstract: Interstitial lung diseases are a large group of heterogeneous diseases characterized by different degrees of alveolitis and pulmonary fibrosis. Accurately diagnosing these diseases has significant guiding value for formulating treatment plans. Although previous work has produced impressive results in classifying interstitial lung diseases, there is still room for improving the accuracy of these te… ▽ More

    Submitted 18 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

  21. arXiv:2203.03635  [pdf, ps, other

    eess.IV cs.CV

    Stepwise Feature Fusion: Local Guides Global

    Authors: Jinfeng Wang, Qiming Huang, Feilong Tang, Jia Meng, Jionglong Su, Sifan Song

    Abstract: Colonoscopy, currently the most efficient and recognized colon polyp detection technology, is necessary for early screening and prevention of colorectal cancer. However, due to the varying size and complex morphological features of colonic polyps as well as the indistinct boundary between polyps and mucosa, accurate segmentation of polyps is still challenging. Deep learning has become popular for… ▽ More

    Submitted 27 June, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 10 pages, 5 figures

  22. arXiv:2201.12605  [pdf

    cs.RO eess.SY

    Design of Outdoor Autonomous Moble Robot

    Authors: I-Hsi Kao, Jian-An Su, Jau-Woei Perng

    Abstract: This study presents the design of a six-wheeled outdoor autonomous mobile robot. The main design goal of our robot is to increase its adaptability and flexibility when moving outdoors. This six-wheeled robot platform was equipped with some sensors, such as a global positioning system (GPS), high definition (HD) webcam, light detection and ranging (LiDAR), and rotary encoders. A personal mobile com… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

    Comments: International Conference on Recent Innovations in Biotechnology, System Engineering, Applied Sciences, Space Environment & Aviation Technology

  23. arXiv:2201.09208  [pdf

    cs.CV eess.SP

    Design of Sensor Fusion Driver Assistance System for Active Pedestrian Safety

    Authors: I-Hsi Kao, Ya-Zhu Yian, Jian-An Su, Yi-Horng Lai, Jau-Woei Perng, Tung-Li Hsieh, Yi-Shueh Tsai, Min-Shiu Hsieh

    Abstract: In this paper, we present a parallel architecture for a sensor fusion detection system that combines a camera and 1D light detection and ranging (lidar) sensor for object detection. The system contains two object detection methods, one based on an optical flow, and the other using lidar. The two sensors can effectively complement the defects of the other. The accurate longitudinal accuracy of the… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: The 14th International Conference on Automation Technology (Automation 2017), December 8-10, 2017, Kaohsiung, Taiwan

  24. arXiv:2110.09860  [pdf, other

    eess.IV cs.CV

    Bilateral-ViT for Robust Fovea Localization

    Authors: Sifan Song, Kang Dang, Qinji Yu, Zilong Wang, Frans Coenen, Jionglong Su, Xiaowei Ding

    Abstract: The fovea is an important anatomical landmark of the retina. Detecting the location of the fovea is essential for the analysis of many retinal diseases. However, robust fovea localization remains a challenging problem, as the fovea region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes a novel Vision Transformer (ViT) approach that integrates inform… ▽ More

    Submitted 3 March, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: This work has been accepted for oral presentation by ISBI2022

  25. arXiv:2108.01553  [pdf, other

    eess.AS cs.SD

    Amortized Neural Networks for Low-Latency Speech Recognition

    Authors: Jonathan Macoskey, Grant P. Strimel, Jinru Su, Ariya Rastrow

    Abstract: We introduce Amortized Neural Networks (AmNets), a compute cost- and latency-aware network architecture particularly well-suited for sequence modeling tasks. We apply AmNets to the Recurrent Neural Network Transducer (RNN-T) to reduce compute cost and latency for an automatic speech recognition (ASR) task. The AmNets RNN-T architecture enables the network to dynamically switch between encoder bran… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: Accepted at Interspeech 2021

  26. arXiv:2105.13302  [pdf, other

    math.ST cs.IT cs.LG eess.SP stat.ML

    Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit

    Authors: Zhiqi Bu, Jason Klusowski, Cynthia Rush, Weijie J. Su

    Abstract: Sorted l1 regularization has been incorporated into many methods for solving high-dimensional statistical estimation problems, including the SLOPE estimator in linear regression. In this paper, we study how this relatively new regularization technique improves variable selection by characterizing the optimal SLOPE trade-off between the false discovery proportion (FDP) and true positive proportion… ▽ More

    Submitted 5 June, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

    Journal ref: Annals of Statistics 2022

  27. arXiv:2102.12173  [pdf

    eess.IV

    Deep learning-based framework for cardiac function assessment in embryonic zebrafish from heart beating videos

    Authors: Amir Mohammad Naderi, Haisong Bu, Jingcheng Su, Mao-Hsiang Huang, Khuong Vo, Ramses Seferino Trigo Torres, J. -C. Chiao, Juhyun Lee, Michael P. H. Lau, Xiaolei Xu, Hung Cao

    Abstract: Zebrafish is a powerful and widely-used model system for a host of biological investigations including cardiovascular studies and genetic screening. Zebrafish are readily assessable during developmental stages; however, the current methods for quantification and monitoring of cardiac functions mostly involve tedious manual work and inconsistent estimations. In this paper, we developed and validate… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  28. arXiv:2101.10444  [pdf, ps, other

    cs.CV eess.IV

    GnetSeg: Semantic Segmentation Model Optimized on a 224mW CNN Accelerator Chip at the Speed of 318FPS

    Authors: Baohua Sun, Weixiong Lin, Hao Sha, Jiapeng Su

    Abstract: Semantic segmentation is the task to cluster pixels on an image belonging to the same class. It is widely used in the real-world applications including autonomous driving, medical imaging analysis, industrial inspection, smartphone camera for person segmentation and so on. Accelerating the semantic segmentation models on the mobile and edge devices are practical needs for the industry. Recent year… ▽ More

    Submitted 9 January, 2021; originally announced January 2021.

    Comments: 7 pages, 3 figures, and 2 tables

  29. arXiv:2101.04928  [pdf, ps, other

    math.OC eess.SY

    Distributed Multi-Building Coordination for Demand Response

    Authors: Junyan Su, Yuning Jiang, Altug Bitlislioglu, Colin N. Jones, Boris Houska

    Abstract: This paper presents a distributed optimization algorithm tailored for solving optimal control problems arising in multi-building coordination. The buildings coordinated by a grid operator, join a demand response program to balance the voltage surge by using an energy cost defined criterion. In order to model the hierarchical structure of the building network, we formulate a distributed convex opti… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  30. Multi-Attention-Network for Semantic Segmentation of Fine Resolution Remote Sensing Images

    Authors: Rui Li, Shunyi Zheng, Chenxi Duan, Ce Zhang, Jianlin Su, P. M. Atkinson

    Abstract: Semantic segmentation of remote sensing images plays an important role in a wide range of applications including land resource management, biosphere monitoring and urban planning. Although the accuracy of semantic segmentation in remote sensing images has been increased significantly by deep convolutional neural networks, several limitations exist in standard models. First, for encoder-decoder arc… ▽ More

    Submitted 23 November, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.14902

  31. arXiv:2006.15228  [pdf, other

    eess.IV

    HypervolGAN: An efficient approach for GAN with multi-objective training function

    Authors: Jingwen Su, Hujun Yin

    Abstract: Since the advent of generative adversarial networks (GANs), various loss functions have been developed and combined to constitute the overall training objective function, in order to improve model performance or for specific learning tasks. For instance, in image enhancement or restoration, there are often several criteria to consider such as signal-noise ratio, smoothness, structures and details.… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

  32. arXiv:2006.05694  [pdf, other

    eess.AS cs.LG cs.SD

    HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

    Authors: Jiaqi Su, Zeyu Jin, Adam Finkelstein

    Abstract: Real-world audio recordings are often degraded by factors such as noise, reverberation, and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to transform recorded speech to sound as though it had been recorded in a studio. We use an end-to-end feed-forward WaveNet architecture, trained with multi-scale adversarial discriminators in both the time domain and the time-f… ▽ More

    Submitted 21 September, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Accepted by INTERSPEECH 2020

  33. arXiv:2006.05637  [pdf, ps, other

    eess.SP math.OC

    Distributed Optimization for Massive Connectivity

    Authors: Yuning Jiang, Junyan Su, Yuanming Shi, Boris Houska

    Abstract: Massive device connectivity in Internet of Thing (IoT) networks with sporadic traffic poses significant communication challenges. To overcome this challenge, the serving base station is required to detect the active devices and estimate the corresponding channel state information during each coherence block. The corresponding joint activity detection and channel estimation problem can be formulate… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  34. arXiv:2004.09754  [pdf, other

    cs.CV cs.LG eess.IV

    The 1st Agriculture-Vision Challenge: Methods and Results

    Authors: Mang Tik Chiu, Xingqian Xu, Kai Wang, Jennifer Hobbs, Naira Hovakimyan, Thomas S. Huang, Honghui Shi, Yunchao Wei, Zilong Huang, Alexander Schwing, Robert Brunner, Ivan Dozier, Wyatt Dozier, Karen Ghandilyan, David Wilson, Hyunseong Park, Junhee Kim, Sungho Kim, Qinghui Liu, Michael C. Kampffmeyer, Robert Jenssen, Arnt B. Salberg, Alexandre Barbosa, Rodrigo Trevisan, Bingchen Zhao , et al. (17 additional authors not shown)

    Abstract: The first Agriculture-Vision Challenge aims to encourage research in developing novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset. Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation. The Agricultu… ▽ More

    Submitted 23 April, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 Workshop

  35. arXiv:2002.09334  [pdf

    physics.med-ph cs.LG eess.IV

    Deep Learning System to Screen Coronavirus Disease 2019 Pneumonia

    Authors: Xiaowei Xu, Xiangao Jiang, Chunlian Ma, Peng Du, Xukun Li, Shuangzhi Lv, Liang Yu, Yanfei Chen, Junwei Su, Guanjing Lang, Yongtao Li, Hong Zhao, Kaijin Xu, Lingxiang Ruan, Wei Wu

    Abstract: We found that the real time reverse transcription-polymerase chain reaction (RT-PCR) detection of viral RNA from sputum or nasopharyngeal swab has a relatively low positive rate in the early stage to determine COVID-19 (named by the World Health Organization). The manifestations of computed tomography (CT) imaging of COVID-19 had their own characteristics, which are different from other types of v… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Journal ref: Engineering, Volume 6, Issue 10, October 2020, Pages 1122-1129

  36. arXiv:2001.08383  [pdf, other

    eess.IV cs.CV cs.LG

    A Multi-site Study of a Breast Density Deep Learning Model for Full-field Digital Mammography Images and Synthetic Mammography Images

    Authors: Thomas P. Matthews, Sadanand Singh, Brent Mombourquette, Jason Su, Meet P. Shah, Stefano Pedemonte, Aaron Long, David Maffit, Jenny Gurney, Rodrigo Morales Hoil, Nikita Ghare, Douglas Smith, Stephen M. Moore, Susan C. Marks, Richard L. Wahl

    Abstract: Purpose: To develop a Breast Imaging Reporting and Data System (BI-RADS) breast density deep learning (DL) model in a multi-site setting for synthetic two-dimensional mammography (SM) images derived from digital breast tomosynthesis exams using full-field digital mammography (FFDM) images and limited SM data. Materials and Methods: A DL model was trained to predict BI-RADS breast density using F… ▽ More

    Submitted 2 October, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

    MSC Class: 68T45 ACM Class: I.5.4; J.3; I.2.10; I.4.8

  37. arXiv:2001.08382  [pdf, other

    cs.CV cs.LG eess.IV

    A Hypersensitive Breast Cancer Detector

    Authors: Stefano Pedemonte, Brent Mombourquette, Alexis Goh, Trevor Tsue, Aaron Long, Sadanand Singh, Thomas Paul Matthews, Meet Shah, Jason Su

    Abstract: Early detection of breast cancer through screening mammography yields a 20-35% increase in survival rate; however, there are not enough radiologists to serve the growing population of women seeking screening mammography. Although commercial computer aided detection (CADe) software has been available to radiologists for decades, it has failed to improve the interpretation of full-field digital mamm… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: SPIE Medical Imaging 2020

  38. arXiv:2001.08381  [pdf, other

    cs.CV cs.LG eess.IV

    Adaptation of a deep learning malignancy model from full-field digital mammography to digital breast tomosynthesis

    Authors: Sadanand Singh, Thomas Paul Matthews, Meet Shah, Brent Mombourquette, Trevor Tsue, Aaron Long, Ranya Almohsen, Stefano Pedemonte, Jason Su

    Abstract: Mammography-based screening has helped reduce the breast cancer mortality rate, but has also been associated with potential harms due to low specificity, leading to unnecessary exams or procedures, and low sensitivity. Digital breast tomosynthesis (DBT) improves on conventional mammography by increasing both sensitivity and specificity and is becoming common in clinical settings. However, deep lea… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: SPIE Medical Imaging 2020

  39. arXiv:1911.09837  [pdf, other

    cs.LG cs.CV eess.SP

    Graph Convolution Networks for Probabilistic Modeling of Driving Acceleration

    Authors: Jianyu Su, Peter A. Beling, Rui Guo, Kyungtae Han

    Abstract: The ability to model and predict ego-vehicle's surrounding traffic is crucial for autonomous pilots and intelligent driver-assistance systems. Acceleration prediction is important as one of the major components of traffic prediction. This paper proposes novel approaches to the acceleration prediction problem. By representing spatial relationships between vehicles with a graph model, we build a gen… ▽ More

    Submitted 7 May, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: Accepted by ITSC 2020

  40. arXiv:1910.04099  [pdf, other

    cs.CV cs.LG eess.IV

    Manhattan Room Layout Reconstruction from a Single 360 image: A Comparative Study of State-of-the-art Methods

    Authors: Chuhang Zou, Jheng-Wei Su, Chi-Han Peng, Alex Colburn, Qi Shan, Peter Wonka, Hung-Kuo Chu, Derek Hoiem

    Abstract: Recent approaches for predicting layouts from 360 panoramas produce excellent results. These approaches build on a common framework consisting of three steps: a pre-processing step based on edge-based alignment, prediction of layout elements, and a post-processing step by fitting a 3D layout to the layout elements. Until now, it has been difficult to compare the methods due to multiple different d… ▽ More

    Submitted 25 December, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Accepted by International Journal of Computer Vision (IJCV), 2021

  41. arXiv:1909.04142  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    DaTscan SPECT Image Classification for Parkinson's Disease

    Authors: Justin Quan, Lin Xu, Rene Xu, Tyrael Tong, Jean Su

    Abstract: Parkinson's Disease (PD) is a neurodegenerative disease that currently does not have a cure. In order to facilitate disease management and reduce the speed of symptom progression, early diagnosis is essential. The current clinical, diagnostic approach is to have radiologists perform human visual analysis of the degeneration of dopaminergic neurons in the substantia nigra region of the brain. Clini… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

  42. arXiv:1907.07502  [pdf, other

    stat.ML cs.LG eess.SP math.ST

    Algorithmic Analysis and Statistical Estimation of SLOPE via Approximate Message Passing

    Authors: Zhiqi Bu, Jason Klusowski, Cynthia Rush, Weijie Su

    Abstract: SLOPE is a relatively new convex optimization procedure for high-dimensional linear regression via the sorted l1 penalty: the larger the rank of the fitted coefficient, the larger the penalty. This non-separable penalty renders many existing techniques invalid or inconclusive in analyzing the SLOPE solution. In this paper, we develop an asymptotically exact characterization of the SLOPE solution u… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

  43. arXiv:1905.11045  [pdf, other

    eess.IV cs.CV cs.LG

    Attention Based Image Compression Post-Processing Convolutional Neural Network

    Authors: Yuyang Xue, Jiannan Su

    Abstract: The traditional image compressors, e.g., BPG and H.266, have achieved great image and video compression quality. Recently, Convolutional Neural Network has been used widely in image compression. We proposed an attention-based convolutional neural network for low bit-rate compression to post-process the output of traditional image compression decoder. Across the experimental results on validation s… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: 4 pages, 2 figures, CVPR Compression Workshop

  44. arXiv:1901.05049  [pdf, other

    cs.LG cs.DC cs.SD eess.AS stat.ML

    Bonseyes AI Pipeline -- bringing AI to you. End-to-end integration of data, algorithms and deployment tools

    Authors: Miguel de Prado, Jing Su, Rabia Saeed, Lorenzo Keller, Noelia Vallez, Andrew Anderson, David Gregg, Luca Benini, Tim Llewellynn, Nabil Ouerhani, Rozenn Dahyot and, Nuria Pazos

    Abstract: Next generation of embedded Information and Communication Technology (ICT) systems are collaborative systems able to perform autonomous tasks. The remarkable expansion of the embedded ICT market, together with the rise and breakthroughs of Artificial Intelligence (AI), have put the focus on the Edge as it stands as one of the keys for the next technological revolution: the seamless integration of… ▽ More

    Submitted 11 June, 2020; v1 submitted 15 January, 2019; originally announced January 2019.

  45. arXiv:1812.10199  [pdf, other

    cs.SD cs.CR eess.AS

    A Multiversion Programming Inspired Approach to Detecting Audio Adversarial Examples

    Authors: Qiang Zeng, Jianhai Su, Chenglong Fu, Golam Kayas, Lannan Luo

    Abstract: Adversarial examples (AEs) are crafted by adding human-imperceptible perturbations to inputs such that a machine-learning based classifier incorrectly labels them. They have become a severe threat to the trustworthiness of machine learning. While AEs in the image domain have been well studied, audio AEs are less investigated. Recently, multiple techniques are proposed to generate audio AEs, which… ▽ More

    Submitted 3 December, 2019; v1 submitted 25 December, 2018; originally announced December 2018.

    Comments: 8 pages, 4 figures, AICS 2019, The AAAI-19 Workshop on Artificial Intelligence for Cyber Security (AICS), 2019

    Report number: AICS/2019/06

    Journal ref: The AAAI-19 Workshop on Artificial Intelligence for Cyber Security (AICS), 2019