Zum Hauptinhalt springen

Showing 1–50 of 94 results for author: Tsai, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11679  [pdf, other

    cs.CV

    Exploring Robustness of Visual State Space model against Backdoor Attacks

    Authors: Cheng-Yi Lee, Cheng-Chang Tsai, Chia-Mu Yu, Chun-Shien Lu

    Abstract: Visual State Space Model (VSS) has demonstrated remarkable performance in various computer vision tasks. However, in the process of development, backdoor attacks have brought severe challenges to security. Such attacks cause an infected model to predict target labels when a specific trigger is activated, while the model behaves normally on benign samples. In this paper, we conduct systematic exper… ▽ More

    Submitted 22 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: 11 pages, 9 figures, minor revise, under review

  2. arXiv:2407.12229  [pdf, other

    eess.AS cs.AI eess.SP

    Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech

    Authors: Haibin Wu, Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Daniel Tompkins, Chung-Hsien Tsai, Canrun Li, Zhen Xiao, Sheng Zhao, Jinyu Li, Naoyuki Kanda

    Abstract: People change their tones of voice, often accompanied by nonverbal vocalizations (NVs) such as laughter and cries, to convey rich emotions. However, most text-to-speech (TTS) systems lack the capability to generate speech with rich emotions, including NVs. This paper introduces EmoCtrl-TTS, an emotion-controllable zero-shot TTS that can generate highly emotional speech with NVs for any speaker. Em… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: See https://aka.ms/emoctrl-tts for demo samples

  3. arXiv:2407.09059  [pdf, other

    cs.CV

    Domain-adaptive Video Deblurring via Test-time Blurring

    Authors: Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

    Abstract: Dynamic scene video deblurring aims to remove undesirable blurry artifacts captured during the exposure process. Although previous video deblurring methods have achieved impressive results, they suffer from significant performance drops due to the domain gap between training and testing videos, especially for those captured in real-world scenarios. To address this issue, we propose a domain adapta… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  4. arXiv:2407.00556  [pdf, other

    cs.MM

    Revisiting Vision-Language Features Adaptation and Inconsistency for Social Media Popularity Prediction

    Authors: Chih-Chung Hsu, Chia-Ming Lee, Yu-Fan Lin, Yi-Shiuan Chou, Chih-Yu Jian, Chi-Han Tsai

    Abstract: Social media popularity (SMP) prediction is a complex task involving multi-modal data integration. While pre-trained vision-language models (VLMs) like CLIP have been widely adopted for this task, their effectiveness in capturing the unique characteristics of social media content remains unexplored. This paper critically examines the applicability of CLIP-based features in SMP prediction, focusing… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Submission of the 7th Social Media Prediction Challenge

  5. arXiv:2406.18009  [pdf, other

    eess.AS cs.SD

    E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS

    Authors: Sefik Emre Eskimez, Xiaofei Wang, Manthan Thakker, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Hemin Yang, Zirun Zhu, Min Tang, Xu Tan, Yanqing Liu, Sheng Zhao, Naoyuki Kanda

    Abstract: This paper introduces Embarrassingly Easy Text-to-Speech (E2 TTS), a fully non-autoregressive zero-shot text-to-speech system that offers human-level naturalness and state-of-the-art speaker similarity and intelligibility. In the E2 TTS framework, the text input is converted into a character sequence with filler tokens. The flow-matching-based mel spectrogram generator is then trained based on the… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  6. arXiv:2406.07429  [pdf, other

    cs.CR

    Making 'syscall' a Privilege not a Right

    Authors: Fangfei Yang, Anjo Vahldiek-Oberwagner, Chia-Che Tsai, Kelly Kaoudis, Nathan Dautenhahn

    Abstract: Browsers, Library OSes, and system emulators rely on sandboxes and in-process isolation to emulate system resources and securely isolate untrusted components. All access to system resources like system calls (syscall) need to be securely mediated by the application. Otherwise system calls may allow untrusted components to evade the emulator or sandbox monitor, and hence, escape and attack the enti… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  7. arXiv:2405.01636  [pdf, other

    cs.CV

    Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey

    Authors: Rokas Gipiškis, Chun-Wei Tsai, Olga Kurasova

    Abstract: Artificial Intelligence (XAI) has found numerous applications in computer vision. While image classification-based explainability techniques have garnered significant attention, their counterparts in semantic segmentation have been relatively neglected. Given the prevalent use of image segmentation, ranging from medical to industrial deployments, these techniques warrant a systematic look. In this… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 35 pages, 9 figures, 2 tables

  8. arXiv:2404.13134  [pdf, other

    cs.MM cs.CV cs.LG

    Deep Learning-based Text-in-Image Watermarking

    Authors: Bishwa Karki, Chun-Hua Tsai, Pei-Chi Huang, Xin Zhong

    Abstract: In this work, we introduce a novel deep learning-based approach to text-in-image watermarking, a method that embeds and extracts textual information within images to enhance data security and integrity. Leveraging the capabilities of deep learning, specifically through the use of Transformer-based architectures for text processing and Vision Transformers for image feature extraction, our method se… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  9. arXiv:2404.01643  [pdf, other

    eess.IV cs.CV cs.LG

    A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection

    Authors: Chih-Chung Hsu, Chia-Ming Lee, Yang Fan Chiang, Yi-Shiuan Chou, Chih-Yu Jiang, Shen-Chieh Tai, Chi-Han Tsai

    Abstract: Conventional Computed Tomography (CT) imaging recognition faces two significant challenges: (1) There is often considerable variability in the resolution and size of each CT scan, necessitating strict requirements for the input size and adaptability of models. (2) CT-scan contains large number of out-of-distribution (OOD) slices. The crucial features may only be present in specific spatial regions… ▽ More

    Submitted 20 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Camera-ready version, accepted by DEF-AI-MIA workshop, in conjunted with CVPR2024

  10. arXiv:2403.18270  [pdf, other

    cs.CV eess.IV

    Image Deraining via Self-supervised Reinforcement Learning

    Authors: He-Hao Liao, Yan-Tsung Peng, Wen-Tao Chu, Ping-Chun Hsieh, Chung-Chi Tsai

    Abstract: The quality of images captured outdoors is often affected by the weather. One factor that interferes with sight is rain, which can obstruct the view of observers and computer vision applications that rely on those images. The work aims to recover rain images by removing rain streaks via Self-supervised Reinforcement Learning (RL) for image deraining (SRL-Derain). We locate rain streak pixels from… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  11. arXiv:2403.11230  [pdf, other

    eess.IV cs.CV cs.LG

    Simple 2D Convolutional Neural Network-based Approach for COVID-19 Detection

    Authors: Chih-Chung Hsu, Chia-Ming Lee, Yang Fan Chiang, Yi-Shiuan Chou, Chih-Yu Jiang, Shen-Chieh Tai, Chi-Han Tsai

    Abstract: This study explores the use of deep learning techniques for analyzing lung Computed Tomography (CT) images. Classic deep learning approaches face challenges with varying slice counts and resolutions in CT images, a diversity arising from the utilization of assorted scanning equipment. Typically, predictions are made on single slices which are then combined for a comprehensive outcome. Yet, this me… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  12. arXiv:2402.10894  [pdf, other

    cs.CV cs.LG

    Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning

    Authors: Chia-Ling Tsai, Hui-Yun Su, Shen-Feng Sung, Wei-Yang Lin, Ying-Ying Su, Tzu-Hsien Yang, Man-Lin Mai

    Abstract: Stroke is a common disabling neurological condition that affects about one-quarter of the adult population over age 25; more than half of patients still have poor outcomes, such as permanent functional dependence or even death, after the onset of acute stroke. The aim of this study is to investigate the efficacy of diffusion-weighted MRI modalities combining with structured health profile on predi… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 12 pages, 5 figures, 5 tables

  13. arXiv:2402.07383  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

    Authors: Naoyuki Kanda, Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Hemin Yang, Zirun Zhu, Min Tang, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yufei Xia, Jinzhu Li, Yanqing Liu, Sheng Zhao, Michael Zeng

    Abstract: Laughter is one of the most expressive and natural aspects of human speech, conveying emotions, social cues, and humor. However, most text-to-speech (TTS) systems lack the ability to produce realistic and appropriate laughter sounds, limiting their applications and user experience. While there have been prior works to generate natural laughter, they fell short in terms of controlling the timing an… ▽ More

    Submitted 4 March, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: See https://aka.ms/elate/ for demo samples, v2: subjective evaluation has been added

  14. arXiv:2402.02731  [pdf, other

    cs.IT math.OC

    Computing Augustin Information via Hybrid Geodesically Convex Optimization

    Authors: Guan-Ren Wang, Chung-En Tsai, Hao-Chung Cheng, Yen-Huan Li

    Abstract: We propose a Riemannian gradient descent with the Poincaré metric to compute the order-$α$ Augustin information, a widely used quantity for characterizing exponential error behaviors in information theory. We prove that the algorithm converges to the optimum at a rate of $\mathcal{O}(1 / T)$. As far as we know, this is the first algorithm with a non-asymptotic optimization error guarantee for all… ▽ More

    Submitted 9 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 17 pages, 2 figures, ISIT 2024

  15. arXiv:2401.08021  [pdf, other

    cs.HC

    All the Way There and Back: Inertial-Based, Phone-in-Pocket Indoor Wayfinding and Backtracking Apps for Blind Travelers

    Authors: Chia Hsuan Tsai, Fatemeh Elyasi, Peng Ren, Roberto Manduchi

    Abstract: We introduce two iOS apps that have been designed to support wayfinding and backtracking for blind travelers navigating in indoor building environments. Wayfinding involves determining and following a route through the building's corridors to reach a destination, and assumes that the app has access to the floor plan of the building. Backtracking one's route, on the other hand, requires no map know… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Chia Hsuan Tsai, Fatemeh Elyasi and Peng Ren contributed equally to this research

  16. arXiv:2312.14502  [pdf, other

    cs.CV

    ViStripformer: A Token-Efficient Transformer for Versatile Video Restoration

    Authors: Fu-Jen Tsai, Yan-Tsung Peng, Chen-Yu Chang, Chan-Yu Li, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin

    Abstract: Video restoration is a low-level vision task that seeks to restore clean, sharp videos from quality-degraded frames. One would use the temporal information from adjacent frames to make video restoration successful. Recently, the success of the Transformer has raised awareness in the computer-vision community. However, its self-attention mechanism requires much memory, which is unsuitable for high-… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  17. arXiv:2312.10998  [pdf, other

    cs.CV

    ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentation

    Authors: Jia-Hao Wu, Fu-Jen Tsai, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

    Abstract: Image deblurring aims to remove undesired blurs from an image captured in a dynamic scene. Much research has been dedicated to improving deblurring performance through model architectural designs. However, there is little work on data augmentation for image deblurring. Since continuous motion causes blurred artifacts during image exposure, we aspire to develop a groundbreaking blur augmentation me… ▽ More

    Submitted 20 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  18. arXiv:2312.09429  [pdf

    eess.SP cs.LG

    Deep Learning-Enabled Swallowing Monitoring and Postoperative Recovery Biosensing System

    Authors: Chih-Ning Tsai, Pei-Wen Yang, Tzu-Yen Huang, Jung-Chih Chen, Hsin-Yi Tseng, Che-Wei Wu, Amrit Sarmah, Tzu-En Lin

    Abstract: This study introduces an innovative 3D printed dry electrode tailored for biosensing in postoperative recovery scenarios. Fabricated through a drop coating process, the electrode incorporates a novel 2D material.

    Submitted 24 November, 2023; originally announced December 2023.

    Comments: the abstract can't uploaded fully

    MSC Class: NA ACM Class: A.0

  19. arXiv:2311.17717  [pdf, other

    cs.CV cs.LG

    Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

    Authors: Chi-Pin Huang, Kai-Po Chang, Chung-Ting Tsai, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang

    Abstract: Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable. The former refrains the model from producing images associated with the target concept for any paraphrased or learned prompts, while the latter preserves its a… ▽ More

    Submitted 18 July, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: ECCV 2024. Project page: https://jasper0314-huang.github.io/receler-concept-erasing/

  20. arXiv:2311.04427  [pdf, other

    cs.HC

    Clonemator: Composing Spatiotemporal Clones to Create Interactive Automators in Virtual Reality

    Authors: Yi-Shuo Lin, Ching-Yi Tsai, Lung-Pan Cheng

    Abstract: Clonemator is a virtual reality (VR) system allowing users to create their avatar clones and configure them spatially and temporally, forming automators to accomplish complex tasks. In particular, clones can (1) freeze at a user's body pose as static objects, (2) synchronously mimic the user's movement, and (3) replay a sequence of the user's actions in a period of time later. Combined with tradit… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 11 pages, 14 figures

    ACM Class: H.5.2; D.1.7

  21. arXiv:2311.02557  [pdf, other

    math.OC cs.LG quant-ph

    Fast Minimization of Expected Logarithmic Loss via Stochastic Dual Averaging

    Authors: Chung-En Tsai, Hao-Chung Cheng, Yen-Huan Li

    Abstract: Consider the problem of minimizing an expected logarithmic loss over either the probability simplex or the set of quantum density matrices. This problem includes tasks such as solving the Poisson inverse problem, computing the maximum-likelihood estimate for quantum state tomography, and approximating positive semi-definite matrix permanents with the currently tightest approximation ratio. Althoug… ▽ More

    Submitted 11 March, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 26 pages, AISTATS 2024

  22. arXiv:2310.18526  [pdf, other

    cs.LG cs.AI

    Sample based Explanations via Generalized Representers

    Authors: Che-Ping Tsai, Chih-Kuan Yeh, Pradeep Ravikumar

    Abstract: We propose a general class of sample based explanations of machine learning models, which we term generalized representers. To measure the effect of a training sample on a model's test prediction, generalized representers use two components: a global sample importance that quantifies the importance of the training point to the model and is invariant to test samples, and a local sample importance t… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted by Neurips 2023

  23. arXiv:2309.04651  [pdf

    eess.IV cs.AI cs.CV

    Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis

    Authors: Nikhil J. Dhinagar, Amit Singh, Saket Ozarkar, Ketaki Buwa, Sophia I. Thomopoulos, Conor Owens-Walton, Emily Laltoo, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Paul M. Thompson

    Abstract: Transfer learning represents a recent paradigm shift in the way we build artificial intelligence (AI) systems. In contrast to training task-specific models, transfer learning involves pre-training deep learning models on a large corpus of data and minimally fine-tuning them for adaptation to specific tasks. Even so, for 3D medical imaging tasks, we do not know if it is best to pre-train models on… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  24. arXiv:2307.12301  [pdf, other

    cs.CV cs.IR cs.LG

    Image Outlier Detection Without Training using RANSAC

    Authors: Chen-Han Tsai, Yu-Shao Peng

    Abstract: Image outlier detection (OD) is an essential tool to ensure the quality of images used in computer vision tasks. Existing algorithms often involve training a model to represent the inlier distribution, and outliers are determined by some deviation measure. Although existing methods proved effective when trained on strictly inlier samples, their performance remains questionable when undesired outli… ▽ More

    Submitted 4 April, 2024; v1 submitted 23 July, 2023; originally announced July 2023.

  25. arXiv:2306.14649  [pdf, other

    cs.NE

    CIMulator: A Comprehensive Simulation Platform for Computing-In-Memory Circuit Macros with Low Bit-Width and Real Memory Materials

    Authors: Hoang-Hiep Le, Md. Aftab Baig, Wei-Chen Hong, Cheng-Hsien Tsai, Cheng-Jui Yeh, Fu-Xiang Liang, I-Ting Huang, Wei-Tzu Tsai, Ting-Yin Cheng, Sourav De, Nan-Yow Chen, Wen-Jay Lee, Ing-Chao Lin, Da-Wei Chang, Darsen D. Lu

    Abstract: This paper presents a simulation platform, namely CIMulator, for quantifying the efficacy of various synaptic devices in neuromorphic accelerators for different neural network architectures. Nonvolatile memory devices, such as resistive random-access memory, ferroelectric field-effect transistor, and volatile static random-access memory devices, can be selected as synaptic devices. A multilayer pe… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  26. arXiv:2306.00043  [pdf, other

    cs.AI cs.NE

    Space Net Optimization

    Authors: Chun-Wei Tsai, Yi-Cheng Yang, Tzu-Chieh Tang, Che-Wei Hsu

    Abstract: Most metaheuristic algorithms rely on a few searched solutions to guide later searches during the convergence process for a simple reason: the limited computing resource of a computer makes it impossible to retain all the searched solutions. This also reveals that each search of most metaheuristic algorithms is just like a ballpark guess. To help address this issue, we present a novel metaheuristi… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 12 pages, 6 figures

  27. arXiv:2305.20002  [pdf, other

    cs.LG

    Representer Point Selection for Explaining Regularized High-dimensional Models

    Authors: Che-Ping Tsai, Jiong Zhang, Eli Chien, Hsiang-Fu Yu, Cho-Jui Hsieh, Pradeep Ravikumar

    Abstract: We introduce a novel class of sample-based explanations we term high-dimensional representers, that can be used to explain the predictions of a regularized high-dimensional model in terms of importance weights for each of the training samples. Our workhorse is a novel representer theorem for general regularized high-dimensional models, which decomposes the model prediction in terms of contribution… ▽ More

    Submitted 30 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted by ICML 2023

  28. arXiv:2305.13946  [pdf, ps, other

    cs.LG math.OC stat.ML

    Data-Dependent Bounds for Online Portfolio Selection Without Lipschitzness and Smoothness

    Authors: Chung-En Tsai, Ying-Ting Lin, Yen-Huan Li

    Abstract: This work introduces the first small-loss and gradual-variation regret bounds for online portfolio selection, marking the first instances of data-dependent bounds for online convex optimization with non-Lipschitz, non-smooth losses. The algorithms we propose exhibit sublinear regret rates in the worst cases and achieve logarithmic regrets when the data is "easy," with per-iteration time almost lin… ▽ More

    Submitted 4 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 37 pages, typos fixed, NeurIPS 2023

  29. Interference-Aware Deployment for Maximizing User Satisfaction in Multi-UAV Wireless Networks

    Authors: Chuan-Chi Lai, Ang-Hsun Tsai, Chia-Wei Ting, Ko-Han Lin, Jing-Chi Ling, Chia-En Tsai

    Abstract: In this letter, we study the deployment of Unmanned Aerial Vehicle mounted Base Stations (UAV-BSs) in multi-UAV cellular networks. We model the multi-UAV deployment problem as a user satisfaction maximization problem, that is, maximizing the proportion of served ground users (GUs) that meet a given minimum data rate requirement. We propose an interference-aware deployment (IAD) algorithm for servi… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 5 pages, 3 figures, to appear in IEEE Wireless Communications Letters

  30. arXiv:2304.02868  [pdf, other

    cs.CL cs.AI cs.LG

    Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions

    Authors: Chen Feng Tsai, Xiaochen Zhou, Sierra S. Liu, Jing Li, Mo Yu, Hongyuan Mei

    Abstract: Large language models (LLMs) such as ChatGPT and GPT-4 have recently demonstrated their remarkable abilities of communicating with human users. In this technical report, we take an initiative to investigate their capacities of playing text games, in which a player has to understand the environment and respond to situations by having dialogues with the game world. Our experiments show that ChatGPT… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  31. arXiv:2303.14565  [pdf, other

    cs.NI

    Saihu: A Common Interface of Worst-Case Delay Analysis Tools for Time-Sensitive Networks

    Authors: Chun-Tso Tsai, Seyed Mohammadhossein Tabatabaee, Stéphan Plassart, Jean-Yves Le Boudec

    Abstract: Time-sensitive networks, as in the context of IEEE-TSN and IETF-Detnet, require bounds on worst-case delays. Various network analysis tools compute such bounds; these tools are based on different methods and provide delay bounds that are all valid but may differ; furthermore, it is generally not known which tool will provide the best bound. To obtain the best possible bound, users need to implemen… ▽ More

    Submitted 23 August, 2024; v1 submitted 25 March, 2023; originally announced March 2023.

  32. arXiv:2303.08490  [pdf, other

    eess.IV cs.CV

    Strong Baseline and Bag of Tricks for COVID-19 Detection of CT Scans

    Authors: Chih-Chung Hsu, Chih-Yu Jian, Chia-Ming Lee, Chi-Han Tsai, Sheng-Chieh Dai

    Abstract: This paper investigates the application of deep learning models for lung Computed Tomography (CT) image analysis. Traditional deep learning frameworks encounter compatibility issues due to variations in slice numbers and resolutions in CT images, which stem from the use of different machines. Commonly, individual slices are predicted and subsequently merged to obtain the final result; however, thi… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: technical report. Keywords: Spatial-Slice correlation, COVID-19 classification, convolutional neural networks, computed tomography

  33. arXiv:2302.13631  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Curriculum Based Multi-Task Learning for Parkinson's Disease Detection

    Authors: Nikhil J. Dhinagar, Conor Owens-Walton, Emily Laltoo, Christina P. Boyle, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Ysbrand van der Werf, Paul M. Thompson

    Abstract: There is great interest in developing radiological classifiers for diagnosis, staging, and predictive modeling in progressive diseases such as Parkinson's disease (PD), a neurodegenerative disease that is difficult to detect in its early stages. Here we leverage severity-based meta-data on the stages of disease to define a curriculum for training a deep convolutional neural network (CNN). Typicall… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted for publication at the 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023

  34. arXiv:2211.12880  [pdf, other

    quant-ph cs.LG math.OC stat.ML

    Faster Stochastic First-Order Method for Maximum-Likelihood Quantum State Tomography

    Authors: Chung-En Tsai, Hao-Chung Cheng, Yen-Huan Li

    Abstract: In maximum-likelihood quantum state tomography, both the sample size and dimension grow exponentially with the number of qubits. It is therefore desirable to develop a stochastic first-order method, just like stochastic gradient descent for modern machine learning, to compute the maximum-likelihood estimate. To this end, we propose an algorithm called stochastic mirror descent with the Burg entrop… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 11 pages, 1 figure

  35. arXiv:2210.16381  [pdf, other

    cs.HC

    Not Another Day Zero: Design Hackathons for Community-Based Water Quality Monitoring

    Authors: Srishti Gupta, Chun-Hua Tsai, John M. Carroll

    Abstract: This study looks at water quality monitoring and management as a new form of community engagement. Through a series of a unique research method called `design hackathons', we engaged with a hyperlocal community of citizens who are actively involved in monitoring and management of their local watershed. These design hackathons sought to understand the motivation, practices, collaboration and experi… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 21 pages, 3 figures, 3 tables

  36. arXiv:2210.08036  [pdf, other

    cs.CV

    Meta Transferring for Deblurring

    Authors: Po-Sheng Liu, Fu-Jen Tsai, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

    Abstract: Most previous deblurring methods were built with a generic model trained on blurred images and their sharp counterparts. However, these approaches might have sub-optimal deblurring results due to the domain gap between the training and test sets. This paper proposes a reblur-deblur meta-transferring scheme to realize test-time adaptation without using ground truth for dynamic scene deblurring. Sin… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted at BMVC 2022

  37. arXiv:2210.00997  [pdf, ps, other

    stat.ML cs.LG math.OC q-fin.PM quant-ph

    Online Self-Concordant and Relatively Smooth Minimization, With Applications to Online Portfolio Selection and Learning Quantum States

    Authors: Chung-En Tsai, Hao-Chung Cheng, Yen-Huan Li

    Abstract: Consider an online convex optimization problem where the loss functions are self-concordant barriers, smooth relative to a convex function $h$, and possibly non-Lipschitz. We analyze the regret of online mirror descent with $h$. Then, based on the result, we prove the following in a unified manner. Denote by $T$ the time horizon and $d$ the parameter dimension. 1. For online portfolio selection, t… ▽ More

    Submitted 21 September, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 34th Int. Conf. Algorithmic Learning Theory (ALT 2023). A typo in the last equation in the proof of Lemma 10 is corrected

  38. arXiv:2208.01287  [pdf, other

    cs.CV

    Multiview Regenerative Morphing with Dual Flows

    Authors: Chih-Jung Tsai, Cheng Sun, Hwann-Tzong Chen

    Abstract: This paper aims to address a new task of image morphing under a multiview setting, which takes two sets of multiview images as the input and generates intermediate renderings that not only exhibit smooth transitions between the two input sets but also ensure visual consistency across different views at any transition state. To achieve this goal, we propose a novel approach called Multiview Regener… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  39. An Exploration of How Training Set Composition Bias in Machine Learning Affects Identifying Rare Objects

    Authors: Sean E. Lake, Chao-Wei Tsai

    Abstract: When training a machine learning classifier on data where one of the classes is intrinsically rare, the classifier will often assign too few sources to the rare class. To address this, it is common to up-weight the examples of the rare class to ensure it isn't ignored. It is also a frequent practice to train on restricted data where the balance of source types is closer to equal for the same reaso… ▽ More

    Submitted 25 July, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: 27 pages, 9 figures, 4 tables, accepted by Astronomy and Computing

  40. arXiv:2207.03050  [pdf, other

    eess.IV cs.CV

    Multi-Task Lung Nodule Detection in Chest Radiographs with a Dual Head Network

    Authors: Chen-Han Tsai, Yu-Shao Peng

    Abstract: Lung nodules can be an alarming precursor to potential lung cancer. Missed nodule detections during chest radiograph analysis remains a common challenge among thoracic radiologists. In this work, we present a multi-task lung nodule detection algorithm for chest radiograph analysis. Unlike past approaches, our algorithm predicts a global-level label indicating nodule presence along with local-level… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 11 pages, 3 figures, Accepted to the MICCAI Conference 2022

  41. arXiv:2207.01579  [pdf, other

    eess.IV cs.CV cs.LG

    Spatiotemporal Feature Learning Based on Two-Step LSTM and Transformer for CT Scans

    Authors: Chih-Chung Hsu, Chi-Han Tsai, Guan-Lin Chen, Sin-Di Ma, Shen-Chieh Tai

    Abstract: Computed tomography (CT) imaging could be very practical for diagnosing various diseases. However, the nature of the CT images is even more diverse since the resolution and number of the slices of a CT scan are determined by the machine and its settings. Conventional deep learning models are hard to tickle such diverse data since the essential requirement of the deep neural network is the consiste… ▽ More

    Submitted 8 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: draft

  42. arXiv:2206.13034  [pdf, other

    cs.LG cs.AI

    Monitoring Shortcut Learning using Mutual Information

    Authors: Mohammed Adnan, Yani Ioannou, Chuan-Yung Tsai, Angus Galloway, H. R. Tizhoosh, Graham W. Taylor

    Abstract: The failure of deep neural networks to generalize to out-of-distribution data is a well-known problem and raises concerns about the deployment of trained networks in safety-critical domains such as healthcare, finance and autonomous vehicles. We study a particular kind of distribution shift $\unicode{x2013}$ shortcuts or spurious correlations in the training data. Shortcut learning is often only e… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Accepted at ICML 2022 Workshop on Spurious Correlations, Invariance, and Stability

  43. arXiv:2204.04627  [pdf, other

    cs.CV

    Stripformer: Strip Transformer for Fast Image Deblurring

    Authors: Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin

    Abstract: Images taken in dynamic scenes may contain unwanted motion blur, which significantly degrades visual quality. Such blur causes short- and long-range region-specific smoothing artifacts that are often directional and non-uniform, which is difficult to be removed. Inspired by the current success of transformers on computer vision and image processing tasks, we develop, Stripformer, a transformer-bas… ▽ More

    Submitted 22 July, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: ECCV 2022 Oral Presentation

  44. arXiv:2203.00870  [pdf, other

    cs.LG cs.GT

    Faith-Shap: The Faithful Shapley Interaction Index

    Authors: Che-Ping Tsai, Chih-Kuan Yeh, Pradeep Ravikumar

    Abstract: Shapley values, which were originally designed to assign attributions to individual players in coalition games, have become a commonly used approach in explainable machine learning to provide attributions to input features for black-box machine learning models. A key attraction of Shapley values is that they uniquely satisfy a very natural set of axiomatic properties. However, extending the Shaple… ▽ More

    Submitted 22 March, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

  45. arXiv:2202.03430  [pdf, other

    eess.IV cs.CV

    A Topology-Attention ConvLSTM Network and Its Application to EM Images

    Authors: Jiaqi Yang, Xiaoling Hu, Chao Chen, Chialing Tsai

    Abstract: Structural accuracy of segmentation is important for finescale structures in biomedical images. We propose a novel TopologyAttention ConvLSTM Network (TACNet) for 3D image segmentation in order to achieve high structural accuracy for 3D segmentation tasks. Specifically, we propose a Spatial Topology-Attention (STA) module to process a 3D image as a stack of 2D image slices and adopt ConvLSTM to le… ▽ More

    Submitted 6 February, 2022; originally announced February 2022.

    Comments: 12 pages, 6 figures, Accepted by MICCAI'21

  46. arXiv:2201.12602  [pdf, other

    cs.SE cs.AI cs.LG

    DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software

    Authors: Chuan-Yung Tsai, Graham W. Taylor

    Abstract: Although machine learning (ML) has been successful in automating various software engineering needs, software testing still remains a highly challenging topic. In this paper, we aim to improve the generative testing of software by directly augmenting the random number generator (RNG) with a deep reinforcement learning (RL) agent using an efficient, automatically extractable state representation of… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

    Comments: Workshop on ML for Systems, 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  47. arXiv:2112.15485  [pdf, ps, other

    cs.SE

    REST API Fuzzing by Coverage Level Guided Blackbox Testing

    Authors: Chung-Hsuan Tsai, Shi-Chun Tsai, Shih-Kun Huang

    Abstract: With the growth of web applications, REST APIs have become the primary communication method between services. In order to ensure system reliability and security, software quality can be assured by effective testing methods. Black box fuzz testing is one of the effective methods to perform tests on a large scale. However, conventional black box fuzz testing generates random data without judging the… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

  48. Attack of the Knights: A Non Uniform Cache Side-Channel Attack

    Authors: Farabi Mahmud, Sungkeun Kim, Harpreet Singh Chawla, Chia-Che Tsai, Eun Jung Kim, Abdullah Muzahid

    Abstract: For a distributed last-level cache (LLC) in a large multicore chip, the access time to one LLC bank can significantly differ from that to another due to the difference in physical distance. In this paper, we successfully demonstrated a new distance-based side-channel attack by timing the AES decryption operation and extracting part of an AES secret key on an Intel Knights Landing CPU. We introduce… ▽ More

    Submitted 31 May, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Journal ref: Annual Computer Security Applications Conference ACSAC 2023

  49. arXiv:2111.12925  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

    Authors: Wei-Ting Chen, Cheng-Che Tsai, Hao-Yu Fang, I-Hsiang Chen, Jian-Jiun Ding, Sy-Yen Kuo

    Abstract: Images acquired from rainy scenes usually suffer from bad visibility which may damage the performance of computer vision applications. The rainy scenarios can be categorized into two classes: moderate rain and heavy rain scenes. Moderate rain scene mainly consists of rain streaks while heavy rain scene contains both rain streaks and the veiling effect (similar to haze). Although existing methods h… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: This paper is accepted by BMVC 2021

  50. arXiv:2111.12170  [pdf, other

    cs.LG cs.AI cs.CV

    Domain-Agnostic Clustering with Self-Distillation

    Authors: Mohammed Adnan, Yani A. Ioannou, Chuan-Yung Tsai, Graham W. Taylor

    Abstract: Recent advancements in self-supervised learning have reduced the gap between supervised and unsupervised representation learning. However, most self-supervised and deep clustering techniques rely heavily on data augmentation, rendering them ineffective for many learning tasks where insufficient domain knowledge exists for performing augmentation. We propose a new self-distillation based algorithm… ▽ More

    Submitted 20 December, 2021; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and Practice