Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Tsai, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16564  [pdf, other

    cs.SD cs.AI eess.AS

    Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight Finetuning

    Authors: Fang-Duo Tsai, Shih-Lun Wu, Haven Kim, Bo-Yu Chen, Hao-Chung Cheng, Yi-Hsuan Yang

    Abstract: Text-to-music models allow users to generate nearly realistic musical audio with textual commands. However, editing music audios remains challenging due to the conflicting desiderata of performing fine-grained alterations on the audio while maintaining a simple user interface. To address this challenge, we propose Audio Prompt Adapter (or AP-Adapter), a lightweight addition to pretrained text-to-m… ▽ More

    Submitted 24 July, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted by the 25th International Society for Music Information Retrieval (ISMIR)

  2. arXiv:2407.09059  [pdf, other

    cs.CV

    Domain-adaptive Video Deblurring via Test-time Blurring

    Authors: Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

    Abstract: Dynamic scene video deblurring aims to remove undesirable blurry artifacts captured during the exposure process. Although previous video deblurring methods have achieved impressive results, they suffer from significant performance drops due to the domain gap between training and testing videos, especially for those captured in real-world scenarios. To address this issue, we propose a domain adapta… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  3. arXiv:2404.09269  [pdf, other

    cs.CV

    PANet: A Physics-guided Parametric Augmentation Net for Image Dehazing by Hazing

    Authors: Chih-Ling Chang, Fu-Jen Tsai, Zi-Ling Huang, Lin Gu, Chia-Wen Lin

    Abstract: Image dehazing faces challenges when dealing with hazy images in real-world scenarios. A huge domain gap between synthetic and real-world haze images degrades dehazing performance in practical settings. However, collecting real-world image datasets for training dehazing models is challenging since both hazy and clean pairs must be captured under the same conditions. In this paper, we propose a Phy… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  4. arXiv:2312.14502  [pdf, other

    cs.CV

    ViStripformer: A Token-Efficient Transformer for Versatile Video Restoration

    Authors: Fu-Jen Tsai, Yan-Tsung Peng, Chen-Yu Chang, Chan-Yu Li, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin

    Abstract: Video restoration is a low-level vision task that seeks to restore clean, sharp videos from quality-degraded frames. One would use the temporal information from adjacent frames to make video restoration successful. Recently, the success of the Transformer has raised awareness in the computer-vision community. However, its self-attention mechanism requires much memory, which is unsuitable for high-… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  5. arXiv:2312.10998  [pdf, other

    cs.CV

    ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentation

    Authors: Jia-Hao Wu, Fu-Jen Tsai, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

    Abstract: Image deblurring aims to remove undesired blurs from an image captured in a dynamic scene. Much research has been dedicated to improving deblurring performance through model architectural designs. However, there is little work on data augmentation for image deblurring. Since continuous motion causes blurred artifacts during image exposure, we aspire to develop a groundbreaking blur augmentation me… ▽ More

    Submitted 20 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  6. arXiv:2304.02868  [pdf, other

    cs.CL cs.AI cs.LG

    Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions

    Authors: Chen Feng Tsai, Xiaochen Zhou, Sierra S. Liu, Jing Li, Mo Yu, Hongyuan Mei

    Abstract: Large language models (LLMs) such as ChatGPT and GPT-4 have recently demonstrated their remarkable abilities of communicating with human users. In this technical report, we take an initiative to investigate their capacities of playing text games, in which a player has to understand the environment and respond to situations by having dialogues with the game world. Our experiments show that ChatGPT… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  7. arXiv:2210.08036  [pdf, other

    cs.CV

    Meta Transferring for Deblurring

    Authors: Po-Sheng Liu, Fu-Jen Tsai, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

    Abstract: Most previous deblurring methods were built with a generic model trained on blurred images and their sharp counterparts. However, these approaches might have sub-optimal deblurring results due to the domain gap between the training and test sets. This paper proposes a reblur-deblur meta-transferring scheme to realize test-time adaptation without using ground truth for dynamic scene deblurring. Sin… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted at BMVC 2022

  8. arXiv:2204.04627  [pdf, other

    cs.CV

    Stripformer: Strip Transformer for Fast Image Deblurring

    Authors: Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin

    Abstract: Images taken in dynamic scenes may contain unwanted motion blur, which significantly degrades visual quality. Such blur causes short- and long-range region-specific smoothing artifacts that are often directional and non-uniform, which is difficult to be removed. Inspired by the current success of transformers on computer vision and image processing tasks, we develop, Stripformer, a transformer-bas… ▽ More

    Submitted 22 July, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: ECCV 2022 Oral Presentation

  9. BANet: Blur-aware Attention Networks for Dynamic Scene Deblurring

    Authors: Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin

    Abstract: Image motion blur results from a combination of object motions and camera shakes, and such blurring effect is generally directional and non-uniform. Previous research attempted to solve non-uniform blurs using self-recurrent multiscale, multi-patch, or multi-temporal architectures with self-attention to obtain decent results. However, using self-recurrent frameworks typically lead to a longer infe… ▽ More

    Submitted 25 October, 2022; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: TIP 2022, Code: https://github.com/pp00704831/BANet

  10. Trust-Region Method with Deep Reinforcement Learning in Analog Design Space Exploration

    Authors: Kai-En Yang, Chia-Yu Tsai, Hung-Hao Shen, Chen-Feng Chiang, Feng-Ming Tsai, Chung-An Wang, Yiju Ting, Chia-Shun Yeh, Chin-Tang Lai

    Abstract: This paper introduces new perspectives on analog design space search. To minimize the time-to-market, this endeavor better cast as constraint satisfaction problem than global optimization defined in prior arts. We incorporate model-based agents, contrasted with model-free learning, to implement a trust-region strategy. As such, simple feed-forward networks can be trained with supervised learning,… ▽ More

    Submitted 2 December, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: 6 pages, 3 figures, 5 tables

  11. arXiv:1111.0867  [pdf, ps, other

    math.CO cs.DS

    The black-and-white coloring problem on distance hereditary graphs and strongly chordal graphs

    Authors: Ton Kloks, Sheung-Hung Poon, Feng-Ren Tsai, Yue-Li Wang

    Abstract: Given a graph G and integers b and w. The black-and-white coloring problem asks if there exist disjoint sets of vertices B and W with |B|=b and |W|=w such that no vertex in B is adjacent to any vertex in W. In this paper we show that the problem is polynomial when restricted to cographs, distance-hereditary graphs, interval graphs and strongly chordal graphs. We show that the problem is NP-complet… ▽ More

    Submitted 3 November, 2011; originally announced November 2011.