Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Tam, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.02442  [pdf, other

    cs.CL

    Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

    Authors: Zhi Rui Tam, Cheng-Kuang Wu, Yi-Lin Tsai, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen

    Abstract: Structured generation, the process of producing content in standardized formats like JSON and XML, is widely utilized in real-world applications to extract key output information from large language models (LLMs). This study investigates whether such constraints on generation space impact LLMs' abilities, including reasoning and domain knowledge comprehension. Specifically, we evaluate LLMs' perfo… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 18 pages

  2. arXiv:2408.02426  [pdf, other

    cs.CV

    FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification

    Authors: Yijin Huang, Pujin Cheng, Roger Tam, Xiaoying Tang

    Abstract: The success of large-scale pre-trained models has established fine-tuning as a standard method for achieving significant improvements in downstream tasks. However, fine-tuning the entire parameter set of a pre-trained model is costly. Parameter-efficient transfer learning (PETL) has recently emerged as a cost-effective alternative for adapting pre-trained models to downstream tasks. Despite its ad… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  3. arXiv:2407.14767  [pdf, other

    cs.CL cs.AI

    I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation

    Authors: Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen

    Abstract: In this study, we explore the proactive ability of LLMs to seek user support, using text-to-SQL generation as a case study. We propose metrics to evaluate the trade-off between performance improvements and user burden, and investigate whether LLMs can determine when to request help and examine their performance with varying levels of information availability. Our experiments reveal that without ex… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: 9 pages, 9 figures

  4. arXiv:2406.08747  [pdf, other

    cs.CL

    StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

    Authors: Cheng-Kuang Wu, Zhi Rui Tam, Chieh-Yen Lin, Yun-Nung Chen, Hung-yi Lee

    Abstract: Recent works have shown that large language model (LLM) agents are able to improve themselves from experience, which is an important ability for continuous enhancement post-deployment. However, existing benchmarks primarily evaluate their innate capabilities and do not assess their ability to improve over time. To address this gap, we introduce StreamBench, a pioneering benchmark designed to evalu… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2403.07576  [pdf, other

    cs.CV

    Fine-grained Prompt Tuning: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification

    Authors: Yijin Huang, Pujin Cheng, Roger Tam, Xiaoying Tang

    Abstract: Parameter-efficient transfer learning (PETL) is proposed as a cost-effective way to transfer pre-trained models to downstream tasks, avoiding the high cost of updating entire large-scale pre-trained models (LPMs). In this work, we present Fine-grained Prompt Tuning (FPT), a novel PETL method for medical image classification. FPT significantly reduces memory consumption compared to other PETL metho… ▽ More

    Submitted 2 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: MICCAI 2024

  6. arXiv:2304.06258  [pdf, other

    cs.CV cs.LG eess.IV

    MProtoNet: A Case-Based Interpretable Model for Brain Tumor Classification with 3D Multi-parametric Magnetic Resonance Imaging

    Authors: Yuanyuan Wei, Roger Tam, Xiaoying Tang

    Abstract: Recent applications of deep convolutional neural networks in medical imaging raise concerns about their interpretability. While most explainable deep learning applications use post hoc methods (such as GradCAM) to generate feature attribution maps, there is a new type of case-based reasoning models, namely ProtoPNet and its variants, which identify prototypes during training and compare input imag… ▽ More

    Submitted 14 April, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: 15 pages, 5 figures, 1 table; accepted for oral presentation at MIDL 2023 (https://openreview.net/forum?id=6Wbj3QCo4U4 ); camera-ready version

  7. arXiv:2210.10969  [pdf, other

    cs.CV

    SSiT: Saliency-guided Self-supervised Image Transformer for Diabetic Retinopathy Grading

    Authors: Yijin Huang, Junyan Lyu, Pujin Cheng, Roger Tam, Xiaoying Tang

    Abstract: Self-supervised Learning (SSL) has been widely applied to learn image representations through exploiting unlabeled images. However, it has not been fully explored in the medical image analysis field. In this work, Saliency-guided Self-Supervised image Transformer (SSiT) is proposed for Diabetic Retinopathy (DR) grading from fundus images. We novelly introduce saliency maps into SSL, with a goal of… ▽ More

    Submitted 12 March, 2024; v1 submitted 19 October, 2022; originally announced October 2022.

  8. arXiv:2110.14160  [pdf, other

    eess.IV cs.CV

    Identifying the key components in ResNet-50 for diabetic retinopathy grading from fundus images: a systematic investigation

    Authors: Yijin Huang, Li Lin, Pujin Cheng, Junyan Lyu, Roger Tam, Xiaoying Tang

    Abstract: Although deep learning based diabetic retinopathy (DR) classification methods typically benefit from well-designed architectures of convolutional neural networks, the training setting also has a non-negligible impact on the prediction performance. The training setting includes various interdependent components, such as objective function, data sampling strategy and data augmentation approach. To i… ▽ More

    Submitted 17 October, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

  9. arXiv:1910.10035  [pdf, other

    eess.IV cs.CV

    Scanner Invariant Multiple Sclerosis Lesion Segmentation from MRI

    Authors: Shahab Aslani, Vittorio Murino, Michael Dayan, Roger Tam, Diego Sona, Ghassan Hamarneh

    Abstract: This paper presents a simple and effective generalization method for magnetic resonance imaging (MRI) segmentation when data is collected from multiple MRI scanning sites and as a consequence is affected by (site-)domain shifts. We propose to integrate a traditional encoder-decoder network with a regularization network. This added network includes an auxiliary loss term which is responsible for th… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.