Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Kwan, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12849  [pdf, ps, other

    cs.IR cs.CL

    Large language models are good medical coders, if provided with tools

    Authors: Keith Kwan

    Abstract: This study presents a novel two-stage Retrieve-Rank system for automated ICD-10-CM medical coding, comparing its performance against a Vanilla Large Language Model (LLM) approach. Evaluating both systems on a dataset of 100 single-term medical conditions, the Retrieve-Rank system achieved 100% accuracy in predicting correct ICD-10-CM codes, significantly outperforming the Vanilla LLM (GPT-3.5-turb… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 7 pages, 1 figure, 2 tables

  2. arXiv:2312.09066  [pdf, other

    cs.CV cs.AI

    CMOSE: Comprehensive Multi-Modality Online Student Engagement Dataset with High-Quality Labels

    Authors: Chi-hsuan Wu, Shih-yang Liu, Xijie Huang, Xingbo Wang, Rong Zhang, Luca Minciullo, Wong Kai Yiu, Kenny Kwan, Kwang-Ting Cheng

    Abstract: Online learning is a rapidly growing industry. However, a major doubt about online learning is whether students are as engaged as they are in face-to-face classes. An engagement recognition system can notify the instructors about the students condition and improve the learning experience. Current challenges in engagement detection involve poor label quality, extreme data imbalance, and intra-class… ▽ More

    Submitted 3 June, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 11 pages

  3. arXiv:2310.04237  [pdf

    cs.CL

    Written and spoken corpus of real and fake social media postings about COVID-19

    Authors: Ng Bee Chin, Ng Zhi Ee Nicole, Kyla Kwan, Lee Yong Han Dylann, Liu Fang, Xu Hong

    Abstract: This study investigates the linguistic traits of fake news and real news. There are two parts to this study: text data and speech data. The text data for this study consisted of 6420 COVID-19 related tweets re-filtered from Patwa et al. (2021). After cleaning, the dataset contained 3049 tweets, with 2161 labeled as 'real' and 888 as 'fake'. The speech data for this study was collected from TikTok,… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 9 pages, 3 tables

  4. Reducing Ambiguities in Line-based Density Plots by Image-space Colorization

    Authors: Yumeng Xue, Patrick Paetzold, Rebecca Kehlbeck, Bin Chen, Kin Chung Kwan, Yunhai Wang, Oliver Deussen

    Abstract: Line-based density plots are used to reduce visual clutter in line charts with a multitude of individual lines. However, these traditional density plots are often perceived ambiguously, which obstructs the user's identification of underlying trends in complex datasets. Thus, we propose a novel image space coloring method for line-based density plots that enhances their interpretability. Our method… ▽ More

    Submitted 22 November, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Published in IEEE Transactions on Visualization and Computer Graphics (Supplementary Material: https://osf.io/jm5yz/)

  5. arXiv:2212.07751  [pdf, other

    cs.CV

    Combating Uncertainty and Class Imbalance in Facial Expression Recognition

    Authors: Jiaxiang Fan, Jian Zhou, Xiaoyu Deng, Huabin Wang, Liang Tao, Hon Keung Kwan

    Abstract: Recognition of facial expression is a challenge when it comes to computer vision. The primary reasons are class imbalance due to data collection and uncertainty due to inherent noise such as fuzzy facial expressions and inconsistent labels. However, current research has focused either on the problem of class imbalance or on the problem of uncertainty, ignoring the intersection of how to address th… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  6. arXiv:2212.07163  [pdf, other

    cs.SD eess.AS

    Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation

    Authors: Yinhao Xu, Jian Zhou, Liang Tao, Hon Keung Kwan

    Abstract: Recently studies on time-domain audio separation networks (TasNets) have made a great stride in speech separation. One of the most representative TasNets is a network with a dual-path segmentation approach. However, the original model called DPRNN used a fixed feature dimension and unchanged segment size throughout all layers of the network. In this paper, we propose a multi-scale feature fusion t… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

  7. arXiv:2111.01430  [pdf, other

    cs.SD eess.AS

    CycleGAN with Dual Adversarial Loss for Bone-Conducted Speech Enhancement

    Authors: Qing Pan, Teng Gao, Jian Zhou, Huabin Wang, Liang Tao, Hon Keung Kwan

    Abstract: Compared with air-conducted speech, bone-conducted speech has the unique advantage of shielding background noise. Enhancement of bone-conducted speech helps to improve its quality and intelligibility. In this paper, a novel CycleGAN with dual adversarial loss (CycleGAN-DAL) is proposed for bone-conducted speech enhancement. The proposed method uses an adversarial loss and a cycle-consistent loss s… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  8. arXiv:2111.01342  [pdf, other

    cs.SD cs.HC eess.AS

    Attention-Guided Generative Adversarial Network for Whisper to Normal Speech Conversion

    Authors: Teng Gao, Jian Zhou, Huabin Wang, Liang Tao, Hon Keung Kwan

    Abstract: Whispered speech is a special way of pronunciation without using vocal cord vibration. A whispered speech does not contain a fundamental frequency, and its energy is about 20dB lower than that of a normal speech. Converting a whispered speech into a normal speech can improve speech quality and intelligibility. In this paper, a novel attention-guided generative adversarial network model incorporati… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  9. arXiv:2108.07115  [pdf, other

    cs.GR cs.HC

    Autocomplete Repetitive Stroking with Image Guidance

    Authors: Yilan Chen, Kin Chung Kwan, Li-Yi Wei, Hongbo Fu

    Abstract: Image-guided drawing can compensate for the lack of skills but often requires a significant number of repetitive strokes to create textures. Existing automatic stroke synthesis methods are usually limited to predefined styles or require indirect manipulation that may break the spontaneous flow of drawing. We present a method to autocomplete repetitive short strokes during users' normal drawing pro… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    ACM Class: I.3.8

  10. arXiv:2002.01075  [pdf, other

    cs.CV eess.IV

    Multistage Model for Robust Face Alignment Using Deep Neural Networks

    Authors: Huabin Wang, Rui Cheng, Jian Zhou, Liang Tao, Hon Keung Kwan

    Abstract: An ability to generalize unconstrained conditions such as severe occlusions and large pose variations remains a challenging goal to achieve in face alignment. In this paper, a multistage model based on deep neural networks is proposed which takes advantage of spatial transformer networks, hourglass networks and exemplar-based shape constraints. First, a spatial transformer - generative adversarial… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.