Skip to main content

Showing 1–50 of 315 results for author: Choi, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09184  [pdf, other

    cs.CL

    Does Incomplete Syntax Influence Korean Language Model? Focusing on Word Order and Case Markers

    Authors: Jong Myoung Kim, Young-Jun Lee, Yong-jin Han, Sangkeun Jung, Ho-Jin Choi

    Abstract: Syntactic elements, such as word order and case markers, are fundamental in natural language processing. Recent studies show that syntactic information boosts language model performance and offers clues for people to understand their learning mechanisms. Unlike languages with a fixed word order such as English, Korean allows for varied word sequences, despite its canonical structure, due to case m… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: COLM 2024; Code and dataset is available in https://github.com/grayapple-git/SIKO

  2. arXiv:2407.06982  [pdf, ps, other

    math.PR cs.IT

    Information-theoretic classification of the cutoff phenomenon in Markov processes

    Authors: Youjia Wang, Michael C. H. Choi

    Abstract: We investigate the cutoff phenomenon for Markov processes under information divergences such as $f$-divergences and Rényi divergences. We classify most common divergences into four types, namely $L^2$-type, $\mathrm{TV}$-type, separation-type and $\mathrm{KL}$ divergence, in which we prove that the cutoff phenomenon are equivalent and relate the cutoff time and window among members within each typ… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 55 pages

    MSC Class: 60J05; 60J10; 60J25; 60J27; 94A15; 94A17

  3. arXiv:2407.05734  [pdf, other

    cs.CL

    Empirical Study of Symmetrical Reasoning in Conversational Chatbots

    Authors: Daniela N. Rim, Heeyoul Choi

    Abstract: This work explores the capability of conversational chatbots powered by large language models (LLMs), to understand and characterize predicate symmetry, a cognitive linguistic function traditionally believed to be an inherent human trait. Leveraging in-context learning (ICL), a paradigm shift enabling chatbots to learn new tasks from prompts without re-training, we assess the symmetrical reasoning… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted in Future Technology Conference (FTC) 2024

  4. arXiv:2407.05664  [pdf, other

    stat.ML cs.AI cs.LG

    How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning

    Authors: Arthur Jacot, Seok Hoan Choi, Yuxiao Wen

    Abstract: We show that deep neural networks (DNNs) can efficiently learn any composition of functions with bounded $F_{1}$-norm, which allows DNNs to break the curse of dimensionality in ways that shallow networks cannot. More specifically, we derive a generalization bound that combines a covering number argument for compositionality, and the $F_{1}$-norm (or the related Barron norm) for large width adaptiv… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  5. arXiv:2407.05315  [pdf, other

    eess.SP cs.LG math.AT

    Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data

    Authors: Eun Som Jeon, Hongjun Choi, Ankita Shukla, Yuan Wang, Hyunglae Lee, Matthew P. Buman, Pavan Turaga

    Abstract: Deep learning methods have achieved a lot of success in various applications involving converting wearable sensor data to actionable health insights. A common application areas is activity recognition, where deep-learning methods still suffer from limitations such as sensitivity to signal quality, sensor characteristic variations, and variability between subjects. To mitigate these issues, robust… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Engineering Applications of Artificial Intelligence 130, 107719

    Journal ref: Engineering Applications of Artificial Intelligence, 130, 107719 (2024)

  6. arXiv:2407.03958  [pdf, other

    cs.CL cs.CV

    Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

    Authors: Young-Jun Lee, Dokyong Lee, Junyoung Youn, Kyeongjin Oh, Byungsoo Ko, Jonghwan Hyeon, Ho-Jin Choi

    Abstract: Humans share a wide variety of images related to their personal experiences within conversations via instant messaging tools. However, existing works focus on (1) image-sharing behavior in singular sessions, leading to limited long-term social interaction, and (2) a lack of personalized image-sharing behavior. In this work, we introduce Stark, a large-scale long-term multi-modal conversation datas… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Project website: https://stark-dataset.github.io

  7. arXiv:2407.00356  [pdf, other

    cs.LG cs.CV

    Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization

    Authors: Hongjun Choi, Jayaraman J. Thiagarajan, Ruben Glatt, Shusen Liu

    Abstract: In this work, we investigate the fundamental trade-off regarding accuracy and parameter efficiency in the parameterization of neural network weights using predictor networks. We present a surprising finding that, when recovering the original model accuracy is the sole objective, it can be achieved effectively through the weight reconstruction objective alone. Additionally, we explore the underlyin… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  8. arXiv:2406.14947  [pdf, other

    cs.RO

    LiCS: Navigation using Learned-imitation on Cluttered Space

    Authors: Joshua Julian Damanik, Jae-Won Jung, Chala Adane Deresa, Han-Lim Choi

    Abstract: In this letter, we propose a robust and fast navigation system in a narrow indoor environment for UGV (Unmanned Ground Vehicle) using 2D LiDAR and odometry. We used behavior cloning with Transformer neural network to learn the optimization-based baseline algorithm. We inject Gaussian noise during expert demonstration to increase the robustness of learned policy. We evaluate the performance of LiCS… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2406.05606  [pdf, other

    cs.CL

    GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge?

    Authors: Dayoon Ko, Jinyoung Kim, Hahyeon Choi, Gunhee Kim

    Abstract: In the real world, knowledge is constantly evolving, which can render existing knowledge-based datasets outdated. This unreliability highlights the critical need for continuous updates to ensure both accuracy and relevance in knowledge-intensive tasks. To address this, we propose GrowOVER-QA and GrowOVER-Dialogue, dynamic open-domain QA and dialogue benchmarks that undergo a continuous cycle of up… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Main

  10. arXiv:2406.03057  [pdf, other

    cs.LG stat.ML

    BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges

    Authors: Hoyong Choi, Nohyun Ki, Hye Won Chung

    Abstract: Data subset selection aims to find a smaller yet informative subset of a large dataset that can approximate the full-dataset training, addressing challenges associated with training neural networks on large-scale datasets. However, existing methods tend to specialize in either high or low selection ratio regimes, lacking a universal approach that consistently achieves competitive performance acros… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  11. arXiv:2405.19049  [pdf, other

    quant-ph cs.NI

    Quantum Circuit Switching with One-Way Repeaters in Star Networks

    Authors: Álvaro G. Iñesta, Hyeongrak Choi, Dirk Englund, Stephanie Wehner

    Abstract: Distributing quantum states reliably among distant locations is a key challenge in the field of quantum networks. One-way quantum networks address this by using one-way communication and quantum error correction. Here, we analyze quantum circuit switching as a protocol to distribute quantum states in one-way quantum networks. In quantum circuit switching, pairs of users can request the delivery of… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Main text: 9 pages, 5 figures. Appendices: 14 pages, 8 figures

  12. arXiv:2405.02501  [pdf, other

    cs.CL cs.AI

    PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning

    Authors: Hyeong Kyu Choi, Yixuan Li

    Abstract: Large Language Models (LLMs) are trained on massive text corpora, which are encoded with diverse personality traits. This triggers an interesting goal of eliciting a desired personality trait from the LLM, and probing its behavioral preferences. Accordingly, we formalize the persona elicitation task, aiming to customize LLM behaviors to align with a target persona. We present Persona In-Context Le… ▽ More

    Submitted 14 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  13. arXiv:2405.02347  [pdf, other

    cs.LG cs.AI cs.CL

    COPAL: Continual Pruning in Large Language Generative Models

    Authors: Srikanth Malla, Joon Hee Choi, Chiho Choi

    Abstract: Adapting pre-trained large language models to different domains in natural language processing requires two key considerations: high computational demands and model's inability to continual adaptation. To simultaneously address both issues, this paper presents COPAL (COntinual Pruning in Adaptive Language settings), an algorithm developed for pruning large language generative models under a contin… ▽ More

    Submitted 14 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: ICML2024

  14. arXiv:2404.16721  [pdf, other

    cs.AI cs.LG

    Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods

    Authors: Min Kyu Shin, Su-Jeong Park, Seung-Keol Ryu, Heeyeon Kim, Han-Lim Choi

    Abstract: This paper presents a novel learning approach for Dubins Traveling Salesman Problems(DTSP) with Neighborhood (DTSPN) to quickly produce a tour of a non-holonomic vehicle passing through neighborhoods of given task points. The method involves two learning phases: initially, a model-free reinforcement learning approach leverages privileged information to distill knowledge from expert trajectories ge… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures, double blind under review

  15. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  16. arXiv:2404.12589  [pdf, other

    math.PR cs.IT math.OC stat.CO

    A rate-distortion framework for MCMC algorithms: geometry and factorization of multivariate Markov chains

    Authors: Michael C. H. Choi, Youjia Wang, Geoffrey Wolfer

    Abstract: We introduce a framework rooted in a rate distortion problem for Markov chains, and show how a suite of commonly used Markov Chain Monte Carlo (MCMC) algorithms are specific instances within it, where the target stationary distribution is controlled by the distortion function. Our approach offers a unified variational view on the optimality of algorithms such as Metropolis-Hastings, Glauber dynami… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 63 pages, 6 figures

    MSC Class: 60F10; 60J10; 60J22; 94A15; 94A17

  17. arXiv:2404.08330  [pdf, other

    cs.CV

    Emerging Property of Masked Token for Effective Pre-training

    Authors: Hyesong Choi, Hunsang Lee, Seyoung Joung, Hyejin Park, Jiyeong Kim, Dongbo Min

    Abstract: Driven by the success of Masked Language Modeling (MLM), the realm of self-supervised learning for computer vision has been invigorated by the central role of Masked Image Modeling (MIM) in driving recent breakthroughs. Notwithstanding the achievements of MIM across various downstream tasks, its overall efficiency is occasionally hampered by the lengthy duration of the pre-training phase. This pap… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  18. arXiv:2404.08327  [pdf, other

    cs.CV

    Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

    Authors: Hyesong Choi, Hyejin Park, Kwang Moo Yi, Sungmin Cha, Dongbo Min

    Abstract: In this paper, we introduce Saliency-Based Adaptive Masking (SBAM), a novel and cost-effective approach that significantly enhances the pre-training performance of Masked Image Modeling (MIM) approaches by prioritizing token salience. Our method provides robustness against variations in masking ratios, effectively mitigating the performance instability issues common in existing methods. This relax… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  19. arXiv:2404.06452  [pdf, other

    cs.RO eess.SY

    PAAM: A Framework for Coordinated and Priority-Driven Accelerator Management in ROS 2

    Authors: Daniel Enright, Yecheng Xiang, Hyunjong Choi, Hyoseung Kim

    Abstract: This paper proposes a Priority-driven Accelerator Access Management (PAAM) framework for multi-process robotic applications built on top of the Robot Operating System (ROS) 2 middleware platform. The framework addresses the issue of predictable execution of time- and safety-critical callback chains that require hardware accelerators such as GPUs and TPUs. PAAM provides a standalone ROS executor th… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 14 Pages, 14 Figures

  20. arXiv:2404.05144  [pdf, other

    cs.CL cs.CV cs.LG

    Enhancing Clinical Efficiency through LLM: Discharge Note Generation for Cardiac Patients

    Authors: HyoJe Jung, Yunha Kim, Heejung Choi, Hyeram Seo, Minkyoung Kim, JiYe Han, Gaeun Kee, Seohyun Park, Soyoung Ko, Byeolhee Kim, Suyeon Kim, Tae Joon Jun, Young-Hak Kim

    Abstract: Medical documentation, including discharge notes, is crucial for ensuring patient care quality, continuity, and effective medical communication. However, the manual creation of these documents is not only time-consuming but also prone to inconsistencies and potential errors. The automation of this documentation process using artificial intelligence (AI) represents a promising area of innovation in… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figure, 3 tables, conference

  21. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  22. Advancing AI with Integrity: Ethical Challenges and Solutions in Neural Machine Translation

    Authors: Richard Kimera, Yun-Seon Kim, Heeyoul Choi

    Abstract: This paper addresses the ethical challenges of Artificial Intelligence in Neural Machine Translation (NMT) systems, emphasizing the imperative for developers to ensure fairness and cultural sensitivity. We investigate the ethical competence of AI models in NMT, examining the Ethical considerations at each stage of NMT development, including data handling, privacy, data ownership, and consent. We i… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 11 pages

  23. arXiv:2403.19763  [pdf, other

    cs.SD cs.HC cs.MM eess.AS

    Creating Aesthetic Sonifications on the Web with SIREN

    Authors: Tristan Peng, Hongchan Choi, Jonathan Berger

    Abstract: SIREN is a flexible, extensible, and customizable web-based general-purpose interface for auditory data display (sonification). Designed as a digital audio workstation for sonification, synthesizers written in JavaScript using the Web Audio API facilitate intuitive mapping of data to auditory parameters for a wide range of purposes. This paper explores the breadth of sound synthesis techniques s… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 7 pages, 1 figure, 5 listings, submitted to the Web Audio Conference 2024

  24. arXiv:2403.12498  [pdf, ps, other

    cs.IT eess.SP

    WMMSE-Based Rate Maximization for RIS-Assisted MU-MIMO Systems

    Authors: Hyuckjin Choi, A. Lee Swindlehurst, Junil Choi

    Abstract: Reconfigurable intelligent surface (RIS) technology, given its ability to favorably modify wireless communication environments, will play a pivotal role in the evolution of future communication systems. This paper proposes rate maximization techniques for both single-user and multiuser MIMO systems, based on the well-known weighted minimum mean square error (WMMSE) criterion. Using a suitable weig… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  25. arXiv:2403.09270  [pdf, ps, other

    cs.IT eess.SP

    A Deep Reinforcement Learning Approach for Autonomous Reconfigurable Intelligent Surfaces

    Authors: Hyuckjin Choi, Ly V. Nguyen, Junil Choi, A. Lee Swindlehurst

    Abstract: A reconfigurable intelligent surface (RIS) is a prospective wireless technology that enhances wireless channel quality. An RIS is often equipped with passive array of elements and provides cost and power-efficient solutions for coverage extension of wireless communication systems. Without any radio frequency (RF) chains or computing resources, however, the RIS requires control information to be se… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  26. arXiv:2403.08149  [pdf, other

    cs.RO

    On the Feasibility of EEG-based Motor Intention Detection for Real-Time Robot Assistive Control

    Authors: Ho Jin Choi, Satyajeet Das, Shaoting Peng, Ruzena Bajcsy, Nadia Figueroa

    Abstract: This paper explores the feasibility of employing EEG-based intention detection for real-time robot assistive control. We focus on predicting and distinguishing motor intentions of left/right arm movements by presenting: i) an offline data collection and training pipeline, used to train a classifier for left/right motion intention prediction, and ii) an online real-time prediction pipeline leveragi… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  27. arXiv:2403.06122  [pdf, other

    cs.CV

    Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning

    Authors: Woo-Jin Ahn, Geun-Yeong Yang, Hyun-Duck Choi, Myo-Taeg Lim

    Abstract: Deep learning models for semantic segmentation often experience performance degradation when deployed to unseen target domains unidentified during the training phase. This is mainly due to variations in image texture (\ie style) from different data sources. To tackle this challenge, existing domain generalized semantic segmentation (DGSS) methods attempt to remove style variations from the feature… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  28. arXiv:2403.05139  [pdf, other

    cs.CV

    Improving Diffusion Models for Virtual Try-on

    Authors: Yisol Choi, Sangkyung Kwak, Kyungmin Lee, Hyungwon Choi, Jinwoo Shin

    Abstract: This paper considers image-based virtual try-on, which renders an image of a person wearing a curated garment, given a pair of images depicting the person and the garment, respectively. Previous works adapt existing exemplar-based inpainting diffusion models for virtual try-on to improve the naturalness of the generated visuals compared to other methods (e.g., GAN-based), but they fail to preserve… ▽ More

    Submitted 19 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  29. NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing

    Authors: Guseul Heo, Sangyeop Lee, Jaehong Cho, Hyunmin Choi, Sanghyeon Lee, Hyungkyu Ham, Gwangsun Kim, Divya Mahajan, Jongse Park

    Abstract: Modern transformer-based Large Language Models (LLMs) are constructed with a series of decoder blocks. Each block comprises three key components: (1) QKV generation, (2) multi-head attention, and (3) feed-forward networks. In batched processing, QKV generation and feed-forward networks involve compute-intensive matrix-matrix multiplications (GEMM), while multi-head attention requires bandwidth-hea… ▽ More

    Submitted 29 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 16 pages, 15 figures

    Journal ref: ASPLOS 2024

  30. arXiv:2402.18930  [pdf, other

    eess.IV cs.CV

    Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets

    Authors: Fatih Kamisli, Fabien Racape, Hyomin Choi

    Abstract: Achieving successful variable bitrate compression with computationally simple algorithms from a single end-to-end learned image or video compression model remains a challenge. Many approaches have been proposed, including conditional auto-encoders, channel-adaptive gains for the latent tensor or uniformly quantizing all elements of the latent tensor. This paper follows the traditional approach to… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted as a paper at DCC 2024

  31. arXiv:2402.18362  [pdf, other

    cs.CV cs.AI

    Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model

    Authors: Sangjoon Park, Yong Bae Kim, Jee Suk Chang, Seo Hee Choi, Hyungjin Chung, Ik Jae Lee, Hwa Kyung Byun

    Abstract: As advancements in the field of breast cancer treatment continue to progress, the assessment of post-surgical cosmetic outcomes has gained increasing significance due to its substantial impact on patients' quality of life. However, evaluating breast cosmesis presents challenges due to the inherently subjective nature of expert labeling. In this study, we present a novel automated approach, Attenti… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  32. arXiv:2402.09564  [pdf, other

    cs.RO

    Tactile-Informed Action Primitives Mitigate Jamming in Dense Clutter

    Authors: Dane Brouwer, Joshua Citron, Hojung Choi, Marion Lepert, Michael Lin, Jeannette Bohg, Mark Cutkosky

    Abstract: It is difficult for robots to retrieve objects in densely cluttered lateral access scenes with movable objects as jamming against adjacent objects and walls can inhibit progress. We propose the use of two action primitives -- burrowing and excavating -- that can fluidize the scene to un-jam obstacles and enable continued progress. Even when these primitives are implemented in an open loop manner a… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Preprint of paper accepted to IEEE ICRA 2024

  33. arXiv:2402.03060  [pdf, other

    cs.CR

    UniHENN: Designing More Versatile Homomorphic Encryption-based CNNs without im2col

    Authors: Hyunmin Choi, Jihun Kim, Seungho Kim, Seonhye Park, Jeongyong Park, Wonbin Choi, Hyoungshick Kim

    Abstract: Homomorphic encryption enables computations on encrypted data without decryption, which is crucial for privacy-preserving cloud services. However, deploying convolutional neural networks (CNNs) with homomorphic encryption encounters significant challenges, particularly in converting input data into a two-dimensional matrix for convolution, typically achieved using the im2col technique. While effic… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  34. arXiv:2401.14421  [pdf, other

    cs.LG cs.MA eess.SY stat.ML

    Multi-Agent Based Transfer Learning for Data-Driven Air Traffic Applications

    Authors: Chuhao Deng, Hong-Cheol Choi, Hyunsang Park, Inseok Hwang

    Abstract: Research in developing data-driven models for Air Traffic Management (ATM) has gained a tremendous interest in recent years. However, data-driven models are known to have long training time and require large datasets to achieve good performance. To address the two issues, this paper proposes a Multi-Agent Bidirectional Encoder Representations from Transformers (MA-BERT) model that fully considers… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 12 pages, 8 figures, submitted for IEEE Transactions on Intelligent Transportation System

  35. Enhanced Labeling Technique for Reddit Text and Fine-Tuned Longformer Models for Classifying Depression Severity in English and Luganda

    Authors: Richard Kimera, Daniela N. Rim, Joseph Kirabira, Ubong Godwin Udomah, Heeyoul Choi

    Abstract: Depression is a global burden and one of the most challenging mental health conditions to control. Experts can detect its severity early using the Beck Depression Inventory (BDI) questionnaire, administer appropriate medication to patients, and impede its progression. Due to the fear of potential stigmatization, many patients turn to social media platforms like Reddit for advice and assistance at… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: In IEEE Proceedings of the 14th International Conference on ICT Convergence (ICTC), Jeju, Korea, October 2023

  36. Enhancing Wind Speed and Wind Power Forecasting Using Shape-Wise Feature Engineering: A Novel Approach for Improved Accuracy and Robustness

    Authors: Mulomba Mukendi Christian, Yun Seon Kim, Hyebong Choi, Jaeyoung Lee, SongHee You

    Abstract: Accurate prediction of wind speed and power is vital for enhancing the efficiency of wind energy systems. Numerous solutions have been implemented to date, demonstrating their potential to improve forecasting. Among these, deep learning is perceived as a revolutionary approach in the field. However, despite their effectiveness, the noise present in the collected data remains a significant challeng… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Journal ref: International Journal of Advanced Culture Technology Vol.11 No.4 393-405 (2023)

  37. arXiv:2401.05007  [pdf

    cs.LG

    Temporal Analysis of World Disaster Risk:A Machine Learning Approach to Cluster Dynamics

    Authors: Christian Mulomba Mukendi, Hyebong Choi

    Abstract: he evaluation of the impact of actions undertaken is essential in management. This paper assesses the impact of efforts considered to mitigate risk and create safe environments on a global scale. We measure this impact by looking at the probability of improvement over a specific short period of time. Using the World Risk Index, we conduct a temporal analysis of global disaster risk dynamics from 2… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: This is the conference proceeding of the ICTC 2023, to be published in IEEE conference proceedings

    Report number: 979-8-3503-1327-7/23/$31.00 \c{opyright}2023 IEEE

  38. arXiv:2401.04369  [pdf

    cs.LG

    Air Quality Forecasting Using Machine Learning: A Global perspective with Relevance to Low-Resource Settings

    Authors: Mulomba Mukendi Christian, Hyebong Choi

    Abstract: Air pollution stands as the fourth leading cause of death globally. While extensive research has been conducted in this domain, most approaches rely on large datasets when it comes to prediction. This limits their applicability in low-resource settings though more vulnerable. This study addresses this gap by proposing a novel machine learning approach for accurate air quality prediction using two… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 16 pages. This is a conference proceeding Presented at: SIBR 2024 (Seoul) Conference on Interdisciplinary Business and Economics Research, 5th-6th January 2024, Seoul, South Korea

  39. Enhancing Acute Kidney Injury Prediction through Integration of Drug Features in Intensive Care Units

    Authors: Gabriel D. M. Manalu, Mulomba Mukendi Christian, Songhee You, Hyebong Choi

    Abstract: The relationship between acute kidney injury (AKI) prediction and nephrotoxic drugs, or drugs that adversely affect kidney function, is one that has yet to be explored in the critical care setting. One contributing factor to this gap in research is the limited investigation of drug modalities in the intensive care unit (ICU) context, due to the challenges of processing prescription data into the c… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 tables

    Journal ref: International Journal of Advanced Smart Convergence Vol.12 No.4 434- 442 (2023)

  40. arXiv:2401.03846  [pdf, other

    cs.CV cs.LG

    UFO: Unidentified Foreground Object Detection in 3D Point Cloud

    Authors: Hyunjun Choi, Hawook Jeong, Jin Young Choi

    Abstract: In this paper, we raise a new issue on Unidentified Foreground Object (UFO) detection in 3D point clouds, which is a crucial technology in autonomous driving in the wild. UFO detection is challenging in that existing 3D object detectors encounter extremely hard challenges in both 3D localization and Out-of-Distribution (OOD) detection. To tackle these challenges, we suggest a new UFO detection fra… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Under review

  41. Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication

    Authors: Hyunmin Choi, Simon Woo, Hyoungshick Kim

    Abstract: Fingerprint authentication is a popular security mechanism for smartphones and laptops. However, its adoption in web and cloud environments has been limited due to privacy concerns over storing and processing biometric data on servers. This paper introduces Blind-Touch, a novel machine learning-based fingerprint authentication system leveraging homomorphic encryption to address these privacy conce… ▽ More

    Submitted 1 April, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: The 38th Annual AAAI Conference on Artificial Intelligence (AAAI) 2024

  42. arXiv:2312.09252  [pdf, other

    cs.CV

    FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection

    Authors: Hongsuk Choi, Isaac Kasahara, Selim Engin, Moritz Graule, Nikhil Chavan-Dafle, Volkan Isler

    Abstract: Recently introduced ControlNet has the ability to steer the text-driven image generation process with geometric input such as human 2D pose, or edge features. While ControlNet provides control over the geometric form of the instances in the generated image, it lacks the capability to dictate the visual appearance of each instance. We present FineControlNet to provide fine control over each instanc… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Hongsuk Choi and Isaac Kasahara have eqaul contributions. 19 pages, 15 figures, 3 tables

  43. arXiv:2312.05548  [pdf, other

    eess.IV cs.CV cs.LG

    A Unified Multi-Phase CT Synthesis and Classification Framework for Kidney Cancer Diagnosis with Incomplete Data

    Authors: Kwang-Hyun Uhm, Seung-Won Jung, Moon Hyung Choi, Sung-Hoo Hong, Sung-Jea Ko

    Abstract: Multi-phase CT is widely adopted for the diagnosis of kidney cancer due to the complementary information among phases. However, the complete set of multi-phase CT is often not available in practical clinical applications. In recent years, there have been some studies to generate the missing modality image from the available data. Nevertheless, the generated images are not guaranteed to be effectiv… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: This article has been accepted for publication in IEEE Journal of Biomedical and Health Informatics

    Journal ref: JBHI, 2022

  44. arXiv:2312.05334  [pdf, other

    eess.IV cs.CV

    ProsDectNet: Bridging the Gap in Prostate Cancer Detection via Transrectal B-mode Ultrasound Imaging

    Authors: Sulaiman Vesal, Indrani Bhattacharya, Hassan Jahanandish, Xinran Li, Zachary Kornberg, Steve Ran Zhou, Elijah Richard Sommer, Moon Hyung Choi, Richard E. Fan, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Interpreting traditional B-mode ultrasound images can be challenging due to image artifacts (e.g., shadowing, speckle), leading to low sensitivity and limited diagnostic accuracy. While Magnetic Resonance Imaging (MRI) has been proposed as a solution, it is expensive and not widely available. Furthermore, most biopsies are guided by Transrectal Ultrasound (TRUS) alone and can miss up to 52% cancer… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted in NeurIPS 2023 (Medical Imaging meets NeurIPS Workshop)

  45. arXiv:2312.04863  [pdf, ps, other

    cs.IT math.PR stat.CO

    Information divergences of Markov chains and their applications

    Authors: Youjia Wang, Michael C. H. Choi

    Abstract: In this paper, we first introduce and define several new information divergences in the space of transition matrices of finite Markov chains which measure the discrepancy between two Markov chains. These divergences offer natural generalizations of classical information-theoretic divergences, such as the $f$-divergences and the Rényi divergence between probability measures, to the context of finit… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 36 pages

    MSC Class: 60J10; 60J20; 94A15; 94A17

  46. arXiv:2312.03003  [pdf, other

    cs.HC cs.AI cs.CL

    Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task Automation

    Authors: Sunjae Lee, Junyoung Choi, Jungjae Lee, Munim Hasan Wasi, Hojun Choi, Steven Y. Ko, Sangeun Oh, Insik Shin

    Abstract: The advent of large language models (LLMs) has opened up new opportunities in the field of mobile task automation. Their superior language understanding and reasoning capabilities allow users to automate complex and repetitive tasks. However, due to the inherent unreliability and high operational cost of LLMs, their practical applicability is quite limited. To address these issues, this paper intr… ▽ More

    Submitted 16 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  47. arXiv:2311.17952  [pdf, other

    cs.CV

    Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation

    Authors: Minhyeok Lee, Dogyoon Lee, Jungho Lee, Suhwan Cho, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee

    Abstract: Referring Image Segmentation (RIS) aims to segment target objects expressed in natural language within a scene at the pixel level. Various recent RIS models have achieved state-of-the-art performance by generating contextual tokens to model multimodal features from pretrained encoders and effectively fusing them using transformer-based cross-modal attention. While these methods match language feat… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  48. arXiv:2311.12454  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis

    Authors: Sang-Hoon Lee, Ha-Yeong Choi, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Large language models (LLM)-based speech synthesis has been widely adopted in zero-shot speech synthesis. However, they require a large-scale data and possess the same limitations as previous autoregressive speech models, including slow inference speed and lack of robustness. This paper proposes HierSpeech++, a fast and strong zero-shot speech synthesizer for text-to-speech (TTS) and voice convers… ▽ More

    Submitted 27 November, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: 16 pages, 9 figures, 12 tables

  49. arXiv:2311.10915  [pdf, other

    cs.RO

    Path Planning in 3D with Motion Primitives for Wind Energy-Harvesting Fixed-Wing Aircraft

    Authors: Seung-Keol Ryu, Michael Moncton, Han-Lim Choi, Eric Frew

    Abstract: In this work, a set of motion primitives is defined for use in an energy-aware motion planning problem. The motion primitives are defined as sequences of control inputs to a simplified four-DOF dynamics model and are used to replace the traditional continuous control space used in many sampling-based motion planners. The primitives are implemented in a Stable Sparse Rapidly Exploring Random Tree (… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 4 pages

  50. arXiv:2311.04693  [pdf, other

    eess.AS cs.AI cs.SD eess.SP

    Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation

    Authors: Ha-Yeong Choi, Sang-Hoon Lee, Seong-Whan Lee

    Abstract: Although voice conversion (VC) systems have shown a remarkable ability to transfer voice style, existing methods still have an inaccurate pitch and low speaker adaptation quality. To address these challenges, we introduce Diff-HierVC, a hierarchical VC system based on two diffusion models. We first introduce DiffPitch, which can effectively generate F0 with the target voice style. Subsequently, th… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: INTERSPEECH 2023 (Oral)