Zum Hauptinhalt springen

Showing 1–50 of 178 results for author: Byun

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10923  [pdf, other

    cs.CL cs.AI

    LBC: Language-Based-Classifier for Out-Of-Variable Generalization

    Authors: Kangjun Noh, Baekryun Seong, Hoyoon Byun, Youngjun Choi, Sungjin Song, Kyungwoo Song

    Abstract: Large Language Models (LLMs) have great success in natural language processing tasks such as response generation. However, their use in tabular data has been limited due to their inferior performance compared to traditional machine learning models (TMLs) such as XGBoost. We find that the pre-trained knowledge of LLMs enables them to interpret new variables that appear in a test without additional… ▽ More

    Submitted 23 August, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: 16 pages, 7 figures, 4 tables

  2. arXiv:2408.10107  [pdf, other

    cs.LG cs.AI stat.ML

    Perturb-and-Compare Approach for Detecting Out-of-Distribution Samples in Constrained Access Environments

    Authors: Heeyoung Lee, Hoyoon Byun, Changdae Oh, JinYeong Bak, Kyungwoo Song

    Abstract: Accessing machine learning models through remote APIs has been gaining prevalence following the recent trend of scaling up model parameters for increased performance. Even though these models exhibit remarkable ability, detecting out-of-distribution (OOD) samples remains a crucial safety concern for end users as these samples may induce unreliable outputs from the model. In this work, we propose a… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Accepted to European Conference on Artificial Intelligence (ECAI) 2024

  3. arXiv:2408.07279  [pdf

    cs.AR cs.HC

    Interactive and Automatic Generation of Primitive Custom Circuit Layout Using LLMs

    Authors: Geunyoung You, Youjin Byun, Sojin Lim, Jaeduk Han

    Abstract: In this study, we investigate the use of Large Language Models (LLMs) for the interactive and automated production of customs circuit layouts described in natural language. Our proposed layout automation process leverages a template-and-grid-based layout generation framework to create process-portable layout generators tailored for various custom circuits, including standard cells and high-speed m… ▽ More

    Submitted 26 July, 2024; originally announced August 2024.

    Comments: 6 pages, 11 figures

  4. arXiv:2408.05074  [pdf

    cs.CL cs.AI

    RT-Surv: Improving Mortality Prediction After Radiotherapy with Large Language Model Structuring of Large-Scale Unstructured Electronic Health Records

    Authors: Sangjoon Park, Chan Woo Wee, Seo Hee Choi, Kyung Hwan Kim, Jee Suk Chang, Hong In Yoon, Ik Jae Lee, Yong Bae Kim, Jaeho Cho, Ki Chang Keum, Chang Geol Lee, Hwa Kyung Byun, Woong Sub Koom

    Abstract: Accurate patient selection is critical in radiotherapy (RT) to prevent ineffective treatments. Traditional survival prediction models, relying on structured data, often lack precision. This study explores the potential of large language models (LLMs) to structure unstructured electronic health record (EHR) data, thereby improving survival prediction accuracy through comprehensive clinical informat… ▽ More

    Submitted 16 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

    Comments: 23 pages, 2 tables, 4 figures

  5. arXiv:2407.03557  [pdf, other

    cs.LG stat.ML

    Decision-Focused Evaluation of Worst-Case Distribution Shift

    Authors: Kevin Ren, Yewon Byun, Bryan Wilder

    Abstract: Distribution shift is a key challenge for predictive models in practice, creating the need to identify potentially harmful shifts in advance of deployment. Existing work typically defines these worst-case shifts as ones that most degrade the individual-level accuracy of the model. However, when models are used to make a downstream population-level decision like the allocation of a scarce resource,… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: UAI 2024

  6. arXiv:2407.01481  [pdf, other

    cs.DC cs.PF

    LLload: Simplifying Real-Time Job Monitoring for HPC Users

    Authors: Chansup Byun, Julia Mullen, Albert Reuther, William Arcand, William Bergeron, David Bestor, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Peter Michaleas, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Lauren Milechin

    Abstract: One of the more complex tasks for researchers using HPC systems is performance monitoring and tuning of their applications. Developing a practice of continuous performance improvement, both for speed-up and efficient use of resources is essential to the long term success of both the HPC practitioner and the research project. Profiling tools provide a nice view of the performance of an application… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  7. arXiv:2407.00087  [pdf, other

    cs.AI cs.CL cs.LG

    ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback

    Authors: Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault

    Abstract: Large Multimodal Models (LMMs) excel at comprehending human instructions and demonstrate remarkable results across a broad spectrum of tasks. Reinforcement Learning from Human Feedback (RLHF) and AI Feedback (RLAIF) further refine LLMs by aligning them with specific preferences. These methods primarily use ranking-based feedback for entire generations. With advanced AI models (Teacher), such as GP… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  8. arXiv:2406.17876  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    ET tu, CLIP? Addressing Common Object Errors for Unseen Environments

    Authors: Ye Won Byun, Cathy Jiao, Shahriar Noroozizadeh, Jimin Sun, Rosa Vitiello

    Abstract: We introduce a simple method that employs pre-trained CLIP encoders to enhance model generalization in the ALFRED task. In contrast to previous literature where CLIP replaces the visual encoder, we suggest using CLIP as an additional module through an auxiliary object detection objective. We validate our method on the recently proposed Episodic Transformer architecture and demonstrate that incorpo… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Journal ref: Conference on Computer Vision and Pattern Recognition (CVPR 2022) - Embodied AI Workshop

  9. arXiv:2406.17477  [pdf, other

    cs.DC cs.LG

    Towards Federated Low-Rank Adaptation with Rank-Heterogeneous Communication

    Authors: Yuji Byun, Jaeho Lee

    Abstract: Low-rank adaptation (LoRA) is an attractive alternative of adapting full weights for the federated fine-tuning of large pretrained models, which can significantly reduce the memory and communication burden. In principle, federated LoRA can provide an effective mean to allocate different resources to each client by tuning ranks for each client, which can be useful in achieving a better communicatio… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  10. arXiv:2406.15664  [pdf, other

    stat.ML cs.LG

    Flat Posterior Does Matter For Bayesian Transfer Learning

    Authors: Sungjun Lim, Jeyoon Yeom, Sooyon Kim, Hoyoon Byun, Jinho Kang, Yohan Jung, Jiyoung Jung, Kyungwoo Song

    Abstract: The large-scale pre-trained neural network has achieved notable success in enhancing performance for downstream tasks. Another promising approach for generalization is Bayesian Neural Network (BNN), which integrates Bayesian methods into neural network architectures, offering advantages such as Bayesian Model averaging (BMA) and uncertainty quantification. Despite these benefits, transfer learning… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  11. arXiv:2406.13502  [pdf, other

    cs.CL cs.SD eess.AS

    ManWav: The First Manchu ASR Model

    Authors: Jean Seo, Minha Kang, Sungjoo Byun, Sangah Lee

    Abstract: This study addresses the widening gap in Automatic Speech Recognition (ASR) research between high resource and extremely low resource languages, with a particular focus on Manchu, a critically endangered language. Manchu exemplifies the challenges faced by marginalized linguistic communities in accessing state-of-the-art technologies. In a pioneering effort, we introduce the first-ever Manchu ASR… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ACL2024/Field Matters

  12. arXiv:2406.09188  [pdf, ps, other

    cs.CV cs.IR

    Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval

    Authors: Jaeseok Byun, Seokhyeon Jeong, Wonjae Kim, Sanghyuk Chun, Taesup Moon

    Abstract: Composed Image Retrieval (CIR) aims to retrieve a target image based on a reference image and conditioning text, enabling controllable searches. Due to the expensive dataset construction cost for CIR triplets, a zero-shot (ZS) CIR setting has been actively studied to eliminate the need for human-collected triplet datasets. The mainstream of ZS-CIR employs an efficient projection module that projec… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 17 pages

  13. arXiv:2405.20042  [pdf, other

    cs.LG

    CycleFormer : TSP Solver Based on Language Modeling

    Authors: Jieun Yook, Junpyo Seo, Joon Huh, Han Joon Byun, Byung-ro Mooon

    Abstract: We propose a new transformer model for the Traveling Salesman Problem (TSP) called CycleFormer. We identified distinctive characteristics that need to be considered when applying a conventional transformer model to TSP and aimed to fully incorporate these elements into the TSP-specific transformer. Unlike the token sets in typical language models, which are limited and static, the token (node) set… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  14. arXiv:2405.18623  [pdf

    cs.HC

    I See You: Teacher Analytics with GPT-4 Vision-Powered Observational Assessment

    Authors: Unggi Lee, Yeil Jeong, Junbo Koh, Gyuri Byun, Yunseo Lee, Hyunwoong Lee, Seunmin Eun, Jewoong Moon, Cheolil Lim, Hyeoncheol Kim

    Abstract: This preliminary study explores the integration of GPT-4 Vision (GPT-4V) technology into teacher analytics, focusing on its applicability in observational assessment to enhance reflective teaching practice. This research is grounded in developing a Video-based Automatic Assessment System (VidAAS) empowered by GPT-4V. Our approach aims to revolutionize teachers' assessment of students' practices by… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 27 pages, 5 figures, 4 tables

  15. arXiv:2405.17618  [pdf, other

    cs.LG cs.AI

    Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales

    Authors: Ju-Seung Byun, Andrew Perrault

    Abstract: Reinforcement learning (RL) training is inherently unstable due to factors such as moving targets and high gradient variance. Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from AI Feedback (RLAIF) can introduce additional difficulty. Differing preferences can complicate the alignment process, and prediction errors in a trained reward model can become more severe as t… ▽ More

    Submitted 29 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  16. arXiv:2405.14632  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning for Fine-tuning Text-to-speech Diffusion Models

    Authors: Jingyi Chen, Ju-Seung Byun, Micha Elsner, Andrew Perrault

    Abstract: Recent advancements in generative models have sparked significant interest within the machine learning community. Particularly, diffusion models have demonstrated remarkable capabilities in synthesizing images and speech. Studies such as those by Lee et al. [19], Black et al. [4], Wang et al. [36], and Fan et al. [8] illustrate that Reinforcement Learning with Human Feedback (RLHF) can enhance dif… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  17. arXiv:2405.01361  [pdf, other

    cs.RO

    Haptic-Based Bilateral Teleoperation of Aerial Manipulator for Extracting Wedged Object with Compensation of Human Reaction Time

    Authors: Jeonghyun Byun, Dohyun Eom, H. Jin Kim

    Abstract: Bilateral teleoperation of an aerial manipulator facilitates the execution of industrial missions thanks to the combination of the aerial platform's maneuverability and the ability to conduct complex tasks with human supervision. Heretofore, research on such operations has focused on flying without any physical interaction or exerting a pushing force on a contact surface that does not involve abru… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: to be presented in 2024 IEEE International Conference on Unmanned Aircraft Systems (ICUAS), Chania, Crete, Greece, 2024

  18. arXiv:2404.11310  [pdf, other

    cs.RO

    Autonomous aerial perching and unperching using omnidirectional tiltrotor and switching controller

    Authors: Dongjae Lee, Sunwoo Hwang, Jeonghyun Byun, Seung Jae Lee, H. Jin Kim

    Abstract: Aerial unperching of multirotors has received little attention as opposed to perching that has been investigated to elongate operation time. This study presents a new aerial robot capable of both perching and unperching autonomously on/from a ferromagnetic surface during flight, and a switching controller to avoid rotor saturation and mitigate overshoot during transition between free-flight and pe… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 7 pages, 10 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA) accepted

  19. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  20. arXiv:2403.16447  [pdf, ps, other

    cs.CL

    A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark

    Authors: Dongjun Jang, Sungjoo Byun, Hyopil Shin

    Abstract: This study examines whether the attention scores between tokens in the BERT model significantly vary based on lexical categories during the fine-tuning process for downstream tasks. Drawing inspiration from the notion that in human language processing, syntactic and semantic information is parsed differently, we categorize tokens in sentences according to their lexical categories and focus on chan… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  21. arXiv:2403.16444  [pdf, other

    cs.CL

    KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models

    Authors: Dongjun Jang, Sungjoo Byun, Hyemi Jo, Hyopil Shin

    Abstract: Instruction Tuning on Large Language Models is an essential process for model to function well and achieve high performance in specific tasks. Accordingly, in mainstream languages such as English, instruction-based datasets are being constructed and made publicly available. In the case of Korean, publicly available models and datasets all rely on using the output of ChatGPT or translating datasets… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  22. arXiv:2403.16158  [pdf, other

    cs.CL

    Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition

    Authors: Sungjoo Byun, Jiseung Hong, Sumin Park, Dongjun Jang, Jean Seo, Minseok Kim, Chaeyoung Oh, Hyopil Shin

    Abstract: Named Entity Recognition (NER) plays a pivotal role in medical Natural Language Processing (NLP). Yet, there has not been an open-source medical NER dataset specifically for the Korean language. To address this, we utilized ChatGPT to assist in constructing the KBMC (Korean Bio-Medical Corpus), which we are now presenting to the public. With the KBMC dataset, we noticed an impressive 20% increase… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Journal ref: LREC-COLING 2024

  23. arXiv:2403.14713  [pdf, other

    cs.LG cs.CY stat.ME stat.ML

    Auditing Fairness under Unobserved Confounding

    Authors: Yewon Byun, Dylan Sam, Michael Oberst, Zachary C. Lipton, Bryan Wilder

    Abstract: The presence of inequity is a fundamental problem in the outcomes of decision-making systems, especially when human lives are at stake. Yet, estimating notions of unfairness or inequity is difficult, particularly if they rely on hard-to-measure concepts such as risk. Such measurements of risk can be accurately obtained when no unobserved confounders have jointly influenced past decisions and outco… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: AISTATS 2024

  24. arXiv:2403.10041  [pdf, other

    cs.RO cs.AI

    Towards Embedding Dynamic Personas in Interactive Robots: Masquerading Animated Social Kinematics (MASK)

    Authors: Jeongeun Park, Taemoon Jeong, Hyeonseong Kim, Taehyun Byun, Seungyoon Shin, Keunjun Choi, Jaewoon Kwon, Taeyoon Lee, Matthew Pan, Sungjoon Choi

    Abstract: This paper presents the design and development of an innovative interactive robotic system to enhance audience engagement using character-like personas. Built upon the foundations of persona-driven dialog agents, this work extends the agent application to the physical realm, employing robots to provide a more immersive and interactive experience. The proposed system, named the Masquerading Animate… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 4 pages, 3 figures

  25. arXiv:2403.02639  [pdf, other

    cs.CV cs.LG

    False Positive Sampling-based Data Augmentation for Enhanced 3D Object Detection Accuracy

    Authors: Jiyong Oh, Junhaeng Lee, Woongchan Byun, Minsang Kong, Sang Hun Lee

    Abstract: Recent studies have focused on enhancing the performance of 3D object detection models. Among various approaches, ground-truth sampling has been proposed as an augmentation technique to address the challenges posed by limited ground-truth data. However, an inherent issue with ground-truth sampling is its tendency to increase false positives. Therefore, this study aims to overcome the limitations o… ▽ More

    Submitted 19 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  26. arXiv:2402.18362  [pdf, other

    cs.CV cs.AI

    Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model

    Authors: Sangjoon Park, Yong Bae Kim, Jee Suk Chang, Seo Hee Choi, Hyungjin Chung, Ik Jae Lee, Hwa Kyung Byun

    Abstract: As advancements in the field of breast cancer treatment continue to progress, the assessment of post-surgical cosmetic outcomes has gained increasing significance due to its substantial impact on patients' quality of life. However, evaluating breast cosmesis presents challenges due to the inherently subjective nature of expert labeling. In this study, we present a novel automated approach, Attenti… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  27. arXiv:2402.15046  [pdf, other

    cs.CL

    CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean

    Authors: Dongjun Jang, Jean Seo, Sungjoo Byun, Taekyoung Kim, Minseok Kim, Hyopil Shin

    Abstract: This paper explores the challenges posed by aspect-based sentiment classification (ABSC) within pretrained language models (PLMs), with a particular focus on contextualization and hallucination issues. In order to tackle these challenges, we introduce CARBD-Ko (a Contextually Annotated Review Benchmark Dataset for Aspect-Based Sentiment Classification in Korean), a benchmark dataset that incorpora… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  28. arXiv:2402.13410  [pdf, other

    cs.LG stat.ML

    Bayesian Neural Networks with Domain Knowledge Priors

    Authors: Dylan Sam, Rattana Pukdee, Daniel P. Jeong, Yewon Byun, J. Zico Kolter

    Abstract: Bayesian neural networks (BNNs) have recently gained popularity due to their ability to quantify model uncertainty. However, specifying a prior for BNNs that captures relevant domain knowledge is often extremely challenging. In this work, we propose a framework for integrating general forms of domain knowledge (i.e., any knowledge that can be represented by a loss function) into a BNN prior throug… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 17 pages, 4 figures

  29. arXiv:2402.09965  [pdf, ps, other

    cs.LG cs.AI

    Hierarchy Representation of Data in Machine Learnings

    Authors: Han Yegang, Park Minjun, Byun Duwon, Park Inkyu

    Abstract: When there are models with clear-cut judgment results for several data points, it is possible that most models exhibit a relationship where if they correctly judge one target, they also correctly judge another target. Conversely, if most models incorrectly judge one target, they may also incorrectly judge another target. We propose a method for visualizing this hierarchy among targets. This inform… ▽ More

    Submitted 29 November, 2023; originally announced February 2024.

  30. arXiv:2402.06264  [pdf

    cs.AI cs.CL cs.SI

    LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education

    Authors: Unggi Lee, Minji Jeon, Yunseo Lee, Gyuri Byun, Yoorim Son, Jaeyoon Shin, Hongkyu Ko, Hyeoncheol Kim

    Abstract: Art appreciation is vital in nurturing critical thinking and emotional intelligence among learners. However, traditional art appreciation education has often been hindered by limited access to art resources, especially for disadvantaged students, and an imbalanced emphasis on STEM subjects in mainstream education. In response to these challenges, recent technological advancements have paved the wa… ▽ More

    Submitted 26 August, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 37 pages, 4 figures, 10 tables

  31. arXiv:2402.05448  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application

    Authors: Bumsoo Kim, Sanghyun Byun, Yonghoon Jung, Wonseop Shin, Sareer UI Amin, Sanghyun Seo

    Abstract: In this paper, we first present the character texture generation system \textit{Minecraft-ify}, specified to Minecraft video game toward in-game application. Ours can generate face-focused image for texture mapping tailored to 3D virtual character having cube manifold. While existing projects or works only generate texture, proposed system can inverse the user-provided real image, or generate aver… ▽ More

    Submitted 3 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 2 pages, 2 figures. Accepted as Spotlight to NeurIPS 2023 Workshop on Machine Learning for Creativity and Design

  32. arXiv:2312.06112  [pdf, other

    cs.CV cs.AI

    MAFA: Managing False Negatives for Vision-Language Pre-training

    Authors: Jaeseok Byun, Dohoon Kim, Taesup Moon

    Abstract: We consider a critical issue of false negatives in Vision-Language Pre-training (VLP), a challenge that arises from the inherent many-to-many correspondence of image-text pairs in large-scale web-crawled datasets. The presence of false negatives can impede achieving optimal performance and even lead to a significant performance drop. To address this challenge, we propose MAFA (MAnaging FAlse negat… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 camera ready version

  33. arXiv:2312.00083  [pdf, other

    cs.CV cs.LG

    BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

    Authors: Pilhyeon Lee, Hyeran Byun

    Abstract: Temporal sentence grounding aims to localize moments relevant to a language description. Recently, DETR-like approaches achieved notable progress by predicting the center and length of a target moment. However, they suffer from the issue of center misalignment raised by the inherent ambiguity of moment centers, leading to inaccurate predictions. To remedy this problem, we propose a novel boundary-… ▽ More

    Submitted 18 July, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

    Comments: Accepted by ECCV 2024

  34. arXiv:2311.18215  [pdf, other

    cs.CL

    Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models

    Authors: Sungjoo Byun, Dongjun Jang, Hyemi Jo, Hyopil Shin

    Abstract: Caution: this paper may include material that could be offensive or distressing. The advent of Large Language Models (LLMs) necessitates the development of training approaches that mitigate the generation of unethical language and aptly manage toxic user queries. Given the challenges related to human labor and the scarcity of data, we present KoTox, comprising 39K unethical instruction-output pa… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following

  35. arXiv:2311.17492  [pdf, other

    cs.CL

    Mergen: The First Manchu-Korean Machine Translation Model Trained on Augmented Data

    Authors: Jean Seo, Sungjoo Byun, Minha Kang, Sangah Lee

    Abstract: The Manchu language, with its roots in the historical Manchurian region of Northeast China, is now facing a critical threat of extinction, as there are very few speakers left. In our efforts to safeguard the Manchu language, we introduce Mergen, the first-ever attempt at a Manchu-Korean Machine Translation (MT) model. To develop this model, we utilize valuable resources such as the Manwen Laodang(… ▽ More

    Submitted 12 January, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: emnlp2023/mrl2023

  36. arXiv:2311.15876  [pdf, other

    cs.CV cs.AI cs.LG

    End-to-End Breast Cancer Radiotherapy Planning via LMMs with Consistency Embedding

    Authors: Kwanyoung Kim, Yujin Oh, Sangjoon Park, Hwa Kyung Byun, Joongyo Lee, Jin Sung Kim, Yong Bae Kim, Jong Chul Ye

    Abstract: Recent advances in AI foundation models have significant potential for lightening the clinical workload by mimicking the comprehensive and multi-faceted approaches used by medical professionals. In the field of radiation oncology, the integration of multiple modalities holds great importance, so the opportunity of foundational model is abundant. Inspired by this, here we present RO-LMM, a multi-pu… ▽ More

    Submitted 1 July, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 10 pages, 4 figures, 11 tables

  37. arXiv:2311.13784  [pdf, other

    cs.CL

    DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP

    Authors: Dongjun Jang, Sangah Lee, Sungjoo Byun, Jinwoong Kim, Jean Seo, Minseok Kim, Soyeon Kim, Chaeyoung Oh, Jaeyoon Kim, Hyemi Jo, Hyopil Shin

    Abstract: This paper presents the DaG LLM (David and Goliath Large Language Model), a language model specialized for Korean and fine-tuned through Instruction Tuning across 41 tasks within 13 distinct categories.

    Submitted 22 November, 2023; originally announced November 2023.

  38. arXiv:2311.01908  [pdf, other

    eess.IV cs.CV

    LLM-driven Multimodal Target Volume Contouring in Radiation Oncology

    Authors: Yujin Oh, Sangjoon Park, Hwa Kyung Byun, Yeona Cho, Ik Jae Lee, Jin Sung Kim, Jong Chul Ye

    Abstract: Target volume contouring for radiation therapy is considered significantly more challenging than the normal organ segmentation tasks as it necessitates the utilization of both image and text-based clinical information. Inspired by the recent advancement of large language models (LLMs) that can facilitate the integration of the textural information and images, here we present a novel LLM-driven mul… ▽ More

    Submitted 15 April, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  39. arXiv:2310.02588  [pdf, other

    cs.CV cs.LG

    ViT-ReciproCAM: Gradient and Attention-Free Visual Explanations for Vision Transformer

    Authors: Seok-Yong Byun, Wonju Lee

    Abstract: This paper presents a novel approach to address the challenges of understanding the prediction process and debugging prediction errors in Vision Transformers (ViT), which have demonstrated superior performance in various computer vision tasks such as image classification and object detection. While several visual explainability techniques, such as CAM, Grad-CAM, Score-CAM, and Recipro-CAM, have be… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  40. arXiv:2310.01405  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.CY

    Representation Engineering: A Top-Down Approach to AI Transparency

    Authors: Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, Dan Hendrycks

    Abstract: In this paper, we identify and characterize the emerging area of representation engineering (RepE), an approach to enhancing the transparency of AI systems that draws on insights from cognitive neuroscience. RepE places population-level representations, rather than neurons or circuits, at the center of analysis, equipping us with novel methods for monitoring and manipulating high-level cognitive p… ▽ More

    Submitted 10 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Code is available at https://github.com/andyzoujm/representation-engineering

  41. arXiv:2310.00522  [pdf, other

    cs.SI

    Mapping of Internet "Coastlines" via Large Scale Anonymized Network Source Correlations

    Authors: Hayden Jananthan, Jeremy Kepner, Michael Jones, William Arcand, David Bestor, William Bergeron, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg , et al. (3 additional authors not shown)

    Abstract: Expanding the scientific tools available to protect computer networks can be aided by a deeper understanding of the underlying statistical distributions of network traffic and their potential geometric interpretations. Analyses of large scale network observations provide a unique window into studying those underlying statistics. Newly developed GraphBLAS hypersparse matrices and D4M associative ar… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 9 pages, 7 figures, IEEE HPEC 2023 (accepted)

  42. arXiv:2309.14117  [pdf, other

    cs.CV cs.AI

    Small Objects Matters in Weakly-supervised Semantic Segmentation

    Authors: Cheolhyun Mun, Sanghuk Lee, Youngjung Uh, Junsuk Choe, Hyeran Byun

    Abstract: Weakly-supervised semantic segmentation (WSSS) performs pixel-wise classification given only image-level labels for training. Despite the difficulty of this task, the research community has achieved promising results over the last five years. Still, current WSSS literature misses the detailed sense of how well the methods perform on different sizes of objects. Thus we propose a novel evaluation me… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted to WACV 2024

  43. pPython Performance Study

    Authors: Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner

    Abstract: pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on a single-node (e.g., a laptop) running Window… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.14908

  44. arXiv:2309.03364  [pdf, other

    cs.SD eess.AS

    Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature

    Authors: Kyungguen Byun, Sunkuk Moon, Erik Visser

    Abstract: We propose a highly controllable voice manipulation system that can perform any-to-any voice conversion (VC) and prosody modulation simultaneously. State-of-the-art VC systems can transfer sentence-level characteristics such as speaker, emotion, and speaking style. However, manipulating the frame-level prosody, such as pitch, energy and speaking rate, still remains challenging. Our proposed model… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures, submitted to ICASSP 2024

  45. arXiv:2309.02730  [pdf, other

    eess.AS cs.AI cs.SD

    Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data

    Authors: Hyungseob Lim, Kyungguen Byun, Sunkuk Moon, Erik Visser

    Abstract: While many recent any-to-any voice conversion models succeed in transferring some target speech's style information to the converted speech, they still lack the ability to faithfully reproduce the speaking style of the target speaker. In this work, we propose a novel method to extract rich style information from target utterances and to efficiently transfer it to source speech content without requ… ▽ More

    Submitted 14 December, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 5 pages, 2 figures, 2 tables

  46. Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays

    Authors: Michael Jones, Jeremy Kepner, Andrew Prout, Timothy Davis, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Sandeep Pisharody, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas

    Abstract: Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires int… ▽ More

    Submitted 8 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 8 pages, 8 figures, 1 table, 69 references. arXiv admin note: text overlap with arXiv:2203.13934. text overlap with arXiv:2309.01806

  47. Focusing and Calibration of Large Scale Network Sensors using GraphBLAS Anonymized Hypersparse Matrices

    Authors: Jeremy Kepner, Michael Jones, Phil Dykstra, Chansup Byun, Timothy Davis, Hayden Jananthan, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Charles Yee , et al. (1 additional authors not shown)

    Abstract: Defending community-owned cyber space requires community-based efforts. Large-scale network observations that uphold the highest regard for privacy are key to protecting our shared cyberspace. Deployment of the necessary network sensors requires careful sensor placement, focusing, and calibration with significant volumes of network observations. This paper demonstrates novel focusing and calibrati… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE HPEC, 9 pages, 12 figures, 1 table, 63 references, 2 appendices

  48. arXiv:2309.00208  [pdf, other

    cs.CL cs.AI

    Large Language Models for Semantic Monitoring of Corporate Disclosures: A Case Study on Korea's Top 50 KOSPI Companies

    Authors: Junwon Sung, Woojin Heo, Yunkyung Byun, Youngsam Kim

    Abstract: In the rapidly advancing domain of artificial intelligence, state-of-the-art language models such as OpenAI's GPT-3.5-turbo and GPT-4 offer unprecedented opportunities for automating complex tasks. This research paper delves into the capabilities of these models for semantically analyzing corporate disclosures in the Korean context, specifically for timely disclosure. The study focuses on the top… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  49. arXiv:2308.10554  [pdf, other

    cs.CV

    Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations

    Authors: Seogkyu Jeon, Bei Liu, Pilhyeon Lee, Kibeom Hong, Jianlong Fu, Hyeran Byun

    Abstract: Training deep generative models usually requires a large amount of data. To alleviate the data collection cost, the task of zero-shot GAN adaptation aims to reuse well-trained generators to synthesize images of an unseen target domain without any further training samples. Due to the data absence, the textual description of the target domain and the vision-language models, e.g., CLIP, are utilized… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023 (poster)

  50. arXiv:2307.09724  [pdf, other

    cs.CV

    AesPA-Net: Aesthetic Pattern-Aware Style Transfer Networks

    Authors: Kibeom Hong, Seogkyu Jeon, Junsoo Lee, Namhyuk Ahn, Kunhee Kim, Pilhyeon Lee, Daesik Kim, Youngjung Uh, Hyeran Byun

    Abstract: To deliver the artistic expression of the target style, recent studies exploit the attention mechanism owing to its ability to map the local patches of the style image to the corresponding patches of the content image. However, because of the low semantic correspondence between arbitrary content and artworks, the attention module repeatedly abuses specific local patches from the style image, resul… ▽ More

    Submitted 8 August, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023. Code is available at this https://github.com/Kibeom-Hong/AesPA-Net