Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Nam, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16574  [pdf, other

    cs.CL

    TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback

    Authors: Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo

    Abstract: Reinforcement Learning from Human Feedback (RLHF) leverages human preference data to train language models to align more closely with human essence. These human preference data, however, are labeled at the sequence level, creating a mismatch between sequence-level preference labels and tokens, which are autoregressively generated from the language model. Although several recent approaches have tri… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: ACL2024 Findings

  2. arXiv:2407.00305  [pdf, other

    cs.HC

    Student-AI Interaction: A Case Study of CS1 students

    Authors: Matin Amoozadeh, Daye Nam, Daniel Prol, Ali Alfageeh, James Prather, Michael Hilton, Sruti Srinivasa Ragavan, Mohammad Amin Alipour

    Abstract: The new capabilities of generative artificial intelligence tools Generative AI, such as ChatGPT, allow users to interact with the system in intuitive ways, such as simple conversations, and receive (mostly) good-quality answers. These systems can support students' learning objectives by providing accessible explanations and examples even with vague queries. At the same time, they can encourage und… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2404.04656  [pdf, other

    cs.LG cs.AI cs.CL

    Binary Classifier Optimization for Large Language Model Alignment

    Authors: Seungjae Jung, Gunsoo Han, Daniel Wontae Nam, Kyoung-Woon On

    Abstract: Aligning Large Language Models (LLMs) to human preferences through preference optimization has been crucial but labor-intensive, necessitating for each prompt a comparison of both a chosen and a rejected text completion by evaluators. Recently, Kahneman-Tversky Optimization (KTO) has demonstrated that LLMs can be aligned using merely binary "thumbs-up" or "thumbs-down" signals on each prompt-compl… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 18 pages, 9 figures

  4. arXiv:2401.07059  [pdf

    cs.CY

    Classifying Proposals of Decentralized Autonomous Organizations Using Large Language Models

    Authors: Christian Ziegler, Marcos Miranda, Guangye Cao, Gustav Arentoft, Doo Wan Nam

    Abstract: Our study demonstrates the effective use of Large Language Models (LLMs) for automating the classification of complex datasets. We specifically target proposals of Decentralized Autonomous Organizations (DAOs), as the clas-sification of this data requires the understanding of context and, therefore, depends on human expertise, leading to high costs associated with the task. The study applies an it… ▽ More

    Submitted 3 July, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

    Report number: Dawo/2024/01 ACM Class: H.0

  5. arXiv:2310.10817  [pdf, other

    cs.SE cs.HC

    Understanding Documentation Use Through Log Analysis: An Exploratory Case Study of Four Cloud Services

    Authors: Daye Nam, Andrew Macvean, Brad Myers, Bogdan Vasilescu

    Abstract: Almost no modern software system is written from scratch, and developers are required to effectively learn to use third-party libraries or software services. Thus, many practitioners and researchers have looked for ways to create effective documentation that supports developers' learning. However, few efforts have focused on how people actually use the documentation. In this paper, we report on an… ▽ More

    Submitted 29 February, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  6. arXiv:2310.06404  [pdf, other

    cs.CL cs.AI cs.LG

    Hexa: Self-Improving for Knowledge-Grounded Dialogue System

    Authors: Daejin Jo, Daniel Wontae Nam, Gunsoo Han, Kyoung-Woon On, Taehwan Kwon, Seungeun Rho, Sungwoong Kim

    Abstract: A common practice in knowledge-grounded dialogue generation is to explicitly utilize intermediate steps (e.g., web-search, memory retrieval) with modular approaches. However, data for such steps are often inaccessible compared to those of dialogue responses as they are unobservable in an ordinary dialogue. To fill in the absence of these data, we develop a self-improving method to improve the gene… ▽ More

    Submitted 2 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  7. arXiv:2310.04631  [pdf, other

    cs.HC

    Trust in Generative AI among students: An Exploratory Study

    Authors: Matin Amoozadeh, David Daniels, Daye Nam, Aayush Kumar, Stella Chen, Michael Hilton, Sruti Srinivasa Ragavan, Mohammad Amin Alipour

    Abstract: Generative artificial systems (GenAI) have experienced exponential growth in the past couple of years. These systems offer exciting capabilities, such as generating programs, that students can well utilize for their learning. Among many dimensions that might affect the effective adoption of GenAI, in this paper, we investigate students' \textit{trust}. Trust in GenAI influences the extent to which… ▽ More

    Submitted 1 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted at SIGCSE 2024

  8. arXiv:2307.08177  [pdf, other

    cs.SE cs.AI cs.HC

    Using an LLM to Help With Code Understanding

    Authors: Daye Nam, Andrew Macvean, Vincent Hellendoorn, Bogdan Vasilescu, Brad Myers

    Abstract: Understanding code is challenging, especially when working in new and complex development environments. Code comments and documentation can help, but are typically scarce or hard to navigate. Large language models (LLMs) are revolutionizing the process of writing code. Can they do the same for helping understand it? In this study, we provide a first investigation of an LLM-based conversational UI… ▽ More

    Submitted 16 January, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

  9. arXiv:2305.13973  [pdf, other

    cs.CL

    Effortless Integration of Memory Management into Open-Domain Conversation Systems

    Authors: Eunbi Choi, Kyoung-Woon On, Gunsoo Han, Sungwoong Kim, Daniel Wontae Nam, Daejin Jo, Seung Eun Rho, Taehwan Kwon, Minjoon Seo

    Abstract: Open-domain conversation systems integrate multiple conversation skills into a single system through a modular approach. One of the limitations of the system, however, is the absence of management capability for external memory. In this paper, we propose a simple method to improve BlenderBot3 by integrating memory management ability into it. Since no training data exists for this purpose, we propo… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  10. arXiv:2305.00630  [pdf, other

    cs.CV

    TRACE: Table Reconstruction Aligned to Corner and Edges

    Authors: Youngmin Baek, Daehyun Nam, Jaeheung Surh, Seung Shin, Seonghyeon Kim

    Abstract: A table is an object that captures structured and informative content within a document, and recognizing a table in an image is challenging due to the complexity and variety of table layouts. Many previous works typically adopt a two-stage approach; (1) Table detection(TD) localizes the table region in an image and (2) Table Structure Recognition(TSR) identifies row- and column-wise adjacency rela… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 18 pages, 7 figures, Accepted by ICDAR 2023

  11. arXiv:2301.11403  [pdf, other

    cs.SI cs.CL cs.LG

    Detecting Pump&Dump Stock Market Manipulation from Online Forums

    Authors: D. Nam, D. B. Skillicorn

    Abstract: The intersection of social media, low-cost trading platforms, and naive investors has created an ideal situation for information-based market manipulations, especially pump&dumps. Manipulators accumulate small-cap stocks, disseminate false information on social media to inflate their price, and sell at the peak. We collect a dataset of stocks whose price and volume profiles have the characteristic… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  12. arXiv:2210.05409  [pdf, other

    cs.LG cs.AI

    LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

    Authors: Daejin Jo, Sungwoong Kim, Daniel Wontae Nam, Taehwan Kwon, Seungeun Rho, Jongmin Kim, Donghoon Lee

    Abstract: Episodic count has been widely used to design a simple yet effective intrinsic motivation for reinforcement learning with a sparse reward. However, the use of episodic count in a high-dimensional state space as well as over a long episode time requires a thorough state compression and fast hashing, which hinders rigorous exploitation of it in such hard and complex exploration environments. Moreove… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  13. arXiv:2201.03758  [pdf, other

    cs.SE

    Predictive Synthesis of API-Centric Code

    Authors: Daye Nam, Baishakhi Ray, Seohyun Kim, Xianshan Qu, Satish Chandra

    Abstract: Today's programmers, especially data science practitioners, make heavy use of data-processing libraries (APIs) such as PyTorch, Tensorflow, NumPy, Pandas, and the like. Program synthesizers can provide significant coding assistance to this community of users; however program synthesis also can be slow due to enormous search spaces. In this work, we examine ways in which machine learning can be use… ▽ More

    Submitted 17 May, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  14. arXiv:2112.00152  [pdf, ps, other

    math.PR cs.DM math-ph

    One-step replica symmetry breaking of random regular NAE-SAT II

    Authors: Danny Nam, Allan Sly, Youngtak Sohn

    Abstract: Continuing our earlier work in \cite{nss20a}, we study the random regular k-NAE-SAT model in the condensation regime. In \cite{nss20a}, the 1RSB properties of the model were established with positive probability. In this paper, we improve the result to probability arbitrarily close to one. To do so, we introduce a new framework which is the synthesis of two approaches: the small subgraph condition… ▽ More

    Submitted 17 December, 2023; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: 57 pages, 1 figure. Accepted to Communications in Mathematical Physics. arXiv admin note: text overlap with arXiv:2011.14270

    MSC Class: 60G15; 60K35; 82B44; 82D30

  15. arXiv:2108.04539  [pdf, other

    cs.CL

    BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

    Authors: Teakgyu Hong, Donghyun Kim, Mingi Ji, Wonseok Hwang, Daehyun Nam, Sungrae Park

    Abstract: Key information extraction (KIE) from document images requires understanding the contextual and spatial semantics of texts in two-dimensional (2D) space. Many recent studies try to solve the task by developing pre-trained language models focusing on combining visual features from document images with texts and their layout. On the other hand, this paper tackles the problem by going back to the bas… ▽ More

    Submitted 5 April, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: AAAI 2022 - Main Technical Track

  16. arXiv:2105.11366  [pdf, other

    cs.LG

    GMAC: A Distributional Perspective on Actor-Critic Framework

    Authors: Daniel Wontae Nam, Younghoon Kim, Chan Y. Park

    Abstract: In this paper, we devise a distributional framework on actor-critic as a solution to distributional instability, action type restriction, and conflation between samples and statistics. We propose a new method that minimizes the Cramér distance with the multi-step Bellman target distribution generated from a novel Sample-Replacement algorithm denoted SR($λ$), which learns the correct value distribu… ▽ More

    Submitted 15 July, 2021; v1 submitted 24 May, 2021; originally announced May 2021.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:7927-7936, 2021

  17. arXiv:2007.09629  [pdf, other

    cs.CV

    Character Region Attention For Text Spotting

    Authors: Youngmin Baek, Seung Shin, Jeonghun Baek, Sungrae Park, Junyeop Lee, Daehyun Nam, Hwalsuk Lee

    Abstract: A scene text spotter is composed of text detection and recognition modules. Many studies have been conducted to unify these modules into an end-to-end trainable model to achieve better performance. A typical architecture places detection and recognition modules into separate branches, and a RoI pooling is commonly used to let the branches share a visual feature. However, there still exists a chanc… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: 17 pages, 9 figures, Accepted by ECCV 2020

  18. arXiv:2006.06244  [pdf, other

    cs.CV

    CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks

    Authors: Youngmin Baek, Daehyun Nam, Sungrae Park, Junyeop Lee, Seung Shin, Jeonghun Baek, Chae Young Lee, Hwalsuk Lee

    Abstract: Despite the recent success of text detection and recognition methods, existing evaluation metrics fail to provide a fair and reliable comparison among those methods. In addition, there exists no end-to-end evaluation metric that takes characteristics of OCR tasks into account. Previous end-to-end metric contains cascaded errors from the binary scoring process applied in both detection and recognit… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 12 pages, 8 figures

  19. 3D Display Calibration by Visual Pattern Analysis

    Authors: Hyoseok Hwang, Hyun Sung Chang, Dongkyung Nam, In So Kweon

    Abstract: Nearly all 3D displays need calibration for correct rendering. More often than not, the optical elements in a 3D display are misaligned from the designed parameter setting. As a result, 3D magic does not perform well as intended. The observed images tend to get distorted. In this paper, we propose a novel display calibration method to fix the situation. In our method, a pattern image is displayed… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.

    Comments: 13 pages, 10 figures.submitted to IEEE Transactions on Image Processing