Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Kim, D J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.01585  [pdf, other

    cs.SE cs.AI

    OpenLogParser: Unsupervised Parsing with Open-Source Large Language Models

    Authors: Zeyang Ma, Dong Jae Kim, Tse-Hsun Chen

    Abstract: Log parsing is a critical step that transforms unstructured log data into structured formats, facilitating subsequent log-based analysis. Traditional syntax-based log parsers are efficient and effective, but they often experience decreased accuracy when processing logs that deviate from the predefined rules. Recently, large language models (LLM) based log parsers have shown superior parsing accura… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  2. arXiv:2406.12806  [pdf, other

    cs.SE cs.AI

    Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents

    Authors: Zehao Wang, Dong Jae Kim, Tse-Hsun Chen

    Abstract: Configuration settings are essential for tailoring software behavior to meet specific performance requirements. However, incorrect configurations are widespread, and identifying those that impact system performance is challenging due to the vast number and complexity of possible settings. In this work, we present PerfSense, a lightweight framework that leverages Large Language Models (LLMs) to eff… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing

    Authors: Zeyang Ma, An Ran Chen, Dong Jae Kim, Tse-Hsun Chen, Shaowei Wang

    Abstract: Logs are important in modern software development with runtime information. Log parsing is the first step in many log-based analyses, that involve extracting structured information from unstructured log data. Traditional log parsers face challenges in accurately parsing logs due to the diversity of log formats, which directly impacts the performance of downstream log-analysis tasks. In this paper,… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  4. arXiv:2404.04496  [pdf, other

    cs.SE

    Towards Better Graph Neural Network-based Fault Localization Through Enhanced Code Representation

    Authors: Md Nakhla Rafi, Dong Jae Kim, An Ran Chen, Tse-Hsun Chen, Shaowei Wang

    Abstract: Automatic software fault localization plays an important role in software quality assurance by pinpointing faulty locations for easier debugging. Coverage-based fault localization, a widely used technique, employs statistics on coverage spectra to rank code based on suspiciousness scores. However, the rigidity of statistical approaches calls for learning-based techniques. Amongst all, Grace, a gra… ▽ More

    Submitted 30 April, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

  5. arXiv:2403.15852  [pdf, other

    cs.SE cs.AI

    When LLM-based Code Generation Meets the Software Development Process

    Authors: Feng Lin, Dong Jae Kim, Tse-Husn, Chen

    Abstract: Software process models play a pivotal role in fostering collaboration and communication within software teams, enabling them to tackle intricate development tasks effectively. This paper introduces LCG, a code generation framework inspired by established software engineering practices. LCG leverages multiple Large Language Model (LLM) agents to emulate various software process models, namely LCGW… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  6. arXiv:2210.09394  [pdf

    cs.AI cs.LG

    Review Learning: Alleviating Catastrophic Forgetting with Generative Replay without Generator

    Authors: Jaesung Yoo, Sunghyuk Choi, Ye Seul Yang, Suhyeon Kim, Jieun Choi, Dongkyeong Lim, Yaeji Lim, Hyung Joon Joo, Dae Jung Kim, Rae Woong Park, Hyeong-Jin Yoon, Kwangsoo Kim

    Abstract: When a deep learning model is sequentially trained on different datasets, it forgets the knowledge acquired from previous data, a phenomenon known as catastrophic forgetting. It deteriorates performance of the deep learning model on diverse datasets, which is critical in privacy-preserving deep learning (PPDL) applications based on transfer learning (TL). To overcome this, we propose review learni… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.