Skip to main content

Showing 1–24 of 24 results for author: Park, J C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03627  [pdf, other

    cs.CL

    DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation

    Authors: Taeho Hwang, Soyeong Jeong, Sukmin Cho, SeungYoon Han, Jong C. Park

    Abstract: Recent advancements in Large Language Models (LLMs) have significantly improved their performance across various Natural Language Processing (NLP) tasks. However, LLMs still struggle with generating non-factual responses due to limitations in their parametric memory. Retrieval-Augmented Generation (RAG) systems address this issue by incorporating external knowledge with a retrieval module. Despite… ▽ More

    Submitted 7 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Journal ref: KnowledgeNLP@ACL 2024

  2. arXiv:2407.02854  [pdf, other

    cs.CL cs.CV

    Universal Gloss-level Representation for Gloss-free Sign Language Translation and Production

    Authors: Eui Jun Hwang, Sukmin Cho, Huije Lee, Youngwoo Yoon, Jong C. Park

    Abstract: Sign language, essential for the deaf and hard-of-hearing, presents unique challenges in translation and production due to its multimodal nature and the inherent ambiguity in mapping sign language motion to spoken language words. Previous methods often rely on gloss annotations, requiring time-intensive labor and specialized expertise in sign language. Gloss-free methods have emerged to address th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 14 pages, 5 figures

  3. arXiv:2406.16013  [pdf, other

    cs.CL cs.AI cs.IR

    Database-Augmented Query Representation for Information Retrieval

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Information retrieval models that aim to search for the documents relevant to the given query have shown many successes, which have been applied to diverse tasks. However, the query provided by the user is oftentimes very short, which challenges the retrievers to correctly fetch relevant documents. To tackle this, existing studies have proposed expanding the query with a couple of additional (user… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  4. arXiv:2406.09719  [pdf, other

    cs.CL cs.AI

    Self-Knowledge Distillation for Learning Ambiguity

    Authors: Hancheol Park, Soyeong Jeong, Sukmin Cho, Jong C. Park

    Abstract: Recent language models have shown remarkable performance on natural language understanding (NLU) tasks. However, they are often sub-optimal when faced with ambiguous samples that can be interpreted in multiple ways, over-confidently predicting a single label without consideration for its correctness. To address this issue, we propose a novel self-knowledge distillation method that enables models t… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures

  5. arXiv:2406.04064  [pdf, other

    cs.CL cs.AI cs.CY

    Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models

    Authors: Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong C. Park

    Abstract: Social bias is shaped by the accumulation of social perceptions towards targets across various demographic identities. To fully understand such social bias in large language models (LLMs), it is essential to consider the composite of social perceptions from diverse perspectives among identities. Previous studies have either evaluated biases in LLMs by indirectly assessing the presence of sentiment… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024

  6. arXiv:2404.13948  [pdf, other

    cs.CL

    Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations

    Authors: Sukmin Cho, Soyeong Jeong, Jeongyeon Seo, Taeho Hwang, Jong C. Park

    Abstract: The robustness of recent Large Language Models (LLMs) has become increasingly crucial as their applicability expands across various domains and real-world applications. Retrieval-Augmented Generation (RAG) is a promising solution for addressing the limitations of LLMs, yet existing studies on the robustness of RAG often overlook the interconnected relationships between RAG components or the potent… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Under Review

  7. arXiv:2403.14403  [pdf, other

    cs.CL cs.AI

    Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Retrieval-Augmented Large Language Models (LLMs), which incorporate the non-parametric knowledge from external knowledge bases into LLMs, have emerged as a promising approach to enhancing response accuracy in several tasks, such as Question-Answering (QA). However, even though there are various approaches dealing with queries of different complexities, they either handle simple queries with unnece… ▽ More

    Submitted 28 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  8. arXiv:2310.17490  [pdf, other

    cs.CL cs.AI

    Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering

    Authors: Sukmin Cho, Jeongyeon Seo, Soyeong Jeong, Jong C. Park

    Abstract: Large language models (LLMs) enable zero-shot approaches in open-domain question answering (ODQA), yet with limited advancements as the reader is compared to the retriever. This study aims at the feasibility of a zero-shot reader that addresses the challenges of computational cost and the need for labeled data. We find that LLMs are distracted due to irrelevant documents in the retrieved set and t… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023 Camera Ready

  9. arXiv:2310.13307  [pdf, other

    cs.CL cs.LG

    Test-Time Self-Adaptive Small Language Models for Question Answering

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Recent instruction-finetuned large language models (LMs) have achieved notable performances in various tasks, such as question-answering (QA). However, despite their ability to memorize a vast amount of general knowledge across diverse tasks, they might be suboptimal on specific tasks due to their limited capacity to transfer and adapt knowledge to target tasks. Moreover, further finetuning LMs wi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: EMNLP Findings 2023

  10. arXiv:2310.12836  [pdf, other

    cs.CL cs.LG

    Knowledge-Augmented Language Model Verification

    Authors: Jinheon Baek, Soyeong Jeong, Minki Kang, Jong C. Park, Sung Ju Hwang

    Abstract: Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowledge internalized in parameters. Yet, LMs often generate the factually incorrect responses to the given queries, since their knowledge may be inaccurate, incomplete, and outdated. To address this problem, previous works propose to augment LMs with the knowledge retrieved from an external knowledge sou… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  11. arXiv:2309.12179  [pdf, other

    cs.CV

    Autoregressive Sign Language Production: A Gloss-Free Approach with Discrete Representations

    Authors: Eui Jun Hwang, Huije Lee, Jong C. Park

    Abstract: Gloss-free Sign Language Production (SLP) offers a direct translation of spoken language sentences into sign language, bypassing the need for gloss intermediaries. This paper presents the Sign language Vector Quantization Network, a novel approach to SLP that leverages Vector Quantization to derive discrete representations from sign pose sequences. Our method, rooted in both manual and non-manual… ▽ More

    Submitted 8 June, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures, 6 tables

  12. arXiv:2306.07061  [pdf, other

    cs.CL cs.AI cs.LG

    Deep Model Compression Also Helps Models Capture Ambiguity

    Authors: Hancheol Park, Jong C. Park

    Abstract: Natural language understanding (NLU) tasks face a non-trivial amount of ambiguous samples where veracity of their labels is debatable among annotators. NLU models should thus account for such ambiguity, but they approximate the human opinion distributions quite poorly and tend to produce over-confident predictions. To address this problem, we must consider how to exactly capture the degree of rela… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  13. arXiv:2306.04293  [pdf, other

    cs.CL cs.IR cs.LG

    Phrase Retrieval for Open-Domain Conversational Question Answering with Conversational Dependency Modeling via Contrastive Learning

    Authors: Soyeong Jeong, Jinheon Baek, Sung Ju Hwang, Jong C. Park

    Abstract: Open-Domain Conversational Question Answering (ODConvQA) aims at answering questions through a multi-turn conversation based on a retriever-reader pipeline, which retrieves passages and then predicts answers with them. However, such a pipeline approach not only makes the reader vulnerable to the errors propagated from the retriever, but also demands additional effort to develop both the retriever… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Findings of ACL 2023

  14. arXiv:2306.02955  [pdf, other

    cs.CL cs.LG

    A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical Questionnaires

    Authors: Hoyun Song, Jisu Shin, Huije Lee, Jong C. Park

    Abstract: Social media is one of the most highly sought resources for analyzing characteristics of the language by its users. In particular, many researchers utilized various linguistic features of mental health problems from social media. However, existing approaches to detecting mental disorders face critical challenges, such as the scarcity of high-quality data or the trade-off between addressing the com… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: ACL 2023, 15 pages, 11 tables, 4 figures

  15. arXiv:2305.13729  [pdf, other

    cs.IR cs.AI cs.CL

    Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker

    Authors: Sukmin Cho, Soyeong Jeong, Jeongyeon Seo, Jong C. Park

    Abstract: Re-rankers, which order retrieved documents with respect to the relevance score on the given query, have gained attention for the information retrieval (IR) task. Rather than fine-tuning the pre-trained language model (PLM), the large-scale language model (LLM) is utilized as a zero-shot re-ranker with excellent results. While LLM is highly dependent on the prompts, the impact and the optimization… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023 Camera Ready

  16. arXiv:2302.05137  [pdf, other

    cs.CL cs.AI cs.IR

    Realistic Conversational Question Answering with Answer Selection based on Calibrated Confidence and Uncertainty Measurement

    Authors: Soyeong Jeong, Jinheon Baek, Sung Ju Hwang, Jong C. Park

    Abstract: Conversational Question Answering (ConvQA) models aim at answering a question with its relevant paragraph and previous question-answer pairs that occurred during conversation multiple times. To apply such models to a real-world scenario, some existing work uses predicted answers, instead of unavailable ground-truth answers, as the conversation history for inference. However, since these models usu… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: EACL 2023

  17. arXiv:2208.06183  [pdf, other

    cs.LG cs.CL

    Non-Autoregressive Sign Language Production via Knowledge Distillation

    Authors: Eui Jun Hwang, Jung Ho Kim, Suk Min Cho, Jong C. Park

    Abstract: Sign Language Production (SLP) aims to translate expressions in spoken language into corresponding ones in sign language, such as skeleton-based sign poses or videos. Existing SLP models are either AutoRegressive (AR) or Non-Autoregressive (NAR). However, AR-SLP models suffer from regression to the mean and error propagation during decoding. NSLP-G, a NAR-based model, resolves these issues to some… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: 10 pages, 4 figures, 3 tables, submitted to ECCV2023

  18. arXiv:2208.00176  [pdf, other

    cs.CL

    ELF22: A Context-based Counter Trolling Dataset to Combat Internet Trolls

    Authors: Huije Lee, Young Ju NA, Hoyun Song, Jisu Shin, Jong C. Park

    Abstract: Online trolls increase social costs and cause psychological damage to individuals. With the proliferation of automated accounts making use of bots for trolling, it is difficult for targeted individual users to handle the situation both quantitatively and qualitatively. To address this issue, we focus on automating the method to counter trolls, as counter responses to combat trolls encourage commun… ▽ More

    Submitted 7 September, 2022; v1 submitted 30 July, 2022; originally announced August 2022.

    Comments: Accepted for LREC 2022

  19. arXiv:2203.07735  [pdf, other

    cs.IR cs.AI cs.LG

    Augmenting Document Representations for Dense Retrieval with Interpolation and Perturbation

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Dense retrieval models, which aim at retrieving the most relevant document for an input query on a dense representation space, have gained considerable attention for their remarkable success. Yet, dense models require a vast amount of labeled training data for notable performance, whereas it is often challenging to acquire query-document pairs annotated by humans. To tackle this problem, we propos… ▽ More

    Submitted 16 March, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  20. arXiv:2202.03978  [pdf

    cs.CV physics.med-ph

    Segmentation by Test-Time Optimization (TTO) for CBCT-based Adaptive Radiation Therapy

    Authors: Xiao Liang, Jaehee Chun, Howard Morgan, Ti Bai, Dan Nguyen, Justin C. Park, Steve Jiang

    Abstract: Online adaptive radiotherapy (ART) requires accurate and efficient auto-segmentation of target volumes and organs-at-risk (OARs) in mostly cone-beam computed tomography (CBCT) images. Propagating expert-drawn contours from the pre-treatment planning CT (pCT) through traditional or deep learning (DL) based deformable image registration (DIR) can achieve improved results in many situations. Typical… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  21. arXiv:2108.08016  [pdf, ps, other

    cs.IT eess.SP

    Low-Complexity Algorithm for Outage Optimal Resource Allocation in Energy Harvesting-Based UAV Identification Networks

    Authors: Jae Cheol Park, Kyu-Min Kang, Junil Choi

    Abstract: We study an unmanned aerial vehicle (UAV) identification network equipped with an energy harvesting (EH) technique. In the network, the UAVs harvest energy through radio frequency (RF) signals transmitted from ground control stations (GCSs) and then transmit their identification information to the ground receiver station (GRS). Specifically, we first derive a closed-form expression of the outage p… ▽ More

    Submitted 21 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 5 pages, 4 figures, accepted to IEEE Communications Letters, Aug. 2021

  22. arXiv:2105.00666  [pdf, other

    cs.IR

    Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

    Authors: Soyeong Jeong, Jinheon Baek, ChaeHun Park, Jong C. Park

    Abstract: One of the challenges in information retrieval (IR) is the vocabulary mismatch problem, which happens when the terms between queries and documents are lexically different but semantically similar. While recent work has proposed to expand the queries or documents by enriching their representations with additional relevant terms to address this challenge, they usually require a large volume of query… ▽ More

    Submitted 14 October, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: SDP@NAACL2021

  23. arXiv:2104.11401  [pdf

    cs.LG cs.CV eess.IV

    Intentional Deep Overfit Learning (IDOL): A Novel Deep Learning Strategy for Adaptive Radiation Therapy

    Authors: Jaehee Chun, Justin C. Park, Sven Olberg, You Zhang, Dan Nguyen, Jing Wang, Jin Sung Kim, Steve Jiang

    Abstract: In this study, we propose a tailored DL framework for patient-specific performance that leverages the behavior of a model intentionally overfitted to a patient-specific training dataset augmented from the prior information available in an ART workflow - an approach we term Intentional Deep Overfit Learning (IDOL). Implementing the IDOL framework in any task in radiotherapy consists of two training… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  24. arXiv:cmp-lg/9505027  [pdf, ps

    cs.CL

    Quantifier Scope and Constituency

    Authors: Jong C. Park

    Abstract: Traditional approaches to quantifier scope typically need stipulation to exclude readings that are unavailable to human understanders. This paper shows that quantifier scope phenomena can be precisely characterized by a semantic representation constrained by surface constituency, if the distinction between referential and quantificational NPs is properly observed. A CCG implementation is describ… ▽ More

    Submitted 11 May, 1995; originally announced May 1995.

    Comments: 8 pages, compressed and uuencoded postscript file, ACL-95