Skip to main content

Showing 1–50 of 122 results for author: Jeong, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09779  [pdf, other

    cs.CV cs.AI

    Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation

    Authors: Kangyeol Kim, Wooseok Seo, Sehyun Nam, Bodam Kim, Suhyeon Jeong, Wonwoo Cho, Jaegul Choo, Youngjae Yu

    Abstract: Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generati… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  2. arXiv:2407.03627  [pdf, other

    cs.CL

    DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation

    Authors: Taeho Hwang, Soyeong Jeong, Sukmin Cho, SeungYoon Han, Jong C. Park

    Abstract: Recent advancements in Large Language Models (LLMs) have significantly improved their performance across various Natural Language Processing (NLP) tasks. However, LLMs still struggle with generating non-factual responses due to limitations in their parametric memory. Retrieval-Augmented Generation (RAG) systems address this issue by incorporating external knowledge with a retrieval module. Despite… ▽ More

    Submitted 7 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Journal ref: KnowledgeNLP@ACL 2024

  3. arXiv:2407.02957  [pdf, other

    cs.RO

    Past, Present, and Future: A Survey of The Evolution of Affective Robotics For Well-being

    Authors: Micol Spitale, Minja Axelsson, Sooyeon Jeong, Paige Tuttosı, Caitlin A. Stamatis, Guy Laban, Angelica Lim, Hatice Gune

    Abstract: Recent research in affective robots has recognized their potential in supporting human well-being. Due to rapidly developing affective and artificial intelligence technologies, this field of research has undergone explosive expansion and advancement in recent years. In order to develop a deeper understanding of recent advancements, we present a systematic review of the past 10 years of research in… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  4. Concept Lens: Visually Analyzing the Consistency of Semantic Manipulation in GANs

    Authors: Sangwon Jeong, Mingwei Li, Matthew Berger, Shusen Liu

    Abstract: As applications of generative AI become mainstream, it is important to understand what generative models are capable of producing, and the extent to which one can predictably control their outputs. In this paper, we propose a visualization design, named Concept Lens, for jointly navigating the data distribution of a generative model, and concept manipulations supported by the model. Our work is fo… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Journal ref: 2023 IEEE Visualization and Visual Analytics (VIS), Melbourne, Australia, 2023, pp. 221-225

  5. arXiv:2406.16013  [pdf, other

    cs.CL cs.AI cs.IR

    Database-Augmented Query Representation for Information Retrieval

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Information retrieval models that aim to search for the documents relevant to the given query have shown many successes, which have been applied to diverse tasks. However, the query provided by the user is oftentimes very short, which challenges the retrievers to correctly fetch relevant documents. To tackle this, existing studies have proposed expanding the query with a couple of additional (user… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  6. arXiv:2406.15634  [pdf, other

    cs.GR

    Text-based Transfer Function Design for Semantic Volume Rendering

    Authors: Sangwon Jeong, Jixian Li, Christopher Johnson, Shusen Liu, Matthew Berger

    Abstract: Transfer function design is crucial in volume rendering, as it directly influences the visual representation and interpretation of volumetric data. However, creating effective transfer functions that align with users' visual objectives is often challenging due to the complex parameter space and the semantic gap between transfer function values and features of interest within the volume. In this wo… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  7. arXiv:2406.09719  [pdf, other

    cs.CL cs.AI

    Self-Knowledge Distillation for Learning Ambiguity

    Authors: Hancheol Park, Soyeong Jeong, Sukmin Cho, Jong C. Park

    Abstract: Recent language models have shown remarkable performance on natural language understanding (NLU) tasks. However, they are often sub-optimal when faced with ambiguous samples that can be interpreted in multiple ways, over-confidently predicting a single label without consideration for its correctness. To address this issue, we propose a novel self-knowledge distillation method that enables models t… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures

  8. arXiv:2406.09188  [pdf, ps, other

    cs.CV cs.IR

    Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval

    Authors: Jaeseok Byun, Seokhyeon Jeong, Wonjae Kim, Sanghyuk Chun, Taesup Moon

    Abstract: Composed Image Retrieval (CIR) aims to retrieve a target image based on a reference image and conditioning text, enabling controllable searches. Due to the expensive dataset construction cost for CIR triplets, a zero-shot (ZS) CIR setting has been actively studied to eliminate the need for human-collected triplet datasets. The mainstream of ZS-CIR employs an efficient projection module that projec… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 17 pages

  9. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  10. arXiv:2406.04064  [pdf, other

    cs.CL cs.AI cs.CY

    Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models

    Authors: Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong C. Park

    Abstract: Social bias is shaped by the accumulation of social perceptions towards targets across various demographic identities. To fully understand such social bias in large language models (LLMs), it is essential to consider the composite of social perceptions from diverse perspectives among identities. Previous studies have either evaluated biases in LLMs by indirectly assessing the presence of sentiment… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024

  11. arXiv:2405.16132  [pdf, other

    quant-ph cs.GR

    Efficient Quantum Circuit Encoding of Object Information in 2D Ray Casting

    Authors: Seungjae Lee, Suhui Jeong, Jiwon Seo

    Abstract: Quantum computing holds the potential to solve problems that are practically unsolvable by classical computers due to its ability to significantly reduce time complexity. We aim to harness this potential to enhance ray casting, a pivotal technique in computer graphics for simplifying the rendering of 3D objects. To perform ray casting in a quantum computer, we need to encode the defining parameter… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Submitted to ITC-CSCC 2024

  12. arXiv:2405.13081  [pdf

    cs.HC cs.AI

    Children's Mental Models of Generative Visual and Text Based AI Models

    Authors: Eliza Kosoy, Soojin Jeong, Anoop Sinha, Alison Gopnik, Tanya Kraljic

    Abstract: In this work we investigate how children ages 5-12 perceive, understand, and use generative AI models such as a text-based LLMs ChatGPT and a visual-based model DALL-E. Generative AI is newly being used widely since chatGPT. Children are also building mental models of generative AI. Those haven't been studied before and it is also the case that the children's models are dynamic as they use the too… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 7 pages, 6 figures

  13. arXiv:2404.19336  [pdf

    cs.AI cs.PL

    Improving LLM Classification of Logical Errors by Integrating Error Relationship into Prompts

    Authors: Yanggyu Lee, Suchae Jeong, Jihie Kim

    Abstract: LLMs trained in the understanding of programming syntax are now providing effective assistance to developers and are being used in programming education such as in generation of coding problem examples or providing code explanations. A key aspect of programming education is understanding and dealing with error message. However, 'logical errors' in which the program operates against the programmer'… ▽ More

    Submitted 1 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted in ITS 2024

  14. arXiv:2404.19007  [pdf, other

    cs.CL cs.AI cs.CY

    How Did We Get Here? Summarizing Conversation Dynamics

    Authors: Yilun Hua, Nicholas Chernogor, Yuzhe Gu, Seoyeon Julie Jeong, Miranda Luo, Cristian Danescu-Niculescu-Mizil

    Abstract: Throughout a conversation, the way participants interact with each other is in constant flux: their tones may change, they may resort to different strategies to convey their points, or they might alter their interaction patterns. An understanding of these dynamics can complement that of the actual facts and opinions discussed, offering a more holistic view of the trajectory of the conversation: ho… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: To appear in the Proceedings of NAACL 2024. Data available in ConvoKit https://convokit.cornell.edu/

  15. arXiv:2404.18315  [pdf, ps, other

    cs.IT

    Design and Optimization of Reconfigurable Intelligent Surfaces Using the PEEC Method

    Authors: Giuseppe Pettanice, Marco Di Renzo, Roberto Valentini, Sumin Jeong, Piergiuseppe Di Marco, Fortunato Santucci, Daniele Romano, Giulio Antonini

    Abstract: The design and optimization of Reconfigurable Intelligent Surfaces (RISs) are key challenges for future wireless communication systems. RISs are devices that can manipulate electromagnetic (EM) waves in a programmable way, thus enhancing the performance and efficiency of wireless links. To achieve this goal, it is essential to have reliable EM models that can capture the behavior of RISs in differ… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  16. arXiv:2404.18310  [pdf, ps, other

    cs.IT

    Multiport Network Modeling for Reconfigurable Intelligent Surfaces: Numerical Validation with a Full-Wave PEEC Simulator

    Authors: Giuseppe Pettanice, Marco Di Renzo, Sumin Jeong, Roberto Valentini, Piergiuseppe Di Marco, Fortunato Santucci, Daniele Romano, Giulio Antonini

    Abstract: Reconfigurable Intelligent Surface (RIS) modeling and optimization are a crucial steps in developing the next generation of wireless communications. To this aim, the availability of accurate electromagnetic (EM) models is of paramount important for the design of RIS-assisted communication links. In this work, we validate a widely-used analytical multiport network for RISs by means of a well-establ… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  17. arXiv:2404.13948  [pdf, other

    cs.CL

    Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations

    Authors: Sukmin Cho, Soyeong Jeong, Jeongyeon Seo, Taeho Hwang, Jong C. Park

    Abstract: The robustness of recent Large Language Models (LLMs) has become increasingly crucial as their applicability expands across various domains and real-world applications. Retrieval-Augmented Generation (RAG) is a promising solution for addressing the limitations of LLMs, yet existing studies on the robustness of RAG often overlook the interconnected relationships between RAG components or the potent… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Under Review

  18. arXiv:2404.11025  [pdf, other

    cs.CV

    NeuroHash: A Hyperdimensional Neuro-Symbolic Framework for Spatially-Aware Image Hashing and Retrieval

    Authors: Sanggeon Yun, Ryozo Masukawa, SungHeon Jeong, Mohsen Imani

    Abstract: Customizable image retrieval from large datasets remains a critical challenge, particularly when preserving spatial relationships within images. Traditional hashing methods, primarily based on deep learning, often fail to capture spatial information adequately and lack transparency. In this paper, we introduce NeuroHash, a novel neuro-symbolic framework leveraging Hyperdimensional Computing (HDC)… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  19. Hyperbolic Heterogeneous Graph Attention Networks

    Authors: Jongmin Park, Seunghoon Han, Soohwan Jeong, Sungsu Lim

    Abstract: Most previous heterogeneous graph embedding models represent elements in a heterogeneous graph as vector representations in a low-dimensional Euclidean space. However, because heterogeneous graphs inherently possess complex structures, such as hierarchical or power-law structures, distortions can occur when representing them in Euclidean space. To overcome this limitation, we propose Hyperbolic He… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted in ACM THE WEB CONFERENCE 2024 short paper track

  20. arXiv:2403.19200  [pdf, other

    cs.IT eess.SP

    Cell-Free MIMO Perceptive Mobile Networks: Cloud vs. Edge Processing

    Authors: Seongah Jeong, Jinkyu Kang, Osvaldo Simeone, Shlomo Shamai

    Abstract: Perceptive mobile networks implement sensing and communication by reusing existing cellular infrastructure. Cell-free multiple-input multiple-output, thanks to the cooperation among distributed access points, supports the deployment of multistatic radar sensing, while providing high spectral efficiency for data communication services. To this end, the distributed access points communicate over fro… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 30 pages, 11 figures

  21. arXiv:2403.15049  [pdf, other

    cs.CV cs.AI

    Continual Vision-and-Language Navigation

    Authors: Seongjun Jeong, Gi-Cheon Kang, Seongho Choi, Joochan Kim, Byoung-Tak Zhang

    Abstract: Vision-and-Language Navigation (VLN) agents navigate to a destination using natural language instructions and the visual information they observe. Existing methods for training VLN agents presuppose fixed datasets, leading to a significant limitation: the introduction of new environments necessitates retraining with previously encountered environments to preserve their knowledge. This makes it dif… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  22. arXiv:2403.14403  [pdf, other

    cs.CL cs.AI

    Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Retrieval-Augmented Large Language Models (LLMs), which incorporate the non-parametric knowledge from external knowledge bases into LLMs, have emerged as a promising approach to enhancing response accuracy in several tasks, such as Question-Answering (QA). However, even though there are various approaches dealing with queries of different complexities, they either handle simple queries with unnece… ▽ More

    Submitted 28 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  23. arXiv:2402.12222  [pdf, other

    cs.CR cs.CL cs.LG cs.SE

    CovRL: Fuzzing JavaScript Engines with Coverage-Guided Reinforcement Learning for LLM-based Mutation

    Authors: Jueon Eom, Seyeon Jeong, Taekyoung Kwon

    Abstract: Fuzzing is an effective bug-finding technique but it struggles with complex systems like JavaScript engines that demand precise grammatical input. Recently, researchers have adopted language models for context-aware mutation in fuzzing to address this problem. However, existing techniques are limited in utilizing coverage guidance for fuzzing, which is rather performed in a black-box manner. This… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 14 pages, 4 figures, 9 tables, 2 listings

    ACM Class: D.4.6; I.2.5; D.2.4

  24. arXiv:2402.07370  [pdf, other

    cs.CV cs.AI

    SelfSwapper: Self-Supervised Face Swapping via Shape Agnostic Masked AutoEncoder

    Authors: Jaeseong Lee, Junha Hyung, Sohyun Jeong, Jaegul Choo

    Abstract: Face swapping has gained significant attention for its varied applications. The majority of previous face swapping approaches have relied on the seesaw game training scheme, which often leads to the instability of the model training and results in undesired samples with blended identities due to the target identity leakage problem. This paper introduces the Shape Agnostic Masked AutoEncoder (SAMAE… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  25. arXiv:2402.02043  [pdf, other

    cs.LG cs.AI cs.NI

    A Plug-in Tiny AI Module for Intelligent and Selective Sensor Data Transmission

    Authors: Wenjun Huang, Arghavan Rezvani, Hanning Chen, Yang Ni, Sanggeon Yun, Sungheon Jeong, Mohsen Imani

    Abstract: Applications in the Internet of Things (IoT) utilize machine learning to analyze sensor-generated data. However, a major challenge lies in the lack of targeted intelligence in current sensing systems, leading to vast data generation and increased computational and communication costs. To address this challenge, we propose a novel sensing module to equip sensing frameworks with intelligent data tra… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 14 pages, 6 figures

  26. arXiv:2401.17675  [pdf, ps, other

    stat.ML cs.DS cs.LG

    Convergence analysis of t-SNE as a gradient flow for point cloud on a manifold

    Authors: Seonghyeon Jeong, Hau-Tieng Wu

    Abstract: We present a theoretical foundation regarding the boundedness of the t-SNE algorithm. t-SNE employs gradient descent iteration with Kullback-Leibler (KL) divergence as the objective function, aiming to identify a set of points that closely resemble the original data points in a high-dimensional space, minimizing KL divergence. Investigating t-SNE properties such as perplexity and affinity under a… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    MSC Class: 90C26; 90C30 ACM Class: F.2.2; F.2.0; G.4

  27. arXiv:2401.15893  [pdf, other

    cs.CV

    Arbitrary-Scale Downscaling of Tidal Current Data Using Implicit Continuous Representation

    Authors: Dongheon Lee, Seungmyong Jeong, Youngmin Ro

    Abstract: Numerical models have long been used to understand geoscientific phenomena, including tidal currents, crucial for renewable energy production and coastal engineering. However, their computational cost hinders generating data of varying resolutions. As an alternative, deep learning-based downscaling methods have gained traction due to their faster inference speeds. But most of them are limited to o… ▽ More

    Submitted 30 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  28. arXiv:2401.12481  [pdf, other

    cs.IT

    AIRS-assisted Vehicular Networks with Rate-Splitting SWIPT Receivers: Joint Trajectory and Communication Design

    Authors: Gyoungyoon Nam, Seokhyun Lee, Seongah Jeong

    Abstract: In this correspondence, we propose to use an intelligent reflective surface (IRS) installed on unmanned aerial vehicle (UAV), referred to as aerial IRS (AIRS), for vehicular networks, where simultaneous wireless information and power transfer (SWIPT) receivers to concurrently allow information decoding (ID) and energy harvesting (EH) are equipped at the battery-limited vehicles. For efficiently su… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures

  29. arXiv:2401.08851  [pdf

    cs.LG cs.CL cs.SD eess.AS q-bio.NC

    Using i-vectors for subject-independent cross-session EEG transfer learning

    Authors: Jonathan Lasko, Jeff Ma, Mike Nicoletti, Jonathan Sussman-Fort, Sooyoung Jeong, William Hartmann

    Abstract: Cognitive load classification is the task of automatically determining an individual's utilization of working memory resources during performance of a task based on physiologic measures such as electroencephalography (EEG). In this paper, we follow a cross-disciplinary approach, where tools and methodologies from speech processing are used to tackle this problem. The corpus we use was released pub… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 11 pages

  30. arXiv:2401.02710  [pdf, other

    cs.CE cs.AI

    Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning

    Authors: Hong-Gi Shin, Sukhyun Jeong, Eui-Yeon Kim, Sungho Hong, Young-Jin Cho, Yong-Hoon Choi

    Abstract: Mining of formulaic alpha factors refers to the process of discovering and developing specific factors or indicators (referred to as alpha factors) for quantitative trading in stock market. To efficiently discover alpha factors in vast search space, reinforcement learning (RL) is commonly employed. This paper proposes a method to enhance existing alpha factor mining approaches by expanding a searc… ▽ More

    Submitted 7 July, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by ICOIN 2024

  31. arXiv:2401.00773  [pdf, other

    cs.LG cs.AI stat.ML

    Unsupervised Outlier Detection using Random Subspace and Subsampling Ensembles of Dirichlet Process Mixtures

    Authors: Dongwook Kim, Juyeon Park, Hee Cheol Chung, Seonghyun Jeong

    Abstract: Probabilistic mixture models are acknowledged as a valuable tool for unsupervised outlier detection owing to their interpretability and intuitive grounding in statistical principles. Within this framework, Dirichlet process mixture models emerge as a compelling alternative to conventional finite mixture models for both clustering and outlier detection tasks. However, despite their evident advantag… ▽ More

    Submitted 13 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  32. arXiv:2312.16233  [pdf, other

    cs.CL

    Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses

    Authors: Seokhoon Jeong, Assentay Makhmud

    Abstract: Recent Large Language Models (LLMs) have shown remarkable capabilities in mimicking fictional characters or real humans in conversational settings. However, the realism and consistency of these responses can be further enhanced by providing richer information of the agent being mimicked. In this paper, we propose a novel approach to generate more realistic and consistent responses from LLMs, lever… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  33. arXiv:2311.18291  [pdf, other

    cs.CV

    TLDR: Text Based Last-layer Retraining for Debiasing Image Classifiers

    Authors: Juhyeon Park, Seokhyeon Jeong, Taesup Moon

    Abstract: A classifier may depend on incidental features stemming from a strong correlation between the feature and the classification target in the training dataset. Recently, Last Layer Retraining (LLR) with group-balanced datasets is known to be efficient in mitigating the spurious correlation of classifiers. However, the acquisition of group-balanced datasets is costly, which hinders the applicability o… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 19 pages, Under Review

  34. arXiv:2311.11470  [pdf, ps, other

    cs.CV

    1st Place in ICCV 2023 Workshop Challenge Track 1 on Resource Efficient Deep Learning for Computer Vision: Budgeted Model Training Challenge

    Authors: Youngjun Kwak, Seonghun Jeong, Yunseung Lee, Changick Kim

    Abstract: The budgeted model training challenge aims to train an efficient classification model under resource limitations. To tackle this task in ImageNet-100, we describe a simple yet effective resource-aware backbone search framework composed of profile and instantiation phases. In addition, we employ multi-resolution ensembles to boost inference accuracy on limited resources. The profile phase obeys tim… ▽ More

    Submitted 9 August, 2023; originally announced November 2023.

    Comments: ICCV 2023 Workshop Challenge Track 1 on RCV

  35. arXiv:2310.19264  [pdf, other

    cs.MM cs.SD eess.AS

    Sound of Story: Multi-modal Storytelling with Audio

    Authors: Jaeyeon Bae, Seokhoon Jeong, Seokun Kang, Namgi Han, Jae-Yon Lee, Hyounghun Kim, Taehwan Kim

    Abstract: Storytelling is multi-modal in the real world. When one tells a story, one may use all of the visualizations and sounds along with the story itself. However, prior studies on storytelling datasets and tasks have paid little attention to sound even though sound also conveys meaningful semantics of the story. Therefore, we propose to extend story understanding and telling areas by establishing a new… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023, project: https://github.com/Sosdatasets/SoS_Dataset/

  36. arXiv:2310.17490  [pdf, other

    cs.CL cs.AI

    Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering

    Authors: Sukmin Cho, Jeongyeon Seo, Soyeong Jeong, Jong C. Park

    Abstract: Large language models (LLMs) enable zero-shot approaches in open-domain question answering (ODQA), yet with limited advancements as the reader is compared to the retriever. This study aims at the feasibility of a zero-shot reader that addresses the challenges of computational cost and the need for labeled data. We find that LLMs are distracted due to irrelevant documents in the retrieved set and t… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023 Camera Ready

  37. arXiv:2310.13307  [pdf, other

    cs.CL cs.LG

    Test-Time Self-Adaptive Small Language Models for Question Answering

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Recent instruction-finetuned large language models (LMs) have achieved notable performances in various tasks, such as question-answering (QA). However, despite their ability to memorize a vast amount of general knowledge across diverse tasks, they might be suboptimal on specific tasks due to their limited capacity to transfer and adapt knowledge to target tasks. Moreover, further finetuning LMs wi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: EMNLP Findings 2023

  38. arXiv:2310.12836  [pdf, other

    cs.CL cs.LG

    Knowledge-Augmented Language Model Verification

    Authors: Jinheon Baek, Soyeong Jeong, Minki Kang, Jong C. Park, Sung Ju Hwang

    Abstract: Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowledge internalized in parameters. Yet, LMs often generate the factually incorrect responses to the given queries, since their knowledge may be inaccurate, incomplete, and outdated. To address this problem, previous works propose to augment LMs with the knowledge retrieved from an external knowledge sou… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  39. arXiv:2310.08897  [pdf, other

    eess.IV cs.CV cs.LG

    Self supervised convolutional kernel based handcrafted feature harmonization: Enhanced left ventricle hypertension disease phenotyping on echocardiography

    Authors: Jina Lee, Youngtaek Hong, Dawun Jeong, Yeonggul Jang, Jaeik Jeon, Sihyeon Jeong, Taekgeun Jung, Yeonyee E. Yoon, Inki Moon, Seung-Ah Lee, Hyuk-Jae Chang

    Abstract: Radiomics, a medical imaging technique, extracts quantitative handcrafted features from images to predict diseases. Harmonization in those features ensures consistent feature extraction across various imaging devices and protocols. Methods for harmonization include standardized imaging protocols, statistical adjustments, and evaluating feature robustness. Myocardial diseases such as Left Ventricul… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 11 pages, 7 figures

  40. arXiv:2310.03353  [pdf, other

    cs.AI cs.LG

    Deep Geometric Learning with Monotonicity Constraints for Alzheimer's Disease Progression

    Authors: Seungwoo Jeong, Wonsik Jung, Junghyo Sohn, Heung-Il Suk

    Abstract: Alzheimer's disease (AD) is a devastating neurodegenerative condition that precedes progressive and irreversible dementia; thus, predicting its progression over time is vital for clinical diagnosis and treatment. Numerous studies have implemented structural magnetic resonance imaging (MRI) to model AD progression, focusing on three integral aspects: (i) temporal variability, (ii) incomplete observ… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  41. arXiv:2308.06053  [pdf, other

    cs.LG cs.AI cs.AR

    Cost-effective On-device Continual Learning over Memory Hierarchy with Miro

    Authors: Xinyue Ma, Suyeon Jeong, Minjia Zhang, Di Wang, Jonghyun Choi, Myeongjae Jeon

    Abstract: Continual learning (CL) trains NN models incrementally from a continuous stream of tasks. To remember previously learned knowledge, prior studies store old samples over a memory hierarchy and replay them when new tasks arrive. Edge devices that adopt CL to preserve data privacy are typically energy-sensitive and thus require high model accuracy while not compromising energy efficiency, i.e., cost-… ▽ More

    Submitted 5 December, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: This paper is published in the 29th Annual International Conference on Mobile Computing and Networking (ACM MobiCom '23)

  42. End-to-End Learnable Multi-Scale Feature Compression for VCM

    Authors: Yeongwoong Kim, Hyewon Jeong, Janghyun Yu, Younhee Kim, Jooyoung Lee, Se Yoon Jeong, Hui Yong Kim

    Abstract: The proliferation of deep learning-based machine vision applications has given rise to a new type of compression, so called video coding for machine (VCM). VCM differs from traditional video coding in that it is optimized for machine vision performance instead of human visual quality. In the feature compression track of MPEG-VCM, multi-scale features extracted from images are subject to compressio… ▽ More

    Submitted 8 August, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 13 pages, accepted by IEEE Transactions on Circuits and Systems for Video Technology

  43. arXiv:2306.09480  [pdf, ps, other

    cs.IT eess.SP

    Optimization of RIS-Aided MIMO -- A Mutually Coupled Loaded Wire Dipole Model

    Authors: H. El Hassani, X. Qian, S. Jeong, N. S. Perović, M. Di Renzo, P. Mursia, V. Sciancalepore, X. Costa-Pérez

    Abstract: We consider a reconfigurable intelligent surface (RIS) assisted multiple-input multiple-output (MIMO) system in the presence of scattering objects. The MIMO transmitter and receiver, the RIS, and the scattering objects are modeled as mutually coupled thin wires connected to load impedances. We introduce a novel numerical algorithm for optimizing the tunable loads connected to the RIS, which does n… ▽ More

    Submitted 18 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  44. arXiv:2306.04293  [pdf, other

    cs.CL cs.IR cs.LG

    Phrase Retrieval for Open-Domain Conversational Question Answering with Conversational Dependency Modeling via Contrastive Learning

    Authors: Soyeong Jeong, Jinheon Baek, Sung Ju Hwang, Jong C. Park

    Abstract: Open-Domain Conversational Question Answering (ODConvQA) aims at answering questions through a multi-turn conversation based on a retriever-reader pipeline, which retrieves passages and then predicts answers with them. However, such a pipeline approach not only makes the reader vulnerable to the errors propagated from the retriever, but also demands additional effort to develop both the retriever… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Findings of ACL 2023

  45. Finite Element Modeling of Pneumatic Bending Actuators for Inflated-Beam Robots

    Authors: Cosima du Pasquier, Sehui Jeong, Allison M. Okamura

    Abstract: Inflated-beam soft robots, such as tip-everting vine robots, can control curvature by contracting one beam side via pneumatic actuation. This work develops a general finite element modeling approach to characterize their bending. The model is validated across four pneumatic actuator types (series, compression, embedded, and fabric pneumatic artificial muscles), and can be extended to other designs… ▽ More

    Submitted 29 September, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  46. Robust Imaging Sonar-based Place Recognition and Localization in Underwater Environments

    Authors: Hogyun Kim, Gilhwan Kang, Seokhwan Jeong, Seungjun Ma, Younggun Cho

    Abstract: Place recognition using SOund Navigation and Ranging (SONAR) images is an important task for simultaneous localization and mapping(SLAM) in underwater environments. This paper proposes a robust and efficient imaging SONAR based place recognition, SONAR context, and loop closure method. Unlike previous methods, our approach encodes geometric information based on the characteristics of raw SONAR mea… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 7 pages, 8 figures

  47. arXiv:2305.13779  [pdf, other

    cs.AR eess.SP

    Transceiver Design and Performance Analysis for LR-FHSS-based Direct-to-Satellite IoT

    Authors: Sooyeob Jung, Seongah Jeong, Jinkyu Kang, Joon Gyu Ryu, Joonhyuk Kang

    Abstract: This paper presents a novel transceiver design aimed at enabling Direct-to-Satellite Internet of Things (DtS-IoT) systems based on long range-frequency hopping spread spectrum (LR-FHSS). Our focus lies in developing an accurate transmission method through the analysis of the frame structure and key parameters outlined in Long Range Wide-Area Network (LoRaWAN) [1]. To address the Doppler effect in… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 5 pages, 6 figures

    Report number: CL2023-1147

  48. arXiv:2305.13729  [pdf, other

    cs.IR cs.AI cs.CL

    Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker

    Authors: Sukmin Cho, Soyeong Jeong, Jeongyeon Seo, Jong C. Park

    Abstract: Re-rankers, which order retrieved documents with respect to the relevance score on the given query, have gained attention for the information retrieval (IR) task. Rather than fine-tuning the pre-trained language model (PLM), the large-scale language model (LLM) is utilized as a zero-shot re-ranker with excellent results. While LLM is highly dependent on the prompts, the impact and the optimization… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023 Camera Ready

  49. arXiv:2304.00779  [pdf, other

    cs.CV cs.AI

    Probabilistic Prompt Learning for Dense Prediction

    Authors: Hyeongjun Kwon, Taeyong Song, Somi Jeong, Jin Kim, Jinhyun Jang, Kwanghoon Sohn

    Abstract: Recent progress in deterministic prompt learning has become a promising alternative to various downstream vision tasks, enabling models to learn powerful visual representations with the help of pre-trained vision-language models. However, this approach results in limited performance for dense prediction tasks that require handling more complex and diverse objects, since a single and deterministic… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: accepted to CVPR 2023

  50. X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network

    Authors: Seonghoon Jeong, Sangho Lee, Hwejae Lee, Huy Kang Kim

    Abstract: Controller Area Network (CAN) is an essential networking protocol that connects multiple electronic control units (ECUs) in a vehicle. However, CAN-based in-vehicle networks (IVNs) face security risks owing to the CAN mechanisms. An adversary can sabotage a vehicle by leveraging the security risks if they can access the CAN bus. Thus, recent actions and cybersecurity regulations (e.g., UNR 155) re… ▽ More

    Submitted 14 March, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: This is the Accepted version of an article for publication in IEEE TVT

    Journal ref: IEEE Transactions on Vehicular Technology, Vol. 73, No. 3, pp. 3230-3246, Mar. 2024