Zum Hauptinhalt springen

Showing 1–50 of 847 results for author: Hwang, S

.
  1. arXiv:2408.12211  [pdf, other

    cs.CV

    Computer-Aided Fall Recognition Using a Three-Stream Spatial-Temporal GCN Model with Adaptive Feature Aggregation

    Authors: Jungpil Shin, Abu Saleh Musa Miah, Rei Egawa1, Koki Hirooka, Md. Al Mehedi Hasan, Yoichi Tomioka, Yong Seok Hwang

    Abstract: The prevention of falls is paramount in modern healthcare, particularly for the elderly, as falls can lead to severe injuries or even fatalities. Additionally, the growing incidence of falls among the elderly, coupled with the urgent need to prevent suicide attempts resulting from medication overdose, underscores the critical importance of accurate and efficient fall detection methods. In this sce… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  2. arXiv:2408.11217  [pdf, other

    cond-mat.quant-gas physics.atom-ph quant-ph

    Beyond skyrmion spin texture from quantum Kelvin-Helmholtz instability

    Authors: SeungJung Huh, Wooyoung Yun, Gabin Yun, Samgyu Hwang, Kiryang Kwon, Junhyeok Hur, Seungho Lee, Hiromitsu Takeuchi, Se Kwon Kim, Jae-yoon Choi

    Abstract: Topology profoundly influences diverse fields of science, providing a powerful framework for classifying phases of matter and predicting nontrivial excitations, such as solitons, vortices, and skyrmions. These topological defects are typically characterized by integer numbers, called topological charges, representing the winding number in their order parameter field. The classification and predict… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 13 pages, 5 main figures and 7 supplemental figures

  3. arXiv:2408.07557  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Band-selective simulation of photoelectron intensity and converging Berry phase in trilayer graphene

    Authors: Hayoon Im, Sue Hyeon Hwang, Minhee Kang, Kyoo Kim, Haeyong Kang, Choongyu Hwang

    Abstract: Berry phase is one of the key elements to understand quantum-mechanical phenomena such as the Aharonov-Bohm effect and the unconventional Hall effect in graphene. The Berry phase in monolayer and bilayer graphene has been manifested by the anisotropic distribution of photoelectron intensity along a closed loop in the momentum space as well as its rotation by a characteristic angle upon rotating li… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Journal ref: Appl. Sci. Converg. Technol. 33, 91 (2024)

  4. arXiv:2408.06010  [pdf, other

    cs.CV

    DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation

    Authors: Jisoo Kim, Jungbin Cho, Joonho Park, Soonmin Hwang, Da Eun Kim, Geon Kim, Youngjae Yu

    Abstract: Speech-driven 3D facial animation has garnered lots of attention thanks to its broad range of applications. Despite recent advancements in achieving realistic lip motion, current methods fail to capture the nuanced emotional undertones conveyed through speech and produce monotonous facial motion. These limitations result in blunt and repetitive facial animations, reducing user engagement and hinde… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: First two authors contributed equally

  5. arXiv:2408.05930  [pdf, ps, other

    cond-mat.mtrl-sci

    Evolution of the Fermi surface of 1T-VSe$_2$ across a structural phase transition

    Authors: Turgut Yilmaz, Xiao Tong, Jerzy T. Sadowski, Sooyeon Hwang, Kenneth Evans-Lutterodt, Kim Kisslinger, Elio Vescovo

    Abstract: The electronic origin of the structural transition in 1T-VSe$_2$ is re-evaluated through an extensive angle-resolved photoemission spectroscopy experiment. The components of the band structure, missing in previous reports, are revealed. Earlier observations, shown to be temperature independent and therefore not correlated with the phase transition, are explained in terms of the increased complexit… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 7 pages, 4 figures

  6. arXiv:2408.05917  [pdf

    cs.CE cs.AI cs.LG

    Inverse design of Non-parameterized Ventilated Acoustic Resonator via Variational Autoencoder with Acoustic Response-encoded Latent Space

    Authors: Min Woo Cho, Seok Hyeon Hwang, Jun-Young Jang, Jin Yeong Song, Sun-kwang Hwang, Kyoung Je Cha, Dong Yong Park, Kyungjun Song, Sang Min Park

    Abstract: Ventilated acoustic resonator(VAR), a type of acoustic metamaterial, emerge as an alternative for sound attenuation in environments that require ventilation, owing to its excellent low-frequency attenuation performance and flexible shape adaptability. However, due to the non-linear acoustic responses of VARs, the VAR designs are generally obtained within a limited parametrized design space, and th… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  7. arXiv:2408.04261  [pdf, other

    cs.CV cs.AI cs.CR

    Unveiling Hidden Visual Information: A Reconstruction Attack Against Adversarial Visual Information Hiding

    Authors: Jonggyu Jang, Hyeonsu Lyu, Seongjin Hwang, Hyun Jong Yang

    Abstract: This paper investigates the security vulnerabilities of adversarial-example-based image encryption by executing data reconstruction (DR) attacks on encrypted images. A representative image encryption method is the adversarial visual information hiding (AVIH), which uses type-I adversarial example training to protect gallery datasets used in image recognition tasks. In the AVIH method, the type-I a… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 12 pages

  8. arXiv:2408.00994  [pdf, other

    cs.SE cs.AI cs.CL

    ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models

    Authors: Hojae Han, Jaejin Kim, Jaeseok Yoo, Youngwon Lee, Seung-won Hwang

    Abstract: This paper aims to extend the code generation capability of large language models (LLMs) to automatically manage comprehensive software requirements from given textual descriptions. Such requirements include both functional (i.e. achieving expected behavior for inputs) and non-functional (e.g., time/space performance, robustness, maintainability) requirements. However, textual descriptions can eit… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted by ACL 2024 main conference

  9. arXiv:2407.20806  [pdf, other

    cs.AI cs.LG

    ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning

    Authors: Hosung Lee, Sejin Kim, Seungpil Lee, Sanha Hwang, Jihwan Lee, Byung-Jun Lee, Sundong Kim

    Abstract: This paper introduces ARCLE, an environment designed to facilitate reinforcement learning research on the Abstraction and Reasoning Corpus (ARC). Addressing this inductive reasoning benchmark with reinforcement learning presents these challenges: a vast action space, a hard-to-reach goal, and a variety of tasks. We demonstrate that an agent with proximal policy optimization can learn individual ta… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Accepted by CoLLAs 2024, Project page: https://github.com/confeitoHS/arcle

  10. arXiv:2407.18602  [pdf, other

    astro-ph.GA astro-ph.CO

    Testing Lyman Alpha Emitters and Lyman-Break Galaxies as Tracers of Large-Scale Structures at High Redshifts

    Authors: Sang Hyeok Im, Ho Seong Hwang, Jaehong Park, Jaehyun Lee, Hyunmi Song, Stephen Appleby, Yohan Dubois, C. Gareth Few, Brad K. Gibson, Juhan Kim, Yonghwi Kim, Changbom Park, Christophe Pichon, Jihye Shin, Owain N. Snaith, Maria Celeste Artale, Eric Gawiser, Lucia Guaita, Woong-Seob Jeong, Kyoung-Soo Lee, Nelson Padilla, Vandana Ramakrishnan, Paulina Troncoso, Yujin Yang

    Abstract: We test whether Lyman alpha emitters (LAEs) and Lyman-break galaxies (LBGs) can be good tracers of high-z large-scale structures, using the Horizon Run 5 cosmological hydrodynamical simulation. We identify LAEs using the Lyα emission line luminosity and its equivalent width, and LBGs using the broad-band magnitudes at z~2.4, 3.1, and 4.5. We first compare the spatial distributions of LAEs, LBGs, a… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 20 pages, 15 figures, 2 tables, accepted for publication in ApJ

  11. arXiv:2407.17843  [pdf, other

    cs.CV cs.AI

    DragText: Rethinking Text Embedding in Point-based Image Editing

    Authors: Gayoon Choi, Taejin Jeong, Sujung Hong, Jaehoon Joo, Seong Jae Hwang

    Abstract: Point-based image editing enables accurate and flexible control through content dragging. However, the role of text embedding in the editing process has not been thoroughly investigated. A significant aspect that remains unexplored is the interaction between text and image embeddings. In this study, we show that during the progressive editing of an input image in a diffusion model, the text embedd… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 22 pages, 18 figures

  12. arXiv:2407.13864  [pdf, other

    astro-ph.GA

    Chandra Survey in the AKARI North Ecliptic Pole Deep Field Optical/Infrared Identifications of X-ray Sources

    Authors: T. Miyaji, B. A. Bravo-Navarro, J. Díaz Tello, M. Krumpe, M. Herrera-Endoqui, H. Ikeda, T. Takagi, N. Oi, A. Shogaki, S. Matsuura, H. Kim, M. A. Malkan, H. S. Hwang, T. Kim, T. Ishigaki, H. Hanami, S. J. Kim, Y. Ohyama, T. Goto, H. Matsuhara

    Abstract: We present a catalog of optical and infrared identifications (ID) of X-ray sources in the AKARI North Ecliptic Pole (NEP) Deep field detected with Chandra covering $\sim 0.34\,{\rm deg^{2}}$ with 0.5-2 keV flux limits ranging $\sim 2 \mathrm{-} 20\times 10^{-16}\,{\rm erg\,s^{-1}\,cm^{-2}}$. The optical/near-infrared counterparts of the X-ray sources are taken from our Hyper Suprime Cam (HSC)/Suba… ▽ More

    Submitted 22 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 16 pages, 9 figures, Three electronic (fits) tables are included in src. Accepted to Astronomy and Astrophysics

  13. arXiv:2407.12325  [pdf, other

    cs.IR

    Optimizing Query Generation for Enhanced Document Retrieval in RAG

    Authors: Hamin Koo, Minseon Kim, Sung Ju Hwang

    Abstract: Large Language Models (LLMs) excel in various language tasks but they often generate incorrect information, a phenomenon known as "hallucinations". Retrieval-Augmented Generation (RAG) aims to mitigate this by using document retrieval for accurate responses. However, RAG still faces hallucinations due to vague queries. This study aims to improve RAG by optimizing query generation with a query-docu… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  14. arXiv:2407.11348  [pdf, other

    cs.CV

    Flatfish Disease Detection Based on Part Segmentation Approach and Disease Image Generation

    Authors: Seo-Bin Hwang, Han-Young Kim, Chae-Yeon Heo, Hie-Yong Jung, Sung-Ju Jung, Yeong-Jun Cho

    Abstract: The flatfish is a major farmed species consumed globally in large quantities. However, due to the densely populated farming environment, flatfish are susceptible to injuries and diseases, making early disease detection crucial. Traditionally, diseases were detected through visual inspection, but observing large numbers of fish is challenging. Automated approaches based on deep learning technologie… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 16 page, 13 figures, 4 tables

  15. arXiv:2407.10164  [pdf, other

    cs.CV

    LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

    Authors: Sanmin Kim, Youngseok Kim, Sihwan Hwang, Hyeonjun Jeong, Dongsuk Kum

    Abstract: Recent advancements in camera-based 3D object detection have introduced cross-modal knowledge distillation to bridge the performance gap with LiDAR 3D detectors, leveraging the precise geometric information in LiDAR point clouds. However, existing cross-modal knowledge distillation methods tend to overlook the inherent imperfections of LiDAR, such as the ambiguity of measurements on distant or occ… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  16. arXiv:2407.09941  [pdf, other

    cs.LG cs.AI

    Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers

    Authors: Sukjun Hwang, Aakash Lahoti, Tri Dao, Albert Gu

    Abstract: A wide array of sequence models are built on a framework modeled after Transformers, comprising alternating sequence mixer and channel mixer layers. This paper studies a unifying matrix mixer view of sequence mixers that can be conceptualized as a linear map on the input sequence. This framework encompasses a broad range of well-known sequence models, including the self-attention of Transformers a… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  17. arXiv:2407.07517  [pdf, other

    eess.IV cs.CV

    Parameter Efficient Fine Tuning for Multi-scanner PET to PET Reconstruction

    Authors: Yumin Kim, Gayoon Choi, Seong Jae Hwang

    Abstract: Reducing scan time in Positron Emission Tomography (PET) imaging while maintaining high-quality images is crucial for minimizing patient discomfort and radiation exposure. Due to the limited size of datasets and distribution discrepancy across scanners in medical imaging, fine-tuning in a parameter-efficient and effective manner is on the rise. Motivated by the potential of Parameter-Efficient Fin… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  18. arXiv:2407.06716  [pdf, other

    cs.IR

    Analyzing the Effectiveness of Listwise Reranking with Positional Invariance on Temporal Generalizability

    Authors: Soyoung Yoon, Jongyoon Kim, Seung-won Hwang

    Abstract: Benchmarking the performance of information retrieval (IR) methods are mostly conducted within a fixed set of documents (static corpora). However, in real-world web search engine environments, the document set is continuously updated and expanded. Addressing these discrepancies and measuring the temporal persistence of IR systems is crucial. By investigating the LongEval benchmark, specifically de… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted at CLEF 2024 LongEval track

  19. arXiv:2407.05059  [pdf, other

    eess.IV cs.CV

    Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Bridge Diffusion Model

    Authors: Kyobin Choo, Youngjun Jun, Mijin Yun, Seong Jae Hwang

    Abstract: In neuroimaging, generally, brain CT is more cost-effective and accessible imaging option compared to MRI. Nevertheless, CT exhibits inferior soft-tissue contrast and higher noise levels, yielding less precise structural clarity. In response, leveraging more readily available CT to construct its counterpart MRI, namely, medical image-to-image translation (I2I), serves as a promising solution. Part… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 13 pages, 7 figures, Early accepted at Medical Image Computing and Computer Assisted Intervention (MICCAI) 2024

    ACM Class: I.4.5; I.4.9; J.3

  20. arXiv:2407.03280  [pdf, other

    cs.IT

    Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks

    Authors: Mintae Kim, Hoon Lee, Sangwon Hwang, Merouane Debbah, Inkyu Lee

    Abstract: This paper presents a cooperative multi-agent deep reinforcement learning (MADRL) approach for unmmaned aerial vehicle (UAV)-aided mobile edge computing (MEC) networks. An UAV with computing capability can provide task offlaoding services to ground internet-of-things devices (IDs). With partial observation of the entire network state, the UAV and the IDs individually determine their MEC strategies… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 13 pages, 6 figures

  21. arXiv:2407.02945  [pdf, other

    cs.CV

    VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors

    Authors: Sungwon Hwang, Min-Jung Kim, Taewoong Kang, Jayeon Kang, Jaegul Choo

    Abstract: Neural rendering-based urban scene reconstruction methods commonly rely on images collected from driving vehicles with cameras facing and moving forward. Although these methods can successfully synthesize from views similar to training camera trajectory, directing the novel view outside the training camera distribution does not guarantee on-par performance. In this paper, we tackle the Extrapolate… ▽ More

    Submitted 13 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: The first two authors contributed equally. Project Page: https://vegs3d.github.io/

  22. arXiv:2407.00972  [pdf, other

    cs.CV

    FALCON: Frequency Adjoint Link with CONtinuous Density Mask for Fast Single Image Dehazing

    Authors: Donghyun Kim, Seil Kang, Seong Jae Hwang

    Abstract: Image dehazing, addressing atmospheric interference like fog and haze, remains a pervasive challenge crucial for robust vision applications such as surveillance and remote sensing under adverse visibility. While various methodologies have evolved from early works predicting transmission matrix and atmospheric light features to deep learning and dehazing networks, they innately prioritize dehazing… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  23. arXiv:2407.00256  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts

    Authors: Ruochen Wang, Sohyun An, Minhao Cheng, Tianyi Zhou, Sung Ju Hwang, Cho-Jui Hsieh

    Abstract: Large Language Models (LLMs) exhibit strong generalization capabilities to novel tasks when prompted with language instructions and in-context demos. Since this ability sensitively depends on the quality of prompts, various methods have been explored to automate the instruction design. While these methods demonstrated promising results, they also restricted the searched prompt to one instruction.… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: ICML 2024. code available at https://github.com/ruocwang/mixture-of-prompts

    MSC Class: 68T01

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), Vienna, Austria, 2024

  24. arXiv:2406.17808  [pdf, other

    cs.CL cs.AI cs.LG

    Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache

    Authors: Jeffrey Willette, Heejun Lee, Youngwan Lee, Myeongjae Jeon, Sung Ju Hwang

    Abstract: The context window within a transformer provides a form of active memory for the current task, which can be useful for few-shot learning and conditional generation, both which depend heavily on previous context tokens. However, as the context length grows, the computational cost increases quadratically. Recent works have shown that saving a few initial tokens along with a fixed-sized sliding windo… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  25. arXiv:2406.16013  [pdf, other

    cs.CL cs.AI cs.IR

    Database-Augmented Query Representation for Information Retrieval

    Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

    Abstract: Information retrieval models that aim to search for the documents relevant to the given query have shown many successes, which have been applied to diverse tasks. However, the query provided by the user is oftentimes very short, which challenges the retrievers to correctly fetch relevant documents. To tackle this, existing studies have proposed expanding the query with a couple of additional (user… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  26. Phase-controlled heat modulation with Aharonov-Bohm interferometers

    Authors: Sun-Yong Hwang, Björn Sothmann, Rosa López

    Abstract: A heat modulator is proposed based on a voltage-biased Aharonov-Bohm interferometer. Once an electrical bias is applied, Peltier effects give rise to a flow of heat that can be modulated by a magnetic flux. We determine the corresponding temperature changes using a simple thermal model. Our calculations demonstrate that the modulated temperature difference can be as large as 80 mK at base temperat… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures

    Journal ref: Phys. Rev. Research 6, 013215 (2024)

  27. arXiv:2406.11672  [pdf, other

    cs.CV

    Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting

    Authors: Junha Hyung, Susung Hong, Sungwon Hwang, Jaeseong Lee, Jaegul Choo, Jin-Hwa Kim

    Abstract: 3D reconstruction from multi-view images is one of the fundamental challenges in computer vision and graphics. Recently, 3D Gaussian Splatting (3DGS) has emerged as a promising technique capable of real-time rendering with high-quality 3D reconstruction. This method utilizes 3D Gaussian representation and tile-based splatting techniques, bypassing the expensive neural field querying. Despite its p… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: project page: https://junhahyung.github.io/erankgs.github.io

  28. Expanding the Design Space of Computer Vision-based Interactive Systems for Group Dance Practice

    Authors: Soohwan Lee, Seoyeong Hwang, Ian Oakley, Kyungho Lee

    Abstract: Group dance, a sub-genre characterized by intricate motions made by a cohort of performers in tight synchronization, has a longstanding and culturally significant history and, in modern forms such as cheerleading, a broad base of current adherents. However, despite its popularity, learning group dance routines remains challenging. Based on the prior success of interactive systems to support indivi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 20 pages, 10 figures, 1 table, to be published in the proceedings of the ACM Designing Interactive Systems Conference, 2024, (DIS '24)

    Journal ref: ACM Designing Interactive Systems Conference, 2024, (DIS '24)

  29. arXiv:2406.11125  [pdf, other

    cs.HC

    Conversational Agents as Catalysts for Critical Thinking: Challenging Design Fixation in Group Design

    Authors: Soohwan Lee, Seoyeong Hwang, Kyungho Lee

    Abstract: This paper investigates the potential of LLM-based conversational agents (CAs) to enhance critical reflection and mitigate design fixation in group design work. By challenging AI-generated recommendations and prevailing group opinions, these agents address issues such as groupthink and promote a more dynamic and inclusive design process. Key design considerations include optimizing intervention ti… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 7 pages, 2 figures, DIS2024 Workshop on 'Death of Design Researcher'

  30. arXiv:2406.10996  [pdf, other

    cs.CL

    THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation

    Authors: Seo Hyun Kim, Kai Tzu-iunn Ong, Taeyoon Kwon, Namyoung Kim, Keummin Ka, SeongHyeon Bae, Yohan Jo, Seung-won Hwang, Dongha Lee, Jinyoung Yeo

    Abstract: Large language models (LLMs) are capable of processing lengthy dialogue histories during prolonged interaction with users without additional memory modules; however, their responses tend to overlook or incorrectly recall information from the past. In this paper, we revisit memory-augmented response generation in the era of LLMs. While prior work focuses on getting rid of outdated memories, we argu… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Under Review

  31. arXiv:2406.10995  [pdf, other

    cs.CV cs.LG

    Concept-skill Transferability-based Data Selection for Large Vision-Language Models

    Authors: Jaewoo Lee, Boyang Li, Sung Ju Hwang

    Abstract: Instruction tuning, or supervised finetuning on extensive task-specific data, is necessary for Large Vision-Language Models (LVLMs) to generalize well across a broad range of vision-language (VL) tasks. However, training on large VL datasets can become prohibitively expensive. In this work, we introduce COINCIDE, an effective and scalable data selection technique that uses a small model as a refer… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Preprint

  32. arXiv:2406.09827  [pdf, other

    cs.CL cs.CV cs.DC cs.LG

    HiP Attention: Sparse Sub-Quadratic Attention with Hierarchical Attention Pruning

    Authors: Heejun Lee, Geon Park, Youngwan Lee, Jina Kim, Wonyoung Jeong, Myeongjae Jeon, Sung Ju Hwang

    Abstract: In modern large language models (LLMs), increasing sequence lengths is a crucial challenge for enhancing their comprehension and coherence in handling complex tasks such as multi-modal question answering. However, handling long context sequences with LLMs is prohibitively costly due to the conventional attention mechanism's quadratic time and space complexity, and the context window size is limite… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 26 pages, 15 figures

  33. arXiv:2406.07736  [pdf, other

    cs.CL

    MultiPragEval: Multilingual Pragmatic Evaluation of Large Language Models

    Authors: Dojun Park, Jiwoo Lee, Seohyun Park, Hyeyun Jeong, Youngeun Koo, Soonha Hwang, Seonwoo Park, Sungeun Lee

    Abstract: As the capabilities of LLMs expand, it becomes increasingly important to evaluate them beyond basic knowledge assessment, focusing on higher-level language understanding. This study introduces MultiPragEval, a robust test suite designed for the multilingual pragmatic evaluation of LLMs across English, German, Korean, and Chinese. Comprising 1200 question units categorized according to Grice's Coop… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 8 pages, under review

  34. arXiv:2406.06748  [pdf, other

    cs.RO cs.MA

    Starling Formation-Flying Optical Experiment: Initial Operations and Flight Results

    Authors: Justin Kruger, Soon S. Hwang, Simone D'Amico

    Abstract: This paper presents initial flight results for distributed optical angles-only navigation of a swarm of small spacecraft, conducted during the Starling Formation-Flying Optical Experiment (StarFOX). StarFOX is a core payload of the NASA Starling mission, which consists of four CubeSats launched in 2023. Prior angles-only flight demonstrations have only featured one observer and target and have rel… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to the 38th Small Satellite Conference

  35. arXiv:2406.04630  [pdf, other

    cs.CL

    Low-Resource Cross-Lingual Summarization through Few-Shot Learning with Large Language Models

    Authors: Gyutae Park, Seojin Hwang, Hwanhee Lee

    Abstract: Cross-lingual summarization (XLS) aims to generate a summary in a target language different from the source language document. While large language models (LLMs) have shown promising zero-shot XLS performance, their few-shot capabilities on this task remain unexplored, especially for low-resource languages with limited parallel data. In this paper, we investigate the few-shot XLS performance of va… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 7 pages,3 figures

  36. SMCL: Saliency Masked Contrastive Learning for Long-tailed Recognition

    Authors: Sanglee Park, Seung-won Hwang, Jungmin So

    Abstract: Real-world data often follow a long-tailed distribution with a high imbalance in the number of samples between classes. The problem with training from imbalanced data is that some background features, common to all classes, can be unobserved in classes with scarce samples. As a result, this background correlates to biased predictions into ``major" classes. In this paper, we propose saliency masked… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: accepted at ICASSP 2023

  37. arXiv:2405.20729  [pdf, other

    cs.CV

    Extreme Point Supervised Instance Segmentation

    Authors: Hyeonjun Lee, Sehyun Hwang, Suha Kwak

    Abstract: This paper introduces a novel approach to learning instance segmentation using extreme points, i.e., the topmost, leftmost, bottommost, and rightmost points, of each object. These points are readily available in the modern bounding box annotation process while offering strong clues for precise segmentation, and thus allows to improve performance at the same annotation cost with box-supervised meth… ▽ More

    Submitted 3 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024

  38. SCUBA-2 Ultra Deep Imaging EAO Survey (STUDIES). V. Confusion-limited Submillimeter Galaxy Number Counts at 450 $μ$m and Data Release for the COSMOS Field

    Authors: Zhen-Kai Gao, Chen-Fatt Lim, Wei-Hao Wang, Chian-Chou Chen, Ian Smail, Scott C. Chapman, Xian Zhong Zheng, Hyunjin Shim, Tadayuki Kodama, Yiping Ao, Siou-Yu Chang, David L. Clements, James S. Dunlop, Luis C. Ho, Yun-Hsin Hsu, Chorng-Yuan Hwang, Ho Seong Hwang, M. P. Koprowski, Douglas Scott, Stephen Serjeant, Yoshiki Toba, Sheona A. Urquhart

    Abstract: We present confusion-limited SCUBA-2 450-$μ$m observations in the COSMOS-CANDELS region as part of the JCMT Large Program, SCUBA-2 Ultra Deep Imaging EAO Survey (STUDIES). Our maps at 450 and 850 $μ$m cover an area of 450 arcmin$^2$. We achieved instrumental noise levels of $σ_{\mathrm{450}}=$ 0.59 mJy beam$^{-1}$ and $σ_{\mathrm{850}}=$ 0.09 mJy beam$^{-1}$ in the deepest area of each map. The co… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 29 pages, 14 figures, accepted for publication in ApJ

  39. arXiv:2405.18540  [pdf, other

    cs.CL cs.CR cs.LG

    Learning diverse attacks on large language models for robust red-teaming and safety tuning

    Authors: Seanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh Jain

    Abstract: Red-teaming, or identifying prompts that elicit harmful responses, is a critical step in ensuring the safe and responsible deployment of large language models (LLMs). Developing effective protection against many modes of attack prompts requires discovering diverse attacks. Automated red-teaming typically uses reinforcement learning to fine-tune an attacker language model to generate prompts that e… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  40. arXiv:2405.18042  [pdf, other

    cs.CV cs.LG

    Visualizing the loss landscape of Self-supervised Vision Transformer

    Authors: Youngwan Lee, Jeffrey Ryan Willette, Jonghee Kim, Sung Ju Hwang

    Abstract: The Masked autoencoder (MAE) has drawn attention as a representative self-supervised approach for masked image modeling with vision transformers. However, even though MAE shows better generalization capability than fully supervised training from scratch, the reason why has not been explored. In another line of work, the Reconstruction Consistent Masked Auto Encoder (RC-MAE), has been proposed whic… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2023 Workshop: Self-Supervised Learning - Theory and Practice

  41. arXiv:2405.17938  [pdf, other

    cs.LG

    RC-Mixup: A Data Augmentation Strategy against Noisy Data for Regression Tasks

    Authors: Seong-Hyeon Hwang, Minsu Kim, Steven Euijong Whang

    Abstract: We study the problem of robust data augmentation for regression tasks in the presence of noisy data. Data augmentation is essential for generalizing deep learning models, but most of the techniques like the popular Mixup are primarily designed for classification tasks on image data. Recently, there are also Mixup techniques that are specialized to regression tasks like C-Mixup. In comparison to Mi… ▽ More

    Submitted 15 August, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to KDD 2024

  42. arXiv:2405.17918  [pdf, other

    cs.LG cs.AI

    Cost-Sensitive Multi-Fidelity Bayesian Optimization with Transfer of Learning Curve Extrapolation

    Authors: Dong Bok Lee, Aoxuan Silvia Zhang, Byungjoo Kim, Junhyeon Park, Juho Lee, Sung Ju Hwang, Hae Beom Lee

    Abstract: In this paper, we address the problem of cost-sensitive multi-fidelity Bayesian Optimization (BO) for efficient hyperparameter optimization (HPO). Specifically, we assume a scenario where users want to early-stop the BO when the performance improvement is not satisfactory with respect to the required computational cost. Motivated by this scenario, we introduce utility, which is a function predefin… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  43. arXiv:2405.17373  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Probing the Relationship between Defects and Enhanced Mobility in MoS2 Monolayers Grown by Mo Foil

    Authors: Sudipta Majumder, Vaibhav Walve, Rahul Chand, Gokul M. A., Sooyeon Hwang, G. V. Pavan Kumar, Aparna Deshpande, Atikur Rahman

    Abstract: Atomic vacancies, such as chalcogen vacancies in 2D TMDs, are important in changing the host material's electronic structure and transport properties. We present a straightforward one-step method for growing monolayer MoS2 utilizing oxidized Molybdenum (Mo) foil using CVD and delve into the transport properties of as-grown samples. Devices fabricated from these MoS2 sheets exhibit excellent electr… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  44. arXiv:2405.16567  [pdf, other

    cs.AI cs.CR

    Automatic Jailbreaking of the Text-to-Image Generative AI Systems

    Authors: Minseon Kim, Hyomin Lee, Boqing Gong, Huishuai Zhang, Sung Ju Hwang

    Abstract: Recent AI systems have shown extremely powerful performance, even surpassing human performance, on various tasks such as information retrieval, language generation, and image generation based on large language models (LLMs). At the same time, there are diverse safety risks that can cause the generation of malicious contents by circumventing the alignment in LLMs, which are often referred to as jai… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: Under review

  45. arXiv:2405.11807  [pdf, other

    cs.HC cs.RO eess.SY

    Dual-sided Peltier Elements for Rapid Thermal Feedback in Wearables

    Authors: Seongjun Kang, Gwangbin Kim, Seokhyun Hwang, Jeongju Park, Ahmed Elsharkawy, SeungJun Kim

    Abstract: This paper introduces a motor-driven Peltier device designed to deliver immediate thermal sensations within extended reality (XR) environments. The system incorporates eight motor-driven Peltier elements, facilitating swift transitions between warm and cool sensations by rotating preheated or cooled elements to opposite sides. A multi-layer structure, comprising aluminum and silicone layers, ensur… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 3 pages, 4 figures, ICRA Wearable Workshop 2024 - 1st Workshop on Advancing Wearable Devices and Applications through Novel Design, Sensing, Actuation, and AI

  46. arXiv:2405.11162  [pdf, other

    cs.CL

    LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs

    Authors: Yongrae Jo, Seongyun Lee, Minju Seo, Sung Ju Hwang, Moontae Lee

    Abstract: Text-to-SQL models are pivotal for making Electronic Health Records (EHRs) accessible to healthcare professionals without SQL knowledge. With the advancements in large language models, these systems have become more adept at translating complex questions into SQL queries. Nonetheless, the critical need for reliability in healthcare necessitates these models to accurately identify unanswerable ques… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: NAACL 2024 Clinical NLP Workshop

  47. arXiv:2405.00115  [pdf

    astro-ph.GA

    Direct Evidence of a Major Merger in the Perseus Cluster

    Authors: Kim HyeongHan, M. James Jee, Wonki Lee, John ZuHone, Irina Zhuravleva, Wooseok Kang, Ho Seong Hwang

    Abstract: Although the Perseus cluster has often been regarded as an archetypical relaxed galaxy cluster, several lines of evidence including ancient, large-scale cold fronts, asymmetric plasma morphology, filamentary galaxy distribution, etc., provide a conflicting view of its dynamical state, suggesting that the cluster might have experienced a major merger. However, the absence of a clear merging compani… ▽ More

    Submitted 8 May, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

    Comments: The current version is a submitted manuscript

  48. arXiv:2404.12250  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Effects of Reduced Interlayer Interactions on the K-point Excitons of MoS$_2$ Nanoscrolls

    Authors: Sagnik Chatterjee, Tamaghna Chowdhury, Pablo Díaz Núñez, Nicholas Kay, Manisha Rajput, Sooyeon Hwang, Ivan Timokhin, Artem Mishchenko, Atikur Rahman

    Abstract: Transition metal dichalcogenide (TMD) nanoscrolls (NS) exhibit significant photoluminescence (PL) signals despite their multilayer structure, which cannot be explained by the strained multilayer description of NS. Here, we investigate the interlayer interactions in NS to address this discrepancy. The reduction of interlayer interactions in NS is attributed to two factors: (1) the symmetry-broken m… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: S.C. and T.C. have contributed equally to this work

  49. arXiv:2404.11310  [pdf, other

    cs.RO

    Autonomous aerial perching and unperching using omnidirectional tiltrotor and switching controller

    Authors: Dongjae Lee, Sunwoo Hwang, Jeonghyun Byun, Seung Jae Lee, H. Jin Kim

    Abstract: Aerial unperching of multirotors has received little attention as opposed to perching that has been investigated to elongate operation time. This study presents a new aerial robot capable of both perching and unperching autonomously on/from a ferromagnetic surface during flight, and a switching controller to avoid rotor saturation and mitigate overshoot during transition between free-flight and pe… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 7 pages, 10 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA) accepted

  50. arXiv:2404.07738  [pdf, other

    cs.CL cs.AI cs.LG

    ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

    Authors: Jinheon Baek, Sujay Kumar Jauhar, Silviu Cucerzan, Sung Ju Hwang

    Abstract: Scientific Research, vital for improving human life, is hindered by its inherent complexity, slow pace, and the need for specialized experts. To enhance its productivity, we propose a ResearchAgent, a large language model-powered research idea writing agent, which automatically generates problems, methods, and experiment designs while iteratively refining them based on scientific literature. Speci… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.