Skip to main content

Showing 1–50 of 115 results for author: Jang, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13524  [pdf, other

    cs.CV cs.AI

    Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation

    Authors: Ilhoon Yoon, Hyeongjun Kwon, Jin Kim, Junyoung Park, Hyunsung Jang, Kwanghoon Sohn

    Abstract: Source-Free domain adaptive Object Detection (SFOD) is a promising strategy for deploying trained detectors to new, unlabeled domains without accessing source data, addressing significant concerns around data privacy and efficiency. Most SFOD methods leverage a Mean-Teacher (MT) self-training paradigm relying heavily on High-confidence Pseudo Labels (HPL). However, these HPL often overlook small i… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2407.12331  [pdf, other

    cs.CV cs.AI

    I2AM: Interpreting Image-to-Image Latent Diffusion Models via Attribution Maps

    Authors: Junseo Park, Hyeryung Jang

    Abstract: Large-scale diffusion models have made significant advancements in the field of image generation, especially through the use of cross-attention mechanisms that guide image formation based on textual descriptions. While the analysis of text-guided cross-attention in diffusion models has been extensively studied in recent years, its application in image-to-image diffusion models remains underexplore… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 9 pages, 9 figures

  3. arXiv:2407.06506  [pdf

    cs.CY

    Information Seeking and Communication among International Students on Reddit

    Authors: Chaeeun Han, Sangpil Youm, Sou Hyun Jang

    Abstract: This study examines the impact of the COVID-19 pandemic on information-seeking behaviors among international students, with a focus on the r/f1visa subreddit. Our study indicates a considerable rise in the number of users posting more than one question during the pandemic. Those asking recurring questions demonstrate more active involvement in communication, suggesting a continuous pursuit of know… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 10th International Conference on Computational Social Science IC2S2, July 17-20, 2024, Philadelphia, USA

  4. arXiv:2406.12258  [pdf, other

    cs.CV

    Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics

    Authors: Hyojin Kim, Jiyoon Lee, Yonghyun Jeong, Haneol Jang, YoungJoon Yoo

    Abstract: This paper presents a novel perspective for enhancing anti-spoofing performance in zero-shot data domain generalization. Unlike traditional image classification tasks, face anti-spoofing datasets display unique generalization characteristics, necessitating novel zero-shot data domain generalization. One step forward to the previous frame-wise spoofing prediction, we introduce a nuanced metric calc… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages with 4 figures, Accepted by CVPRW 2024

  5. arXiv:2406.07398  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Visual Representation Learning with Stochastic Frame Prediction

    Authors: Huiwon Jang, Dongyoung Kim, Junsu Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo

    Abstract: Self-supervised learning of image representations by predicting future frames is a promising direction but still remains a challenge. This is because of the under-determined nature of frame prediction; multiple potential futures can arise from a single current frame. To tackle this challenge, in this paper, we revisit the idea of stochastic video generation that learns to capture uncertainty in fr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: International Conference on Machine Learning (ICML) 2024

  6. arXiv:2406.07034  [pdf, other

    cs.AI cs.CL

    Improving Multi-hop Logical Reasoning in Knowledge Graphs with Context-Aware Query Representation Learning

    Authors: Jeonghoon Kim, Heesoo Jung, Hyeju Jang, Hogun Park

    Abstract: Multi-hop logical reasoning on knowledge graphs is a pivotal task in natural language processing, with numerous approaches aiming to answer First-Order Logic (FOL) queries. Recent geometry (e.g., box, cone) and probability (e.g., beta distribution)-based methodologies have effectively addressed complex FOL queries. However, a common challenge across these methods lies in determining accurate geome… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  7. arXiv:2405.18808  [pdf, other

    cs.CV

    BRACTIVE: A Brain Activation Approach to Human Visual Brain Learning

    Authors: Xuan-Bac Nguyen, Hojin Jang, Xin Li, Samee U. Khan, Pawan Sinha, Khoa Luu

    Abstract: The human brain is a highly efficient processing unit, and understanding how it works can inspire new algorithms and architectures in machine learning. In this work, we introduce a novel framework named Brain Activation Network (BRACTIVE), a transformer-based approach to studying the human visual brain. The main objective of BRACTIVE is to align the visual features of subjects with corresponding b… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  8. arXiv:2405.18606  [pdf, other

    cs.CV cs.IT

    Track Initialization and Re-Identification for~3D Multi-View Multi-Object Tracking

    Authors: Linh Van Ma, Tran Thien Dat Nguyen, Ba-Ngu Vo, Hyunsung Jang, Moongu Jeon

    Abstract: We propose a 3D multi-object tracking (MOT) solution using only 2D detections from monocular cameras, which automatically initiates/terminates tracks as well as resolves track appearance-reappearance and occlusions. Moreover, this approach does not require detector retraining when cameras are reconfigured but only the camera matrices of reconfigured cameras need to be updated. Our approach is base… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  9. arXiv:2405.18093  [pdf, other

    cs.DC cs.LG

    Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters

    Authors: Jinkyu Yim, Jaeyong Song, Yerim Choi, Jaebeen Lee, Jaewon Jung, Hongsun Jang, Jinho Lee

    Abstract: Training large language models (LLMs) is known to be challenging because of the huge computational and memory capacity requirements. To address these issues, it is common to use a cluster of GPUs with 3D parallelism, which splits a model along the data batch, pipeline stage, and intra-layer tensor dimensions. However, the use of 3D parallelism produces the additional challenge of finding the optim… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: published at DATE 2024

  10. arXiv:2405.16012  [pdf, other

    cs.LG

    Pessimistic Backward Policy for GFlowNets

    Authors: Hyosoon Jang, Yunhui Jang, Minsu Kim, Jinkyoo Park, Sungsoo Ahn

    Abstract: This paper studies Generative Flow Networks (GFlowNets), which learn to sample objects proportionally to a given reward function through the trajectory of state transitions. In this work, we observe that GFlowNets tend to under-exploit the high-reward objects due to training on insufficient number of trajectories, which may lead to a large gap between the estimated flow and the (known) reward valu… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  11. arXiv:2405.03183  [pdf, other

    cs.DC cs.CR math.NA

    Impact of EIP-4844 on Ethereum: Consensus Security, Ethereum Usage, Rollup Transaction Dynamics, and Blob Gas Fee Markets

    Authors: Seongwan Park, Bosul Mun, Seungyun Lee, Woojin Jeong, Jaewook Lee, Hyeonsang Eom, Huisu Jang

    Abstract: On March 13, 2024, Ethereum implemented EIP-4844, designed to enhance its role as a data availability layer. While this upgrade reduces data posting costs for rollups, it also raises concerns about its impact on the consensus layer due to increased propagation sizes. Moreover, the broader effects on the overall Ethereum ecosystem remain largely unexplored. In this paper, we conduct an empirical an… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  12. arXiv:2404.06948  [pdf, other

    cs.CL cs.AI

    MetaCheckGPT -- A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models

    Authors: Rahul Mehta, Andrew Hoblitzell, Jack O'Keefe, Hyeju Jang, Vasudeva Varma

    Abstract: Hallucinations in large language models (LLMs) have recently become a significant problem. A recent effort in this direction is a shared task at Semeval 2024 Task 6, SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes. This paper describes our winning solution ranked 1st and 2nd in the 2 sub-tasks of model agnostic and model aware tracks respectively. We propose… ▽ More

    Submitted 11 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Entry for SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

    MSC Class: 68T07; 68T50 ACM Class: I.2.7

  13. arXiv:2404.06357  [pdf, other

    cs.CL

    Generalizable Sarcasm Detection Is Just Around The Corner, Of Course!

    Authors: Hyewon Jang, Diego Frassinelli

    Abstract: We tested the robustness of sarcasm detection models by examining their behavior when fine-tuned on four sarcasm datasets containing varying characteristics of sarcasm: label source (authors vs. third-party), domain (social media/online vs. offline conversations/dialogues), style (aggressive vs. humorous mocking). We tested their prediction performance on the same dataset (intra-dataset) and acros… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  14. arXiv:2404.05256  [pdf, other

    cs.CV cs.AI

    StyleForge: Enhancing Text-to-Image Synthesis for Any Artistic Styles with Dual Binding

    Authors: Junseo Park, Beomseok Ko, Hyeryung Jang

    Abstract: Recent advancements in text-to-image models, such as Stable Diffusion, have showcased their ability to create visual images from natural language prompts. However, existing methods like DreamBooth struggle with capturing arbitrary art styles due to the abstract and multifaceted nature of stylistic attributes. We introduce Single-StyleForge, a novel approach for personalized text-to-image synthesis… ▽ More

    Submitted 17 July, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: 20 pages, 12 figuers

  15. arXiv:2404.03899  [pdf, other

    cs.HC

    Effects of Multisensory Feedback on the Perception and Performance of Virtual Reality Hand-Retargeted Interaction

    Authors: Hyunyoung Jang, Jinwook Kim, Jeongmi Lee

    Abstract: Retargeting methods that modify the visual representation of real movements have been widely used to expand the interaction space and create engaging virtual reality experiences. For optimal user experience and performance, it is essential to specify the perception of retargeting and utilize the appropriate range of modification parameters. However, previous studies mostly concentrated on whether… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 17 pages, 8 figures, 2 tables

  16. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  17. arXiv:2404.00830  [pdf, other

    cs.RO

    2D Ego-Motion with Yaw Estimation using Only mmWave Radars via Two-Way weighted ICP

    Authors: Hojune Kim, Hyesu Jang, Ayoung Kim

    Abstract: The interest in single-chip mmWave Radar is driven by their compact form factor, cost-effectiveness, and robustness under harsh environmental conditions. Despite its promising attributes, the principal limitation of mmWave radar lies in its capacity for autonomous yaw rate estimation. Conventional solutions have often resorted to integrating inertial measurement unit (IMU) or deploying multiple ra… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  18. arXiv:2404.00678  [pdf, other

    cs.CV cs.GR

    OmniSDF: Scene Reconstruction using Omnidirectional Signed Distance Functions and Adaptive Binoctrees

    Authors: Hakyeong Kim, Andreas Meuleman, Hyeonjoong Jang, James Tompkin, Min H. Kim

    Abstract: We present a method to reconstruct indoor and outdoor static scene geometry and appearance from an omnidirectional video moving in a small circular sweep. This setting is challenging because of the small baseline and large depth ranges, making it difficult to find ray crossings. To better constrain the optimization, we estimate geometry as a signed distance field within a spherical binoctree data… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  19. arXiv:2404.00676  [pdf, other

    cs.CV cs.GR

    OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos

    Authors: Dongyoung Choi, Hyeonjoong Jang, Min H. Kim

    Abstract: Omnidirectional cameras are extensively used in various applications to provide a wide field of vision. However, they face a challenge in synthesizing novel views due to the inevitable presence of dynamic objects, including the photographer, in their wide field of view. In this paper, we introduce a new approach called Omnidirectional Local Radiance Fields (OmniLocalRF) that can render static-only… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  20. Imaging radar and LiDAR image translation for 3-DOF extrinsic calibration

    Authors: Sangwoo Jung, Hyesu Jang, Minwoo Jung, Ayoung Kim, Myung-Hwan Jeon

    Abstract: The integration of sensor data is crucial in the field of robotics to take full advantage of the various sensors employed. One critical aspect of this integration is determining the extrinsic calibration parameters, such as the relative transformation, between each sensor. The use of data fusion between complementary sensors, such as radar and LiDAR, can provide significant benefits, particularly… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  21. arXiv:2403.06668  [pdf, other

    cs.LG cs.CV

    PeerAiD: Improving Adversarial Distillation from a Specialized Peer Tutor

    Authors: Jaewon Jung, Hongsun Jang, Jaeyong Song, Jinho Lee

    Abstract: Adversarial robustness of the neural network is a significant concern when it is applied to security-critical domains. In this situation, adversarial distillation is a promising option which aims to distill the robustness of the teacher network to improve the robustness of a small student network. Previous works pretrain the teacher network to make it robust against the adversarial examples aimed… ▽ More

    Submitted 17 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  22. arXiv:2403.06664  [pdf, other

    cs.AR cs.LG

    Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System

    Authors: Hongsun Jang, Jaeyong Song, Jaewon Jung, Jaeyoung Park, Youngsok Kim, Jinho Lee

    Abstract: The recent huge advance of Large Language Models (LLMs) is mainly driven by the increase in the number of parameters. This has led to substantial memory capacity requirements, necessitating the use of dozens of GPUs just to meet the capacity. One popular solution to this is storage-offloaded training, which uses host memory and storage as an extended memory hierarchy. However, this obviously comes… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Published at HPCA 2024 (Best Paper Award Honorable Mention)

  23. LodeStar: Maritime Radar Descriptor for Semi-Direct Radar Odometry

    Authors: Hyesu Jang, Minwoo Jung, Myung-Hwan Jeon, Ayoung Kim

    Abstract: Maritime radars are prevalently adopted to capture the vessel's omnidirectional data as imagery. Nevertheless, inherent challenges persist with marine radars, including limited frequency, suboptimal resolution, and indeterminate detections. Additionally, the scarcity of discernible landmarks in the vast marine expanses remains a challenge, resulting in consecutive scenes that often lack matching f… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: IEEE Robotics and Automation Letter

    Journal ref: IEEE Robotics and Automation Letter, 9-2 (2024) 1684-1691

  24. arXiv:2402.14334  [pdf, other

    cs.CL

    INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models

    Authors: Hanseok Oh, Hyunji Lee, Seonghyeon Ye, Haebin Shin, Hansol Jang, Changwook Jun, Minjoon Seo

    Abstract: Despite the critical need to align search targets with users' intention, retrievers often only prioritize query information without delving into the users' intended search context. Enhancing the capability of retrievers to understand intentions and preferences of users, akin to language model instructions, has the potential to yield more aligned search targets. Prior studies restrict the applicati… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  25. arXiv:2401.12532  [pdf, other

    cs.LG cs.AI

    DAFA: Distance-Aware Fair Adversarial Training

    Authors: Hyungyu Lee, Saehyung Lee, Hyemi Jang, Junsung Park, Ho Bae, Sungroh Yoon

    Abstract: The disparity in accuracy between classes in standard training is amplified during adversarial training, a phenomenon termed the robust fairness problem. Existing methodologies aimed to enhance robust fairness by sacrificing the model's performance on easier classes in order to improve its performance on harder ones. However, we observe that under adversarial attacks, the majority of the model's p… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR 2024

  26. arXiv:2401.06443  [pdf, other

    cs.CL

    BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

    Authors: Minjun Kim, Seungwoo Song, Youhan Lee, Haneol Jang, Kyungtae Lim

    Abstract: The current research direction in generative models, such as the recently developed GPT4, aims to find relevant knowledge information for multimodal and multilingual inputs to provide answers. Under these research circumstances, the demand for multilingual evaluation of visual question answering (VQA) tasks, a representative task of multimodal systems, has increased. Accordingly, we propose a bili… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  27. arXiv:2311.06837  [pdf, other

    cs.LG cs.DC

    GraNNDis: Efficient Unified Distributed Training Framework for Deep GNNs on Large Clusters

    Authors: Jaeyong Song, Hongsun Jang, Jaewon Jung, Youngsok Kim, Jinho Lee

    Abstract: Graph neural networks (GNNs) are one of the most rapidly growing fields within deep learning. According to the growth in the dataset and the model size used for GNNs, an important problem is that it becomes nearly impossible to keep the whole network on GPU memory. Among numerous attempts, distributed training is one popular approach to address the problem. However, due to the nature of GNNs, exis… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  28. arXiv:2311.03938  [pdf, other

    cs.CV

    Analysis of NaN Divergence in Training Monocular Depth Estimation Model

    Authors: Bum Jun Kim, Hyeonah Jang, Sang Woo Kim

    Abstract: The latest advances in deep learning have facilitated the development of highly accurate monocular depth estimation models. However, when training a monocular depth estimation network, practitioners and researchers have observed not a number (NaN) loss, which disrupts gradient descent optimization. Although several practitioners have reported the stochastic and mysterious occurrence of NaN loss th… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures

  29. arXiv:2310.16318  [pdf, other

    cs.LG cs.AI

    Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder

    Authors: Huiwon Jang, Jihoon Tack, Daewon Choi, Jongheon Jeong, Jinwoo Shin

    Abstract: Despite its practical importance across a wide range of modalities, recent advances in self-supervised learning (SSL) have been primarily focused on a few well-curated domains, e.g., vision and language, often relying on their domain-specific knowledge. For example, Masked Auto-Encoder (MAE) has become one of the popular architectures in these domains, but less has explored its potential in other… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023. The first two authors contributed equally

  30. arXiv:2310.10088  [pdf, other

    eess.IV cs.CV cs.LG

    PUCA: Patch-Unshuffle and Channel Attention for Enhanced Self-Supervised Image Denoising

    Authors: Hyemi Jang, Junsung Park, Dahuin Jung, Jaihyun Lew, Ho Bae, Sungroh Yoon

    Abstract: Although supervised image denoising networks have shown remarkable performance on synthesized noisy images, they often fail in practice due to the difference between real and synthesized noise. Since clean-noisy image pairs from the real world are extremely costly to gather, self-supervised learning, which utilizes noisy input itself as a target, has been studied. To prevent a self-supervised deno… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  31. arXiv:2310.04846  [pdf, other

    cs.RO

    Soft finger rotational stability for precision grasps

    Authors: Hun Jang, Valentyn Petrichenko, Joonbum Bae, Kevin Haninger

    Abstract: Soft robotic fingers can safely grasp fragile or variable form objects, but their force capacity is limited, especially with less contact area: precision grasps and when objects are smaller or not spherical. Current research is improving force capacity through mechanical design by increasing contact area or stiffness, typically without models which explain soft finger force limitations. To address… ▽ More

    Submitted 24 March, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: Submitted IROS24

  32. arXiv:2310.03301  [pdf, other

    cs.LG

    Learning Energy Decompositions for Partial Inference of GFlowNets

    Authors: Hyosoon Jang, Minsu Kim, Sungsoo Ahn

    Abstract: This paper studies generative flow networks (GFlowNets) to sample objects from the Boltzmann energy distribution via a sequence of actions. In particular, we focus on improving GFlowNet with partial inference: training flow functions with the evaluation of the intermediate states or transitions. To this end, the recently developed forward-looking GFlowNet reparameterizes the flow functions based o… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  33. arXiv:2309.10339  [pdf, other

    cs.CL

    KoBigBird-large: Transformation of Transformer for Korean Language Understanding

    Authors: Kisu Yang, Yoonna Jang, Taewoo Lee, Jinwoo Seong, Hyungjin Lee, Hwanseok Jang, Heuiseok Lim

    Abstract: This work presents KoBigBird-large, a large size of Korean BigBird that achieves state-of-the-art performance and allows long sequence processing for Korean language understanding. Without further pretraining, we only transform the architecture and extend the positional encoding with our proposed Tapered Absolute Positional Encoding Representations (TAPER). In experiments, KoBigBird-large shows st… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted at IJCNLP-AACL 2023

  34. arXiv:2309.02713  [pdf, other

    cs.CV cs.AI

    SlAction: Non-intrusive, Lightweight Obstructive Sleep Apnea Detection using Infrared Video

    Authors: You Rim Choi, Gyeongseon Eo, Wonhyuck Youn, Hyojin Lee, Haemin Jang, Dongyoon Kim, Hyunwoo Shin, Hyung-Sin Kim

    Abstract: Obstructive sleep apnea (OSA) is a prevalent sleep disorder affecting approximately one billion people world-wide. The current gold standard for diagnosing OSA, Polysomnography (PSG), involves an overnight hospital stay with multiple attached sensors, leading to potential inaccuracies due to the first-night effect. To address this, we present SlAction, a non-intrusive OSA detection system for dail… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV CVAMD 2023, poster

  35. arXiv:2308.13989  [pdf, other

    cs.CV

    LDL: Line Distance Functions for Panoramic Localization

    Authors: Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim

    Abstract: We introduce LDL, a fast and robust algorithm that localizes a panorama to a 3D map using line segments. LDL focuses on the sparse structural information of lines in the scene, which is robust to illumination changes and can potentially enable efficient computation. While previous line-based localization approaches tend to sacrifice accuracy or computation time, our method effectively observes the… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  36. arXiv:2308.00558  [pdf, other

    cs.NE

    Gradient Scaling on Deep Spiking Neural Networks with Spike-Dependent Local Information

    Authors: Seongsik Park, Jeonghee Jo, Jongkil Park, Yeonjoo Jeong, Jaewook Kim, Suyoun Lee, Joon Young Kwak, Inho Kim, Jong-Keuk Park, Kyeong Seok Lee, Gye Weon Hwang, Hyun Jae Jang

    Abstract: Deep spiking neural networks (SNNs) are promising neural networks for their model capacity from deep neural network architecture and energy efficiency from SNNs' operations. To train deep SNNs, recently, spatio-temporal backpropagation (STBP) with surrogate gradient was proposed. Although deep SNNs have been successfully trained with STBP, they cannot fully utilize spike information. In this work,… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: ICML-23 Localized Learning Workshop: Decentralized Model Updates via Non-Global Objectives

  37. arXiv:2307.14179  [pdf, other

    cs.CV

    Resolution-Aware Design of Atrous Rates for Semantic Segmentation Networks

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: DeepLab is a widely used deep neural network for semantic segmentation, whose success is attributed to its parallel architecture called atrous spatial pyramid pooling (ASPP). ASPP uses multiple atrous convolutions with different atrous rates to extract both local and global information. However, fixed values of atrous rates are used for the ASPP module, which restricts the size of its field of vie… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 18 pages, 12 figures

  38. arXiv:2307.04321  [pdf, other

    cs.RO

    RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold

    Authors: Hyesu Jang, Minwoo Jung, Ayoung Kim

    Abstract: Due to the robustness in sensing, radar has been highlighted, overcoming harsh weather conditions such as fog and heavy snow. In this paper, we present a novel radar-only place recognition that measures the similarity score by utilizing Radon-transformed sinogram images and cross-correlation in frequency domain. Doing so achieves rigid transform invariance during place recognition, while ignoring… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  39. Public Attitudes Toward ChatGPT on Twitter: Sentiments, Topics, and Occupations

    Authors: Ratanond Koonchanok, Yanling Pan, Hyeju Jang

    Abstract: ChatGPT sets a new record with the fastest-growing user base, as a chatbot powered by a large language model (LLM). While it demonstrates state-of-the-art capabilities in a variety of language-generation tasks, it also raises widespread public concerns regarding its societal impact. In this paper, we investigated public attitudes towards ChatGPT by applying natural language processing techniques s… ▽ More

    Submitted 13 February, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

  40. arXiv:2305.05952  [pdf, other

    cs.CR

    Unraveling the MEV Enigma: ABI-Free Detection Model using Graph Neural Networks

    Authors: Seongwan Park, Woojin Jeong, Yunyoung Lee, Bumho Son, Huisu Jang, Jaewook Lee

    Abstract: The detection of Maximal Extractable Value (MEV) in blockchain is crucial for enhancing blockchain security, as it enables the evaluation of potential consensus layer risks, the effectiveness of anti-centralization solutions, and the assessment of user exploitation. However, existing MEV detection methods face limitations due to their low recall rate, reliance on pre-registered Application Binary… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  41. arXiv:2305.04722  [pdf, other

    cs.CV

    Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive Fields

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: Vision transformers (ViTs) that model an image as a sequence of partitioned patches have shown notable performance in diverse vision tasks. Because partitioning patches eliminates the image structure, to reflect the order of patches, ViTs utilize an explicit component called positional embedding. However, we claim that the use of positional embedding does not simply guarantee the order-awareness o… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 11 pages, 7 Figures

  42. arXiv:2305.01135  [pdf, other

    cs.RO

    A New Wave in Robotics: Survey on Recent mmWave Radar Applications in Robotics

    Authors: Kyle Harlow, Hyesu Jang, Timothy D. Barfoot, Ayoung Kim, Christoffer Heckman

    Abstract: We survey the current state of millimeterwave (mmWave) radar applications in robotics with a focus on unique capabilities, and discuss future opportunities based on the state of the art. Frequency Modulated Continuous Wave (FMCW) mmWave radars operating in the 76--81GHz range are an appealing alternative to lidars, cameras and other sensors operating in the near visual spectrum. Radar has been mad… ▽ More

    Submitted 19 January, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: 20 Pages, 7 Figures, 2 Tables, TRO Submission pending

    ACM Class: I.2.9

  43. arXiv:2304.01900  [pdf, other

    cs.CV cs.AI

    PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion

    Authors: Gwanghyun Kim, Ji Ha Jang, Se Young Chun

    Abstract: Recently, significant advancements have been made in 3D generative models, however training these models across diverse domains is challenging and requires an huge amount of training data and knowledge of pose distribution. Text-guided domain adaptation methods have allowed the generator to be adapted to the target domains using text prompts, thereby obviating the need for assembling numerous data… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Project page: https://gwang-kim.github.io/podia_3d/

  44. arXiv:2303.11563  [pdf, other

    cs.LG cs.AI q-bio.QM

    Dynamic Healthcare Embeddings for Improving Patient Care

    Authors: Hankyu Jang, Sulyun Lee, D. M. Hasibul Hasan, Philip M. Polgreen, Sriram V. Pemmaraju, Bijaya Adhikari

    Abstract: As hospitals move towards automating and integrating their computing systems, more fine-grained hospital operations data are becoming available. These data include hospital architectural drawings, logs of interactions between patients and healthcare professionals, prescription data, procedures data, and data on patient admission, discharge, and transfers. This has opened up many fascinating avenue… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: To be published in IEEE/ACM ASONAM 2022

  45. arXiv:2303.00996  [pdf, other

    cs.LG cs.AI cs.CV

    Unsupervised Meta-Learning via Few-shot Pseudo-supervised Contrastive Learning

    Authors: Huiwon Jang, Hankook Lee, Jinwoo Shin

    Abstract: Unsupervised meta-learning aims to learn generalizable knowledge across a distribution of tasks constructed from unlabeled data. Here, the main challenge is how to construct diverse tasks for meta-learning without label information; recent works have proposed to create, e.g., pseudo-labeling via pretrained representations or creating synthetic samples via generative models. However, such a task co… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023 (Spotlight). The first two authors contributed equally. The code is available at https://github.com/alinlab/PsCo

  46. arXiv:2302.10506  [pdf, other

    cs.LG

    Diffusion Probabilistic Models for Structured Node Classification

    Authors: Hyosoon Jang, Seonghyun Park, Sangwoo Mo, Sungsoo Ahn

    Abstract: This paper studies structured node classification on graphs, where the predictions should consider dependencies between the node labels. In particular, we focus on solving the problem for partially labeled graphs where it is essential to incorporate the information in the known label for predicting the unknown labels. To address this issue, we propose a novel framework leveraging the diffusion pro… ▽ More

    Submitted 18 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  47. arXiv:2302.08741  [pdf, other

    cs.CV cs.AI cs.LG

    New Insights for the Stability-Plasticity Dilemma in Online Continual Learning

    Authors: Dahuin Jung, Dongjin Lee, Sunwon Hong, Hyemi Jang, Ho Bae, Sungroh Yoon

    Abstract: The aim of continual learning is to learn new tasks continuously (i.e., plasticity) without forgetting previously learned knowledge from old tasks (i.e., stability). In the scenario of online continual learning, wherein data comes strictly in a streaming manner, the plasticity of online continual learning is more vulnerable than offline continual learning because the training signal that can be ob… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted to ICLR2023

  48. arXiv:2302.06112  [pdf, other

    cs.LG cs.CV

    How to Use Dropout Correctly on Residual Networks with Batch Normalization

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Donggeon Lee, Sang Woo Kim

    Abstract: For the stable optimization of deep neural networks, regularization methods such as dropout and batch normalization have been used in various tasks. Nevertheless, the correct position to apply dropout has rarely been discussed, and different positions have been employed depending on the practitioners. In this study, we investigate the correct position to apply dropout. We demonstrate that for a re… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 10 pages, 4 figures

  49. arXiv:2302.03193  [pdf, other

    cs.LG cs.CV

    On the Ideal Number of Groups for Isometric Gradient Propagation

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: Recently, various normalization layers have been proposed to stabilize the training of deep neural networks. Among them, group normalization is a generalization of layer normalization and instance normalization by allowing a degree of freedom in the number of groups it uses. However, to determine the optimal number of groups, trial-and-error-based hyperparameter tuning is required, and such experi… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 10 pages, 2 figures

  50. arXiv:2301.12443  [pdf, other

    cs.LG cs.DC

    Pipe-BD: Pipelined Parallel Blockwise Distillation

    Authors: Hongsun Jang, Jaewon Jung, Jaeyong Song, Joonsang Yu, Youngsok Kim, Jinho Lee

    Abstract: Training large deep neural network models is highly challenging due to their tremendous computational and memory requirements. Blockwise distillation provides one promising method towards faster convergence by splitting a large model into multiple smaller models. In state-of-the-art blockwise distillation methods, training is performed block-by-block in a data-parallel manner using multiple GPUs.… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

    Comments: To appear at DATE'23