Zum Hauptinhalt springen

Showing 1–50 of 236 results for author: Choe, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.15591  [pdf, other

    cs.LG

    VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and Purification

    Authors: Yungi Cho, Woorim Han, Miseon Yu, Younghan Lee, Ho Bae, Yunheung Paek

    Abstract: Vertical Federated Learning (VFL) focuses on handling vertically partitioned data over FL participants. Recent studies have discovered a significant vulnerability in VFL to backdoor attacks which specifically target the distinct characteristics of VFL. Therefore, these attacks may neutralize existing defense mechanisms designed primarily for Horizontal Federated Learning (HFL) and deep neural netw… ▽ More

    Submitted 28 August, 2024; v1 submitted 28 August, 2024; originally announced August 2024.

    Comments: Accepted by 29th European Symposium on Research in Computer Security (ESORICS 2024)

  2. arXiv:2408.11841  [pdf, other

    cs.CY cs.AI cs.CL

    Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

    Authors: Beatriz Borges, Negar Foroutan, Deniz Bayazit, Anna Sotnikova, Syrielle Montariol, Tanya Nazaretzky, Mohammadreza Banaei, Alireza Sakhaeirad, Philippe Servant, Seyed Parsa Neshaei, Jibril Frej, Angelika Romanou, Gail Weiss, Sepideh Mamooler, Zeming Chen, Simin Fan, Silin Gao, Mete Ismayilzada, Debjit Paul, Alexandre Schöpfer, Andrej Janchevski, Anja Tiede, Clarence Linden, Emanuele Troiani, Francesco Salvi , et al. (65 additional authors not shown)

    Abstract: AI assistants are being increasingly used by students enrolled in higher education institutions. While these tools provide opportunities for improved teaching and education, they also pose significant challenges for assessment and learning outcomes. We conceptualize these challenges through the lens of vulnerability, the potential for university assessments and learning outcomes to be impacted by… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 20 pages, 8 figures

  3. arXiv:2408.07576  [pdf, other

    cs.CV cs.AI

    MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation

    Authors: Beoungwoo Kang, Seunghun Moon, Yubin Cho, Hyunwoo Yu, Suk-Ju Kang

    Abstract: Beyond the Transformer, it is important to explore how to exploit the capacity of the MetaFormer, an architecture that is fundamental to the performance improvements of the Transformer. Previous studies have exploited it only for the backbone network. Unlike previous studies, we explore the capacity of the Metaformer architecture more extensively in the semantic segmentation task. We propose a pow… ▽ More

    Submitted 14 August, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: Accepted by WACV 2024

  4. Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation

    Authors: Yubin Cho, Hyunwoo Yu, Suk-ju Kang

    Abstract: Referring segmentation aims to segment a target object related to a natural language expression. Key challenges of this task are understanding the meaning of complex and ambiguous language expressions and determining the relevant regions in the image with multiple objects by referring to the expression. Recent models have focused on the early fusion with the language features at the intermediate s… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Published in IEEE Transactions on Multimedia (TMM)

  5. Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition

    Authors: Hogyun Kim, Jiwon Choi, Taehu Sim, Giseop Kim, Younggun Cho

    Abstract: We often encounter limited FOV situations due to various factors such as sensor fusion or sensor mount in real-world robot navigation. However, the limited FOV interrupts the generation of descriptions and impacts place recognition adversely. Therefore, we suffer from correcting accumulated drift errors in a consistent map using LiDAR-based place recognition with limited FOV. Thus, in this paper,… ▽ More

    Submitted 26 August, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: IEEE Robotics and Automation Letters (2024)

  6. arXiv:2408.06673  [pdf

    cs.CL

    Pragmatic inference of scalar implicature by LLMs

    Authors: Ye-eun Cho, Seong mook Kim

    Abstract: This study investigates how Large Language Models (LLMs), particularly BERT (Devlin et al., 2019) and GPT-2 (Radford et al., 2019), engage in pragmatic inference of scalar implicature, such as some. Two sets of experiments were conducted using cosine similarity and next sentence/token prediction as experimental methods. The results in experiment 1 showed that, both models interpret some as pragmat… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: This research was presented at the Association for Computational Linguistics conference, held on August 11-16

  7. arXiv:2408.05558  [pdf, other

    cs.CV

    Object Re-identification via Spatial-temporal Fusion Networks and Causal Identity Matching

    Authors: Hye-Geun Kim, Yong-Hyuk Moon, Yeong-Jun Cho

    Abstract: Object re-identification (ReID) in large camera networks faces numerous challenges. First, the similar appearances of objects degrade ReID performance, a challenge that needs to be addressed by existing appearance-based ReID methods. Second, most ReID studies are performed in laboratory settings and do not consider real-world scenarios. To overcome these challenges, we introduce a novel ReID frame… ▽ More

    Submitted 22 August, 2024; v1 submitted 10 August, 2024; originally announced August 2024.

  8. arXiv:2407.17261  [pdf, other

    cs.CV

    Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation

    Authors: Hyunwoo Yu, Yubin Cho, Beoungwoo Kang, Seunghun Moon, Kyeongbo Kong, Suk-Ju Kang

    Abstract: We present an Encoder-Decoder Attention Transformer, EDAFormer, which consists of the Embedding-Free Transformer (EFT) encoder and the all-attention decoder leveraging our Embedding-Free Attention (EFA) structure. The proposed EFA is a novel global context modeling mechanism that focuses on functioning the global non-linearity, not the specific roles of the query, key and value. For the decoder, w… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  9. arXiv:2407.11348  [pdf, other

    cs.CV

    Flatfish Disease Detection Based on Part Segmentation Approach and Disease Image Generation

    Authors: Seo-Bin Hwang, Han-Young Kim, Chae-Yeon Heo, Hie-Yong Jung, Sung-Ju Jung, Yeong-Jun Cho

    Abstract: The flatfish is a major farmed species consumed globally in large quantities. However, due to the densely populated farming environment, flatfish are susceptible to injuries and diseases, making early disease detection crucial. Traditionally, diseases were detected through visual inspection, but observing large numbers of fish is challenging. Automated approaches based on deep learning technologie… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 16 page, 13 figures, 4 tables

  10. arXiv:2407.10413  [pdf, other

    cs.CV cs.AI

    Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentation

    Authors: Seungri Yoon, Yunseong Cho, Tae In Ahn

    Abstract: Monitoring and managing the growth and quality of fruits are very important tasks. To effectively train deep learning models like YOLO for real-time fruit detection, high-quality image datasets are essential. However, such datasets are often lacking in agriculture. Generative AI models can help create high-quality images. In this study, we used MidJourney and Firefly tools to generate images of me… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 8 pages, 7 figures

  11. arXiv:2407.01158  [pdf, other

    cs.CL

    Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation

    Authors: Takyoung Kim, Kyungjae Lee, Young Rok Jang, Ji Yong Cho, Gangwoo Kim, Minseok Cho, Moontae Lee

    Abstract: Interactions with billion-scale large language models typically yield long-form responses due to their extensive parametric capacities, along with retrieval-augmented features. While detailed responses provide insightful viewpoint of a specific subject, they frequently generate redundant and less engaging content that does not meet user interests. In this work, we focus on the role of query outlin… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Work in progress. Resources are available at https://github.com/youngerous/qtree

  12. arXiv:2406.16994  [pdf, other

    eess.SP cs.AI

    Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks

    Authors: Gyu Seon Kim, Yeryeong Cho, Jaehyun Chung, Soohyun Park, Soyi Jung, Zhu Han, Joongheon Kim

    Abstract: Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for prov… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 22 figures

  13. arXiv:2406.10590  [pdf, other

    cs.HC

    LLM-Mediated Domain-Specific Voice Agents: The Case of TextileBot

    Authors: Shu Zhong, Elia Gatti, James Hardwick, Miriam Ribul, Youngjun Cho, Marianna Obrist

    Abstract: Developing domain-specific conversational agents (CAs) has been challenged by the need for extensive domain-focused data. Recent advancements in Large Language Models (LLMs) make them a viable option as a knowledge backbone. LLMs behaviour can be enhanced through prompting, instructing them to perform downstream tasks in a zero-shot fashion (i.e. without training). To this end, we incorporated str… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  14. arXiv:2406.06587  [pdf, other

    cs.CL cs.AI cs.HC

    Exploring Human-AI Perception Alignment in Sensory Experiences: Do LLMs Understand Textile Hand?

    Authors: Shu Zhong, Elia Gatti, Youngjun Cho, Marianna Obrist

    Abstract: Aligning large language models (LLMs) behaviour with human intent is critical for future AI. An important yet often overlooked aspect of this alignment is the perceptual alignment. Perceptual modalities like touch are more multifaceted and nuanced compared to other sensory modalities such as vision. This work investigates how well LLMs align with human touch experiences using the "textile hand" ta… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  15. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  16. arXiv:2406.01506  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    The Geometry of Categorical and Hierarchical Concepts in Large Language Models

    Authors: Kiho Park, Yo Joong Choe, Yibo Jiang, Victor Veitch

    Abstract: Understanding how semantic meaning is encoded in the representation spaces of large language models is a fundamental problem in interpretability. In this paper, we study the two foundational questions in this area. First, how are categorical concepts, such as {'mammal', 'bird', 'reptile', 'fish'}, represented? Second, how are hierarchical relations between concepts encoded? For example, how is the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Code is available at https://github.com/KihoPark/LLM_Categorical_Hierarchical_Representations

  17. arXiv:2405.18832  [pdf, other

    cs.LG cs.AI cs.AR

    MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models

    Authors: Taehyun Kim, Kwanseok Choi, Youngmock Cho, Jaehoon Cho, Hyuk-Jae Lee, Jaewoong Sim

    Abstract: Mixture-of-Experts (MoE) large language models (LLM) have memory requirements that often exceed the GPU memory capacity, requiring costly parameter movement from secondary memories to the GPU for expert computation. In this work, we present Mixture of Near-Data Experts (MoNDE), a near-data computing solution that efficiently enables MoE LLM inference. MoNDE reduces the volume of MoE parameter move… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to DAC 2024

  18. arXiv:2405.18732  [pdf, other

    physics.geo-ph cs.AI cs.LG physics.app-ph

    Gemini & Physical World: Large Language Models Can Estimate the Intensity of Earthquake Shaking from Multi-Modal Social Media Posts

    Authors: S. Mostafa Mousavi, Marc Stogaitis, Tajinder Gadh, Richard M Allen, Alexei Barski, Robert Bosch, Patrick Robertson, Nivetha Thiruverahan, Youngmin Cho, Aman Raj

    Abstract: This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  19. arXiv:2405.18148  [pdf, other

    cs.CV cs.AI

    Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation

    Authors: JuneHyoung Kwon, Eunju Lee, Yunsung Cho, YoungBin Kim

    Abstract: Weakly supervised semantic segmentation (WSSS) employing weak forms of labels has been actively studied to alleviate the annotation cost of acquiring pixel-level labels. However, classifiers trained on biased datasets tend to exploit shortcut features and make predictions based on spurious correlations between certain backgrounds and objects, leading to a poor generalization performance. In this p… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to WACV 2024

  20. arXiv:2405.16424  [pdf, other

    cs.HC cs.AI cs.LG

    Improving Health Professionals' Onboarding with AI and XAI for Trustworthy Human-AI Collaborative Decision Making

    Authors: Min Hun Lee, Silvana Xin Yi Choo, Shamala D/O Thilarajah

    Abstract: With advanced AI/ML, there has been growing research on explainable AI (XAI) and studies on how humans interact with AI and XAI for effective human-AI collaborative decision-making. However, we still have a lack of understanding of how AI systems and XAI should be first presented to users without technical backgrounds. In this paper, we present the findings of semi-structured interviews with healt… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  21. Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments

    Authors: Jooyong Park, Jungwoo Lee, Euncheol Choi, Younggun Cho

    Abstract: In urban environments for delivery robots, particularly in areas such as campuses and towns, many custom features defy standard road semantic categorizations. Addressing this challenge, our paper introduces a method leveraging Salient Object Detection (SOD) to extract these unique features, employing them as pivotal factors for enhanced robot loop closure and localization. Traditional geometric fe… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 8 pages, 9 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  22. arXiv:2405.08142  [pdf

    cs.CL cs.CY

    Discursive objection strategies in online comments: Developing a classification schema and validating its training

    Authors: Ashley L. Shea, Aspen K. B. Omapang, Ji Yong Cho, Miryam Y. Ginsparg, Natalie Bazarova, Winice Hui, René F. Kizilcec, Chau Tong, Drew Margolin

    Abstract: Most Americans agree that misinformation, hate speech and harassment are harmful and inadequately curbed on social media through current moderation practices. In this paper, we aim to understand the discursive strategies employed by people in response to harmful speech in news comments. We conducted a content analysis of more than 6500 comment replies to trending news videos on YouTube and Twitter… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: This paper was accepted and presented at the 73rd Annual International Communication Association International Conference, May 2023

    ACM Class: I.2.7, J.4

  23. arXiv:2405.04359  [pdf, other

    cs.RO

    A Personalizable Controller for the Walking Assistive omNi-Directional Exo-Robot (WANDER)

    Authors: A. Fortuna, M. Lorenzini, M. Leonori, JM. Gandarias, P. Balatti, Y. Cho, E. De Momi, A. Ajoudani

    Abstract: Preserving and encouraging mobility in the elderly and adults with chronic conditions is of paramount importance. However, existing walking aids are either inadequate to provide sufficient support to users' stability or too bulky and poorly maneuverable to be used outside hospital environments. In addition, they all lack adaptability to individual requirements. To address these challenges, this pa… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures, IEEE International Conference on Robotics and Automation (2024)

  24. arXiv:2405.03929  [pdf, other

    cs.AI physics.ao-ph

    Unicorn: U-Net for Sea Ice Forecasting with Convolutional Neural Ordinary Differential Equations

    Authors: Jaesung Park, Sungchul Hong, Yoonseo Cho, Jong-June Jeon

    Abstract: Sea ice at the North Pole is vital to global climate dynamics. However, accurately forecasting sea ice poses a significant challenge due to the intricate interaction among multiple variables. Leveraging the capability to integrate multiple inputs and powerful performances seamlessly, many studies have turned to neural networks for sea ice forecasting. This paper introduces a novel deep architectur… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  25. arXiv:2404.18395  [pdf, other

    cs.RO

    Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle

    Authors: Jungwoo Lee, Younggun Cho

    Abstract: This paper proposes a photorealistic real-time dense 3D mapping system that utilizes a learning-based image enhancement method and mesh-based map representation. Due to the characteristics of the underwater environment, where problems such as hazing and low contrast occur, it is hard to apply conventional simultaneous localization and mapping (SLAM) methods. Furthermore, for sensitive tasks like i… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 7 pages, 15 figures, IEEE ICRA Workshop on Field Robotics 2024

  26. arXiv:2404.08611  [pdf, other

    cs.CV cs.AI physics.med-ph

    Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

    Authors: Xin Tie, Muheon Shin, Changhee Lee, Scott B. Perlman, Zachary Huemann, Amy J. Weisman, Sharon M. Castellino, Kara M. Kelly, Kathleen M. McCarten, Adina L. Alazraki, Junjie Hu, Steve Y. Cho, Tyler J. Bradshaw

    Abstract: $\textbf{Purpose}$: Automatic quantification of longitudinal changes in PET scans for lymphoma patients has proven challenging, as residual disease in interim-therapy scans is often subtle and difficult to detect. Our goal was to develop a longitudinally-aware segmentation network (LAS-Net) that can quantify serial PET/CT images for pediatric Hodgkin lymphoma patients. $\textbf{Materials and Metho… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 6 figures, 4 tables in the main text

  27. arXiv:2404.05417  [pdf, other

    cs.HC cs.AI cs.CY

    Indexing Analytics to Instances: How Integrating a Dashboard can Support Design Education

    Authors: Ajit Jain, Andruid Kerne, Nic Lupfer, Gabriel Britain, Aaron Perrine, Yoonsuck Choe, John Keyser, Ruihong Huang, Jinsil Seo, Annie Sungkajun, Robert Lightfoot, Timothy McGuire

    Abstract: We investigate how to use AI-based analytics to support design education. The analytics at hand measure multiscale design, that is, students' use of space and scale to visually and conceptually organize their design work. With the goal of making the analytics intelligible to instructors, we developed a research artifact integrating a design analytics dashboard with design instances, and the design… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 22 pages, 4 figures, Submitted to ACM DIS

    ACM Class: H.5.2

  28. arXiv:2404.04241  [pdf, other

    cs.RO

    Modeling Kinematic Uncertainty of Tendon-Driven Continuum Robots via Mixture Density Networks

    Authors: Jordan Thompson, Brian Y. Cho, Daniel S. Brown, Alan Kuntz

    Abstract: Tendon-driven continuum robot kinematic models are frequently computationally expensive, inaccurate due to unmodeled effects, or both. In particular, unmodeled effects produce uncertainties that arise during the robot's operation that lead to variability in the resulting geometry. We propose a novel solution to these issues through the development of a Gaussian mixture kinematic model. We train a… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  29. arXiv:2404.03816  [pdf, other

    cs.RO

    Accounting for Hysteresis in the Forward Kinematics of Nonlinearly-Routed Tendon-Driven Continuum Robots via a Learned Deep Decoder Network

    Authors: Brian Y. Cho, Daniel S. Esser, Jordan Thompson, Bao Thach, Robert J. Webster III, Alan Kuntz

    Abstract: Tendon-driven continuum robots have been gaining popularity in medical applications due to their ability to curve around complex anatomical structures, potentially reducing the invasiveness of surgery. However, accurate modeling is required to plan and control the movements of these flexible robots. Physics-based models have limitations due to unmodeled effects, leading to mismatches between model… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, 9 figures, Submitted to IEEE Robotics and Automation Letters

  30. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  31. arXiv:2404.00670  [pdf, other

    cs.CV q-bio.QM stat.AP

    Statistical Analysis by Semiparametric Additive Regression and LSTM-FCN Based Hierarchical Classification for Computer Vision Quantification of Parkinsonian Bradykinesia

    Authors: Youngseo Cho, In Hee Kwak, Dohyeon Kim, Jinhee Na, Hanjoo Sung, Jeongjae Lee, Young Eun Kim, Hyeo-il Ma

    Abstract: Bradykinesia, characterized by involuntary slowing or decrement of movement, is a fundamental symptom of Parkinson's Disease (PD) and is vital for its clinical diagnosis. Despite various methodologies explored to quantify bradykinesia, computer vision-based approaches have shown promising results. However, these methods often fall short in adequately addressing key bradykinesia characteristics in… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  32. arXiv:2403.14176  [pdf, other

    cs.RO

    ReFeree: Radar-based efficient global descriptor using a Feature and Free space for Place Recognition

    Authors: Byunghee Choi, Hogyun Kim, Younggun Cho

    Abstract: Radar is highlighted for robust sensing capabilities in adverse weather conditions (e.g. dense fog, heavy rain, or snowfall). In addition, Radar can cover wide areas and penetrate small particles. Despite these advantages, Radar-based place recognition remains in the early stages compared to other sensors due to its unique characteristics such as low resolution, and significant noise. In this pape… ▽ More

    Submitted 17 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 5 pages, 4 figures

  33. arXiv:2403.10760  [pdf, other

    cs.RO cs.AI cs.LG

    CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects

    Authors: Yoonyoung Cho, Junhyek Han, Yoontae Cho, Beomjoon Kim

    Abstract: Nonprehensile manipulation is essential for manipulating objects that are too thin, large, or otherwise ungraspable in the wild. To sidestep the difficulty of contact modeling in conventional modeling-based approaches, reinforcement learning (RL) has recently emerged as a promising alternative. However, previous RL approaches either lack the ability to generalize over diverse object shapes, or use… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  34. arXiv:2403.06342  [pdf, other

    math.NA cs.LG

    Separable Physics-informed Neural Networks for Solving the BGK Model of the Boltzmann Equation

    Authors: Jaemin Oh, Seung Yeon Cho, Seok-Bae Yun, Eunbyung Park, Youngjoon Hong

    Abstract: In this study, we introduce a method based on Separable Physics-Informed Neural Networks (SPINNs) for effectively solving the BGK model of the Boltzmann equation. While the mesh-free nature of PINNs offers significant advantages in handling high-dimensional partial differential equations (PDEs), challenges arise when applying quadrature rules for accurate integral evaluation in the BGK operator, w… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    MSC Class: 68T20; 35R09

  35. arXiv:2403.05861  [pdf, ps, other

    cs.DC

    DeepVM: Integrating Spot and On-Demand VMs for Cost-Efficient Deep Learning Clusters in the Cloud

    Authors: Yoochan Kim, Kihyun Kim, Yonghyeon Cho, Jinwoo Kim, Awais Khan, Ki-Dong Kang, Baik-Song An, Myung-Hoon Cha, Hong-Yeon Kim, Youngjae Kim

    Abstract: Distributed Deep Learning (DDL), as a paradigm, dictates the use of GPU-based clusters as the optimal infrastructure for training large-scale Deep Neural Networks (DNNs). However, the high cost of such resources makes them inaccessible to many users. Public cloud services, particularly Spot Virtual Machines (VMs), offer a cost-effective alternative, but their unpredictable availability poses a sig… ▽ More

    Submitted 14 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

  36. arXiv:2403.02870  [pdf, other

    cs.AI cs.CR cs.LG

    Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices

    Authors: Younghan Lee, Sohee Jun, Yungi Cho, Woorim Han, Hyungon Moon, Yunheung Paek

    Abstract: With growing popularity, deep learning (DL) models are becoming larger-scale, and only the companies with vast training datasets and immense computing power can manage their business serving such large models. Most of those DL models are proprietary to the companies who thus strive to keep their private models safe from the model extraction attack (MEA), whose aim is to steal the model by training… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by 27th European Symposium on Research in Computer Security (ESORICS 2022)

  37. arXiv:2403.02846  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

    Authors: Younghan Lee, Yungi Cho, Woorim Han, Ho Bae, Yunheung Paek

    Abstract: Federated Learning (FL) thrives in training a global model with numerous clients by only sharing the parameters of their local models trained with their private training datasets. Therefore, without revealing the private dataset, the clients can obtain a deep learning (DL) model with high performance. However, recent research proposed poisoning attacks that cause a catastrophic loss in the accurac… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by 28th European Symposium on Research in Computer Security (ESORICS 2023)

  38. arXiv:2402.11477  [pdf, other

    cs.CY

    Studying Differential Mental Health Expressions in India

    Authors: Khushi Shelat, Sunny Rai, Devansh R Jain, Kishen Sivabalan, Young Min Cho, Maitreyi Redkar, Samindara Sawant, Sharath Chandra Guntuku

    Abstract: Psychosocial stressors and the symptomatology of mental disorders vary across cultures. However, current understandings of mental health expressions on social media are predominantly derived from studies in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. In this paper, we analyze mental health posts on Reddit made by individuals in India, to identify variations in online… ▽ More

    Submitted 16 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  39. arXiv:2402.09698  [pdf, other

    stat.ME cs.LG math.PR math.ST stat.ML

    Combining Evidence Across Filtrations Using Adjusters

    Authors: Yo Joong Choe, Aaditya Ramdas

    Abstract: In anytime-valid sequential inference, it is known that any admissible procedure must be based on e-processes, which are composite generalizations of test martingales that quantify the accumulated evidence against a composite null hypothesis at any arbitrary stopping time. This paper studies methods for combining e-processes constructed using different information sets (filtrations) for the same n… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Substantially revised with new results in Sections 5 and 6. Code is available at https://github.com/yjchoe/CombiningEvidenceAcrossFiltrations

  40. arXiv:2402.08966  [pdf, other

    cs.CV cs.CL

    Pretraining Vision-Language Model for Difference Visual Question Answering in Longitudinal Chest X-rays

    Authors: Yeongjae Cho, Taehee Kim, Heejun Shin, Sungzoon Cho, Dongmyung Shin

    Abstract: Difference visual question answering (diff-VQA) is a challenging task that requires answering complex questions based on differences between a pair of images. This task is particularly important in reading chest X-ray images because radiologists often compare multiple images of the same patient taken at different times to track disease progression and changes in its severity in their clinical prac… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  41. arXiv:2401.16437  [pdf, other

    physics.ao-ph cs.LG

    A Benchmark Dataset for Tornado Detection and Prediction using Full-Resolution Polarimetric Weather Radar Data

    Authors: Mark S. Veillette, James M. Kurdzo, Phillip M. Stepanian, John Y. N. Cho, Siddharth Samsi, Joseph McDonald

    Abstract: Weather radar is the primary tool used by forecasters to detect and warn for tornadoes in near-real time. In order to assist forecasters in warning the public, several algorithms have been developed to automatically detect tornadic signatures in weather radar observations. Recently, Machine Learning (ML) algorithms, which learn directly from large amounts of labeled data, have been shown to be hig… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 37 pages, 15 Figures, 2 Tables

  42. arXiv:2401.13087  [pdf, other

    cs.CV stat.AP

    Open-source data pipeline for street-view images: a case study on community mobility during COVID-19 pandemic

    Authors: Matthew Martell, Nick Terry, Ribhu Sengupta, Chris Salazar, Nicole A. Errett, Scott B. Miles, Joseph Wartman, Youngjun Choe

    Abstract: Street View Images (SVI) are a common source of valuable data for researchers. Researchers have used SVI data for estimating pedestrian volumes, demographic surveillance, and to better understand built and natural environments in cityscapes. However, the most common source of publicly available SVI data is Google Street View. Google Street View images are collected infrequently, making temporal an… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 16 pages, 4 figures, two tables. Martell and Terry are equally contributing first authors

  43. arXiv:2401.06799  [pdf, other

    cs.CL cs.LG

    Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior

    Authors: Youngjae Cho, HeeSun Bae, Seungjae Shin, Yeo Dong Youn, Weonyoung Joo, Il-Chul Moon

    Abstract: Recent Vision-Language Pretrained (VLP) models have become the backbone for many downstream tasks, but they are utilized as frozen model without learning. Prompt learning is a method to improve the pre-trained VLP model by adding a learnable context vector to the inputs of the text encoder. In a few-shot learning scenario of the downstream task, MLE training can lead the context vector to over-fit… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI-2024

  44. arXiv:2401.06432  [pdf, other

    cs.LG cs.DC

    Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

    Authors: Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi

    Abstract: Foundation models (FMs) adapt well to specific domains or tasks with fine-tuning, and federated learning (FL) enables the potential for privacy-preserving fine-tuning of the FMs with on-device local data. For federated fine-tuning of FMs, we consider the FMs with small to medium parameter sizes of single digit billion at maximum, referred to as on-device FMs (ODFMs) that can be deployed on devices… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  45. arXiv:2401.06400  [pdf, other

    cs.CL cs.CV

    Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model

    Authors: Taehee Kim, Yeongjae Cho, Heejun Shin, Yohan Jo, Dongmyung Shin

    Abstract: Visual question answering (VQA) is a task where an image is given, and a series of questions are asked about the image. To build an efficient VQA algorithm, a large amount of QA data is required which is very expensive. Generating synthetic QA pairs based on templates is a practical way to obtain data. However, VQA models trained on those data do not perform well on complex, human-written question… ▽ More

    Submitted 22 August, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  46. arXiv:2401.05254  [pdf, other

    cs.CY cs.CL

    Language-based Valence and Arousal Expressions between the United States and China: a Cross-Cultural Examination

    Authors: Young-Min Cho, Dandan Pang, Stuti Thapa, Garrick Sherman, Lyle Ungar, Louis Tay, Sharath Chandra Guntuku

    Abstract: Although affective expressions of individuals have been extensively studied using social media, research has primarily focused on the Western context. There are substantial differences among cultures that contribute to their affective expressions. This paper examines the differences between Twitter (X) in the United States and Sina Weibo posts in China on two primary dimensions of affect - valence… ▽ More

    Submitted 30 July, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: preview

  47. arXiv:2401.04139  [pdf

    cs.LG

    CCNETS: A Novel Brain-Inspired Approach for Enhanced Pattern Recognition in Imbalanced Datasets

    Authors: Hanbeot Park, Yunjeong Cho, Hoon-Hee Kim

    Abstract: This study introduces CCNETS (Causal Learning with Causal Cooperative Nets), a novel generative model-based classifier designed to tackle the challenge of generating data for imbalanced datasets in pattern recognition. CCNETS is uniquely crafted to emulate brain-like information processing and comprises three main components: Explainer, Producer, and Reasoner. Each component is designed to mimic s… ▽ More

    Submitted 25 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: 31 pages, authors (3) is Corresponding Author

  48. arXiv:2401.02710  [pdf, other

    cs.CE cs.AI

    Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning

    Authors: Hong-Gi Shin, Sukhyun Jeong, Eui-Yeon Kim, Sungho Hong, Young-Jin Cho, Yong-Hoon Choi

    Abstract: Mining of formulaic alpha factors refers to the process of discovering and developing specific factors or indicators (referred to as alpha factors) for quantitative trading in stock market. To efficiently discover alpha factors in vast search space, reinforcement learning (RL) is commonly employed. This paper proposes a method to enhance existing alpha factor mining approaches by expanding a searc… ▽ More

    Submitted 7 July, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by ICOIN 2024

  49. arXiv:2312.12488  [pdf, other

    cs.LG cs.CR cs.CV

    Foreseeing Reconstruction Quality of Gradient Inversion: An Optimization Perspective

    Authors: HyeongGwon Hong, Yooshin Cho, Hanbyel Cho, Jaesung Ahn, Junmo Kim

    Abstract: Gradient inversion attacks can leak data privacy when clients share weight updates with the server in federated learning (FL). Existing studies mainly use L2 or cosine distance as the loss function for gradient matching in the attack. Our empirical investigation shows that the vulnerability ranking varies with the loss function used. Gradient norm, which is commonly used as a vulnerability proxy f… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: To appear in AAAI 2024

  50. arXiv:2312.04005  [pdf, other

    cs.CV cs.AI

    KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis

    Authors: Youngwan Lee, Kwanyong Park, Yoorhim Cho, Yong-Ju Lee, Sung Ju Hwang

    Abstract: As text-to-image (T2I) synthesis models increase in size, they demand higher inference costs due to the need for more expensive GPUs with larger memory, which makes it challenging to reproduce these models in addition to the restricted access to training datasets. Our study aims to reduce these inference costs and explores how far the generative capabilities of T2I models can be extended using onl… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page: https://youngwanlee.github.io/KOALA/