Zum Hauptinhalt springen

Showing 1–44 of 44 results for author: Hwang, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11155  [pdf, other

    cs.RO

    Range-based Multi-Robot Integrity Monitoring Against Cyberattacks and Faults: An Anchor-Free Approach

    Authors: Vishnu Vijay, Kartik A. Pant, Minhyun Cho, Yifan Guo, James M. Goppert, Inseok Hwang

    Abstract: Coordination of multi-robot systems (MRSs) relies on efficient sensing and reliable communication among the robots. However, the sensors and communication channels of these robots are often vulnerable to cyberattacks and faults, which can disrupt their individual behavior and the overall objective of the MRS. In this work, we present a multi-robot integrity monitoring framework that utilizes inter… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 8 pages, 7 figures

  2. arXiv:2408.09802  [pdf, other

    cs.SD cs.CV eess.AS

    Hear Your Face: Face-based voice conversion with F0 estimation

    Authors: Jaejun Lee, Yoori Oh, Injune Hwang, Kyogu Lee

    Abstract: This paper delves into the emerging field of face-based voice conversion, leveraging the unique relationship between an individual's facial features and their vocal characteristics. We present a novel face-based voice conversion framework that particularly utilizes the average fundamental frequency of the target speaker, derived solely from their facial images. Through extensive analysis, our fram… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Interspeech 2024

  3. arXiv:2407.09342  [pdf, other

    cs.RO

    MIXED-SENSE: A Mixed Reality Sensor Emulation Framework for Test and Evaluation of UAVs Against False Data Injection Attacks

    Authors: Kartik A. Pant, Li-Yu Lin, Jaehyeok Kim, Worawis Sribunma, James M. Goppert, Inseok Hwang

    Abstract: We present a high-fidelity Mixed Reality sensor emulation framework for testing and evaluating the resilience of Unmanned Aerial Vehicles (UAVs) against false data injection (FDI) attacks. The proposed approach can be utilized to assess the impact of FDI attacks, benchmark attack detector performance, and validate the effectiveness of mitigation/reconfiguration strategies in single-UAV and UAV swa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 6 pages, 5 figures, IROS 2024

  4. arXiv:2406.09117  [pdf, other

    cs.CV cs.AI

    PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation

    Authors: Injoon Hwang, Haewon Park, Youngwan Lee, Jooyoung Yang, SunJae Maeng

    Abstract: Low-rank adaption (LoRA) is a prominent method that adds a small number of learnable parameters to the frozen pre-trained weights for parameter-efficient fine-tuning. Prompted by the question, ``Can we make its representation enough with LoRA weights solely at the final phase of finetuning without the pre-trained weights?'' In this work, we introduce Progressive Compression LoRA~(PC-LoRA), which u… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at T4V@CVPR

  5. arXiv:2406.03234  [pdf, other

    cs.LG cs.AI

    Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning

    Authors: Inwoo Hwang, Yunhyeok Kwak, Suhyung Choi, Byoung-Tak Zhang, Sanghack Lee

    Abstract: Causal dynamics learning has recently emerged as a promising approach to enhancing robustness in reinforcement learning (RL). Typically, the goal is to build a dynamics model that makes predictions based on the causal relationships among the entities. Despite the fact that causal connections often manifest only under certain contexts, existing approaches overlook such fine-grained relationships an… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  6. arXiv:2406.00614  [pdf, other

    cs.LG cs.AI

    Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

    Authors: Yunhyeok Kwak, Inwoo Hwang, Dooyoung Kim, Sanghack Lee, Byoung-Tak Zhang

    Abstract: Monte Carlo Tree Search (MCTS) has showcased its efficacy across a broad spectrum of decision-making problems. However, its performance often degrades under vast combinatorial action space, especially where an action is composed of multiple sub-actions. In this work, we propose an action abstraction based on the compositional structure between a state and sub-actions for improving the efficiency o… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: UAI 2024 (Oral). The first two authors contributed equally

  7. arXiv:2405.07220  [pdf, other

    cs.LG cs.AI stat.ML

    On Discovery of Local Independence over Continuous Variables via Neural Contextual Decomposition

    Authors: Inwoo Hwang, Yunhyeok Kwak, Yeon-Ji Song, Byoung-Tak Zhang, Sanghack Lee

    Abstract: Conditional independence provides a way to understand causal relationships among the variables of interest. An underlying system may exhibit more fine-grained causal relationships especially between a variable and its parents, which will be called the local independence relationships. One of the most widely studied local relationships is Context-Specific Independence (CSI), which holds in a specif… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Conference on Causal Learning and Reasoning (CLeaR), 2023

  8. arXiv:2404.14647  [pdf, other

    cs.RO eess.SY

    Human Behavior Modeling via Identification of Task Objective and Variability

    Authors: Sooyung Byeon, Dawei Sun, Inseok Hwang

    Abstract: Human behavior modeling is important for the design and implementation of human-automation interactive control systems. In this context, human behavior refers to a human's control input to systems. We propose a novel method for human behavior modeling that uses human demonstrations for a given task to infer the unknown task objective and the variability. The task objective represents the human's i… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages

  9. arXiv:2404.01805  [pdf, other

    cs.LG

    Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification

    Authors: Michael Mitsios, Georgios Vamvoukakis, Georgia Maniati, Nikolaos Ellinas, Georgios Dimitriou, Konstantinos Markopoulos, Panos Kakoulidis, Alexandra Vioni, Myrsini Christidou, Junkwang Oh, Gunu Jho, Inchul Hwang, Georgios Vardaxoglou, Aimilios Chalamandaris, Pirros Tsiakoulis, Spyros Raptis

    Abstract: Emotion detection in textual data has received growing interest in recent years, as it is pivotal for developing empathetic human-computer interaction systems. This paper introduces a method for categorizing emotions from text, which acknowledges and differentiates between the diversified similarities and distinctions of various emotions. Initially, we establish a baseline by training a transforme… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  10. arXiv:2404.00856  [pdf, other

    cs.SD cs.AI eess.AS

    Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling

    Authors: Injune Hwang, Kyogu Lee

    Abstract: Recently, there have been efforts to encode the linguistic information of speech using a self-supervised framework for speech synthesis. However, predicting representations from surrounding representations can inadvertently entangle speaker information in the speech representation. This paper aims to remove speaker information by exploiting the structured nature of speech, composed of discrete uni… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  11. arXiv:2402.01520  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations

    Authors: Panos Kakoulidis, Nikolaos Ellinas, Georgios Vamvoukakis, Myrsini Christidou, Alexandra Vioni, Georgia Maniati, Junkwang Oh, Gunu Jho, Inchul Hwang, Pirros Tsiakoulis, Aimilios Chalamandaris

    Abstract: In this paper, we propose a singing voice synthesis model, Karaoker-SSL, that is trained only on text and speech data as a typical multi-speaker acoustic model. It is a low-resource pipeline that does not utilize any singing data end-to-end, since its vocoder is also trained on speech data. Karaoker-SSL is conditioned by self-supervised speech representations in an unsupervised manner. We preproce… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to IEEE ICASSP SASB 2024

  12. arXiv:2402.01298  [pdf, other

    eess.AS cs.AI cs.SD

    Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations

    Authors: Jaeyeon Kim, Injune Hwang, Kyogu Lee

    Abstract: We propose a framework to learn semantics from raw audio signals using two types of representations, encoding contextual and phonetic information respectively. Specifically, we introduce a speech-to-unit processing pipeline that captures two types of representations with different time resolutions. For the language model, we adopt a dual-channel architecture to incorporate both types of representa… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP 2024

  13. arXiv:2401.14421  [pdf, other

    cs.LG cs.MA eess.SY stat.ML

    Multi-Agent Based Transfer Learning for Data-Driven Air Traffic Applications

    Authors: Chuhao Deng, Hong-Cheol Choi, Hyunsang Park, Inseok Hwang

    Abstract: Research in developing data-driven models for Air Traffic Management (ATM) has gained a tremendous interest in recent years. However, data-driven models are known to have long training time and require large datasets to achieve good performance. To address the two issues, this paper proposes a Multi-Agent Bidirectional Encoder Representations from Transformers (MA-BERT) model that fully considers… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 12 pages, 8 figures, submitted for IEEE Transactions on Intelligent Transportation System

  14. arXiv:2310.16191  [pdf, other

    cs.CR

    Can Virtual Reality Protect Users from Keystroke Inference Attacks?

    Authors: Zhuolin Yang, Zain Sarwar, Iris Hwang, Ronik Bhaskar, Ben Y. Zhao, Haitao Zheng

    Abstract: Virtual Reality (VR) has gained popularity by providing immersive and interactive experiences without geographical limitations. It also provides a sense of personal privacy through physical separation. In this paper, we show that despite assumptions of enhanced privacy, VR is unable to shield its users from side-channel attacks that steal private information. Ironically, this vulnerability arises… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted by USENIX 2024

  15. arXiv:2310.05299  [pdf

    eess.IV cs.CV cs.LG

    Image Compression and Decompression Framework Based on Latent Diffusion Model for Breast Mammography

    Authors: InChan Hwang, MinJae Woo

    Abstract: This research presents a novel framework for the compression and decompression of medical images utilizing the Latent Diffusion Model (LDM). The LDM represents advancement over the denoising diffusion probabilistic model (DDPM) with a potential to yield superior image quality while requiring fewer computational resources in the image decompression process. A possible application of LDM and Torchvi… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 6 pages IEEE conference

  16. arXiv:2308.16880  [pdf, other

    cs.CV

    Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details

    Authors: Inwoo Hwang, Hyeonwoo Kim, Young Min Kim

    Abstract: We propose Text2Scene, a method to automatically create realistic textures for virtual scenes composed of multiple objects. Guided by a reference image and text descriptions, our pipeline adds detailed texture on labeled 3D geometries in the room such that the generated colors respect the hierarchical structure or semantic parts that are often composed of similar materials. Instead of applying fla… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted to CVPR 2023

  17. arXiv:2305.04422  [pdf

    eess.IV cs.CV cs.CY cs.LG

    Multivariate Analysis on Performance Gaps of Artificial Intelligence Models in Screening Mammography

    Authors: Linglin Zhang, Beatrice Brown-Mulry, Vineela Nalla, InChan Hwang, Judy Wawira Gichoya, Aimilia Gastounioti, Imon Banerjee, Laleh Seyyed-Kalantari, MinJae Woo, Hari Trivedi

    Abstract: Although deep learning models for abnormality classification can perform well in screening mammography, the demographic, imaging, and clinical characteristics associated with increased risk of model failure remain unclear. This retrospective study uses the Emory BrEast Imaging Dataset(EMBED) containing mammograms from 115931 patients imaged at Emory Healthcare between 2013-2020, with BI-RADS asses… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 29 pages, 6 tables, 7 figures, 2 supplemental tables

  18. arXiv:2304.08204  [pdf, other

    cs.CV

    Learning Geometry-aware Representations by Sketching

    Authors: Hyundo Lee, Inwoo Hwang, Hyunsung Go, Won-Seok Choi, Kibeom Kim, Byoung-Tak Zhang

    Abstract: Understanding geometric concepts, such as distance and shape, is essential for understanding the real world and also for many vision tasks. To incorporate such information into a visual representation of a scene, we propose learning to represent the scene by sketching, inspired by human behavior. Our method, coined Learning by Sketching (LBS), learns to convert an image into a set of colored strok… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  19. arXiv:2302.00671  [pdf, other

    cs.LG cs.AI cs.RO

    Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing

    Authors: Grace Zhang, Ayush Jain, Injune Hwang, Shao-Hua Sun, Joseph J. Lim

    Abstract: The ability to leverage shared behaviors between tasks is critical for sample-efficient multi-task reinforcement learning (MTRL). While prior methods have primarily explored parameter and data sharing, direct behavior-sharing has been limited to task families requiring similar behaviors. Our goal is to extend the efficacy of behavior-sharing to more general task families that could require a mix o… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  20. arXiv:2212.04018  [pdf, other

    cs.RO

    An Open-Source Gazebo Plugin for GNSS Multipath Signal Emulation in Virtual Urban Canyons

    Authors: Kartik Anand Pant, Zhanpeng Yang, James M Goppert, Inseok Hwang

    Abstract: One of the major errors affecting GNSS signals in urban canyons is GNSS multipath error. In this work, we develop a Gazebo plugin which utilizes a ray tracing technique to account for multipath effects in a virtual urban canyon environment using virtual satellites. This software plugin balances accuracy and computational complexity to run the simulation in real-time for both software-in-the-loop (… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 13 pages, 8 figures

  21. arXiv:2211.02291  [pdf, other

    cs.CV cs.AI cs.LG

    SelecMix: Debiased Learning by Contradicting-pair Sampling

    Authors: Inwoo Hwang, Sangjun Lee, Yunhyeok Kwak, Seong Joon Oh, Damien Teney, Jin-Hwa Kim, Byoung-Tak Zhang

    Abstract: Neural networks trained with ERM (empirical risk minimization) sometimes learn unintended decision rules, in particular when their training data is biased, i.e., when training labels are strongly correlated with undesirable features. To prevent a network from learning such features, recent methods augment training data such that examples displaying spurious correlations (i.e., bias-aligned example… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  22. arXiv:2211.01327  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis

    Authors: Konstantinos Klapsas, Karolos Nikitaras, Nikolaos Ellinas, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: A large part of the expressive speech synthesis literature focuses on learning prosodic representations of the speech signal which are then modeled by a prior distribution during inference. In this paper, we compare different prior architectures at the task of predicting phoneme level prosodic representations extracted with an unsupervised FVAE model. We use both subjective and objective metrics t… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2023

  23. arXiv:2211.00523  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis

    Authors: Karolos Nikitaras, Konstantinos Klapsas, Nikolaos Ellinas, Georgia Maniati, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: This paper proposes an Expressive Speech Synthesis model that utilizes token-level latent prosodic variables in order to capture and control utterance-level attributes, such as character acting voice and speaking style. Current works aim to explicitly factorize such fine-grained and utterance-level speech attributes into different representations extracted by modules that operate in the correspond… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2023

  24. arXiv:2211.00375  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Generating Multilingual Gender-Ambiguous Text-to-Speech Voices

    Authors: Konstantinos Markopoulos, Georgia Maniati, Georgios Vamvoukakis, Nikolaos Ellinas, Georgios Vardaxoglou, Panos Kakoulidis, Junkwang Oh, Gunu Jho, Inchul Hwang, Aimilios Chalamandaris, Pirros Tsiakoulis, Spyros Raptis

    Abstract: The gender of any voice user interface is a key element of its perceived identity. Recently, there has been increasing interest in interfaces where the gender is ambiguous rather than clearly identifying as female or male. This work addresses the task of generating novel gender-ambiguous TTS voices in a multi-speaker, multilingual setting. This is accomplished by efficiently sampling from a latent… ▽ More

    Submitted 11 June, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Accepted to INTERSPEECH 2023

  25. arXiv:2211.00342  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features

    Authors: Alexandra Vioni, Georgia Maniati, Nikolaos Ellinas, June Sig Sung, Inchul Hwang, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: Current state-of-the-art methods for automatic synthetic speech evaluation are based on MOS prediction neural models. Such MOS prediction models include MOSNet and LDNet that use spectral features as input, and SSL-MOS that relies on a pretrained self-supervised learning model that directly uses the speech signal as input. In modern high-quality neural TTS systems, prosodic appropriateness with re… ▽ More

    Submitted 7 May, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Proceedings of ICASSP 2023

  26. arXiv:2210.17264   

    cs.SD cs.CL cs.LG eess.AS

    Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation

    Authors: Nikolaos Ellinas, Georgios Vamvoukakis, Konstantinos Markopoulos, Georgia Maniati, Panos Kakoulidis, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: This paper presents a method for end-to-end cross-lingual text-to-speech (TTS) which aims to preserve the target language's pronunciation regardless of the original speaker's language. The model used is based on a non-attentive Tacotron architecture, where the decoder has been replaced with a normalizing flow network conditioned on the speaker identity, allowing both TTS and voice conversion (VC)… ▽ More

    Submitted 27 February, 2024; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: Fundamental changes to the model described and experimental procedure

  27. arXiv:2208.04832  [pdf, other

    cs.AI cs.LG cs.NE

    On the Importance of Critical Period in Multi-stage Reinforcement Learning

    Authors: Junseok Park, Inwoo Hwang, Min Whoo Lee, Hyunseok Oh, Minsu Lee, Youngki Lee, Byoung-Tak Zhang

    Abstract: The initial years of an infant's life are known as the critical period, during which the overall development of learning performance is significantly impacted due to neural plasticity. In recent studies, an AI agent, with a deep neural network mimicking mechanisms of actual neurons, exhibited a learning period similar to human's critical period. Especially during this initial period, the appropria… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted by the ICML Complex Feedback in Online Learning Workshop (Open Problems) 2022

  28. Sparse Ellipsometry: Portable Acquisition of Polarimetric SVBRDF and Shape with Unstructured Flash Photography

    Authors: Inseung Hwang, Daniel S. Jeon, Adolfo Muñoz, Diego Gutierrez, Xin Tong, Min H. Kim

    Abstract: Ellipsometry techniques allow to measure polarization information of materials, requiring precise rotations of optical components with different configurations of lights and sensors. This results in cumbersome capture devices, carefully calibrated in lab conditions, and in very long acquisition times, usually in the order of a few days per object. Recent techniques allow to capture polarimetric sp… ▽ More

    Submitted 8 February, 2023; v1 submitted 9 July, 2022; originally announced July 2022.

    Journal ref: ACM Transactions on Graphics 41, 4, Article 133 (July 2022)

  29. arXiv:2206.12455  [pdf, other

    cs.CV

    Ev-NeRF: Event Based Neural Radiance Field

    Authors: Inwoo Hwang, Junho Kim, Young Min Kim

    Abstract: We present Ev-NeRF, a Neural Radiance Field derived from event data. While event cameras can measure subtle brightness changes in high frame rates, the measurements in low lighting or extreme motion suffer from significant domain discrepancy with complex noise. As a result, the performance of event-based vision tasks does not transfer to challenging environments, where the event cameras are expect… ▽ More

    Submitted 5 March, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to WACV 2023

  30. arXiv:2203.12247  [pdf, other

    cs.CV

    Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition

    Authors: Junho Kim, Inwoo Hwang, Young Min Kim

    Abstract: We introduce Ev-TTA, a simple, effective test-time adaptation algorithm for event-based object recognition. While event cameras are proposed to provide measurements of scenes with fast motions or drastic illumination changes, many existing event-based recognition algorithms suffer from performance deterioration under extreme conditions due to significant domain shifts. Ev-TTA mitigates the severe… ▽ More

    Submitted 28 March, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  31. arXiv:2012.14681  [pdf, other

    cs.CL cs.AI

    Faster Re-translation Using Non-Autoregressive Model For Simultaneous Neural Machine Translation

    Authors: Hyojung Han, Sathish Indurthi, Mohd Abbas Zaidi, Nikhil Kumar Lakumarapu, Beomseok Lee, Sangha Kim, Chanwoo Kim, Inchul Hwang

    Abstract: Recently, simultaneous translation has gathered a lot of attention since it enables compelling applications such as subtitle translation for a live event or real-time video-call translation. Some of these translation applications allow editing of partial translation giving rise to re-translation approaches. The current re-translation approaches are based on autoregressive sequence generation model… ▽ More

    Submitted 1 June, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

  32. arXiv:2012.01227  [pdf, other

    cs.LG cs.AI

    Message Passing Adaptive Resonance Theory for Online Active Semi-supervised Learning

    Authors: Taehyeong Kim, Injune Hwang, Hyundo Lee, Hyunseo Kim, Won-Seok Choi, Joseph J. Lim, Byoung-Tak Zhang

    Abstract: Active learning is widely used to reduce labeling effort and training time by repeatedly querying only the most beneficial samples from unlabeled data. In real-world problems where data cannot be stored indefinitely due to limited storage or privacy issues, the query selection and the model update should be performed as soon as a new data sample is observed. Various online active learning methods… ▽ More

    Submitted 10 July, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: Accepted to ICML 2021

  33. arXiv:2010.10740  [pdf, other

    cs.LG cs.RO eess.SY

    Safety Verification of Model Based Reinforcement Learning Controllers

    Authors: Akshita Gupta, Inseok Hwang

    Abstract: Model-based reinforcement learning (RL) has emerged as a promising tool for developing controllers for real world systems (e.g., robotics, autonomous driving, etc.). However, real systems often have constraints imposed on their state space which must be satisfied to ensure the safety of the system and its environment. Developing a verification tool for RL algorithms is challenging because the non-… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  34. arXiv:2006.07446  [pdf, other

    cs.LG cs.AI

    Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine

    Authors: Kwangyeon Kim, Akshita Gupta, Hong-Cheol Choi, Inseok Hwang

    Abstract: Several works have addressed the problem of incorporating constraints in the reinforcement learning (RL) framework, however majority of them can only guarantee the satisfaction of soft constraints. In this work, we address the problem of satisfying hard state constraints in a model-free RL setting with the deterministic system dynamics. The proposed algorithm is developed for the discrete state an… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  35. Label Propagation Adaptive Resonance Theory for Semi-supervised Continuous Learning

    Authors: Taehyeong Kim, Injune Hwang, Gi-Cheon Kang, Won-Seok Choi, Hyunseo Kim, Byoung-Tak Zhang

    Abstract: Semi-supervised learning and continuous learning are fundamental paradigms for human-level intelligence. To deal with real-world problems where labels are rarely given and the opportunity to access the same data is limited, it is necessary to apply these two paradigms in a joined fashion. In this paper, we propose Label Propagation Adaptive Resonance Theory (LPART) for semi-supervised continuous l… ▽ More

    Submitted 16 April, 2020; originally announced May 2020.

    Comments: 5 pages, 2 figures, 1 table, accepted in ICASSP 2020

  36. arXiv:2001.04893  [pdf, ps, other

    cs.LG cs.CV

    SimEx: Express Prediction of Inter-dataset Similarity by a Fleet of Autoencoders

    Authors: Inseok Hwang, Jinho Lee, Frank Liu, Minsik Cho

    Abstract: Knowing the similarity between sets of data has a number of positive implications in training an effective model, such as assisting an informed selection out of known datasets favorable to model transfer or data augmentation problems with an unknown dataset. Common practices to estimate the similarity between data include comparing in the original sample space, comparing in the embedding space fro… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 12 pages

  37. arXiv:1912.13366  [pdf, other

    cs.LG cs.AI stat.ML

    Fast and Accurate Transferability Measurement for Heterogeneous Multivariate Data

    Authors: Seungcheol Park, Huiwen Xu, Taehun Kim, Inhwan Hwang, Kyung-Jun Kim, U Kang

    Abstract: Given a set of heterogeneous source datasets with their classifiers, how can we quickly find the most useful source dataset for a specific target task? We address the problem of measuring transferability between source and target datasets, where the source and the target have different feature spaces and distributions. We propose Transmeter, a fast and accurate method to estimate the transferabili… ▽ More

    Submitted 29 January, 2021; v1 submitted 23 December, 2019; originally announced December 2019.

  38. Ensemble-Based Deep Reinforcement Learning for Chatbots

    Authors: Heriberto Cuayáhuitl, Donghyeon Lee, Seonghan Ryu, Yongjin Cho, Sungja Choi, Satish Indurthi, Seunghak Yu, Hyungtak Choi, Inchul Hwang, Jihie Kim

    Abstract: Trainable chatbots that exhibit fluent and human-like conversations remain a big challenge in artificial intelligence. Deep Reinforcement Learning (DRL) is promising for addressing this challenge, but its successful application remains an open question. This article describes a novel ensemble-based approach applied to value-based DRL chatbots, which use finite action sets as a form of meaning repr… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: arXiv admin note: text overlap with arXiv:1908.10331

  39. arXiv:1908.10331  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Deep Reinforcement Learning for Chatbots Using Clustered Actions and Human-Likeness Rewards

    Authors: Heriberto Cuayáhuitl, Donghyeon Lee, Seonghan Ryu, Sungja Choi, Inchul Hwang, Jihie Kim

    Abstract: Training chatbots using the reinforcement learning paradigm is challenging due to high-dimensional states, infinite action spaces and the difficulty in specifying the reward function. We address such problems using clustered actions instead of infinite actions, and a simple but promising reward function based on human-likeness scores derived from human-human dialogue data. We train Deep Reinforcem… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: In International Joint Conference of Neural Networks (IJCNN), 2019

  40. Game Data Mining Competition on Churn Prediction and Survival Analysis using Commercial Game Log Data

    Authors: EunJo Lee, Yoonjae Jang, DuMim Yoon, JiHoon Jeon, Seong-il Yang, Sang-Kwang Lee, Dae-Wook Kim, Pei Pei Chen, Anna Guitart, Paul Bertens, África Periáñez, Fabian Hadiji, Marc Müller, Youngjun Joo, Jiyeon Lee, Inchon Hwang, Kyung-Joong Kim

    Abstract: Game companies avoid sharing their game data with external researchers. Only a few research groups have been granted limited access to game data so far. The reluctance of these companies to make data publicly available limits the wide use and development of data mining techniques and artificial intelligence research specific to the game industry. In this work, we developed and implemented an inter… ▽ More

    Submitted 18 December, 2018; v1 submitted 6 February, 2018; originally announced February 2018.

    Comments: IEEE Transactions on Games

    Journal ref: IEEE Transactions on Games, 2018

  41. arXiv:1708.07977  [pdf, other

    cs.CV

    Synthesising Wider Field Images from Narrow-Field Retinal Video Acquired Using a Low-Cost Direct Ophthalmoscope (Arclight) Attached to a Smartphone

    Authors: Keylor Daniel Chaves Viquez, Ognjen Arandjelovic, Andrew Blaikie, In Ae Hwang

    Abstract: Access to low cost retinal imaging devices in low and middle income countries is limited, compromising progress in preventing needless blindness. The Arclight is a recently developed low-cost solar powered direct ophthalmoscope which can be attached to the camera of a smartphone to acquire retinal images and video. However, the acquired data is inherently limited by the optics of direct ophthalmos… ▽ More

    Submitted 26 August, 2017; originally announced August 2017.

    Comments: International Conference on Computer Vision Workshop on BioImage Computing, 2017

  42. Co-salient Object Detection Based on Deep Saliency Networks and Seed Propagation over an Integrated Graph

    Authors: Dong-ju Jeong, Insung Hwang, Nam Ik Cho

    Abstract: This paper presents a co-salient object detection method to find common salient regions in a set of images. We utilize deep saliency networks to transfer co-saliency prior knowledge and better capture high-level semantic information, and the resulting initial co-saliency maps are enhanced by seed propagation steps over an integrated graph. The deep saliency networks are trained in a supervised man… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

    Comments: 13 pages, 10 figures, 3 tables

  43. arXiv:1701.06190  [pdf, other

    cs.CV

    A New Convolutional Network-in-Network Structure and Its Applications in Skin Detection, Semantic Segmentation, and Artifact Reduction

    Authors: Yoonsik Kim, Insung Hwang, Nam Ik Cho

    Abstract: The inception network has been shown to provide good performance on image classification problems, but there are not much evidences that it is also effective for the image restoration or pixel-wise labeling problems. For image restoration problems, the pooling is generally not used because the decimated features are not helpful for the reconstruction of an image as the output. Moreover, most deep… ▽ More

    Submitted 22 January, 2017; originally announced January 2017.

    Comments: 10 pages

  44. arXiv:1110.5667  [pdf, other

    cs.AI cs.LG

    Inducing Probabilistic Programs by Bayesian Program Merging

    Authors: Irvin Hwang, Andreas Stuhlmüller, Noah D. Goodman

    Abstract: This report outlines an approach to learning generative models from data. We express models as probabilistic programs, which allows us to capture abstract patterns within the examples. By choosing our language for programs to be an extension of the algebraic data type of the examples, we can begin with a program that generates all and only the examples. We then introduce greater abstraction, and h… ▽ More

    Submitted 25 October, 2011; originally announced October 2011.