Skip to main content

Showing 1–50 of 115 results for author: Jung, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08702  [pdf, other

    cs.AI cs.CL cs.CV

    VLind-Bench: Measuring Language Priors in Large Vision-Language Models

    Authors: Kang-il Lee, Minbeom Kim, Seunghyun Yoon, Minsung Kim, Dongryeol Lee, Hyukhun Koh, Kyomin Jung

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated outstanding performance across various multimodal tasks. However, they suffer from a problem known as language prior, where responses are generated based solely on textual patterns while disregarding image information. Addressing the issue of language prior is crucial, as it can lead to undesirable biases or hallucinations when dealing with im… ▽ More

    Submitted 10 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.10944  [pdf, other

    physics.chem-ph cs.LG

    Probabilistic transfer learning methodology to expedite high fidelity simulation of reactive flows

    Authors: Bruno S. Soriano, Ki Sung Jung, Tarek Echekki, Jacqueline H. Chen, Mohammad Khalil

    Abstract: Reduced order models based on the transport of a lower dimensional manifold representation of the thermochemical state, such as Principal Component (PC) transport and Machine Learning (ML) techniques, have been developed to reduce the computational cost associated with the Direct Numerical Simulations (DNS) of reactive flows. Both PC transport and ML normally require an abundance of data to exhibi… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  3. arXiv:2405.02995   

    math.NA cs.SD

    Analysis about Theoretical Foundations for Method to Enhancing ASR Performance using OCR Word Frequency Differences

    Authors: Kyudan Jung, Nam-Joon Kim, Hyun Gon Ryu, Hyuk-Jae Lee

    Abstract: As interest in large language models (LLMs) grows, the importance of accuracy in automatic speech recognition (ASR) has become more pronounced. This is particularly true for lectures that include specialized terminology, where the success rate of traditional ASR models tends to be low, posing a challenging problem. A method to improve ASR performance for specialized terminology using the word freq… ▽ More

    Submitted 11 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: Need significant edit

  4. arXiv:2404.18063  [pdf, other

    cs.LG physics.flu-dyn

    Machine Learning Techniques for Data Reduction of CFD Applications

    Authors: Jaemoon Lee, Ki Sung Jung, Qian Gong, Xiao Li, Scott Klasky, Jacqueline Chen, Anand Rangarajan, Sanjay Ranka

    Abstract: We present an approach called guaranteed block autoencoder that leverages Tensor Correlations (GBATC) for reducing the spatiotemporal data generated by computational fluid dynamics (CFD) and other scientific applications. It uses a multidimensional block of tensors (spanning in space and time) for both input and output, capturing the spatiotemporal and interspecies relationship within a tensor. Th… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 10 pages, 8 figures

  5. arXiv:2404.15650  [pdf, other

    cs.CL

    Return of EM: Entity-driven Answer Set Expansion for QA Evaluation

    Authors: Dongryeol Lee, Minwoo Lee, Kyungmin Min, Joonsuk Park, Kyomin Jung

    Abstract: Recently, directly using large language models (LLMs) has been shown to be the most reliable method to evaluate QA models. However, it suffers from limited interpretability, high cost, and environmental harm. To address these, we propose to use soft EM with entity-driven answer set expansion. Our approach expands the gold answer set to include diverse surface forms, based on the observation that t… ▽ More

    Submitted 11 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Under Review (9 pages, 4 figures)

  6. arXiv:2404.11916  [pdf, other

    cs.CL cs.AI

    SKIP: Skill-Localized Prompt Tuning for Inference Speed Boost-Up

    Authors: Nakyeong Yang, Junseok Kim, Jiwon Moon, Yunah Jang, Kyomin Jung

    Abstract: Prompt-tuning methods have shown comparable performance as parameter-efficient fine-tuning (PEFT) methods in various natural language understanding tasks. However, existing prompt tuning methods still utilize the entire model architecture; thus, they fail to accelerate inference speed in the application. In this paper, we propose a novel approach called SKIll-localized Prompt tuning (SKIP), which… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 6 pages

  7. arXiv:2404.11826  [pdf, other

    cs.CL

    AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence

    Authors: Minbeom Kim, Hwanhee Lee, Joonsuk Park, Hwaran Lee, Kyomin Jung

    Abstract: As the integration of large language models into daily life is on the rise, there is a clear gap in benchmarks for advising on subjective and personal dilemmas. To address this, we introduce AdvisorQA, the first benchmark developed to assess LLMs' capability in offering advice for deeply personalized concerns, utilizing the LifeProTips subreddit forum. This forum features a dynamic interaction whe… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 19 pages, 11 figures

  8. arXiv:2404.03991  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Efficient and Accurate CT Segmentation via Edge-Preserving Probabilistic Downsampling

    Authors: Shahzad Ali, Yu Rim Lee, Soo Young Park, Won Young Tak, Soon Ki Jung

    Abstract: Downsampling images and labels, often necessitated by limited resources or to expedite network training, leads to the loss of small objects and thin boundaries. This undermines the segmentation network's capacity to interpret images accurately and predict detailed labels, resulting in diminished performance compared to processing at original resolutions. This situation exemplifies the trade-off be… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 5 pages (4 figures, 1 table); This work has been submitted to the IEEE Signal Processing Letters. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  10. arXiv:2403.05814  [pdf, other

    cs.CL cs.AI

    MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs

    Authors: Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung

    Abstract: Despite advancements in on-topic dialogue systems, effectively managing topic shifts within dialogues remains a persistent challenge, largely attributed to the limited availability of training datasets. To address this issue, we propose Multi-Passage to Dialogue (MP2D), a data generation framework that automatically creates conversational question-answering datasets with natural topic transitions.… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 20 pages

  11. arXiv:2402.06900  [pdf, other

    cs.CL cs.AI

    Can LLMs Recognize Toxicity? Definition-Based Toxicity Metric

    Authors: Hyukhun Koh, Dohyung Kim, Minwoo Lee, Kyomin Jung

    Abstract: In the pursuit of developing Large Language Models (LLMs) that adhere to societal standards, it is imperative to detect the toxicity in the generated text. The majority of existing toxicity metrics rely on encoder models trained on specific toxicity datasets, which are susceptible to out-of-distribution (OOD) problems and depend on the dataset's definition of toxicity. In this paper, we introduce… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: 8 page long

  12. arXiv:2312.10108  [pdf, other

    cs.CV cs.AI cs.LG

    Privacy-Aware Document Visual Question Answering

    Authors: Rubèn Tito, Khanh Nguyen, Marlon Tobaben, Raouf Kerkouche, Mohamed Ali Souibgui, Kangsoo Jung, Lei Kang, Ernest Valveny, Antti Honkela, Mario Fritz, Dimosthenis Karatzas

    Abstract: Document Visual Question Answering (DocVQA) is a fast growing branch of document understanding. Despite the fact that documents contain sensitive or copyrighted information, none of the current DocVQA methods offers strong privacy guarantees. In this work, we explore privacy in the domain of DocVQA for the first time. We highlight privacy issues in state of the art multi-modal LLM models used fo… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  13. arXiv:2312.00356  [pdf, other

    physics.chem-ph cs.LG

    Transfer learning for predicting source terms of principal component transport in chemically reactive flow

    Authors: Ki Sung Jung, Tarek Echekki, Jacqueline H. Chen, Mohammad Khalil

    Abstract: The objective of this study is to evaluate whether the number of requisite training samples can be reduced with the use of various transfer learning models for predicting, for example, the chemical source terms of the data-driven reduced-order model that represents the homogeneous ignition process of a hydrogen/air mixture. Principal component analysis is applied to reduce the dimensionality of th… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 41 pages, 14 figures

  14. arXiv:2311.15208  [pdf, other

    cs.CL cs.AI

    LongStory: Coherent, Complete and Length Controlled Long story Generation

    Authors: Kyeongman Park, Nakyeong Yang, Kyomin Jung

    Abstract: A human author can write any length of story without losing coherence. Also, they always bring the story to a proper ending, an ability that current language models lack. In this work, we present the LongStory for coherent, complete, and length-controlled long story generation. LongStory introduces two novel methodologies: (1) the long and short-term contexts weight calibrator (CWC) and (2) long s… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  15. arXiv:2311.13338  [pdf, other

    cs.CV

    High-Quality Face Caricature via Style Translation

    Authors: Lamyanba Laishram, Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

    Abstract: Caricature is an exaggerated form of artistic portraiture that accentuates unique yet subtle characteristics of human faces. Recently, advancements in deep end-to-end techniques have yielded encouraging outcomes in capturing both style and elevated exaggerations in creating face caricatures. Most of these approaches tend to produce cartoon-like results that could be more practical for real-world a… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 14 pages, 21 figures

  16. arXiv:2311.09820  [pdf, other

    cs.IR

    IterCQR: Iterative Conversational Query Reformulation with Retrieval Guidance

    Authors: Yunah Jang, Kang-il Lee, Hyunkyung Bae, Hwanhee Lee, Kyomin Jung

    Abstract: Conversational search aims to retrieve passages containing essential information to answer queries in a multi-turn conversation. In conversational search, reformulating context-dependent conversational queries into stand-alone forms is imperative to effectively utilize off-the-shelf retrievers. Previous methodologies for conversational query reformulation frequently depend on human-annotated rewri… ▽ More

    Submitted 8 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  17. arXiv:2311.09627  [pdf, other

    cs.AI cs.CL cs.LG

    Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination

    Authors: Nakyeong Yang, Taegwan Kang, Jungkyu Choi, Honglak Lee, Kyomin Jung

    Abstract: Instruction-following language models often show undesirable biases. These undesirable biases may be accelerated in the real-world usage of language models, where a wide range of instructions is used through zero-shot example prompting. To solve this problem, we first define the bias neuron, which significantly affects biased outputs, and prove its existence empirically. Furthermore, we propose a… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: accepted to ACL 2024

  18. arXiv:2311.09585  [pdf, other

    cs.CL

    LifeTox: Unveiling Implicit Toxicity in Life Advice

    Authors: Minbeom Kim, Jahyun Koo, Hwanhee Lee, Joonsuk Park, Hwaran Lee, Kyomin Jung

    Abstract: As large language models become increasingly integrated into daily life, detecting implicit toxicity across diverse contexts is crucial. To this end, we introduce LifeTox, a dataset designed for identifying implicit toxicity within a broad range of advice-seeking scenarios. Unlike existing safety datasets, LifeTox comprises diverse contexts derived from personal experiences through open-ended ques… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, NAACL 2024

  19. arXiv:2311.07589  [pdf, other

    cs.CL cs.AI

    Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources

    Authors: Yerin Hwang, Yongil Kim, Hyunkyung Bae, Jeesoo Bang, Hwanhee Lee, Kyomin Jung

    Abstract: To address the data scarcity issue in Conversational question answering (ConvQA), a dialog inpainting method, which utilizes documents to generate ConvQA datasets, has been proposed. However, the original dialog inpainting model is trained solely on the dialog reconstruction task, resulting in the generation of questions with low contextual relevance due to insufficient learning of question-answer… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 main conference

  20. arXiv:2311.04037  [pdf, other

    cs.CR cs.AI cs.LG stat.ME

    Causal Discovery Under Local Privacy

    Authors: Rūta Binkytė, Carlos Pinzón, Szilvia Lestyán, Kangsoo Jung, Héber H. Arcolezi, Catuscia Palamidessi

    Abstract: Differential privacy is a widely adopted framework designed to safeguard the sensitive information of data providers within a data set. It is based on the application of controlled noise at the interface between the server that stores and processes the data, and the data consumers. Local differential privacy is a variant that allows data providers to apply the privatization mechanism themselves on… ▽ More

    Submitted 3 May, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

  21. arXiv:2311.01161  [pdf, other

    cs.CL cs.AI

    Weakly Supervised Semantic Parsing with Execution-based Spurious Program Filtering

    Authors: Kang-il Lee, Segwang Kim, Kyomin Jung

    Abstract: The problem of spurious programs is a longstanding challenge when training a semantic parser from weak supervision. To eliminate such programs that have wrong semantics but correct denotation, existing methods focus on exploiting similarities between examples based on domain-specific knowledge. In this paper, we propose a domain-agnostic filtering mechanism based on program execution results. Spec… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  22. arXiv:2310.14663  [pdf, other

    eess.AS cs.CL

    DPP-TTS: Diversifying prosodic features of speech via determinantal point processes

    Authors: Seongho Joo, Hyukhun Koh, Kyomin Jung

    Abstract: With the rapid advancement in deep generative models, recent neural Text-To-Speech(TTS) models have succeeded in synthesizing human-like speech. There have been some efforts to generate speech with various prosody beyond monotonous prosody patterns. However, previous works have several limitations. First, typical TTS models depend on the scaled sampling temperature for boosting the diversity of pr… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  23. arXiv:2309.16701  [pdf, other

    cs.CV cs.AI cs.CL

    Is it Really Negative? Evaluating Natural Language Video Localization Performance on Multiple Reliable Videos Pool

    Authors: Nakyeong Yang, Minsung Kim, Seunghyun Yoon, Joongbo Shin, Kyomin Jung

    Abstract: With the explosion of multimedia content in recent years, Video Corpus Moment Retrieval (VCMR), which aims to detect a video moment that matches a given natural language query from multiple videos, has become a critical problem. However, existing VCMR studies have a significant limitation since they have regarded all videos not paired with a specific query as negative, neglecting the possibility o… ▽ More

    Submitted 18 March, 2024; v1 submitted 15 August, 2023; originally announced September 2023.

    Comments: 15 pages, 10 figures

  24. arXiv:2309.13457  [pdf, other

    cs.LG cs.CV physics.comp-ph physics.flu-dyn

    Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data

    Authors: Wai Tong Chung, Bassem Akoush, Pushan Sharma, Alex Tamkin, Ki Sung Jung, Jacqueline H. Chen, Jack Guo, Davy Brouzet, Mohsen Talei, Bruno Savard, Alexei Y. Poludnenko, Matthias Ihme

    Abstract: Analysis of compressible turbulent flows is essential for applications related to propulsion, energy generation, and the environment. Here, we present BLASTNet 2.0, a 2.2 TB network-of-datasets containing 744 full-domain samples from 34 high-fidelity direct numerical simulations, which addresses the current limited availability of 3D high-fidelity reacting and non-reacting compressible turbulent f… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted in Adv. in Neural Information Processing Systems 36 (NeurIPS 2023). Link: https://nips.cc/virtual/2023/poster/73433 . 55 pages, 21 figures. Keywords: Super-resolution, 3D, Neural Scaling, Physics-informed Loss, Computational Fluid Dynamics, Partial Differential Equations, Turbulent Reacting Flows, Direct Numerical Simulation, Fluid Mechanics, Combustion, Computer Vision

  25. arXiv:2309.00416  [pdf, other

    cs.LG cs.CR cs.CY stat.ML

    Advancing Personalized Federated Learning: Group Privacy, Fairness, and Beyond

    Authors: Filippo Galli, Kangsoo Jung, Sayan Biswas, Catuscia Palamidessi, Tommaso Cucinotta

    Abstract: Federated learning (FL) is a framework for training machine learning models in a distributed and collaborative manner. During training, a set of participating clients process their data stored locally, sharing only the model updates obtained by minimizing a cost function over their local inputs. FL was proposed as a stepping-stone towards privacy-preserving machine learning, but it has been shown… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  26. arXiv:2307.10479  [pdf, other

    cs.IR cs.DS

    Fast Approximate Nearest Neighbor Search with a Dynamic Exploration Graph using Continuous Refinement

    Authors: Nico Hezel, Kai Uwe Barthel, Konstantin Schall, Klaus Jung

    Abstract: For approximate nearest neighbor search, graph-based algorithms have shown to offer the best trade-off between accuracy and search time. We propose the Dynamic Exploration Graph (DEG) which significantly outperforms existing algorithms in terms of search and exploration efficiency by combining two new ideas: First, a single undirected even regular graph is incrementally built by partially replacin… ▽ More

    Submitted 21 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  27. arXiv:2307.09455  [pdf, other

    cs.CL

    Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers

    Authors: Jaeyoung Kim, Kyuheon Jung, Dongbin Na, Sion Jang, Eunbin Park, Sungchul Choi

    Abstract: For real-world language applications, detecting an out-of-distribution (OOD) sample is helpful to alert users or reject such unreliable samples. However, modern over-parameterized language models often produce overconfident predictions for both in-distribution (ID) and OOD samples. In particular, language models suffer from OOD samples with a similar semantic representation to ID samples since the… ▽ More

    Submitted 19 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: 12 pages, 2 figures

    MSC Class: 68T50

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023 (2023) 1469-1482

  28. arXiv:2305.14016  [pdf, other

    cs.CL

    Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation

    Authors: Minwoo Lee, Hyukhun Koh, Kang-il Lee, Dongdong Zhang, Minsung Kim, Kyomin Jung

    Abstract: Gender bias is a significant issue in machine translation, leading to ongoing research efforts in developing bias mitigation techniques. However, most works focus on debiasing bilingual models without much consideration for multilingual systems. In this paper, we specifically target the gender bias issue of multilingual machine translation models for unambiguous cases where there is a single corre… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 Main Conference

  29. arXiv:2305.13808  [pdf, other

    cs.CL

    Asking Clarification Questions to Handle Ambiguity in Open-Domain QA

    Authors: Dongryeol Lee, Segwang Kim, Minwoo Lee, Hwanhee Lee, Joonsuk Park, Sang-Woo Lee, Kyomin Jung

    Abstract: Ambiguous questions persist in open-domain question answering, because formulating a precise question with a unique answer is often challenging. Previously, Min et al. (2020) have tackled this issue by generating disambiguated questions for all possible interpretations of the ambiguous question. This can be effective, but not ideal for providing an answer to the user. Instead, we propose to ask a… ▽ More

    Submitted 25 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 15 pages, 4 figures, accepted to EMNLP 2023 Findings

  30. arXiv:2305.06869  [pdf, other

    cs.RO

    An Adaptive Graduated Nonconvexity Loss Function for Robust Nonlinear Least Squares Solutions

    Authors: Kyungmin Jung, Thomas Hitchcox, James Richard Forbes

    Abstract: Many problems in robotics, such as estimating the state from noisy sensor data or aligning two point clouds, can be posed and solved as least-squares problems. Unfortunately, vanilla nonminimal solvers for least-squares problems are notoriously sensitive to outliers. As such, various robust loss functions have been proposed to reduce the sensitivity to outliers. Examples of loss functions include… ▽ More

    Submitted 10 May, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

  31. arXiv:2303.13099  [pdf, other

    cs.CL cs.AI

    Multi-View Zero-Shot Open Intent Induction from Dialogues: Multi Domain Batch and Proxy Gradient Transfer

    Authors: Hyukhun Koh, Haesung Pyun, Nakyeong Yang, Kyomin Jung

    Abstract: In Task Oriented Dialogue (TOD) system, detecting and inducing new intents are two main challenges to apply the system in the real world. In this paper, we suggest the semantic multi-view model to resolve these two challenges: (1) SBERT for General Embedding (GE), (2) Multi Domain Batch (MDB) for dialogue domain knowledge, and (3) Proxy Gradient Transfer (PGT) for cluster-specialized semantic. MDB… ▽ More

    Submitted 13 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, SIGDIAL DSTC 2023 workshop

  32. arXiv:2303.08389  [pdf, other

    cs.CL

    PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning

    Authors: Yongil Kim, Yerin Hwang, Hyeongu Yun, Seunghyun Yoon, Trung Bui, Kyomin Jung

    Abstract: Vulnerability to lexical perturbation is a critical weakness of automatic evaluation metrics for image captioning. This paper proposes Perturbation Robust Multi-Lingual CLIPScore(PR-MCS), which exhibits robustness to such perturbations, as a novel reference-free image captioning metric applicable to multiple languages. To achieve perturbation robustness, we fine-tune the text encoder of CLIP with… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  33. Varianceflow: High-Quality and Controllable Text-to-Speech using Variance Information via Normalizing Flow

    Authors: Yoonhyung Lee, Jinhyeok Yang, Kyomin Jung

    Abstract: There are two types of methods for non-autoregressive text-to-speech models to learn the one-to-many relationship between text and speech effectively. The first one is to use an advanced generative framework such as normalizing flow (NF). The second one is to use variance information such as pitch or energy together when generating speech. For the second type, it is also possible to control the va… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Comments: Accepted for ICASSP 2022

  34. arXiv:2301.13012  [pdf, other

    cs.CV cs.AI cs.LG

    Key Feature Replacement of In-Distribution Samples for Out-of-Distribution Detection

    Authors: Jaeyoung Kim, Seo Taek Kong, Dongbin Na, Kyu-Hwan Jung

    Abstract: Out-of-distribution (OOD) detection can be used in deep learning-based applications to reject outlier samples from being unreliably classified by deep neural networks. Learning to classify between OOD and in-distribution samples is difficult because data comprising the former is extremely diverse. It has been observed that an auxiliary OOD dataset is most effective in training a "rejection" networ… ▽ More

    Submitted 26 December, 2022; originally announced January 2023.

    Comments: Accepted to the 37th AAAI Conference on Artificial Intelligence (AAAI 2023) Main Track

  35. arXiv:2212.10938  [pdf, other

    cs.CL

    Critic-Guided Decoding for Controlled Text Generation

    Authors: Minbeom Kim, Hwanhee Lee, Kang Min Yoo, Joonsuk Park, Hwaran Lee, Kyomin Jung

    Abstract: Steering language generation towards objectives or away from undesired content has been a long-standing goal in utilizing language models (LM). Recent work has demonstrated reinforcement learning and weighted decoding as effective approaches to achieve a higher level of language control and quality with pros and cons. In this work, we propose a novel critic decoding method for controlled language… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 11 pages, 6 figures

  36. arXiv:2211.14807  [pdf, other

    cs.CG

    Universal convex covering problems under translation and discrete rotations

    Authors: Mook Kwon Jung, Sang Duk Yoon, Hee-Kap Ahn, Takeshi Tokuyama

    Abstract: We consider the smallest-area universal covering of planar objects of perimeter 2 (or equivalently closed curves of length 2) allowing translation and discrete rotations. In particular, we show that the solution is an equilateral triangle of height 1 when translation and discrete rotation of $π$ are allowed. Our proof is purely geometric and elementary. We also give convex coverings of closed curv… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    MSC Class: 52C15; 05B40 ACM Class: F.0; G.0

  37. arXiv:2209.12881  [pdf, other

    cs.CV cs.RO

    Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments

    Authors: Kyungmin Jung, Thomas Hitchcox, James Richard Forbes

    Abstract: The recent development of high-precision subsea optical scanners allows for 3D keypoint detectors and feature descriptors to be leveraged on point cloud scans from subsea environments. However, the literature lacks a comprehensive survey to identify the best combination of detectors and descriptors to be used in these challenging and novel environments. This paper aims to identify the best detecto… ▽ More

    Submitted 26 February, 2024; v1 submitted 26 September, 2022; originally announced September 2022.

  38. arXiv:2208.10718  [pdf, other

    cs.LG cs.AI

    String-based Molecule Generation via Multi-decoder VAE

    Authors: Kisoo Kwon, Kuhwan Jung, Junghyun Park, Hwidong Na, Jinwoo Shin

    Abstract: In this paper, we investigate the problem of string-based molecular generation via variational autoencoders (VAEs) that have served a popular generative approach for various tasks in artificial intelligence. We propose a simple, yet effective idea to improve the performance of VAE for the task. Our main idea is to maintain multiple decoders while sharing a single encoder, i.e., it is a type of ens… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 7 pages, 3 figures, 4 tables

  39. Multimodal Speech Emotion Recognition using Cross Attention with Aligned Audio and Text

    Authors: Yoonhyung Lee, Seunghyun Yoon, Kyomin Jung

    Abstract: In this paper, we propose a novel speech emotion recognition model called Cross Attention Network (CAN) that uses aligned audio and text signals as inputs. It is inspired by the fact that humans recognize speech as a combination of simultaneously produced acoustic and textual signals. First, our method segments the audio and the underlying text signals into equal number of steps in an aligned way… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: 5 pages, accepted by INTERSPEECH 2020

    Journal ref: Proc. Interspeech 2020, 2717-2721

  40. arXiv:2207.12546  [pdf, other

    cs.LG physics.flu-dyn

    The Bearable Lightness of Big Data: Towards Massive Public Datasets in Scientific Machine Learning

    Authors: Wai Tong Chung, Ki Sung Jung, Jacqueline H. Chen, Matthias Ihme

    Abstract: In general, large datasets enable deep learning models to perform with good accuracy and generalizability. However, massive high-fidelity simulation datasets (from molecular chemistry, astrophysics, computational fluid dynamics (CFD), etc. can be challenging to curate due to dimensionality and storage constraints. Lossy compression algorithms can help mitigate limitations from storage, as long as… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in ICML 2022 2nd AI for Science Workshop. 10 pages, 8 figures

    Journal ref: ICML 2022 2nd AI for Science Workshop

  41. Lightweight Encoder-Decoder Architecture for Foot Ulcer Segmentation

    Authors: Shahzad Ali, Arif Mahmood, Soon Ki Jung

    Abstract: Continuous monitoring of foot ulcer healing is needed to ensure the efficacy of a given treatment and to avoid any possibility of deterioration. Foot ulcer segmentation is an essential step in wound diagnosis. We developed a model that is similar in spirit to the well-established encoder-decoder and residual convolution neural networks. Our model includes a residual connection along with a channel… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Published version of this article is available at https://link.springer.com/chapter/10.1007/978-3-031-06381-7_17

    Journal ref: Frontiers of Computer Vision. IW-FCV 2022. Communications in Computer and Information Science, vol 1578. Springer, Cham (2022)

  42. arXiv:2206.03396  [pdf, other

    cs.LG cs.AI cs.CR

    Group privacy for personalized federated learning

    Authors: Filippo Galli, Sayan Biswas, Kangsoo Jung, Tommaso Cucinotta, Catuscia Palamidessi

    Abstract: Federated learning (FL) is a type of collaborative machine learning where participating peers/clients process their data locally, sharing only updates to the collaborative model. This enables to build privacy-aware distributed machine learning models, among others. The goal is the optimization of a statistical model's parameters by minimizing a cost function of a collection of datasets which are s… ▽ More

    Submitted 4 September, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

  43. Tight Differential Privacy Guarantees for the Shuffle Model with $k$-Randomized Response

    Authors: Sayan Biswas, Kangsoo Jung, Catuscia Palamidessi

    Abstract: Most differentially private (DP) algorithms assume a central model in which a reliable third party inserts noise to queries made on datasets, or a local model where the users locally perturb their data. However, the central model is vulnerable via a single point of failure, and in the local model, the utility of the data deteriorates significantly. The recently proposed shuffle model is an interme… ▽ More

    Submitted 29 April, 2024; v1 submitted 18 May, 2022; originally announced May 2022.

    Journal ref: LNCS 14551 (2024)

  44. Tight Differential Privacy Blanket for Shuffle Model

    Authors: Sayan Biswas, Kangsoo Jung, Catuscia Palamidessi

    Abstract: With the recent bloom of focus on digital economy, the importance of personal data has seen a massive surge of late. Keeping pace with this trend, the model of data market is starting to emerge as a process to obtain high-quality personal information in exchange of incentives. To have a formal guarantee to protect the privacy of the sensitive data involved in digital economy, \emph{differential pr… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: Extended Abstract

  45. arXiv:2205.04255  [pdf, other

    cs.CV cs.DS

    Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting

    Authors: Kai Uwe Barthel, Nico Hezel, Klaus Jung, Konstantin Schall

    Abstract: Images sorted by similarity enables more images to be viewed simultaneously, and can be very useful for stock photo agencies or e-commerce applications. Visually sorted grid layouts attempt to arrange images so that their proximity on the grid corresponds as closely as possible to their similarity. Various metrics exist for evaluating such arrangements, but there is low experimental evidence on co… ▽ More

    Submitted 11 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

  46. arXiv:2205.04157  [pdf, other

    cs.CL cs.AI

    Task-specific Compression for Multi-task Language Models using Attribution-based Pruning

    Authors: Nakyeong Yang, Yunah Jang, Hwanhee Lee, Seohyeong Jung, Kyomin Jung

    Abstract: Multi-task language models show outstanding performance for various natural language understanding tasks with only a single model. However, these language models utilize an unnecessarily large number of model parameters, even when used only for a specific task. This paper proposes a novel training-free compression method for multi-task language models using a pruning method. Specifically, we use a… ▽ More

    Submitted 11 February, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 11 pages, 4 figures

    Journal ref: EACL 2023 Findings

  47. arXiv:2205.02035  [pdf, other

    cs.CL

    Masked Summarization to Generate Factually Inconsistent Summaries for Improved Factual Consistency Checking

    Authors: Hwanhee Lee, Kang Min Yoo, Joonsuk Park, Hwaran Lee, Kyomin Jung

    Abstract: Despite the recent advances in abstractive summarization systems, it is still difficult to determine whether a generated summary is factual consistent with the source text. To this end, the latest approach is to train a factual consistency classifier on factually consistent and inconsistent summaries. Luckily, the former is readily available as reference summaries in existing summarization dataset… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: NAACL 2022 Findings

  48. arXiv:2204.08263  [pdf, other

    cs.CL

    Factual Error Correction for Abstractive Summaries Using Entity Retrieval

    Authors: Hwanhee Lee, Cheoneum Park, Seunghyun Yoon, Trung Bui, Franck Dernoncourt, Juae Kim, Kyomin Jung

    Abstract: Despite the recent advancements in abstractive summarization systems leveraged from large-scale datasets and pre-trained language models, the factual correctness of the summary is still insufficient. One line of trials to mitigate this problem is to include a post-editing process that can detect and correct factual errors in the summary. In building such a post-editing system, it is strongly requi… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 6 pages, 3 figures

  49. Establishing the Price of Privacy in Federated Data Trading

    Authors: Kangsoo Jung, Sayan Biswas, Catuscia Palamidessi

    Abstract: Personal data is becoming one of the most essential resources in today's information-based society. Accordingly, there is a growing interest in data markets, which operate data trading services between data providers and data consumers. One issue the data markets have to address is that of the potential threats to privacy. Usually some kind of protection must be provided, which generally comes to… ▽ More

    Submitted 22 June, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Journal ref: Protocols, Strands, and Logic 2021, pp 232-250

  50. arXiv:2111.13363  [pdf, other

    cs.CV cs.LG

    PicArrange -- Visually Sort, Search, and Explore Private Images on a Mac Computer

    Authors: Klaus Jung, Kai Uwe Barthel, Nico Hezel, Konstantin Schall

    Abstract: The native macOS application PicArrange integrates state-of-the-art image sorting and similarity search to enable users to get a better overview of their images. Many file and image management features have been added to make it a tool that addresses a full image management workflow. A modification of the Self Sorting Map algorithm enables a list-like image arrangement without loosing the visual s… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: 5 pages, 3 figures