Skip to main content

Showing 1–28 of 28 results for author: Koh, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08702  [pdf, other

    cs.AI cs.CL cs.CV

    VLind-Bench: Measuring Language Priors in Large Vision-Language Models

    Authors: Kang-il Lee, Minbeom Kim, Seunghyun Yoon, Minsung Kim, Dongryeol Lee, Hyukhun Koh, Kyomin Jung

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated outstanding performance across various multimodal tasks. However, they suffer from a problem known as language prior, where responses are generated based solely on textual patterns while disregarding image information. Addressing the issue of language prior is crucial, as it can lead to undesirable biases or hallucinations when dealing with im… ▽ More

    Submitted 10 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2404.01628  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Equi-angular Representations for Online Continual Learning

    Authors: Minhyuk Seo, Hyunseo Koh, Wonje Jeung, Minjae Lee, San Kim, Hankook Lee, Sungjun Cho, Sungik Choi, Hyunwoo Kim, Jonghyun Choi

    Abstract: Online continual learning suffers from an underfitted solution due to insufficient training for prompt model update (e.g., single-epoch training). To address the challenge, we propose an efficient online continual learning method using the neural collapse phenomenon. In particular, we induce neural collapse to form a simplex equiangular tight frame (ETF) structure in the representation space so th… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  3. arXiv:2402.06900  [pdf, other

    cs.CL cs.AI

    Can LLMs Recognize Toxicity? Definition-Based Toxicity Metric

    Authors: Hyukhun Koh, Dohyung Kim, Minwoo Lee, Kyomin Jung

    Abstract: In the pursuit of developing Large Language Models (LLMs) that adhere to societal standards, it is imperative to detect the toxicity in the generated text. The majority of existing toxicity metrics rely on encoder models trained on specific toxicity datasets, which are susceptible to out-of-distribution (OOD) problems and depend on the dataset's definition of toxicity. In this paper, we introduce… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: 8 page long

  4. arXiv:2401.05800  [pdf, other

    cs.LG cs.AI

    Graph Spatiotemporal Process for Multivariate Time Series Anomaly Detection with Missing Values

    Authors: Yu Zheng, Huan Yee Koh, Ming Jin, Lianhua Chi, Haishuai Wang, Khoa T. Phan, Yi-Ping Phoebe Chen, Shirui Pan, Wei Xiang

    Abstract: The detection of anomalies in multivariate time series data is crucial for various practical applications, including smart power grids, traffic flow forecasting, and industrial process control. However, real-world time series data is usually not well-structured, posting significant challenges to existing approaches: (1) The existence of missing values in multivariate time series data along variabl… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted by Information Fusion

  5. arXiv:2312.03000  [pdf, other

    cs.HC

    VidereX: A Navigational Application inspired by ants

    Authors: Nam Ho Koh, Doran Amos, Paul Graham, Andrew Philippides

    Abstract: Navigation is a crucial element in any person's life, whether for work, education, social living or any other miscellaneous reason; naturally, the importance of it is universally recognised and valued. One of the critical components of navigation is vision, which facilitates movement from one place to another. Navigating unfamiliar settings, especially for the blind or visually impaired, can pose… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 6 pages, 7 figures, Workshop on Rapid and Robust Robotic Active Learning (R3AL) - Robotics: Science and Systems 2023 (RSS 2023)

  6. arXiv:2311.07343  [pdf, other

    cs.LG

    Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning

    Authors: Felix den Breejen, Sangmin Bae, Stephen Cha, Tae-Young Kim, Seoung Hyun Koh, Se-Young Yun

    Abstract: While interests in tabular deep learning has significantly grown, conventional tree-based models still outperform deep learning methods. To narrow this performance gap, we explore the innovative retrieval mechanism, a methodology that allows neural networks to refer to other data points while making predictions. Our experiments reveal that retrieval-based training, especially when fine-tuning the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Table Representation Learning Workshop at NeurIPS 2023

  7. arXiv:2310.14663  [pdf, other

    eess.AS cs.CL

    DPP-TTS: Diversifying prosodic features of speech via determinantal point processes

    Authors: Seongho Joo, Hyukhun Koh, Kyomin Jung

    Abstract: With the rapid advancement in deep generative models, recent neural Text-To-Speech(TTS) models have succeeded in synthesizing human-like speech. There have been some efforts to generate speech with various prosody beyond monotonous prosody patterns. However, previous works have several limitations. First, typical TTS models depend on the scaled sampling temperature for boosting the diversity of pr… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  8. arXiv:2310.07984  [pdf

    cs.AI cs.CE

    Large Language Models for Scientific Synthesis, Inference and Explanation

    Authors: Yizhen Zheng, Huan Yee Koh, Jiaxin Ju, Anh T. N. Nguyen, Lauren T. May, Geoffrey I. Webb, Shirui Pan

    Abstract: Large language models are a form of artificial intelligence systems whose primary knowledge consists of the statistical patterns, semantic relationships, and syntactical structures of language1. Despite their limited forms of "knowledge", these systems are adept at numerous complex tasks including creative writing, storytelling, translation, question-answering, summarization, and computer code gen… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Supplementary Information: https://drive.google.com/file/d/1KrpUpzuFTeMx6a6zl18lqdo8vV-UUa1Z/view?usp=sharing Github Repo: https://github.com/zyzisastudyreallyhardguy/LLM4SD

  9. Correlation-aware Spatial-Temporal Graph Learning for Multivariate Time-series Anomaly Detection

    Authors: Yu Zheng, Huan Yee Koh, Ming Jin, Lianhua Chi, Khoa T. Phan, Shirui Pan, Yi-Ping Phoebe Chen, Wei Xiang

    Abstract: Multivariate time-series anomaly detection is critically important in many applications, including retail, transportation, power grid, and water treatment plants. Existing approaches for this problem mostly employ either statistical models which cannot capture the non-linear relations well or conventional deep learning models (e.g., CNN and LSTM) that do not explicitly learn the pairwise correlati… ▽ More

    Submitted 16 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 17 pages, double columns, 10 tables, 3 figures. Accepted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  10. arXiv:2307.03759  [pdf, other

    cs.LG cs.AI

    A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection

    Authors: Ming Jin, Huan Yee Koh, Qingsong Wen, Daniele Zambon, Cesare Alippi, Geoffrey I. Webb, Irwin King, Shirui Pan

    Abstract: Time series are the primary data type used to record dynamic system measurements and generated in great volume by both physical sensors and online processes (virtual sensors). Time series analytics is therefore crucial to unlocking the wealth of information implicit in available data. With the recent advancements in graph neural networks (GNNs), there has been a surge in GNN-based approaches for t… ▽ More

    Submitted 9 August, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: Ongoing work; 27 pages, 6 figures, 5 tables; Github page: https://github.com/KimMeen/Awesome-GNN4TS

  11. arXiv:2305.14016  [pdf, other

    cs.CL

    Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation

    Authors: Minwoo Lee, Hyukhun Koh, Kang-il Lee, Dongdong Zhang, Minsung Kim, Kyomin Jung

    Abstract: Gender bias is a significant issue in machine translation, leading to ongoing research efforts in developing bias mitigation techniques. However, most works focus on debiasing bilingual models without much consideration for multilingual systems. In this paper, we specifically target the gender bias issue of multilingual machine translation models for unambiguous cases where there is a single corre… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 Main Conference

  12. arXiv:2305.10407  [pdf, other

    cs.CL

    BAD: BiAs Detection for Large Language Models in the context of candidate screening

    Authors: Nam Ho Koh, Joseph Plata, Joyce Chai

    Abstract: Application Tracking Systems (ATS) have allowed talent managers, recruiters, and college admissions committees to process large volumes of potential candidate applications efficiently. Traditionally, this screening process was conducted manually, creating major bottlenecks due to the quantity of applications and introducing many instances of human bias. The advent of large language models (LLMs) s… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 12 pages, 6 figures

    MSC Class: I.2; I.2.7 ACM Class: F.2.2, I.2.7

  13. arXiv:2303.13099  [pdf, other

    cs.CL cs.AI

    Multi-View Zero-Shot Open Intent Induction from Dialogues: Multi Domain Batch and Proxy Gradient Transfer

    Authors: Hyukhun Koh, Haesung Pyun, Nakyeong Yang, Kyomin Jung

    Abstract: In Task Oriented Dialogue (TOD) system, detecting and inducing new intents are two main challenges to apply the system in the real world. In this paper, we suggest the semantic multi-view model to resolve these two challenges: (1) SBERT for General Embedding (GE), (2) Multi Domain Batch (MDB) for dialogue domain knowledge, and (3) Proxy Gradient Transfer (PGT) for cluster-specialized semantic. MDB… ▽ More

    Submitted 13 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, SIGDIAL DSTC 2023 workshop

  14. arXiv:2303.04623  [pdf

    cs.LG cond-mat.dis-nn

    Continuous Function Structured in Multilayer Perceptron for Global Optimization

    Authors: Heeyuen Koh

    Abstract: The gradient information of multilayer perceptron with a linear neuron is modified with functional derivative for the global minimum search benchmarking problems. From this approach, we show that the landscape of the gradient derived from given continuous function using functional derivative can be the MLP-like form with ax+b neurons. In this extent, the suggested algorithm improves the availabili… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  15. arXiv:2302.13696  [pdf, other

    cs.LG cs.AI cs.GT cs.NE

    Moderate Adaptive Linear Units (MoLU)

    Authors: Hankyul Koh, Joon-hyuk Ko, Wonho Jhe

    Abstract: We propose a new high-performance activation function, Moderate Adaptive Linear Units (MoLU), for the deep neural network. The MoLU is a simple, beautiful and powerful activation function that can be a good main activation function among hundreds of activation functions. Because the MoLU is made up of the elementary functions, not only it is a infinite diffeomorphism (i.e. smooth and infinitely di… ▽ More

    Submitted 10 June, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 4 pages, 5 figures

  16. arXiv:2210.16732  [pdf, other

    cs.CL

    How Far are We from Robust Long Abstractive Summarization?

    Authors: Huan Yee Koh, Jiaxin Ju, He Zhang, Ming Liu, Shirui Pan

    Abstract: Abstractive summarization has made tremendous progress in recent years. In this work, we perform fine-grained human annotations to evaluate long document abstractive summarization systems (i.e., models and metrics) with the aim of implementing them to generate reliable summaries. For long document abstractive models, we show that the constant strive for state-of-the-art ROUGE results can lead us t… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  17. Does Mode of Digital Contact Tracing Affect User Willingness to Share Information? A Quantitative Study

    Authors: Camellia Zakaria, Pin Sym Foong, Chang Siang Lim, Pavithren V. S. Pakianathan, Gerald Huat Choon Koh, Simon Tangi Perrault

    Abstract: Digital contact tracing can limit the spread of infectious diseases. Nevertheless, there remain barriers to attaining sufficient adoption. In this study, we investigate how willingness to participate in contact tracing is affected by two critical factors: the modes of data collection and the type of data collected. We conducted a scenario-based survey study among 220 respondents in the United Stat… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 18 pages, 11 figures, 13 tables

    Journal ref: In CHI Conference on Human Factors in Computing Systems, pp. 1-18. 2022

  18. arXiv:2210.01407  [pdf, other

    cs.LG math.DS math.OC physics.app-ph

    Homotopy-based training of NeuralODEs for accurate dynamics discovery

    Authors: Joon-Hyuk Ko, Hankyul Koh, Nojun Park, Wonho Jhe

    Abstract: Neural Ordinary Differential Equations (NeuralODEs) present an attractive way to extract dynamical laws from time series data, as they bridge neural networks with the differential equation-based modeling paradigm of the physical sciences. However, these models often display long training times and suboptimal results, especially for longer duration data. While a common strategy in the literature im… ▽ More

    Submitted 23 January, 2024; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 10 pages, 5 figures, accepted at NeurIPS2023 (https://neurips.cc/virtual/2023/poster/70313)

    Journal ref: Joon-Hyuk, Hankyul Koh, Nojun Park, and Wonho Jhe. Advances in Neural Information Processing Systems (2023)

  19. An Empirical Survey on Long Document Summarization: Datasets, Models and Metrics

    Authors: Huan Yee Koh, Jiaxin Ju, Ming Liu, Shirui Pan

    Abstract: Long documents such as academic articles and business reports have been the standard format to detail out important issues and complicated subjects that require extra attention. An automatic summarization system that can effectively condense long documents into short and concise texts to encapsulate the most important information would thus be significant in aiding the reader's comprehension. Rece… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: Accepted for publication by ACM Computing Surveys

  20. arXiv:2203.15355  [pdf, other

    cs.CV cs.AI cs.LG

    Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries

    Authors: Jihwan Bang, Hyunseo Koh, Seulki Park, Hwanjun Song, Jung-Woo Ha, Jonghyun Choi

    Abstract: Learning under a continuously changing data distribution with incorrect labels is a desirable real-world problem yet challenging. A large body of continual learning (CL) methods, however, assumes data streams with clean labels, and online learning scenarios under noisy data streams are yet underexplored. We consider a more practical CL task setup of an online learning from blurry data stream with… ▽ More

    Submitted 30 March, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted paper at CVPR 2022

  21. arXiv:2110.10031  [pdf, other

    cs.LG cs.CV

    Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference

    Authors: Hyunseo Koh, Dahyun Kim, Jung-Woo Ha, Jonghyun Choi

    Abstract: Despite rapid advances in continual learning, a large body of research is devoted to improving performance in the existing setups. While a handful of work do propose new continual learning setups, they still lack practicality in certain aspects. For better practicality, we first propose a novel continual learning setup that is online, task-free, class-incremental, of blurry task boundaries and sub… ▽ More

    Submitted 21 March, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: to appear in ICLR2022

  22. arXiv:2110.01280  [pdf, other

    cs.CL

    Leveraging Information Bottleneck for Scientific Document Summarization

    Authors: Jiaxin Ju, Ming Liu, Huan Yee Koh, Yuan Jin, Lan Du, Shirui Pan

    Abstract: This paper presents an unsupervised extractive approach to summarize scientific long documents based on the Information Bottleneck principle. Inspired by previous work which uses the Information Bottleneck principle for sentence compression, we extend it to document level summarization with two separate steps. In the first step, we use signal(s) as queries to retrieve the key content from the sour… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: Accepted at EMNLP 2021 Findings

  23. arXiv:2102.01033  [pdf, other

    hep-ex cs.CV

    Scalable, End-to-End, Deep-Learning-Based Data Reconstruction Chain for Particle Imaging Detectors

    Authors: Francois Drielsma, Kazuhiro Terao, Laura Dominé, Dae Heun Koh

    Abstract: Recent inroads in Computer Vision (CV) and Machine Learning (ML) have motivated a new approach to the analysis of particle imaging detector data. Unlike previous efforts which tackled isolated CV tasks, this paper introduces an end-to-end, ML-based data reconstruction chain for Liquid Argon Time Projection Chambers (LArTPCs), the state-of-the-art in precision imaging at the intensity frontier of n… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020), Vancouver, Canada

  24. Blockchain for the Internet of Vehicles towards Intelligent Transportation Systems: A Survey

    Authors: Muhammad Baqer Mollah, Jun Zhao, Dusit Niyato, Yong Liang Guan, Chau Yuen, Sumei Sun, Kwok-Yan Lam, Leong Hai Koh

    Abstract: Internet of Vehicles (IoV) is an emerging concept that is believed to help realise the vision of intelligent transportation systems (ITS). IoV has become an important research area of impactful applications in recent years due to the rapid advancements in vehicular technologies, high throughput satellite communication, Internet of Things and cyber-physical systems. IoV enables the integration of s… ▽ More

    Submitted 2 October, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: 28 Pages, 17 Figures, 4 tables

    Journal ref: IEEE Internet of Things Journal 2020

  25. arXiv:2007.03083  [pdf, other

    physics.ins-det cs.CV eess.IV

    Scalable, Proposal-free Instance Segmentation Network for 3D Pixel Clustering and Particle Trajectory Reconstruction in Liquid Argon Time Projection Chambers

    Authors: Dae Heun Koh, Pierre Côte de Soux, Laura Dominé, François Drielsma, Ran Itay, Qing Lin, Kazuhiro Terao, Ka Vang Tsang, Tracy Usher

    Abstract: Liquid Argon Time Projection Chambers (LArTPCs) are high resolution particle imaging detectors, employed by accelerator-based neutrino oscillation experiments for high precision physics measurements. While images of particle trajectories are intuitive to analyze for physicists, the development of a high quality, automated data reconstruction chain remains challenging. One of the most critical reco… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  26. arXiv:2007.01335  [pdf, other

    physics.ins-det cs.CV

    Clustering of Electromagnetic Showers and Particle Interactions with Graph Neural Networks in Liquid Argon Time Projection Chambers Data

    Authors: Francois Drielsma, Qing Lin, Pierre Côte de Soux, Laura Dominé, Ran Itay, Dae Heun Koh, Bradley J. Nelson, Kazuhiro Terao, Ka Vang Tsang, Tracy L. Usher

    Abstract: Liquid Argon Time Projection Chambers (LArTPCs) are a class of detectors that produce high resolution images of charged particles within their sensitive volume. In these images, the clustering of distinct particles into superstructures is of central importance to the current and future neutrino physics program. Electromagnetic (EM) activity typically exhibits spatially detached fragments of varyin… ▽ More

    Submitted 14 December, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  27. arXiv:2006.14745  [pdf, other

    hep-ex cs.CV physics.ins-det

    Point Proposal Network for Reconstructing 3D Particle Endpoints with Sub-Pixel Precision in Liquid Argon Time Projection Chambers

    Authors: Laura Dominé, Pierre Côte de Soux, François Drielsma, Dae Heun Koh, Ran Itay, Qing Lin, Kazuhiro Terao, Ka Vang Tsang, Tracy L. Usher

    Abstract: Liquid Argon Time Projection Chambers (LArTPC) are particle imaging detectors recording 2D or 3D images of trajectories of charged particles. Identifying points of interest in these images, namely the initial and terminal points of track-like particle trajectories such as muons and protons, and the initial points of electromagnetic shower-like particle trajectories such as electrons and gamma rays… ▽ More

    Submitted 10 July, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Journal ref: Phys. Rev. D 104, 032004 (2021)

  28. arXiv:1911.03298  [pdf, other

    cs.CR cs.DC cs.NI cs.SI eess.SY

    Blockchain for Future Smart Grid: A Comprehensive Survey

    Authors: Muhammad Baqer Mollah, Jun Zhao, Dusit Niyato, Kwok-Yan Lam, Xin Zhang, Amer M. Y. M. Ghias, Leong Hai Koh, Lei Yang

    Abstract: The concept of smart grid has been introduced as a new vision of the conventional power grid to figure out an efficient way of integrating green and renewable energy technologies. In this way, Internet-connected smart grid, also called energy Internet, is also emerging as an innovative approach to ensure the energy from anywhere at any time. The ultimate goal of these developments is to build a su… ▽ More

    Submitted 13 May, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: 26 pages, 13 figures, 5 tables

    Journal ref: IEEE Internet of Things Journal, 2020