Search | arXiv e-print repository

Parameterized Physics-informed Neural Networks for Parameterized PDEs

Authors: Woojin Cho, Minju Jo, Haksoo Lim, Kookjin Lee, Dongeun Lee, Sanghyun Hong, Noseong Park

Abstract: Complex physical systems are often described by partial differential equations (PDEs) that depend on parameters such as the Reynolds number in fluid mechanics. In applications such as design optimization or uncertainty quantification, solutions of those PDEs need to be evaluated at numerous points in the parameter space. While physics-informed neural networks (PINNs) have emerged as a new strong c… ▽ More Complex physical systems are often described by partial differential equations (PDEs) that depend on parameters such as the Reynolds number in fluid mechanics. In applications such as design optimization or uncertainty quantification, solutions of those PDEs need to be evaluated at numerous points in the parameter space. While physics-informed neural networks (PINNs) have emerged as a new strong competitor as a surrogate, their usage in this scenario remains underexplored due to the inherent need for repetitive and time-consuming training. In this paper, we address this problem by proposing a novel extension, parameterized physics-informed neural networks (P$^2$INNs). P$^2$INNs enable modeling the solutions of parameterized PDEs via explicitly encoding a latent representation of PDE parameters. With the extensive empirical evaluation, we demonstrate that P$^2$INNs outperform the baselines both in accuracy and parameter efficiency on benchmark 1D and 2D parameterized PDEs and are also effective in overcoming the known "failure modes". △ Less

Submitted 18 August, 2024; originally announced August 2024.

arXiv:2408.07944 [pdf, other]

Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning

Authors: Wonwoo Cho, Kangyeol Kim, Saemee Choi, Jaegul Choo

Abstract: Despite the growing prevalence of black-box pre-trained models (PTMs) such as prediction API services, there remains a significant challenge in directly applying general models to real-world scenarios due to the data distribution gap. Considering a data deficiency and constrained computational resource scenario, this paper proposes a novel parameter-efficient transfer learning framework for vision… ▽ More Despite the growing prevalence of black-box pre-trained models (PTMs) such as prediction API services, there remains a significant challenge in directly applying general models to real-world scenarios due to the data distribution gap. Considering a data deficiency and constrained computational resource scenario, this paper proposes a novel parameter-efficient transfer learning framework for vision recognition models in the black-box setting. Our framework incorporates two novel training techniques. First, we align the input space (i.e., image) of PTMs to the target data distribution by generating visual prompts of spatial and frequency domain. Along with the novel spatial-frequency hybrid visual prompter, we design a novel training technique based on probabilistic clusters, which can enhance class separation in the output space (i.e., prediction probabilities). In experiments, our model demonstrates superior performance in a few-shot transfer learning setting across extensive visual recognition datasets, surpassing state-of-the-art baselines. Additionally, we show that the proposed method efficiently reduces computational costs for training and inference phases. △ Less

Submitted 15 August, 2024; originally announced August 2024.

Comments: ACM Multimedia 2024

arXiv:2408.05917 [pdf]

Inverse design of Non-parameterized Ventilated Acoustic Resonator via Variational Autoencoder with Acoustic Response-encoded Latent Space

Authors: Min Woo Cho, Seok Hyeon Hwang, Jun-Young Jang, Jin Yeong Song, Sun-kwang Hwang, Kyoung Je Cha, Dong Yong Park, Kyungjun Song, Sang Min Park

Abstract: Ventilated acoustic resonator(VAR), a type of acoustic metamaterial, emerge as an alternative for sound attenuation in environments that require ventilation, owing to its excellent low-frequency attenuation performance and flexible shape adaptability. However, due to the non-linear acoustic responses of VARs, the VAR designs are generally obtained within a limited parametrized design space, and th… ▽ More Ventilated acoustic resonator(VAR), a type of acoustic metamaterial, emerge as an alternative for sound attenuation in environments that require ventilation, owing to its excellent low-frequency attenuation performance and flexible shape adaptability. However, due to the non-linear acoustic responses of VARs, the VAR designs are generally obtained within a limited parametrized design space, and the design relies on the iteration of the numerical simulation which consumes a considerable amount of computational time and resources. This paper proposes an acoustic response-encoded variational autoencoder (AR-VAE), a novel variational autoencoder-based generative design model for the efficient and accurate inverse design of VAR even with non-parametrized designs. The AR-VAE matches the high-dimensional acoustic response with the VAR cross-section image in the dimension-reduced latent space, which enables the AR-VAE to generate various non-parametrized VAR cross-section images with the target acoustic response. AR-VAE generates non-parameterized VARs from target acoustic responses, which show a 25-fold reduction in mean squared error compared to conventional deep learning-based parameter searching methods while exhibiting lower average mean squared error and peak frequency variance. By combining the inverse-designed VARs by AR-VAE, multi-cavity VAR was devised for broadband and multitarget peak frequency attenuation. The proposed design method presents a new approach for structural inverse-design with a high-dimensional non-linear physical response. △ Less

Submitted 12 August, 2024; originally announced August 2024.

arXiv:2408.03004 [pdf, other]

Reconciling Cosmological Tensions with Inelastic Dark Matter and Dark Radiation in a $\boldsymbol{U(1)_D}$ Framework

Authors: Wonsub Cho, Ki-Young Choi, Satyabrata Mahapatra

Abstract: We propose a novel and comprehensive particle physics framework that addresses multiple cosmological tensions observed in recent measurements of the Hubble parameter, $S_8$, and Lyman-$α$ forest data. Our model, termed `{\bf SIDR+$\boldsymbol{z_t}$}' (Self Interacting Dark Radiation with transition redshift), is based on an inelastic dark matter (IDM) scenario coupled with dark radiation, governed… ▽ More We propose a novel and comprehensive particle physics framework that addresses multiple cosmological tensions observed in recent measurements of the Hubble parameter, $S_8$, and Lyman-$α$ forest data. Our model, termed `{\bf SIDR+$\boldsymbol{z_t}$}' (Self Interacting Dark Radiation with transition redshift), is based on an inelastic dark matter (IDM) scenario coupled with dark radiation, governed by a $U(1)_D$ gauge symmetry. This framework naturally incorporates cold dark matter (DM), SIDR, and the interactions between these components. The fluid-like behavior of the dark radiation component, effectively mitigates both the Hubble and $S_8$ tensions by suppressing free-streaming effects. Simultaneously, the interacting DM-DR system attenuates the matter power spectrum at small scales, potentially reconciling discrepancies in Lyman-$α$ (Ly-$α$) observations. The inelastic nature of DM provides a distinct temperature dependence for the DM-DR interaction rate determined by the mass-splitting between the inelastic dark fermions which is crucial for resolving the Ly-$α$ discrepancies. We present a cosmologically consistent analysis of the model by solving the relevant Boltzmann equations to obtain the energy density and number density evolution of different species of the model. The DR undergoes two ``steps" of increased energy density when the heavier dark species freeze out and become non-relativistic, transferring their entropy to the dark radiation and enhancing $ΔN{\rm eff}$. The analysis showcases the model's potential to uphold the Big Bang Nucleosynthesis (BBN) prediction of $ΔN_{\rm eff}$ but dominantly producing additional contributions prior to recombination, while simultaneously achieving correct relic density of DM though an hybrid of freeze-in and non-thermal production. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: 27 pages, 7 captioned figures

arXiv:2408.00351 [pdf, other]

Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos

Authors: Subin Jeon, In Cho, Minsu Kim, Woong Oh Cho, Seon Joo Kim

Abstract: We propose a new framework for creating and easily manipulating 3D models of arbitrary objects using casually captured videos. Our core ingredient is a novel hierarchy deformation model, which captures motions of objects with a tree-structured bones. Our hierarchy system decomposes motions based on the granularity and reveals the correlations between parts without exploiting any prior structural k… ▽ More We propose a new framework for creating and easily manipulating 3D models of arbitrary objects using casually captured videos. Our core ingredient is a novel hierarchy deformation model, which captures motions of objects with a tree-structured bones. Our hierarchy system decomposes motions based on the granularity and reveals the correlations between parts without exploiting any prior structural knowledge. We further propose to regularize the bones to be positioned at the basis of motions, centers of parts, sufficiently covering related surfaces of the part. This is achieved by our bone occupancy function, which identifies whether a given 3D point is placed within the bone. Coupling the proposed components, our framework offers several clear advantages: (1) users can obtain animatable 3D models of the arbitrary objects in improved quality from their casual videos, (2) users can manipulate 3D models in an intuitive manner with minimal costs, and (3) users can interactively add or delete control points as necessary. The experimental results demonstrate the efficacy of our framework on diverse instances, in reconstruction quality, interpretability and easier manipulation. Our code is available at https://github.com/subin6/HSNB. △ Less

Submitted 1 August, 2024; originally announced August 2024.

Comments: ECCV 2024 accepted

arXiv:2407.09779 [pdf, other]

Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation

Authors: Kangyeol Kim, Wooseok Seo, Sehyun Nam, Bodam Kim, Suhyeon Jeong, Wonwoo Cho, Jaegul Choo, Youngjae Yu

Abstract: Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generati… ▽ More Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generation and 2) retouch. In the first stage, our step-blended inference utilizes the inherent sample diversity of vanilla T2I models to produce diversified layout images, while also enhancing prompt fidelity. In the second stage, multi-source attention swapping integrates the context image from the first stage with the reference image, leveraging the structure from the context image and extracting visual features from the reference image. This achieves high prompt fidelity while preserving identity characteristics. Through our extensive experiments, we demonstrate that our method generates a wide variety of images with diverse layouts while maintaining the unique identity features of the personalized objects, even with challenging text prompts. This versatility highlights the potential of our framework to handle complex conditions, significantly enhancing the diversity and applicability of personalized image synthesis. △ Less

Submitted 13 July, 2024; originally announced July 2024.

arXiv:2407.08229 [pdf, other]

Stable dark matter from Pauli blocking in the degenerate fermion background with Quantum Field Theory

Authors: Wonsub Cho, Ki-Young Choi, Junghoon Joh, Osamu Seto

Abstract: We study a mechanism to make dark matter stable based on the Pauli blocking in the fermion background. In the background where fermions occupy the states, the decay of dark matter to those final states is not allowed, as a result, DM becomes stable. We derive the evolution equations of the distribution function in the quantum field theory and compare it with the Boltzmann equation. We apply this m… ▽ More We study a mechanism to make dark matter stable based on the Pauli blocking in the fermion background. In the background where fermions occupy the states, the decay of dark matter to those final states is not allowed, as a result, DM becomes stable. We derive the evolution equations of the distribution function in the quantum field theory and compare it with the Boltzmann equation. We apply this mechanism to a realistic model of neutrino and dark matter. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 19 pages, 9 figures

Report number: EPHOU-24-008

arXiv:2407.03231 [pdf]

doi 10.1021/acs.nanolett.4c01536

Dimensionality Engineering of Magnetic Anisotropy from Anomalous Hall Effect in Synthetic SrRuO3 Crystals

Authors: Seung Gyo Jeong, Seong Won Cho, Sehwan Song, Jin Young Oh, Do Gyeom Jeong, Gyeongtak Han, Hu Young Jeong, Ahmed Yousef Mohamed, Woo-suk Noh, Sungkyun Park, Jong Seok Lee, Suyoun Lee, Young-Min Kim, Deok-Yong Cho, Woo Seok Choi

Abstract: Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designi… ▽ More Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designing oxide superlattices with a correlated ferromagnetic SrRuO3 and nonmagnetic SrTiO3 layers, we observed modulated ferromagnetic behavior with the change of the SrRuO3 thickness. Especially, for three-unit-cell-thick layers, we observe a significant 1,500% improvement of coercive field in the anomalous Hall effect, which cannot be solely attributed to the dimensional crossover in ferromagnetism. The atomic-scale heterostructures further reveal the systematic modulation of anisotropy for the lattice structure and orbital hybridization, explaining the enhanced magnetic anisotropy. Our findings provide valuable insights into engineering the anisotropic hybridization of synthetic magnetic crystals, offering a tunable spin order for various applications. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 23 pages

Journal ref: published 2024

arXiv:2406.18881 [pdf, other]

doi 10.1109/JSSC.2024.3435736

A Wireless, Multicolor Fluorescence Image Sensor Implant for Real-Time Monitoring in Cancer Therapy

Authors: Micah Roschelle, Rozhan Rabbani, Surin Gweon, Rohan Kumar, Alec Vercruysse, Nam Woo Cho, Matthew H. Spitzer, Ali M. Niknejad, Vladimir M. Stojanovic, Mekhail Anwar

Abstract: Real-time monitoring of dynamic biological processes in the body is critical to understanding disease progression and treatment response. This data, for instance, can help address the lower than 50% response rates to cancer immunotherapy. However, current clinical imaging modalities lack the molecular contrast, resolution, and chronic usability for rapid and accurate response assessments. Here, we… ▽ More Real-time monitoring of dynamic biological processes in the body is critical to understanding disease progression and treatment response. This data, for instance, can help address the lower than 50% response rates to cancer immunotherapy. However, current clinical imaging modalities lack the molecular contrast, resolution, and chronic usability for rapid and accurate response assessments. Here, we present a fully wireless image sensor featuring a 2.5$\times$5 mm$^2$ CMOS integrated circuit for multicolor fluorescence imaging deep in tissue. The sensor operates wirelessly via ultrasound (US) at 5 cm depth in oil, harvesting energy with 221 mW/cm$^{2}$ incident US power density (31% of FDA limits) and backscattering data at 13 kbps with a bit error rate <$10^{-6}$. In-situ fluorescence excitation is provided by micro-laser diodes controlled with a programmable on-chip driver. An optical frontend combining a multi-bandpass interference filter and a fiber optic plate provides >6 OD excitation blocking and enables three-color imaging for detecting multiple cell types. A 36$\times$40-pixel array captures images with <125 $μ$m resolution. We demonstrate wireless, dual-color fluorescence imaging of both effector and suppressor immune cells in ex vivo mouse tumor samples with and without immunotherapy. These results show promise for providing rapid insight into therapeutic response and resistance, guiding personalized medicine. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: *equally contributing authors

Journal ref: IEEE J. Solid-State Circuits (2024)

arXiv:2406.12223 [pdf, other]

ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations

Authors: Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, Roy Ka-wei Lee

Abstract: Detecting hate speech and offensive language is essential for maintaining a safe and respectful digital environment. This study examines the limitations of state-of-the-art large language models (LLMs) in identifying offensive content within systematically perturbed data, with a focus on Chinese, a language particularly susceptible to such perturbations. We introduce \textsf{ToxiCloakCN}, an enhan… ▽ More Detecting hate speech and offensive language is essential for maintaining a safe and respectful digital environment. This study examines the limitations of state-of-the-art large language models (LLMs) in identifying offensive content within systematically perturbed data, with a focus on Chinese, a language particularly susceptible to such perturbations. We introduce \textsf{ToxiCloakCN}, an enhanced dataset derived from ToxiCN, augmented with homophonic substitutions and emoji transformations, to test the robustness of LLMs against these cloaking perturbations. Our findings reveal that existing models significantly underperform in detecting offensive content when these perturbations are applied. We provide an in-depth analysis of how different types of offensive content are affected by these perturbations and explore the alignment between human and model explanations of offensiveness. Our work highlights the urgent need for more advanced techniques in offensive language detection to combat the evolving tactics used to evade detection mechanisms. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 10 pages,5 Tables, 2 Figures

arXiv:2406.11135 [pdf, other]

doi 10.1145/3656156.3663694

Towards Understanding Emotions for Engaged Mental Health Conversations

Authors: Kellie Yu Hui Sim, Kohleen Tijing Fortuno, Kenny Tsu Wei Choo

Abstract: Providing timely support and intervention is crucial in mental health settings. As the need to engage youth comfortable with texting increases, mental health providers are exploring and adopting text-based media such as chatbots, community-based forums, online therapies with licensed professionals, and helplines operated by trained responders. To support these text-based media for mental health--p… ▽ More Providing timely support and intervention is crucial in mental health settings. As the need to engage youth comfortable with texting increases, mental health providers are exploring and adopting text-based media such as chatbots, community-based forums, online therapies with licensed professionals, and helplines operated by trained responders. To support these text-based media for mental health--particularly for crisis care--we are developing a system to perform passive emotion-sensing using a combination of keystroke dynamics and sentiment analysis. Our early studies of this system posit that the analysis of short text messages and keyboard typing patterns can provide emotion information that may be used to support both clients and responders. We use our preliminary findings to discuss the way forward for applying AI to support mental health providers in providing better care. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: 5 pages, 1 figure, to be published in DIS Companion '24

ACM Class: H.5.2; I.2.7

arXiv:2406.05071 [pdf, other]

Massively Multiagent Minigames for Training Generalist Agents

Authors: Kyoung Whan Choe, Ryan Sullivan, Joseph Suárez

Abstract: We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigame… ▽ More We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigames with a single set of weights. We release the environment, baselines, and training code under the MIT license. We hope that Meta MMO will spur additional progress on Neural MMO and, more generally, will serve as a useful benchmark for many-agent generalization. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2405.20165 [pdf, other]

Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation

Authors: Wooseong Cho, Taehyun Hwang, Joongkyu Lee, Min-hwan Oh

Abstract: We study reinforcement learning with multinomial logistic (MNL) function approximation where the underlying transition probability kernel of the Markov decision processes (MDPs) is parametrized by an unknown transition core with features of state and action. For the finite horizon episodic setting with inhomogeneous state transitions, we propose provably efficient algorithms with randomized explor… ▽ More We study reinforcement learning with multinomial logistic (MNL) function approximation where the underlying transition probability kernel of the Markov decision processes (MDPs) is parametrized by an unknown transition core with features of state and action. For the finite horizon episodic setting with inhomogeneous state transitions, we propose provably efficient algorithms with randomized exploration having frequentist regret guarantees. For our first algorithm, $\texttt{RRL-MNL}$, we adapt optimistic sampling to ensure the optimism of the estimated value function with sufficient frequency and establish that $\texttt{RRL-MNL}$ is both statistically and computationally efficient, achieving a $\tilde{O}(κ^{-1} d^{\frac{3}{2}} H^{\frac{3}{2}} \sqrt{T})$ frequentist regret bound with constant-time computational cost per episode. Here, $d$ is the dimension of the transition core, $H$ is the horizon length, $T$ is the total number of steps, and $κ$ is a problem-dependent constant. Despite the simplicity and practicality of $\texttt{RRL-MNL}$, its regret bound scales with $κ^{-1}$, which is potentially large in the worst case. To improve the dependence on $κ^{-1}$, we propose $\texttt{ORRL-MNL}$, which estimates the value function using local gradient information of the MNL transition model. We show that its frequentist regret bound is $\tilde{O}(d^{\frac{3}{2}} H^{\frac{3}{2}} \sqrt{T} + κ^{-1} d^2 H^2)$. To the best of our knowledge, these are the first randomized RL algorithms for the MNL transition model that achieve both computational and statistical efficiency. Numerical experiments demonstrate the superior performance of the proposed algorithms. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.06754 [pdf, other]

Wall-Street: Smart Surface-Enabled 5G mmWave for Roadside Networking

Authors: Kun Woo Cho, Prasanthi Maddala, Ivan Seskar, Kyle Jamieson

Abstract: 5G mmWave roadside networks promise high-speed wireless connectivity, but face significant challenges in maintaining reliable connections for users moving at high speed. Frequent handovers, complex beam alignment, and signal attenuation due to obstacles like car bodies lead to service interruptions and degraded performance. We present Wall-Street, a smart surface installed on vehicles to enhance 5… ▽ More 5G mmWave roadside networks promise high-speed wireless connectivity, but face significant challenges in maintaining reliable connections for users moving at high speed. Frequent handovers, complex beam alignment, and signal attenuation due to obstacles like car bodies lead to service interruptions and degraded performance. We present Wall-Street, a smart surface installed on vehicles to enhance 5G mmWave connectivity for users inside. Wall-Street improves mobility management by (1) steering outdoor mmWave signals into the vehicle, ensuring coverage for all users; (2) enabling simultaneous serving cell data transfer and candidate handover cell measurement, allowing seamless handovers without service interruption; and (3) combining beams from source and target cells during a handover to increase reliability. Through its flexible and diverse signal manipulation capabilities, Wall-Street provides uninterrupted high-speed connectivity for latency-sensitive applications in challenging mobile environments. We have implemented and integrated Wall-Street in the COSMOS testbed and evaluated its real-time performance with four gNBs and a mobile client inside a surface-enabled vehicle, driving on a nearby road. Wall-Street achieves a 2.5-3.4x TCP throughput improvement and a 0.4-0.8x reduction in delay over a baseline 5G Standalone handover protocol. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: 15 pages, 22 figures, under submission

arXiv:2405.01842 [pdf, ps, other]

SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore

Authors: Ri Chi Ng, Nirmalendu Prakash, Ming Shan Hee, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

Abstract: To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native ann… ▽ More To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native annotators. \textsf{SGHateCheck} reveals critical flaws in state-of-the-art models, highlighting their inadequacy in sensitive content moderation. This work aims to foster the development of more effective hate speech detection tools for diverse linguistic environments, particularly for Singapore and Southeast Asia contexts. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2404.17130 [pdf]

Ultralow-Power Single-Sensor-Based E-Nose System Powered by Duty Cycling and Deep Learning for Real-Time Gas Identification

Authors: Taejung Kim, Yonggi Kim, Wootaek Cho, Jong-Hyun Kwak, Jeonghoon Cho, Youjang Pyeon, Jae Joon Kim, Heungjoo Shin

Abstract: This study presents a novel, ultralow-power single-sensor-based electronic nose (e-nose) system for real-time gas identification, distinguishing itself from conventional sensor-array-based e-nose systems whose power consumption and cost increase with the number of sensors. Our system employs a single metal oxide semiconductor (MOS) sensor built on a suspended 1D nanoheater, driven by duty cycling-… ▽ More This study presents a novel, ultralow-power single-sensor-based electronic nose (e-nose) system for real-time gas identification, distinguishing itself from conventional sensor-array-based e-nose systems whose power consumption and cost increase with the number of sensors. Our system employs a single metal oxide semiconductor (MOS) sensor built on a suspended 1D nanoheater, driven by duty cycling-characterized by repeated pulsed power inputs. The sensor's ultrafast thermal response, enabled by its small size, effectively decouples the effects of temperature and surface charge exchange on the MOS nanomaterial's conductivity. This provides distinct sensing signals that alternate between responses coupled with and decoupled from the thermally enhanced conductivity, all within a single time domain during duty cycling. The magnitude and ratio of these dual responses vary depending on the gas type and concentration, facilitating the early-stage gas identification of five gas types within 30 s via a convolutional neural network (classification accuracy = 93.9%, concentration regression error = 19.8%). Additionally, the duty-cycling mode significantly reduces power consumption by up to 90%, lowering it to 160 $μ$W to heat the sensor to 250$^\circ$C. Manufactured using only wafer-level batch microfabrication processes, this innovative e-nose system promises the facile implementation of battery-driven, long-term, and cost-effective IoT monitoring systems. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 67 pages, 10 figures

arXiv:2404.14873 [pdf, ps, other]

Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data

Authors: Hyeontae Jo, Sung Woong Cho, Hyung Ju Hwang

Abstract: Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found tha… ▽ More Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found that traditional methods for parameter estimation in differential equations, such as using mean values of time trajectories or Gaussian Process-based trajectory generation, have limitations in estimating the shape of parameter distributions, often leading to a significant loss of data information. To address this issue, we introduce a novel method, Estimation of Parameter Distribution (EPD), providing accurate distribution of parameters without loss of data information. EPD operates in three main steps: generating synthetic time trajectories by randomly selecting observed values at each time point, estimating parameters of a differential equation that minimize the discrepancy between these trajectories and the true solution of the equation, and selecting the parameters depending on the scale of discrepancy. We then evaluated the performance of EPD across several models, including exponential growth, logistic population models, and target cell-limited models with delayed virus production, demonstrating its superiority in capturing the shape of parameter distributions. Furthermore, we applied EPD to real-world datasets, capturing various shapes of parameter distributions rather than a normal distribution. These results effectively address the heterogeneity within systems, marking a substantial progression in accurately modeling systems using RCS data. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 16 pages, 10 figures

MSC Class: 65L08; 65D17; 68U07

arXiv:2404.11539 [pdf, other]

Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis

Authors: Soyoung Yang, Won Ik Cho

Abstract: In the era of rapid evolution of generative language models within the realm of natural language processing, there is an imperative call to revisit and reformulate evaluation methodologies, especially in the domain of aspect-based sentiment analysis (ABSA). This paper addresses the emerging challenges introduced by the generative paradigm, which has moderately blurred traditional boundaries betwee… ▽ More In the era of rapid evolution of generative language models within the realm of natural language processing, there is an imperative call to revisit and reformulate evaluation methodologies, especially in the domain of aspect-based sentiment analysis (ABSA). This paper addresses the emerging challenges introduced by the generative paradigm, which has moderately blurred traditional boundaries between understanding and generation tasks. Building upon prevailing practices in the field, we analyze the advantages and shortcomings associated with the prevalent ABSA evaluation paradigms. Through an in-depth examination, supplemented by illustrative examples, we highlight the intricacies involved in aligning generative outputs with other evaluative metrics, specifically those derived from other tasks, including question answering. While we steer clear of advocating for a singular and definitive metric, our contribution lies in paving the path for a comprehensive guideline tailored for ABSA evaluations in this generative paradigm. In this position paper, we aim to provide practitioners with profound reflections, offering insights and directions that can aid in navigating this evolving landscape, ensuring evaluations that are both accurate and reflective of generative capabilities. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 10 pages

arXiv:2404.09041 [pdf, other]

Three Disclaimers for Safe Disclosure: A Cardwriter for Reporting the Use of Generative AI in Writing Process

Authors: Won Ik Cho, Eunjung Cho, Hyeonji Shin

Abstract: Generative artificial intelligence (AI) and large language models (LLMs) are increasingly being used in the academic writing process. This is despite the current lack of unified framework for reporting the use of machine assistance. In this work, we propose "Cardwriter", an intuitive interface that produces a short report for authors to declare their use of generative AI in their writing process.… ▽ More Generative artificial intelligence (AI) and large language models (LLMs) are increasingly being used in the academic writing process. This is despite the current lack of unified framework for reporting the use of machine assistance. In this work, we propose "Cardwriter", an intuitive interface that produces a short report for authors to declare their use of generative AI in their writing process. The demo is available online, at https://cardwriter.vercel.app △ Less

Submitted 13 April, 2024; originally announced April 2024.

Comments: 6 pages; an implementation version of PaperCard project

arXiv:2404.05244 [pdf, ps, other]

On Finite Presentability of Subsemigroups of the Monogenic Free Inverse Semigroup

Authors: Yung Won Cho, Nik Ruskuc

Abstract: The monogenic free inverse semigroup $FI_1$ is not finitely presented as a semigroup due to the classic result by Schein (1975). We extend this result and prove that a finitely generated subsemigroup of $FI_1$ is finitely presented if and only if it contains only finitely many idempotents. As a consequence, we derive that an inverse subsemigroup of $FI_1$ is finitely presented as a semigroup if an… ▽ More The monogenic free inverse semigroup $FI_1$ is not finitely presented as a semigroup due to the classic result by Schein (1975). We extend this result and prove that a finitely generated subsemigroup of $FI_1$ is finitely presented if and only if it contains only finitely many idempotents. As a consequence, we derive that an inverse subsemigroup of $FI_1$ is finitely presented as a semigroup if and only if it is a finite semilattice. △ Less

Submitted 12 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

MSC Class: 20M05; 20M18

arXiv:2404.00405 [pdf, other]

doi 10.1145/3613905.3650786

A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration

Authors: Jie Gao, Simret Araya Gebreegziabher, Kenny Tsu Wei Choo, Toby Jia-Jun Li, Simon Tangi Perrault, Thomas W. Malone

Abstract: With ChatGPT's release, conversational prompting has become the most popular form of human-LLM interaction. However, its effectiveness is limited for more complex tasks involving reasoning, creativity, and iteration. Through a systematic analysis of HCI papers published since 2021, we identified four key phases in the human-LLM interaction flow - planning, facilitating, iterating, and testing - to… ▽ More With ChatGPT's release, conversational prompting has become the most popular form of human-LLM interaction. However, its effectiveness is limited for more complex tasks involving reasoning, creativity, and iteration. Through a systematic analysis of HCI papers published since 2021, we identified four key phases in the human-LLM interaction flow - planning, facilitating, iterating, and testing - to precisely understand the dynamics of this process. Additionally, we have developed a taxonomy of four primary interaction modes: Mode 1: Standard Prompting, Mode 2: User Interface, Mode 3: Context-based, and Mode 4: Agent Facilitator. This taxonomy was further enriched using the "5W1H" guideline method, which involved a detailed examination of definitions, participant roles (Who), the phases that happened (When), human objectives and LLM abilities (What), and the mechanics of each interaction mode (How). We anticipate this taxonomy will contribute to the future design and evaluation of human-LLM interaction. △ Less

Submitted 30 March, 2024; originally announced April 2024.

Comments: 11 pages, 4 figures, 3 tables. Accepted at CHI Late-Breaking Work 2024

arXiv:2403.13614 [pdf, ps, other]

Graph products of residually finite monoids are residually finite

Authors: Jung Won Cho, Victoria Gould, Nik Ruškuc, Dandan Yang

Abstract: We show that any graph product of residually finite monoids is residually finite. As a special case we obtain that any free product of residually finite monoids is residually finite. The corresponding results for graph products of semigroups follow. We show that any graph product of residually finite monoids is residually finite. As a special case we obtain that any free product of residually finite monoids is residually finite. The corresponding results for graph products of semigroups follow. △ Less

Submitted 27 August, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

MSC Class: 20M10; 20M05

arXiv:2403.05209 [pdf, other]

Overcoming Data Inequality across Domains with Semi-Supervised Domain Generalization

Authors: Jinha Park, Wonguk Cho, Taesup Kim

Abstract: While there have been considerable advancements in machine learning driven by extensive datasets, a significant disparity still persists in the availability of data across various sources and populations. This inequality across domains poses challenges in modeling for those with limited data, which can lead to profound practical and ethical concerns. In this paper, we address a representative case… ▽ More While there have been considerable advancements in machine learning driven by extensive datasets, a significant disparity still persists in the availability of data across various sources and populations. This inequality across domains poses challenges in modeling for those with limited data, which can lead to profound practical and ethical concerns. In this paper, we address a representative case of data inequality problem across domains termed Semi-Supervised Domain Generalization (SSDG), in which only one domain is labeled while the rest are unlabeled. We propose a novel algorithm, ProUD, which can effectively learn domain-invariant features via domain-aware prototypes along with progressive generalization via uncertainty-adaptive mixing of labeled and unlabeled domains. Our experiments on three different benchmark datasets demonstrate the effectiveness of ProUD, outperforming all baseline models including single domain generalization and semi-supervised learning. Source code will be released upon acceptance of the paper. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: 20 pages, 4 figures

arXiv:2402.08187 [pdf, other]

Learning time-dependent PDE via graph neural networks and deep operator network for robust accuracy on irregular grids

Authors: Sung Woong Cho, Jae Yong Lee, Hyung Ju Hwang

Abstract: Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions… ▽ More Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions as inputs and outputs, enabling real-time predictions as surrogate models for solution operators. There has also been significant progress in the research on surrogate models based on graph neural networks (GNNs), specifically targeting the dynamics in time-dependent PDEs. In this paper, we propose GraphDeepONet, an autoregressive model based on GNNs, to effectively adapt DeepONet, which is well-known for successful operator learning. GraphDeepONet exhibits robust accuracy in predicting solutions compared to existing GNN-based PDE solver models. It maintains consistent performance even on irregular grids, leveraging the advantages inherited from DeepONet and enabling predictions on arbitrary grids. Additionally, unlike traditional DeepONet and its variants, GraphDeepONet enables time extrapolation for time-dependent PDE solutions. We also provide theoretical analysis of the universal approximation capability of GraphDeepONet in approximating continuous operators across arbitrary time intervals. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 25 pages, 11 figures

MSC Class: 65D17; 68U07

arXiv:2402.05875 [pdf, ps, other]

Generators and presentations of inverse subsemigroups of the monogenic free inverse semigroup

Authors: Jung Won Cho, Nik Ruskuc

Abstract: It was proved by Oliveira and Silva (2005) that every finitely generated inverse subsemigroup of the monogenic free inverse semigroup $FI_1$ is finitely presented. The present paper continues this development, and gives generating sets and presentations for general (i.e. not necessarily finitely generated) inverse subsemigroups of $FI_1$. For an inverse semigroup $S$ and an inverse subsemigroup… ▽ More It was proved by Oliveira and Silva (2005) that every finitely generated inverse subsemigroup of the monogenic free inverse semigroup $FI_1$ is finitely presented. The present paper continues this development, and gives generating sets and presentations for general (i.e. not necessarily finitely generated) inverse subsemigroups of $FI_1$. For an inverse semigroup $S$ and an inverse subsemigroup $T$ of $S$, we say $S$ is finitely generated modulo $T$ if there is a finite set $A$ such that $S = \langle T, A \rangle$. Likewise, we say that $S$ is finitely presented modulo $T $ if $S$ can be defined by a presentation of the form $\text{Inv}\langle X, Y \mid R, Q\rangle$, where $\text{Inv}\langle X\mid R\rangle$ is a presentation for $T$ and $Y$ and $Q$ are finite. We show that every inverse subsemigroup $S$ of $FI_1$ is finitely generated modulo its semilattice of idempotents $E(S)$. By way of contrast, we show that when $S\neq E(S)$, it can never be finitely presented modulo $E(S)$. However, in the process we establish some nice (albeit infinite) presentations for $S$ modulo $E(S)$. △ Less

Submitted 8 February, 2024; originally announced February 2024.

MSC Class: 20M05; 20M18

arXiv:2401.10421 [pdf]

doi 10.1103/PhysRevB.109.L060403

Composition dependence of bulk properties in the Co-intercalated transition-metal dichalcogenide Co$_{1/3}$TaS$_2$

Authors: Pyeongjae Park, Woonghee Cho, Chaebin Kim, Yeochan An, Maxim Avdeev, Kazuki Iida, Ryoichi Kajimoto, Je-Geun Park

Abstract: Spontaneous Hall conductivity has recently been reported in the triangular lattice antiferromagnet Co$_{1/3}$TaS$_2$ under a zero magnetic field. This phenomenon originates from the distinctive noncoplanar triple-Q magnetic ground state, possessing uniform real-space Berry curvature characterized by scalar spin chirality. We investigated the physical properties of Co$_{1/3}$TaS$_2$ by judiciously… ▽ More Spontaneous Hall conductivity has recently been reported in the triangular lattice antiferromagnet Co$_{1/3}$TaS$_2$ under a zero magnetic field. This phenomenon originates from the distinctive noncoplanar triple-Q magnetic ground state, possessing uniform real-space Berry curvature characterized by scalar spin chirality. We investigated the physical properties of Co$_{1/3}$TaS$_2$ by judiciously controlling the composition, revealing a drastic change in its bulk properties, even by slight variations in cobalt composition, despite the same crystal structure. For $0.299 < x < 0.325$, Co$_x$TaS$_2$ keeps all the characteristics of the ground state consistent with the previous studies -- two antiferromagnetic phase transitions at $T_{N1}$ and $T_{N2} (< T_{N1})$, a large spontaneous Hall conductivity ($σ_{xy} (H=0)$), and a weak ferromagnetic moment along the c-axis. However, samples with $x > 0.330$ exhibit distinct bulk properties, including the absence of both $σ_{xy} (H=0)$ and the weak ferromagnetic moment. Our neutron diffraction data reveal that Co$_x$TaS$_2$ with $x > 0.330$ develops coplanar helical magnetic order with $q_{m1} = (1/3, 0, 0)$. This is entirely different from what has been seen in $x < 0.325$, explaining the observed composition dependence. △ Less

Submitted 18 January, 2024; originally announced January 2024.

Comments: 15 pages, 4 figures, accepted for publication in Phys. Rev. B

Journal ref: Phys. Rev. B 109, L060403 (2024)

arXiv:2312.15949 [pdf, other]

HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork

Authors: Jae Yong Lee, Sung Woong Cho, Hyung Ju Hwang

Abstract: Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear mappings between function spaces. However, the DeepONet requires many parameters a… ▽ More Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear mappings between function spaces. However, the DeepONet requires many parameters and has a high computational cost when learning operators, particularly those with complex (discontinuous or non-smooth) target functions. This study proposes HyperDeepONet, which uses the expressive power of the hypernetwork to enable the learning of a complex operator with a smaller set of parameters. The DeepONet and its variant models can be thought of as a method of injecting the input function information into the target function. From this perspective, these models can be viewed as a particular case of HyperDeepONet. We analyze the complexity of DeepONet and conclude that HyperDeepONet needs relatively lower complexity to obtain the desired accuracy for operator learning. HyperDeepONet successfully learned various operators with fewer computational resources compared to other benchmarks. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Comments: 26 pages, 13 figures. Published as a conference paper at Eleventh International Conference on Learning Representations (ICLR 2023)

MSC Class: 65D17; 68U07

arXiv:2312.15449 [pdf, other]

iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds

Authors: Dongmin Choi, Wonwoo Cho, Kangyeol Kim, Jaegul Choo

Abstract: Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object… ▽ More Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object detector. Supporting a user-friendly 2D interface, which can ease the cognitive burden of exploring 3D space to provide click interactions, iDet3D enables users to annotate the entire objects in each scene with minimal interactions. Taking the sparse nature of 3D point clouds into account, we design a negative click simulation (NCS) to improve accuracy by reducing false-positive predictions. In addition, iDet3D incorporates two click propagation techniques to take full advantage of user interactions: (1) dense click guidance (DCG) for keeping user-provided information throughout the network and (2) spatial click propagation (SCP) for detecting other instances of the same class based on the user-specified objects. Through our extensive experiments, we present that our method can construct precise annotations in a few clicks, which shows the practicality as an efficient annotation tool for 3D object detection. △ Less

Submitted 24 December, 2023; originally announced December 2023.

Comments: Accepted to AAAI 2024

arXiv:2312.12889 [pdf]

Singular Hall response from a correlated ferromagnetic flat nodal-line semimetal

Authors: Woohyun Cho, Yoon-Gu Kang, Jaehun Cha, Dong Hyun David Lee, Do Hoon Kiem, Jaewhan Oh, Jongho Park, Changyoung Kim, Yongsoo Yang, Yeong Kwan Kim, Myung Joon Han, Heejun Yang

Abstract: Topological quantum phases have been largely understood in weakly correlated systems, which have identified various quantum phenomena such as spin Hall effect, protected transport of helical fermions, and topological superconductivity. Robust ferromagnetic order in correlated topological materials particularly attracts attention, as it can provide a versatile platform for novel quantum devices. He… ▽ More Topological quantum phases have been largely understood in weakly correlated systems, which have identified various quantum phenomena such as spin Hall effect, protected transport of helical fermions, and topological superconductivity. Robust ferromagnetic order in correlated topological materials particularly attracts attention, as it can provide a versatile platform for novel quantum devices. Here, we report singular Hall response arising from a unique band structure of flat topological nodal lines in combination with electron correlation in an itinerant, van der Waals ferromagnetic semimetal, Fe3GaTe2, with a high Curie temperature of Tc=360 K. High anomalous Hall conductivity violating the conventional scaling, resistivity upturn at low temperature, and a large Sommerfeld coefficient are observed in Fe3GaTe2, which implies heavy fermion features in this ferromagnetic topological material. Our circular dichroism in angle-resolved photoemission spectroscopy and theoretical calculations support the original electronic features in the material. Thus, low-dimensional Fe3GaTe2 with electronic correlation, topology, and room-temperature ferromagnetic order appears to be a promising candidate for robust quantum devices. △ Less

Submitted 20 December, 2023; originally announced December 2023.

arXiv:2312.12467 [pdf, other]

Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer

Authors: Youn-Yeol Yu, Jeongwhan Choi, Woojin Cho, Kookjin Lee, Nayong Kim, Kiseok Chang, Chang-Seung Woo, Ilho Kim, Seok-Woo Lee, Joon-Young Yang, Sooyoung Yoon, Noseong Park

Abstract: Recently, many mesh-based graph neural network (GNN) models have been proposed for modeling complex high-dimensional physical systems. Remarkable achievements have been made in significantly reducing the solving time compared to traditional numerical solvers. These methods are typically designed to i) reduce the computational cost in solving physical dynamics and/or ii) propose techniques to enhan… ▽ More Recently, many mesh-based graph neural network (GNN) models have been proposed for modeling complex high-dimensional physical systems. Remarkable achievements have been made in significantly reducing the solving time compared to traditional numerical solvers. These methods are typically designed to i) reduce the computational cost in solving physical dynamics and/or ii) propose techniques to enhance the solution accuracy in fluid and rigid body dynamics. However, it remains under-explored whether they are effective in addressing the challenges of flexible body dynamics, where instantaneous collisions occur within a very short timeframe. In this paper, we present Hierarchical Contact Mesh Transformer (HCMT), which uses hierarchical mesh structures and can learn long-range dependencies (occurred by collisions) among spatially distant positions of a body -- two close positions in a higher-level mesh correspond to two distant positions in a lower-level mesh. HCMT enables long-range interactions, and the hierarchical mesh structure quickly propagates collision effects to faraway positions. To this end, it consists of a contact mesh Transformer and a hierarchical mesh Transformer (CMT and HMT, respectively). Lastly, we propose a flexible body dynamics dataset, consisting of trajectories that reflect experimental settings frequently used in the display industry for product designs. We also compare the performance of several baselines using well-known benchmark datasets. Our results show that HCMT provides significant performance improvements over existing methods. Our code is available at https://github.com/yuyudeep/hcmt. △ Less

Submitted 25 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

Comments: Accepted at ICLR 2024

arXiv:2312.10274 [pdf, other]

Operator-learning-inspired Modeling of Neural Ordinary Differential Equations

Authors: Woojin Cho, Seunghyeon Cho, Hyundong Jin, Jinsung Jeon, Kookjin Lee, Sanghyun Hong, Dongeun Lee, Jonghyun Choi, Noseong Park

Abstract: Neural ordinary differential equations (NODEs), one of the most influential works of the differential equation-based deep learning, are to continuously generalize residual networks and opened a new field. They are currently utilized for various downstream tasks, e.g., image classification, time series classification, image generation, etc. Its key part is how to model the time-derivative of the hi… ▽ More Neural ordinary differential equations (NODEs), one of the most influential works of the differential equation-based deep learning, are to continuously generalize residual networks and opened a new field. They are currently utilized for various downstream tasks, e.g., image classification, time series classification, image generation, etc. Its key part is how to model the time-derivative of the hidden state, denoted dh(t)/dt. People have habitually used conventional neural network architectures, e.g., fully-connected layers followed by non-linear activations. In this paper, however, we present a neural operator-based method to define the time-derivative term. Neural operators were initially proposed to model the differential operator of partial differential equations (PDEs). Since the time-derivative of NODEs can be understood as a special type of the differential operator, our proposed method, called branched Fourier neural operator (BFNO), makes sense. In our experiments with general downstream tasks, our method significantly outperforms existing methods. △ Less

Submitted 15 December, 2023; originally announced December 2023.

arXiv:2312.09603 [pdf, other]

Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification

Authors: June-Woo Kim, Sangmin Bae, Won-Yang Cho, Byungjo Lee, Ho-Young Jung

Abstract: Despite the remarkable advances in deep learning technology, achieving satisfactory performance in lung sound classification remains a challenge due to the scarcity of available data. Moreover, the respiratory sound samples are collected from a variety of electronic stethoscopes, which could potentially introduce biases into the trained models. When a significant distribution shift occurs within t… ▽ More Despite the remarkable advances in deep learning technology, achieving satisfactory performance in lung sound classification remains a challenge due to the scarcity of available data. Moreover, the respiratory sound samples are collected from a variety of electronic stethoscopes, which could potentially introduce biases into the trained models. When a significant distribution shift occurs within the test dataset or in a practical scenario, it can substantially decrease the performance. To tackle this issue, we introduce cross-domain adaptation techniques, which transfer the knowledge from a source domain to a distinct target domain. In particular, by considering different stethoscope types as individual domains, we propose a novel stethoscope-guided supervised contrastive learning approach. This method can mitigate any domain-related disparities and thus enables the model to distinguish respiratory sounds of the recording variation of the stethoscope. The experimental results on the ICBHI dataset demonstrate that the proposed methods are effective in reducing the domain dependency and achieving the ICBHI Score of 61.71%, which is a significant improvement of 2.16% over the baseline. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: accepted to ICASSP 2024

arXiv:2312.07402 [pdf, other]

doi 10.1103/PhysRevB.109.165205

Electronic band structure of Sb2Te3

Authors: I. Mohelsky, J. Wyzula, F. Le Mardele, F. Abadizaman, O. Caha, A. Dubroka, X. D. Sun, C. W. Cho, B. A. Piot, M. F. Tanzim, I. Aguilera, G. Bauer, G. Springholz, M. Orlita

Abstract: Here we report on Landau level spectroscopy of an epitaxially grown thin film of the topological insulator Sb2Te3, complemented by ellipsometry and magneto-transport measurements. The observed response suggests that Sb2Te3 is a direct-gap semiconductor with the fundamental band gap located at the Γpoint, or along the trigonal axis, and its width reaches Eg = 190 meV at low temperatures. Our data a… ▽ More Here we report on Landau level spectroscopy of an epitaxially grown thin film of the topological insulator Sb2Te3, complemented by ellipsometry and magneto-transport measurements. The observed response suggests that Sb2Te3 is a direct-gap semiconductor with the fundamental band gap located at the Γpoint, or along the trigonal axis, and its width reaches Eg = 190 meV at low temperatures. Our data also indicate the presence of other low-energy extrema with a higher multiplicity in both the conduction and valence bands. The conclusions based on our experimental data are confronted with and to a great extent corroborated by the electronic band structure calculated using the GW method. △ Less

Submitted 15 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

Comments: 11 pages, 8 figures, to be published in Phys. Rev. B

Journal ref: Phys. Rev. B 109, 165205 (2024)

arXiv:2311.15518 [pdf, other]

The Seoul National University AGN Monitoring Project III: H$β$ lag measurements of 32 luminous AGNs and the high-luminosity end of the size--luminosity relation

Authors: Jong-Hak Woo, Shu Wang, Suvendu Rakshit, Hojin Cho, Donghoon Son, Vardha N. Bennert, Elena Gallo, Edmund Hodges-Kluck, Tommaso Treu, Aaron J. Barth, Wanjin Cho, Adi Foord, Jaehyuk Geum, Hengxiao Guo, Yashashree Jadhav, Yiseul Jeon, Kyle M. Kabasares, Won-Suk Kang, Changseok Kim, Minjin Kim, Tae-Woo Kim, Huynh Anh N. Le, Matthew A. Malkan, Amit Kumar Mandal, Daeseong Park , et al. (6 additional authors not shown)

Abstract: We present the main results from a long-term reverberation mapping campaign carried out for the Seoul National University Active Galactic Nuclei (AGN) Monitoring Project. High-quality data were obtained during 2015-2021 for 32 luminous AGNs (i.e., continuum luminosity in the range of $10^{44-46}$ erg s$^{-1}$) at a regular cadence, of 20-30 days for spectroscopy and 3-5 days for photometry. We obt… ▽ More We present the main results from a long-term reverberation mapping campaign carried out for the Seoul National University Active Galactic Nuclei (AGN) Monitoring Project. High-quality data were obtained during 2015-2021 for 32 luminous AGNs (i.e., continuum luminosity in the range of $10^{44-46}$ erg s$^{-1}$) at a regular cadence, of 20-30 days for spectroscopy and 3-5 days for photometry. We obtain time lag measurements between the variability in the H$β$ emission and the continuum for 32 AGNs; twenty-five of those have the best lag measurements based on our quality assessment, examining correlation strength, and the posterior lag distribution. Our study significantly increases the current sample of reverberation-mapped AGNs, particularly at the moderate to high luminosity end. Combining our results with literature measurements, we derive a H$β$ broad line region size--luminosity relation with a shallower slope than reported in the literature. For a given luminosity, most of our measured lags are shorter than the expectation, implying that single-epoch black hole mass estimators based on previous calibrations could suffer large systematic uncertainties. △ Less

Submitted 26 November, 2023; originally announced November 2023.

Comments: Accepted by ApJ; 39 pages, 22 figures

arXiv:2311.03736 [pdf, other]

Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning

Authors: Joseph Suárez, Phillip Isola, Kyoung Whan Choe, David Bloomin, Hao Xiang Li, Nikhil Pinnaparaju, Nishaanth Kanna, Daniel Scott, Ryan Sullivan, Rose S. Shuman, Lucas de Alcântara, Herbie Bradley, Louis Castricato, Kirsty You, Yuhao Jiang, Qimai Li, Jiaxin Chen, Xiaolong Zhu

Abstract: Neural MMO 2.0 is a massively multi-agent environment for reinforcement learning research. The key feature of this new version is a flexible task system that allows users to define a broad range of objectives and reward signals. We challenge researchers to train agents capable of generalizing to tasks, maps, and opponents never seen during training. Neural MMO features procedurally generated maps… ▽ More Neural MMO 2.0 is a massively multi-agent environment for reinforcement learning research. The key feature of this new version is a flexible task system that allows users to define a broad range of objectives and reward signals. We challenge researchers to train agents capable of generalizing to tasks, maps, and opponents never seen during training. Neural MMO features procedurally generated maps with 128 agents in the standard setting and support for up to. Version 2.0 is a complete rewrite of its predecessor with three-fold improved performance and compatibility with CleanRL. We release the platform as free and open-source software with comprehensive documentation available at neuralmmo.github.io and an active community Discord. To spark initial research on this new platform, we are concurrently running a competition at NeurIPS 2023. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2310.14259 [pdf]

doi 10.1186/s40580-023-00360-y

Investigation of the mechanism of the anomalous Hall effects in Cr2Te3/(BiSb)2(TeSe)3 heterostructure

Authors: Seong Won Cho, In Hak Lee, Youngwoong Lee, Sangheon Kim, Yeong Gwang Khim, Seung-Young Park, Younghun Jo, Junwoo Choi, Seungwu Han, Young Jun Chang, Suyoun Lee

Abstract: The interplay between ferromagnetism and the non-trivial topology has unveiled intriguing phases in the transport of charges and spins. For example, it is consistently observed the so-called topological Hall effect (THE) featuring a hump structure in the curve of the Hall resistance (Rxy) vs. a magnetic field (H) of a heterostructure consisting of a ferromagnet (FM) and a topological insulator (TI… ▽ More The interplay between ferromagnetism and the non-trivial topology has unveiled intriguing phases in the transport of charges and spins. For example, it is consistently observed the so-called topological Hall effect (THE) featuring a hump structure in the curve of the Hall resistance (Rxy) vs. a magnetic field (H) of a heterostructure consisting of a ferromagnet (FM) and a topological insulator (TI). The origin of the hump structure is still controversial between the topological Hall effect model and the multi-component anomalous Hall effect (AHE) model. In this work, we have investigated a heterostructure consisting of BixSb2-xTeySe3-y (BSTS) and Cr2Te3 (CT), which are well-known TI and two-dimensional FM, respectively. By using the so-called minor-loop measurement, we have found that the hump structure observed in the CT/BSTS is more likely to originate from two AHE channels. Moreover, by analyzing the scaling behavior of each amplitude of two AHE with the longitudinal resistivities of CT and BSTS, we have found that one AHE is attributed to the extrinsic contribution of CT while the other is due to the intrinsic contribution of BSTS. It implies that the proximity-induced ferromagnetic layer inside BSTS serves as a source of the intrinsic AHE, resulting in the hump structure explained by the two AHE model. △ Less

Submitted 22 October, 2023; originally announced October 2023.

Journal ref: Nano Convergence (2023) 10:11

arXiv:2310.11551 [pdf, other]

WaveFlex: A Smart Surface for Private CBRS Wireless Cellular Networks

Authors: Fan Yi, Kun Woo Cho, Yaxiong Xie, Kyle Jamieson

Abstract: We present the design and implementation of WaveFlex, the first smart surface that enhances Private LTE/5G networks operating under the shared-license framework in the Citizens Broadband Radio Service frequency band. WaveFlex works in the presence of frequency diversity: multiple nearby base stations operating on different frequencies, as dictated by a Spectrum Access System coordinator. It also h… ▽ More We present the design and implementation of WaveFlex, the first smart surface that enhances Private LTE/5G networks operating under the shared-license framework in the Citizens Broadband Radio Service frequency band. WaveFlex works in the presence of frequency diversity: multiple nearby base stations operating on different frequencies, as dictated by a Spectrum Access System coordinator. It also handles time dynamism: due to the dynamic sharing rules of the band, base stations occasionally switch channels, especially when priority users enter the network. Finally, WaveFlex operates independently of the network itself, not requiring access to nor modification of the base station or mobile users, yet it remain compliant with and effective on prevailing cellular protocols. We have designed and fabricated WaveFlex on a custom multi-layer PCB, software defined radio-based network monitor, and supporting control software and hardware. Our experimental evaluation benchmarks an operational Private LTE network running at full line rate. Results demonstrate an 8.50 dB average SNR gain, and an average throughput gain of 4.36 Mbps for a single small cell, and 3.19 Mbps for four small cells, in a realistic indoor office scenario. △ Less

Submitted 17 October, 2023; originally announced October 2023.

Comments: 15 pages

arXiv:2310.09528 [pdf, other]

Hypernetwork-based Meta-Learning for Low-Rank Physics-Informed Neural Networks

Authors: Woojin Cho, Kookjin Lee, Donsub Rim, Noseong Park

Abstract: In various engineering and applied science applications, repetitive numerical simulations of partial differential equations (PDEs) for varying input parameters are often required (e.g., aircraft shape optimization over many design parameters) and solvers are required to perform rapid execution. In this study, we suggest a path that potentially opens up a possibility for physics-informed neural net… ▽ More In various engineering and applied science applications, repetitive numerical simulations of partial differential equations (PDEs) for varying input parameters are often required (e.g., aircraft shape optimization over many design parameters) and solvers are required to perform rapid execution. In this study, we suggest a path that potentially opens up a possibility for physics-informed neural networks (PINNs), emerging deep-learning-based solvers, to be considered as one such solver. Although PINNs have pioneered a proper integration of deep-learning and scientific computing, they require repetitive time-consuming training of neural networks, which is not suitable for many-query scenarios. To address this issue, we propose a lightweight low-rank PINNs containing only hundreds of model parameters and an associated hypernetwork-based meta-learning algorithm, which allows efficient approximation of solutions of PDEs for varying ranges of PDE input parameters. Moreover, we show that the proposed method is effective in overcoming a challenging issue, known as "failure modes" of PINNs. △ Less

Submitted 14 October, 2023; originally announced October 2023.

arXiv:2310.06979 [pdf, other]

Modulating spin-valley relaxation in WSe$_2$ with variable thickness VOPc layers

Authors: Daphné Lubert-Perquel, Byeong Wook Cho, Alan J. Philips, Young Hee Lee, Jeffrey L. Blackburn, Justin C. Johnson

Abstract: Combining the synthetic tunability of molecular compounds with the optical selection rules of transition metal dichalcogenides (TMDC) that derive from spin-valley coupling could provide interesting opportunities for the readout of quantum information. However, little is known about the electronic and spin interactions at such interfaces and the influence on spin-valley relaxation. In this work, va… ▽ More Combining the synthetic tunability of molecular compounds with the optical selection rules of transition metal dichalcogenides (TMDC) that derive from spin-valley coupling could provide interesting opportunities for the readout of quantum information. However, little is known about the electronic and spin interactions at such interfaces and the influence on spin-valley relaxation. In this work, vanadyl phthalocyanine (VOPc) molecular layers are thermally evaporated on WSe$_2$ to explore the effect of molecular layer thickness on excited-state spin-valley polarization. The thinnest molecular layer supports an interfacial state which destroys the spin-valley polarization almost instantaneously, whereas a thicker molecular layer results in longer-lived spin-valley polarization than the WSe$_2$ monolayer alone. The mechanism appears to involve a tightly-bound species at the molecule/TMDC interface that strengthens exchange interactions and is largely avoided in thicker VOPc layers that isolate electrons from WSe$_2$ holes. △ Less

Submitted 9 August, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.04824 [pdf, other]

PaperCard for Reporting Machine Assistance in Academic Writing

Authors: Won Ik Cho, Eunjung Cho, Kyunghyun Cho

Abstract: Academic writing process has benefited from various technological developments over the years including search engines, automatic translators, and editing tools that review grammar and spelling mistakes. They have enabled human writers to become more efficient in writing academic papers, for example by helping with finding relevant literature more effectively and polishing texts. While these devel… ▽ More Academic writing process has benefited from various technological developments over the years including search engines, automatic translators, and editing tools that review grammar and spelling mistakes. They have enabled human writers to become more efficient in writing academic papers, for example by helping with finding relevant literature more effectively and polishing texts. While these developments have so far played a relatively assistive role, recent advances in large-scale language models (LLMs) have enabled LLMs to play a more major role in the writing process, such as coming up with research questions and generating key contents. This raises critical questions surrounding the concept of authorship in academia. ChatGPT, a question-answering system released by OpenAI in November 2022, has demonstrated a range of capabilities that could be utilised in producing academic papers. The academic community will have to address relevant pressing questions, including whether Artificial Intelligence (AI) should be merited authorship if it made significant contributions in the writing process, or whether its use should be restricted such that human authorship would not be undermined. In this paper, we aim to address such questions, and propose a framework we name "PaperCard", a documentation for human authors to transparently declare the use of AI in their writing process. △ Less

Submitted 7 October, 2023; originally announced October 2023.

Comments: Accepted at EAAMO'23 as a poster presentation

arXiv:2309.13858 [pdf, other]

Impact of Human-AI Interaction on User Trust and Reliance in AI-Assisted Qualitative Coding

Authors: Jie Gao, Junming Cao, ShunYi Yeo, Kenny Tsu Wei Choo, Zheng Zhang, Toby Jia-Jun Li, Shengdong Zhao, Simon Tangi Perrault

Abstract: While AI shows promise for enhancing the efficiency of qualitative analysis, the unique human-AI interaction resulting from varied coding strategies makes it challenging to develop a trustworthy AI-assisted qualitative coding system (AIQCs) that supports coding tasks effectively. We bridge this gap by exploring the impact of varying coding strategies on user trust and reliance on AI. We conducted… ▽ More While AI shows promise for enhancing the efficiency of qualitative analysis, the unique human-AI interaction resulting from varied coding strategies makes it challenging to develop a trustworthy AI-assisted qualitative coding system (AIQCs) that supports coding tasks effectively. We bridge this gap by exploring the impact of varying coding strategies on user trust and reliance on AI. We conducted a mixed-methods split-plot 3x3 study, involving 30 participants, and a follow-up study with 6 participants, exploring varying text selection and code length in the use of our AIQCs system for qualitative analysis. Our results indicate that qualitative open coding should be conceptualized as a series of distinct subtasks, each with differing levels of complexity, and therefore, should be given tailored design considerations. We further observed a discrepancy between perceived and behavioral measures, and emphasized the potential challenges of under- and over-reliance on AIQCs systems. Additional design implications were also proposed for consideration. △ Less

Submitted 24 September, 2023; originally announced September 2023.

Comments: 27 pages with references, 9 figures, 5 tables

arXiv:2309.11017 [pdf, other]

3SAT on an All-to-All-Connected CMOS Ising Solver Chip

Authors: Hüsrev Cılasun, Ziqing Zeng, Ramprasath S, Abhimanyu Kumar, Hao Lo, William Cho, Chris H. Kim, Ulya R. Karpuzcu, Sachin S. Sapatnekar

Abstract: This work solves 3SAT, a classical NP-complete problem, on a CMOS-based Ising hardware chip with all-to-all connectivity. The paper addresses practical issues in going from algorithms to hardware. It considers several degrees of freedom in mapping the 3SAT problem to the chip - using multiple Ising formulations for 3SAT; exploring multiple strategies for decomposing large problems into subproblems… ▽ More This work solves 3SAT, a classical NP-complete problem, on a CMOS-based Ising hardware chip with all-to-all connectivity. The paper addresses practical issues in going from algorithms to hardware. It considers several degrees of freedom in mapping the 3SAT problem to the chip - using multiple Ising formulations for 3SAT; exploring multiple strategies for decomposing large problems into subproblems that can be accommodated on the Ising chip; and executing a sequence of these subproblems on CMOS hardware to obtain the solution to the larger problem. These are evaluated within a software framework, and the results are used to identify the most promising formulations and decomposition techniques. These best approaches are then mapped to the all-to-all hardware, and the performance of 3SAT is evaluated on the chip. Experimental data shows that the deployed decomposition and mapping strategies impact SAT solution quality: without our methods, the CMOS hardware cannot achieve 3SAT solutions on SATLIB benchmarks. △ Less

Submitted 19 September, 2023; originally announced September 2023.

ACM Class: B.7

arXiv:2309.06866 [pdf, other]

Magnon gap excitations in van der Waals antiferromagnet MnPSe$_3$

Authors: Dipankar Jana, D. Vaclavkova, I. Mohelsky, P. Kapuscinski, C. W. Cho, I. Breslavetz, M. Białek, J. -Ph. Ansermet, B. A. Piot, M. Orlita, C. Faugeras, M. Potemski

Abstract: Magneto-spectroscopy methods have been employed to study the zero-wavevector magnon excitations in MnPSe$_3$. Experiments carried out as a function of temperature and the applied magnetic field show that two low-energy magnon branches of MnPSe$_3$ in its antiferromagnetic phase are gapped. The observation of two low-energy magnon gaps (at 14 and 0.7 cm$^{-1}$) implies that MnPSe$_3$ is a biaxial a… ▽ More Magneto-spectroscopy methods have been employed to study the zero-wavevector magnon excitations in MnPSe$_3$. Experiments carried out as a function of temperature and the applied magnetic field show that two low-energy magnon branches of MnPSe$_3$ in its antiferromagnetic phase are gapped. The observation of two low-energy magnon gaps (at 14 and 0.7 cm$^{-1}$) implies that MnPSe$_3$ is a biaxial antiferromagnet. A relatively strong out-of-plane anisotropy imposes the spin alignment to be in-plane whereas the spin directionality within the plane is governed by a factor of 2.5 $\times$ 10$^{-3}$ weaker in-plane anisotropy. △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: 9 pages, 3 figures

arXiv:2309.01961 [pdf, other]

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

Authors: Taehoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh , et al. (17 additional authors not shown)

Abstract: In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested… ▽ More In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested using a new evaluation dataset that includes a large variety of visual concepts from many domains. There was no specific training data provided for the challenge, and therefore the challenge entries were required to adapt to new types of image descriptions that had not been seen during training. This report includes information on the newly proposed NICE dataset, evaluation methods, challenge results, and technical details of top-ranking entries. We expect that the outcomes of the challenge will contribute to the improvement of AI models on various vision-language tasks. △ Less

Submitted 10 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: Tech report, project page https://nice.lgresearch.ai/

arXiv:2307.16887 [pdf]

Data-Based MHE for Agile Quadrotor Flight

Authors: Wonoo Choo, Erkan Kayacan

Abstract: This paper develops a data-based moving horizon estimation (MHE) method for agile quadrotors. Accurate state estimation of the system is paramount for precise trajectory control for agile quadrotors; however, the high level of aerodynamic forces experienced by the quadrotors during high-speed flights make this task extremely challenging. These complex turbulent effects are difficult to model and t… ▽ More This paper develops a data-based moving horizon estimation (MHE) method for agile quadrotors. Accurate state estimation of the system is paramount for precise trajectory control for agile quadrotors; however, the high level of aerodynamic forces experienced by the quadrotors during high-speed flights make this task extremely challenging. These complex turbulent effects are difficult to model and the unmodelled dynamics introduce inaccuracies in the state estimation. In this work, we propose a method to model these aerodynamic effects using Gaussian Processes which we integrate into the MHE to achieve efficient and accurate state estimation with minimal computational burden. Through extensive simulation and experimental studies, this method has demonstrated significant improvement in state estimation performance displaying superior robustness to poor state measurements. △ Less

Submitted 31 July, 2023; originally announced July 2023.

Comments: 8 pages, accepted in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023

arXiv:2306.16683 [pdf, other]

The Seoul National University AGN Monitoring Project IV: H$α$ reverberation mapping of 6 AGNs and the H$α$ Size-Luminosity Relation

Authors: Hojin Cho, Jong-Hak Woo, Shu Wang, Donghoon Son, Jaejin Shin, Suvendu Rakshit, Aaron J. Barth, Vardha N. Bennert, Elena Gallo, Edmund Hodges-Kluck, Tommaso Treu, Hyun-Jin Bae, Wanjin Cho, Adi Foord, Jaehyuk Geum, Yashashree Jadhav, Yiseul Jeon, Kyle M. Kabasares, Daeun Kang, Wonseok Kang, Changseok Kim, Donghwa Kim, Minjin Kim, Taewoo Kim, Huynh Anh N. Le , et al. (7 additional authors not shown)

Abstract: The broad line region (BLR) size-luminosity relation has paramount importance for estimating the mass of black holes in active galactic nuclei (AGNs). Traditionally, the size of the H$β$ BLR is often estimated from the optical continuum luminosity at 5100\angstrom{} , while the size of the H$α$ BLR and its correlation with the luminosity is much less constrained. As a part of the Seoul National Un… ▽ More The broad line region (BLR) size-luminosity relation has paramount importance for estimating the mass of black holes in active galactic nuclei (AGNs). Traditionally, the size of the H$β$ BLR is often estimated from the optical continuum luminosity at 5100\angstrom{} , while the size of the H$α$ BLR and its correlation with the luminosity is much less constrained. As a part of the Seoul National University AGN Monitoring Project (SAMP) which provides six-year photometric and spectroscopic monitoring data, we present our measurements of the H$α$ lags of 6 high-luminosity AGNs. Combined with the measurements for 42 AGNs from the literature, we derive the size-luminosity relations of H$α$ BLR against broad H$α$ and 5100\angstrom{} continuum luminosities. We find the slope of the relations to be $0.61\pm0.04$ and $0.59\pm0.04$, respectively, which are consistent with the \hb{} size-luminosity relation. Moreover, we find a linear relation between the 5100\angstrom{} continuum luminosity and the broad H$α$ luminosity across 7 orders of magnitude. Using these results, we propose a new virial mass estimator based on the H$α$ broad emission line, finding that the previous mass estimates based on the scaling relations in the literature are overestimated by up to 0.7 dex at masses lower than $10^7$~M$_{\odot}$. △ Less

Submitted 29 June, 2023; originally announced June 2023.

Comments: Accepted for publication in ApJ (Jun. 25th, 2023). 21 pages, 12 figures

arXiv:2305.17680 [pdf, other]

Evaluating GPT-3 Generated Explanations for Hateful Content Moderation

Authors: Han Wang, Ming Shan Hee, Md Rabiul Awal, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

Abstract: Recent research has focused on using large language models (LLMs) to generate explanations for hate speech through fine-tuning or prompting. Despite the growing interest in this area, these generated explanations' effectiveness and potential limitations remain poorly understood. A key concern is that these explanations, generated by LLMs, may lead to erroneous judgments about the nature of flagged… ▽ More Recent research has focused on using large language models (LLMs) to generate explanations for hate speech through fine-tuning or prompting. Despite the growing interest in this area, these generated explanations' effectiveness and potential limitations remain poorly understood. A key concern is that these explanations, generated by LLMs, may lead to erroneous judgments about the nature of flagged content by both users and content moderators. For instance, an LLM-generated explanation might inaccurately convince a content moderator that a benign piece of content is hateful. In light of this, we propose an analytical framework for examining hate speech explanations and conducted an extensive survey on evaluating such explanations. Specifically, we prompted GPT-3 to generate explanations for both hateful and non-hateful content, and a survey was conducted with 2,400 unique respondents to evaluate the generated explanations. Our findings reveal that (1) human evaluators rated the GPT-generated explanations as high quality in terms of linguistic fluency, informativeness, persuasiveness, and logical soundness, (2) the persuasive nature of these explanations, however, varied depending on the prompting strategy employed, and (3) this persuasiveness may result in incorrect judgments about the hatefulness of the content. Our study underscores the need for caution in applying LLM-generated explanations for content moderation. Code and results are available at https://github.com/Social-AI-Studio/GPT3-HateEval. △ Less

Submitted 30 August, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

Comments: 9 pages, 2 figures, Accepted by International Joint Conference on Artificial Intelligence(IJCAI)

ACM Class: I.2.7

arXiv:2305.17254 [pdf]

Computationally Efficient Data-Driven MPC for Agile Quadrotor Flight

Authors: Wonoo Choo, Erkan Kayacan

Abstract: This paper develops computationally efficient data-driven model predictive control (MPC) for Agile quadrotor flight. Agile quadrotors in high-speed flights can experience high levels of aerodynamic effects. Modeling these turbulent aerodynamic effects is a cumbersome task and the resulting model may be overly complex and computationally infeasible. Combining Gaussian Process (GP) regression models… ▽ More This paper develops computationally efficient data-driven model predictive control (MPC) for Agile quadrotor flight. Agile quadrotors in high-speed flights can experience high levels of aerodynamic effects. Modeling these turbulent aerodynamic effects is a cumbersome task and the resulting model may be overly complex and computationally infeasible. Combining Gaussian Process (GP) regression models with a simple dynamic model of the system has demonstrated significant improvements in control performance. However, direct integration of the GP models to the MPC pipeline poses a significant computational burden to the optimization process. Therefore, we present an approach to separate the GP models to the MPC pipeline by computing the model corrections using reference trajectory and the current state measurements prior to the online MPC optimization. This method has been validated in the Gazebo simulation environment and has demonstrated of up to $50\%$ reduction in trajectory tracking error, matching the performance of the direct GP integration method with improved computational efficiency. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: 6 pages, accepted in ACC 2023 (American Control Conference, 2023)

arXiv:2305.14032 [pdf, other]

doi 10.21437/Interspeech.2023-1426

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Authors: Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

Abstract: Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study,… ▽ More Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study, we demonstrate that the pretrained model on large-scale visual and audio datasets can be generalized to the respiratory sound classification task. In addition, we introduce a straightforward Patch-Mix augmentation, which randomly mixes patches between different samples, with Audio Spectrogram Transformer (AST). We further propose a novel and effective Patch-Mix Contrastive Learning to distinguish the mixed representations in the latent space. Our method achieves state-of-the-art performance on the ICBHI dataset, outperforming the prior leading score by an improvement of 4.08%. △ Less

Submitted 22 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: INTERSPEECH 2023, Code URL: https://github.com/raymin0223/patch-mix_contrastive_learning

arXiv:2304.05560 [pdf, other]

CoAIcoder: Examining the Effectiveness of AI-assisted Human-to-Human Collaboration in Qualitative Analysis

Authors: Jie Gao, Kenny Tsu Wei Choo, Junming Cao, Roy Ka Wei Lee, Simon Perrault

Abstract: While AI-assisted individual qualitative analysis has been substantially studied, AI-assisted collaborative qualitative analysis (CQA)-a process that involves multiple researchers working together to interpret data-remains relatively unexplored. After identifying CQA practices and design opportunities through formative interviews, we designed and implemented CoAIcoder, a tool leveraging AI to enha… ▽ More While AI-assisted individual qualitative analysis has been substantially studied, AI-assisted collaborative qualitative analysis (CQA)-a process that involves multiple researchers working together to interpret data-remains relatively unexplored. After identifying CQA practices and design opportunities through formative interviews, we designed and implemented CoAIcoder, a tool leveraging AI to enhance human-to-human collaboration within CQA through four distinct collaboration methods. With a between-subject design, we evaluated CoAIcoder with 32 pairs of CQA-trained participants across common CQA phases under each collaboration method. Our findings suggest that while using a shared AI model as a mediator among coders could improve CQA efficiency and foster agreement more quickly in the early coding stage, it might affect the final code diversity. We also emphasize the need to consider the independence level when using AI to assist human-to-human collaboration in various CQA scenarios. Lastly, we suggest design implications for future AI-assisted CQA systems. △ Less

Submitted 24 July, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

Comments: Will appear on ACM Transactions on Computer-Human Interaction (TOCHI)

Showing 1–50 of 228 results for author: Cho, W