-
Parameterized Physics-informed Neural Networks for Parameterized PDEs
Authors:
Woojin Cho,
Minju Jo,
Haksoo Lim,
Kookjin Lee,
Dongeun Lee,
Sanghyun Hong,
Noseong Park
Abstract:
Complex physical systems are often described by partial differential equations (PDEs) that depend on parameters such as the Reynolds number in fluid mechanics. In applications such as design optimization or uncertainty quantification, solutions of those PDEs need to be evaluated at numerous points in the parameter space. While physics-informed neural networks (PINNs) have emerged as a new strong c…
▽ More
Complex physical systems are often described by partial differential equations (PDEs) that depend on parameters such as the Reynolds number in fluid mechanics. In applications such as design optimization or uncertainty quantification, solutions of those PDEs need to be evaluated at numerous points in the parameter space. While physics-informed neural networks (PINNs) have emerged as a new strong competitor as a surrogate, their usage in this scenario remains underexplored due to the inherent need for repetitive and time-consuming training. In this paper, we address this problem by proposing a novel extension, parameterized physics-informed neural networks (P$^2$INNs). P$^2$INNs enable modeling the solutions of parameterized PDEs via explicitly encoding a latent representation of PDE parameters. With the extensive empirical evaluation, we demonstrate that P$^2$INNs outperform the baselines both in accuracy and parameter efficiency on benchmark 1D and 2D parameterized PDEs and are also effective in overcoming the known "failure modes".
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning
Authors:
Wonwoo Cho,
Kangyeol Kim,
Saemee Choi,
Jaegul Choo
Abstract:
Despite the growing prevalence of black-box pre-trained models (PTMs) such as prediction API services, there remains a significant challenge in directly applying general models to real-world scenarios due to the data distribution gap. Considering a data deficiency and constrained computational resource scenario, this paper proposes a novel parameter-efficient transfer learning framework for vision…
▽ More
Despite the growing prevalence of black-box pre-trained models (PTMs) such as prediction API services, there remains a significant challenge in directly applying general models to real-world scenarios due to the data distribution gap. Considering a data deficiency and constrained computational resource scenario, this paper proposes a novel parameter-efficient transfer learning framework for vision recognition models in the black-box setting. Our framework incorporates two novel training techniques. First, we align the input space (i.e., image) of PTMs to the target data distribution by generating visual prompts of spatial and frequency domain. Along with the novel spatial-frequency hybrid visual prompter, we design a novel training technique based on probabilistic clusters, which can enhance class separation in the output space (i.e., prediction probabilities). In experiments, our model demonstrates superior performance in a few-shot transfer learning setting across extensive visual recognition datasets, surpassing state-of-the-art baselines. Additionally, we show that the proposed method efficiently reduces computational costs for training and inference phases.
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
Inverse design of Non-parameterized Ventilated Acoustic Resonator via Variational Autoencoder with Acoustic Response-encoded Latent Space
Authors:
Min Woo Cho,
Seok Hyeon Hwang,
Jun-Young Jang,
Jin Yeong Song,
Sun-kwang Hwang,
Kyoung Je Cha,
Dong Yong Park,
Kyungjun Song,
Sang Min Park
Abstract:
Ventilated acoustic resonator(VAR), a type of acoustic metamaterial, emerge as an alternative for sound attenuation in environments that require ventilation, owing to its excellent low-frequency attenuation performance and flexible shape adaptability. However, due to the non-linear acoustic responses of VARs, the VAR designs are generally obtained within a limited parametrized design space, and th…
▽ More
Ventilated acoustic resonator(VAR), a type of acoustic metamaterial, emerge as an alternative for sound attenuation in environments that require ventilation, owing to its excellent low-frequency attenuation performance and flexible shape adaptability. However, due to the non-linear acoustic responses of VARs, the VAR designs are generally obtained within a limited parametrized design space, and the design relies on the iteration of the numerical simulation which consumes a considerable amount of computational time and resources. This paper proposes an acoustic response-encoded variational autoencoder (AR-VAE), a novel variational autoencoder-based generative design model for the efficient and accurate inverse design of VAR even with non-parametrized designs. The AR-VAE matches the high-dimensional acoustic response with the VAR cross-section image in the dimension-reduced latent space, which enables the AR-VAE to generate various non-parametrized VAR cross-section images with the target acoustic response. AR-VAE generates non-parameterized VARs from target acoustic responses, which show a 25-fold reduction in mean squared error compared to conventional deep learning-based parameter searching methods while exhibiting lower average mean squared error and peak frequency variance. By combining the inverse-designed VARs by AR-VAE, multi-cavity VAR was devised for broadband and multitarget peak frequency attenuation. The proposed design method presents a new approach for structural inverse-design with a high-dimensional non-linear physical response.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
Reconciling Cosmological Tensions with Inelastic Dark Matter and Dark Radiation in a $\boldsymbol{U(1)_D}$ Framework
Authors:
Wonsub Cho,
Ki-Young Choi,
Satyabrata Mahapatra
Abstract:
We propose a novel and comprehensive particle physics framework that addresses multiple cosmological tensions observed in recent measurements of the Hubble parameter, $S_8$, and Lyman-$α$ forest data. Our model, termed `{\bf SIDR+$\boldsymbol{z_t}$}' (Self Interacting Dark Radiation with transition redshift), is based on an inelastic dark matter (IDM) scenario coupled with dark radiation, governed…
▽ More
We propose a novel and comprehensive particle physics framework that addresses multiple cosmological tensions observed in recent measurements of the Hubble parameter, $S_8$, and Lyman-$α$ forest data. Our model, termed `{\bf SIDR+$\boldsymbol{z_t}$}' (Self Interacting Dark Radiation with transition redshift), is based on an inelastic dark matter (IDM) scenario coupled with dark radiation, governed by a $U(1)_D$ gauge symmetry. This framework naturally incorporates cold dark matter (DM), SIDR, and the interactions between these components. The fluid-like behavior of the dark radiation component, effectively mitigates both the Hubble and $S_8$ tensions by suppressing free-streaming effects. Simultaneously, the interacting DM-DR system attenuates the matter power spectrum at small scales, potentially reconciling discrepancies in Lyman-$α$ (Ly-$α$) observations. The inelastic nature of DM provides a distinct temperature dependence for the DM-DR interaction rate determined by the mass-splitting between the inelastic dark fermions which is crucial for resolving the Ly-$α$ discrepancies. We present a cosmologically consistent analysis of the model by solving the relevant Boltzmann equations to obtain the energy density and number density evolution of different species of the model. The DR undergoes two ``steps" of increased energy density when the heavier dark species freeze out and become non-relativistic, transferring their entropy to the dark radiation and enhancing $ΔN{\rm eff}$. The analysis showcases the model's potential to uphold the Big Bang Nucleosynthesis (BBN) prediction of $ΔN_{\rm eff}$ but dominantly producing additional contributions prior to recombination, while simultaneously achieving correct relic density of DM though an hybrid of freeze-in and non-thermal production.
△ Less
Submitted 6 August, 2024;
originally announced August 2024.
-
Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos
Authors:
Subin Jeon,
In Cho,
Minsu Kim,
Woong Oh Cho,
Seon Joo Kim
Abstract:
We propose a new framework for creating and easily manipulating 3D models of arbitrary objects using casually captured videos. Our core ingredient is a novel hierarchy deformation model, which captures motions of objects with a tree-structured bones. Our hierarchy system decomposes motions based on the granularity and reveals the correlations between parts without exploiting any prior structural k…
▽ More
We propose a new framework for creating and easily manipulating 3D models of arbitrary objects using casually captured videos. Our core ingredient is a novel hierarchy deformation model, which captures motions of objects with a tree-structured bones. Our hierarchy system decomposes motions based on the granularity and reveals the correlations between parts without exploiting any prior structural knowledge. We further propose to regularize the bones to be positioned at the basis of motions, centers of parts, sufficiently covering related surfaces of the part. This is achieved by our bone occupancy function, which identifies whether a given 3D point is placed within the bone. Coupling the proposed components, our framework offers several clear advantages: (1) users can obtain animatable 3D models of the arbitrary objects in improved quality from their casual videos, (2) users can manipulate 3D models in an intuitive manner with minimal costs, and (3) users can interactively add or delete control points as necessary. The experimental results demonstrate the efficacy of our framework on diverse instances, in reconstruction quality, interpretability and easier manipulation. Our code is available at https://github.com/subin6/HSNB.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation
Authors:
Kangyeol Kim,
Wooseok Seo,
Sehyun Nam,
Bodam Kim,
Suhyeon Jeong,
Wonwoo Cho,
Jaegul Choo,
Youngjae Yu
Abstract:
Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generati…
▽ More
Personalized text-to-image (P-T2I) generation aims to create new, text-guided images featuring the personalized subject with a few reference images. However, balancing the trade-off relationship between prompt fidelity and identity preservation remains a critical challenge. To address the issue, we propose a novel P-T2I method called Layout-and-Retouch, consisting of two stages: 1) layout generation and 2) retouch. In the first stage, our step-blended inference utilizes the inherent sample diversity of vanilla T2I models to produce diversified layout images, while also enhancing prompt fidelity. In the second stage, multi-source attention swapping integrates the context image from the first stage with the reference image, leveraging the structure from the context image and extracting visual features from the reference image. This achieves high prompt fidelity while preserving identity characteristics. Through our extensive experiments, we demonstrate that our method generates a wide variety of images with diverse layouts while maintaining the unique identity features of the personalized objects, even with challenging text prompts. This versatility highlights the potential of our framework to handle complex conditions, significantly enhancing the diversity and applicability of personalized image synthesis.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
Stable dark matter from Pauli blocking in the degenerate fermion background with Quantum Field Theory
Authors:
Wonsub Cho,
Ki-Young Choi,
Junghoon Joh,
Osamu Seto
Abstract:
We study a mechanism to make dark matter stable based on the Pauli blocking in the fermion background. In the background where fermions occupy the states, the decay of dark matter to those final states is not allowed, as a result, DM becomes stable. We derive the evolution equations of the distribution function in the quantum field theory and compare it with the Boltzmann equation. We apply this m…
▽ More
We study a mechanism to make dark matter stable based on the Pauli blocking in the fermion background. In the background where fermions occupy the states, the decay of dark matter to those final states is not allowed, as a result, DM becomes stable. We derive the evolution equations of the distribution function in the quantum field theory and compare it with the Boltzmann equation. We apply this mechanism to a realistic model of neutrino and dark matter.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Dimensionality Engineering of Magnetic Anisotropy from Anomalous Hall Effect in Synthetic SrRuO3 Crystals
Authors:
Seung Gyo Jeong,
Seong Won Cho,
Sehwan Song,
Jin Young Oh,
Do Gyeom Jeong,
Gyeongtak Han,
Hu Young Jeong,
Ahmed Yousef Mohamed,
Woo-suk Noh,
Sungkyun Park,
Jong Seok Lee,
Suyoun Lee,
Young-Min Kim,
Deok-Yong Cho,
Woo Seok Choi
Abstract:
Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designi…
▽ More
Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designing oxide superlattices with a correlated ferromagnetic SrRuO3 and nonmagnetic SrTiO3 layers, we observed modulated ferromagnetic behavior with the change of the SrRuO3 thickness. Especially, for three-unit-cell-thick layers, we observe a significant 1,500% improvement of coercive field in the anomalous Hall effect, which cannot be solely attributed to the dimensional crossover in ferromagnetism. The atomic-scale heterostructures further reveal the systematic modulation of anisotropy for the lattice structure and orbital hybridization, explaining the enhanced magnetic anisotropy. Our findings provide valuable insights into engineering the anisotropic hybridization of synthetic magnetic crystals, offering a tunable spin order for various applications.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
A Wireless, Multicolor Fluorescence Image Sensor Implant for Real-Time Monitoring in Cancer Therapy
Authors:
Micah Roschelle,
Rozhan Rabbani,
Surin Gweon,
Rohan Kumar,
Alec Vercruysse,
Nam Woo Cho,
Matthew H. Spitzer,
Ali M. Niknejad,
Vladimir M. Stojanovic,
Mekhail Anwar
Abstract:
Real-time monitoring of dynamic biological processes in the body is critical to understanding disease progression and treatment response. This data, for instance, can help address the lower than 50% response rates to cancer immunotherapy. However, current clinical imaging modalities lack the molecular contrast, resolution, and chronic usability for rapid and accurate response assessments. Here, we…
▽ More
Real-time monitoring of dynamic biological processes in the body is critical to understanding disease progression and treatment response. This data, for instance, can help address the lower than 50% response rates to cancer immunotherapy. However, current clinical imaging modalities lack the molecular contrast, resolution, and chronic usability for rapid and accurate response assessments. Here, we present a fully wireless image sensor featuring a 2.5$\times$5 mm$^2$ CMOS integrated circuit for multicolor fluorescence imaging deep in tissue. The sensor operates wirelessly via ultrasound (US) at 5 cm depth in oil, harvesting energy with 221 mW/cm$^{2}$ incident US power density (31% of FDA limits) and backscattering data at 13 kbps with a bit error rate <$10^{-6}$. In-situ fluorescence excitation is provided by micro-laser diodes controlled with a programmable on-chip driver. An optical frontend combining a multi-bandpass interference filter and a fiber optic plate provides >6 OD excitation blocking and enables three-color imaging for detecting multiple cell types. A 36$\times$40-pixel array captures images with <125 $μ$m resolution. We demonstrate wireless, dual-color fluorescence imaging of both effector and suppressor immune cells in ex vivo mouse tumor samples with and without immunotherapy. These results show promise for providing rapid insight into therapeutic response and resistance, guiding personalized medicine.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations
Authors:
Yunze Xiao,
Yujia Hu,
Kenny Tsu Wei Choo,
Roy Ka-wei Lee
Abstract:
Detecting hate speech and offensive language is essential for maintaining a safe and respectful digital environment. This study examines the limitations of state-of-the-art large language models (LLMs) in identifying offensive content within systematically perturbed data, with a focus on Chinese, a language particularly susceptible to such perturbations. We introduce \textsf{ToxiCloakCN}, an enhan…
▽ More
Detecting hate speech and offensive language is essential for maintaining a safe and respectful digital environment. This study examines the limitations of state-of-the-art large language models (LLMs) in identifying offensive content within systematically perturbed data, with a focus on Chinese, a language particularly susceptible to such perturbations. We introduce \textsf{ToxiCloakCN}, an enhanced dataset derived from ToxiCN, augmented with homophonic substitutions and emoji transformations, to test the robustness of LLMs against these cloaking perturbations. Our findings reveal that existing models significantly underperform in detecting offensive content when these perturbations are applied. We provide an in-depth analysis of how different types of offensive content are affected by these perturbations and explore the alignment between human and model explanations of offensiveness. Our work highlights the urgent need for more advanced techniques in offensive language detection to combat the evolving tactics used to evade detection mechanisms.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Towards Understanding Emotions for Engaged Mental Health Conversations
Authors:
Kellie Yu Hui Sim,
Kohleen Tijing Fortuno,
Kenny Tsu Wei Choo
Abstract:
Providing timely support and intervention is crucial in mental health settings. As the need to engage youth comfortable with texting increases, mental health providers are exploring and adopting text-based media such as chatbots, community-based forums, online therapies with licensed professionals, and helplines operated by trained responders. To support these text-based media for mental health--p…
▽ More
Providing timely support and intervention is crucial in mental health settings. As the need to engage youth comfortable with texting increases, mental health providers are exploring and adopting text-based media such as chatbots, community-based forums, online therapies with licensed professionals, and helplines operated by trained responders. To support these text-based media for mental health--particularly for crisis care--we are developing a system to perform passive emotion-sensing using a combination of keystroke dynamics and sentiment analysis. Our early studies of this system posit that the analysis of short text messages and keyboard typing patterns can provide emotion information that may be used to support both clients and responders. We use our preliminary findings to discuss the way forward for applying AI to support mental health providers in providing better care.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Massively Multiagent Minigames for Training Generalist Agents
Authors:
Kyoung Whan Choe,
Ryan Sullivan,
Joseph Suárez
Abstract:
We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigame…
▽ More
We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigames with a single set of weights. We release the environment, baselines, and training code under the MIT license. We hope that Meta MMO will spur additional progress on Neural MMO and, more generally, will serve as a useful benchmark for many-agent generalization.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
Authors:
Wooseong Cho,
Taehyun Hwang,
Joongkyu Lee,
Min-hwan Oh
Abstract:
We study reinforcement learning with multinomial logistic (MNL) function approximation where the underlying transition probability kernel of the Markov decision processes (MDPs) is parametrized by an unknown transition core with features of state and action. For the finite horizon episodic setting with inhomogeneous state transitions, we propose provably efficient algorithms with randomized explor…
▽ More
We study reinforcement learning with multinomial logistic (MNL) function approximation where the underlying transition probability kernel of the Markov decision processes (MDPs) is parametrized by an unknown transition core with features of state and action. For the finite horizon episodic setting with inhomogeneous state transitions, we propose provably efficient algorithms with randomized exploration having frequentist regret guarantees. For our first algorithm, $\texttt{RRL-MNL}$, we adapt optimistic sampling to ensure the optimism of the estimated value function with sufficient frequency and establish that $\texttt{RRL-MNL}$ is both statistically and computationally efficient, achieving a $\tilde{O}(κ^{-1} d^{\frac{3}{2}} H^{\frac{3}{2}} \sqrt{T})$ frequentist regret bound with constant-time computational cost per episode. Here, $d$ is the dimension of the transition core, $H$ is the horizon length, $T$ is the total number of steps, and $κ$ is a problem-dependent constant. Despite the simplicity and practicality of $\texttt{RRL-MNL}$, its regret bound scales with $κ^{-1}$, which is potentially large in the worst case. To improve the dependence on $κ^{-1}$, we propose $\texttt{ORRL-MNL}$, which estimates the value function using local gradient information of the MNL transition model. We show that its frequentist regret bound is $\tilde{O}(d^{\frac{3}{2}} H^{\frac{3}{2}} \sqrt{T} + κ^{-1} d^2 H^2)$. To the best of our knowledge, these are the first randomized RL algorithms for the MNL transition model that achieve both computational and statistical efficiency. Numerical experiments demonstrate the superior performance of the proposed algorithms.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Wall-Street: Smart Surface-Enabled 5G mmWave for Roadside Networking
Authors:
Kun Woo Cho,
Prasanthi Maddala,
Ivan Seskar,
Kyle Jamieson
Abstract:
5G mmWave roadside networks promise high-speed wireless connectivity, but face significant challenges in maintaining reliable connections for users moving at high speed. Frequent handovers, complex beam alignment, and signal attenuation due to obstacles like car bodies lead to service interruptions and degraded performance. We present Wall-Street, a smart surface installed on vehicles to enhance 5…
▽ More
5G mmWave roadside networks promise high-speed wireless connectivity, but face significant challenges in maintaining reliable connections for users moving at high speed. Frequent handovers, complex beam alignment, and signal attenuation due to obstacles like car bodies lead to service interruptions and degraded performance. We present Wall-Street, a smart surface installed on vehicles to enhance 5G mmWave connectivity for users inside. Wall-Street improves mobility management by (1) steering outdoor mmWave signals into the vehicle, ensuring coverage for all users; (2) enabling simultaneous serving cell data transfer and candidate handover cell measurement, allowing seamless handovers without service interruption; and (3) combining beams from source and target cells during a handover to increase reliability. Through its flexible and diverse signal manipulation capabilities, Wall-Street provides uninterrupted high-speed connectivity for latency-sensitive applications in challenging mobile environments. We have implemented and integrated Wall-Street in the COSMOS testbed and evaluated its real-time performance with four gNBs and a mobile client inside a surface-enabled vehicle, driving on a nearby road. Wall-Street achieves a 2.5-3.4x TCP throughput improvement and a 0.4-0.8x reduction in delay over a baseline 5G Standalone handover protocol.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore
Authors:
Ri Chi Ng,
Nirmalendu Prakash,
Ming Shan Hee,
Kenny Tsu Wei Choo,
Roy Ka-Wei Lee
Abstract:
To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native ann…
▽ More
To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native annotators. \textsf{SGHateCheck} reveals critical flaws in state-of-the-art models, highlighting their inadequacy in sensitive content moderation. This work aims to foster the development of more effective hate speech detection tools for diverse linguistic environments, particularly for Singapore and Southeast Asia contexts.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Ultralow-Power Single-Sensor-Based E-Nose System Powered by Duty Cycling and Deep Learning for Real-Time Gas Identification
Authors:
Taejung Kim,
Yonggi Kim,
Wootaek Cho,
Jong-Hyun Kwak,
Jeonghoon Cho,
Youjang Pyeon,
Jae Joon Kim,
Heungjoo Shin
Abstract:
This study presents a novel, ultralow-power single-sensor-based electronic nose (e-nose) system for real-time gas identification, distinguishing itself from conventional sensor-array-based e-nose systems whose power consumption and cost increase with the number of sensors. Our system employs a single metal oxide semiconductor (MOS) sensor built on a suspended 1D nanoheater, driven by duty cycling-…
▽ More
This study presents a novel, ultralow-power single-sensor-based electronic nose (e-nose) system for real-time gas identification, distinguishing itself from conventional sensor-array-based e-nose systems whose power consumption and cost increase with the number of sensors. Our system employs a single metal oxide semiconductor (MOS) sensor built on a suspended 1D nanoheater, driven by duty cycling-characterized by repeated pulsed power inputs. The sensor's ultrafast thermal response, enabled by its small size, effectively decouples the effects of temperature and surface charge exchange on the MOS nanomaterial's conductivity. This provides distinct sensing signals that alternate between responses coupled with and decoupled from the thermally enhanced conductivity, all within a single time domain during duty cycling. The magnitude and ratio of these dual responses vary depending on the gas type and concentration, facilitating the early-stage gas identification of five gas types within 30 s via a convolutional neural network (classification accuracy = 93.9%, concentration regression error = 19.8%). Additionally, the duty-cycling mode significantly reduces power consumption by up to 90%, lowering it to 160 $μ$W to heat the sensor to 250$^\circ$C. Manufactured using only wafer-level batch microfabrication processes, this innovative e-nose system promises the facile implementation of battery-driven, long-term, and cost-effective IoT monitoring systems.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data
Authors:
Hyeontae Jo,
Sung Woong Cho,
Hyung Ju Hwang
Abstract:
Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found tha…
▽ More
Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found that traditional methods for parameter estimation in differential equations, such as using mean values of time trajectories or Gaussian Process-based trajectory generation, have limitations in estimating the shape of parameter distributions, often leading to a significant loss of data information. To address this issue, we introduce a novel method, Estimation of Parameter Distribution (EPD), providing accurate distribution of parameters without loss of data information. EPD operates in three main steps: generating synthetic time trajectories by randomly selecting observed values at each time point, estimating parameters of a differential equation that minimize the discrepancy between these trajectories and the true solution of the equation, and selecting the parameters depending on the scale of discrepancy. We then evaluated the performance of EPD across several models, including exponential growth, logistic population models, and target cell-limited models with delayed virus production, demonstrating its superiority in capturing the shape of parameter distributions. Furthermore, we applied EPD to real-world datasets, capturing various shapes of parameter distributions rather than a normal distribution. These results effectively address the heterogeneity within systems, marking a substantial progression in accurately modeling systems using RCS data.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis
Authors:
Soyoung Yang,
Won Ik Cho
Abstract:
In the era of rapid evolution of generative language models within the realm of natural language processing, there is an imperative call to revisit and reformulate evaluation methodologies, especially in the domain of aspect-based sentiment analysis (ABSA). This paper addresses the emerging challenges introduced by the generative paradigm, which has moderately blurred traditional boundaries betwee…
▽ More
In the era of rapid evolution of generative language models within the realm of natural language processing, there is an imperative call to revisit and reformulate evaluation methodologies, especially in the domain of aspect-based sentiment analysis (ABSA). This paper addresses the emerging challenges introduced by the generative paradigm, which has moderately blurred traditional boundaries between understanding and generation tasks. Building upon prevailing practices in the field, we analyze the advantages and shortcomings associated with the prevalent ABSA evaluation paradigms. Through an in-depth examination, supplemented by illustrative examples, we highlight the intricacies involved in aligning generative outputs with other evaluative metrics, specifically those derived from other tasks, including question answering. While we steer clear of advocating for a singular and definitive metric, our contribution lies in paving the path for a comprehensive guideline tailored for ABSA evaluations in this generative paradigm. In this position paper, we aim to provide practitioners with profound reflections, offering insights and directions that can aid in navigating this evolving landscape, ensuring evaluations that are both accurate and reflective of generative capabilities.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Three Disclaimers for Safe Disclosure: A Cardwriter for Reporting the Use of Generative AI in Writing Process
Authors:
Won Ik Cho,
Eunjung Cho,
Hyeonji Shin
Abstract:
Generative artificial intelligence (AI) and large language models (LLMs) are increasingly being used in the academic writing process. This is despite the current lack of unified framework for reporting the use of machine assistance. In this work, we propose "Cardwriter", an intuitive interface that produces a short report for authors to declare their use of generative AI in their writing process.…
▽ More
Generative artificial intelligence (AI) and large language models (LLMs) are increasingly being used in the academic writing process. This is despite the current lack of unified framework for reporting the use of machine assistance. In this work, we propose "Cardwriter", an intuitive interface that produces a short report for authors to declare their use of generative AI in their writing process. The demo is available online, at https://cardwriter.vercel.app
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
On Finite Presentability of Subsemigroups of the Monogenic Free Inverse Semigroup
Authors:
Yung Won Cho,
Nik Ruskuc
Abstract:
The monogenic free inverse semigroup $FI_1$ is not finitely presented as a semigroup due to the classic result by Schein (1975). We extend this result and prove that a finitely generated subsemigroup of $FI_1$ is finitely presented if and only if it contains only finitely many idempotents. As a consequence, we derive that an inverse subsemigroup of $FI_1$ is finitely presented as a semigroup if an…
▽ More
The monogenic free inverse semigroup $FI_1$ is not finitely presented as a semigroup due to the classic result by Schein (1975). We extend this result and prove that a finitely generated subsemigroup of $FI_1$ is finitely presented if and only if it contains only finitely many idempotents. As a consequence, we derive that an inverse subsemigroup of $FI_1$ is finitely presented as a semigroup if and only if it is a finite semilattice.
△ Less
Submitted 12 June, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration
Authors:
Jie Gao,
Simret Araya Gebreegziabher,
Kenny Tsu Wei Choo,
Toby Jia-Jun Li,
Simon Tangi Perrault,
Thomas W. Malone
Abstract:
With ChatGPT's release, conversational prompting has become the most popular form of human-LLM interaction. However, its effectiveness is limited for more complex tasks involving reasoning, creativity, and iteration. Through a systematic analysis of HCI papers published since 2021, we identified four key phases in the human-LLM interaction flow - planning, facilitating, iterating, and testing - to…
▽ More
With ChatGPT's release, conversational prompting has become the most popular form of human-LLM interaction. However, its effectiveness is limited for more complex tasks involving reasoning, creativity, and iteration. Through a systematic analysis of HCI papers published since 2021, we identified four key phases in the human-LLM interaction flow - planning, facilitating, iterating, and testing - to precisely understand the dynamics of this process. Additionally, we have developed a taxonomy of four primary interaction modes: Mode 1: Standard Prompting, Mode 2: User Interface, Mode 3: Context-based, and Mode 4: Agent Facilitator. This taxonomy was further enriched using the "5W1H" guideline method, which involved a detailed examination of definitions, participant roles (Who), the phases that happened (When), human objectives and LLM abilities (What), and the mechanics of each interaction mode (How). We anticipate this taxonomy will contribute to the future design and evaluation of human-LLM interaction.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
Graph products of residually finite monoids are residually finite
Authors:
Jung Won Cho,
Victoria Gould,
Nik Ruškuc,
Dandan Yang
Abstract:
We show that any graph product of residually finite monoids is residually finite. As a special case we obtain that any free product of residually finite monoids is residually finite. The corresponding results for graph products of semigroups follow.
We show that any graph product of residually finite monoids is residually finite. As a special case we obtain that any free product of residually finite monoids is residually finite. The corresponding results for graph products of semigroups follow.
△ Less
Submitted 27 August, 2024; v1 submitted 20 March, 2024;
originally announced March 2024.
-
Overcoming Data Inequality across Domains with Semi-Supervised Domain Generalization
Authors:
Jinha Park,
Wonguk Cho,
Taesup Kim
Abstract:
While there have been considerable advancements in machine learning driven by extensive datasets, a significant disparity still persists in the availability of data across various sources and populations. This inequality across domains poses challenges in modeling for those with limited data, which can lead to profound practical and ethical concerns. In this paper, we address a representative case…
▽ More
While there have been considerable advancements in machine learning driven by extensive datasets, a significant disparity still persists in the availability of data across various sources and populations. This inequality across domains poses challenges in modeling for those with limited data, which can lead to profound practical and ethical concerns. In this paper, we address a representative case of data inequality problem across domains termed Semi-Supervised Domain Generalization (SSDG), in which only one domain is labeled while the rest are unlabeled. We propose a novel algorithm, ProUD, which can effectively learn domain-invariant features via domain-aware prototypes along with progressive generalization via uncertainty-adaptive mixing of labeled and unlabeled domains. Our experiments on three different benchmark datasets demonstrate the effectiveness of ProUD, outperforming all baseline models including single domain generalization and semi-supervised learning. Source code will be released upon acceptance of the paper.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Learning time-dependent PDE via graph neural networks and deep operator network for robust accuracy on irregular grids
Authors:
Sung Woong Cho,
Jae Yong Lee,
Hyung Ju Hwang
Abstract:
Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions…
▽ More
Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions as inputs and outputs, enabling real-time predictions as surrogate models for solution operators. There has also been significant progress in the research on surrogate models based on graph neural networks (GNNs), specifically targeting the dynamics in time-dependent PDEs. In this paper, we propose GraphDeepONet, an autoregressive model based on GNNs, to effectively adapt DeepONet, which is well-known for successful operator learning. GraphDeepONet exhibits robust accuracy in predicting solutions compared to existing GNN-based PDE solver models. It maintains consistent performance even on irregular grids, leveraging the advantages inherited from DeepONet and enabling predictions on arbitrary grids. Additionally, unlike traditional DeepONet and its variants, GraphDeepONet enables time extrapolation for time-dependent PDE solutions. We also provide theoretical analysis of the universal approximation capability of GraphDeepONet in approximating continuous operators across arbitrary time intervals.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Generators and presentations of inverse subsemigroups of the monogenic free inverse semigroup
Authors:
Jung Won Cho,
Nik Ruskuc
Abstract:
It was proved by Oliveira and Silva (2005) that every finitely generated inverse subsemigroup of the monogenic free inverse semigroup $FI_1$ is finitely presented. The present paper continues this development, and gives generating sets and presentations for general (i.e. not necessarily finitely generated) inverse subsemigroups of $FI_1$. For an inverse semigroup $S$ and an inverse subsemigroup…
▽ More
It was proved by Oliveira and Silva (2005) that every finitely generated inverse subsemigroup of the monogenic free inverse semigroup $FI_1$ is finitely presented. The present paper continues this development, and gives generating sets and presentations for general (i.e. not necessarily finitely generated) inverse subsemigroups of $FI_1$. For an inverse semigroup $S$ and an inverse subsemigroup $T$ of $S$, we say $S$ is finitely generated modulo $T$ if there is a finite set $A$ such that $S = \langle T, A \rangle$. Likewise, we say that $S$ is finitely presented modulo $T $ if $S$ can be defined by a presentation of the form $\text{Inv}\langle X, Y \mid R, Q\rangle$, where $\text{Inv}\langle X\mid R\rangle$ is a presentation for $T$ and $Y$ and $Q$ are finite. We show that every inverse subsemigroup $S$ of $FI_1$ is finitely generated modulo its semilattice of idempotents $E(S)$. By way of contrast, we show that when $S\neq E(S)$, it can never be finitely presented modulo $E(S)$. However, in the process we establish some nice (albeit infinite) presentations for $S$ modulo $E(S)$.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Composition dependence of bulk properties in the Co-intercalated transition-metal dichalcogenide Co$_{1/3}$TaS$_2$
Authors:
Pyeongjae Park,
Woonghee Cho,
Chaebin Kim,
Yeochan An,
Maxim Avdeev,
Kazuki Iida,
Ryoichi Kajimoto,
Je-Geun Park
Abstract:
Spontaneous Hall conductivity has recently been reported in the triangular lattice antiferromagnet Co$_{1/3}$TaS$_2$ under a zero magnetic field. This phenomenon originates from the distinctive noncoplanar triple-Q magnetic ground state, possessing uniform real-space Berry curvature characterized by scalar spin chirality. We investigated the physical properties of Co$_{1/3}$TaS$_2$ by judiciously…
▽ More
Spontaneous Hall conductivity has recently been reported in the triangular lattice antiferromagnet Co$_{1/3}$TaS$_2$ under a zero magnetic field. This phenomenon originates from the distinctive noncoplanar triple-Q magnetic ground state, possessing uniform real-space Berry curvature characterized by scalar spin chirality. We investigated the physical properties of Co$_{1/3}$TaS$_2$ by judiciously controlling the composition, revealing a drastic change in its bulk properties, even by slight variations in cobalt composition, despite the same crystal structure. For $0.299 < x < 0.325$, Co$_x$TaS$_2$ keeps all the characteristics of the ground state consistent with the previous studies -- two antiferromagnetic phase transitions at $T_{N1}$ and $T_{N2} (< T_{N1})$, a large spontaneous Hall conductivity ($σ_{xy} (H=0)$), and a weak ferromagnetic moment along the c-axis. However, samples with $x > 0.330$ exhibit distinct bulk properties, including the absence of both $σ_{xy} (H=0)$ and the weak ferromagnetic moment. Our neutron diffraction data reveal that Co$_x$TaS$_2$ with $x > 0.330$ develops coplanar helical magnetic order with $q_{m1} = (1/3, 0, 0)$. This is entirely different from what has been seen in $x < 0.325$, explaining the observed composition dependence.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork
Authors:
Jae Yong Lee,
Sung Woong Cho,
Hyung Ju Hwang
Abstract:
Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear mappings between function spaces. However, the DeepONet requires many parameters a…
▽ More
Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear mappings between function spaces. However, the DeepONet requires many parameters and has a high computational cost when learning operators, particularly those with complex (discontinuous or non-smooth) target functions. This study proposes HyperDeepONet, which uses the expressive power of the hypernetwork to enable the learning of a complex operator with a smaller set of parameters. The DeepONet and its variant models can be thought of as a method of injecting the input function information into the target function. From this perspective, these models can be viewed as a particular case of HyperDeepONet. We analyze the complexity of DeepONet and conclude that HyperDeepONet needs relatively lower complexity to obtain the desired accuracy for operator learning. HyperDeepONet successfully learned various operators with fewer computational resources compared to other benchmarks.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds
Authors:
Dongmin Choi,
Wonwoo Cho,
Kangyeol Kim,
Jaegul Choo
Abstract:
Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object…
▽ More
Accurately annotating multiple 3D objects in LiDAR scenes is laborious and challenging. While a few previous studies have attempted to leverage semi-automatic methods for cost-effective bounding box annotation, such methods have limitations in efficiently handling numerous multi-class objects. To effectively accelerate 3D annotation pipelines, we propose iDet3D, an efficient interactive 3D object detector. Supporting a user-friendly 2D interface, which can ease the cognitive burden of exploring 3D space to provide click interactions, iDet3D enables users to annotate the entire objects in each scene with minimal interactions. Taking the sparse nature of 3D point clouds into account, we design a negative click simulation (NCS) to improve accuracy by reducing false-positive predictions. In addition, iDet3D incorporates two click propagation techniques to take full advantage of user interactions: (1) dense click guidance (DCG) for keeping user-provided information throughout the network and (2) spatial click propagation (SCP) for detecting other instances of the same class based on the user-specified objects. Through our extensive experiments, we present that our method can construct precise annotations in a few clicks, which shows the practicality as an efficient annotation tool for 3D object detection.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Singular Hall response from a correlated ferromagnetic flat nodal-line semimetal
Authors:
Woohyun Cho,
Yoon-Gu Kang,
Jaehun Cha,
Dong Hyun David Lee,
Do Hoon Kiem,
Jaewhan Oh,
Jongho Park,
Changyoung Kim,
Yongsoo Yang,
Yeong Kwan Kim,
Myung Joon Han,
Heejun Yang
Abstract:
Topological quantum phases have been largely understood in weakly correlated systems, which have identified various quantum phenomena such as spin Hall effect, protected transport of helical fermions, and topological superconductivity. Robust ferromagnetic order in correlated topological materials particularly attracts attention, as it can provide a versatile platform for novel quantum devices. He…
▽ More
Topological quantum phases have been largely understood in weakly correlated systems, which have identified various quantum phenomena such as spin Hall effect, protected transport of helical fermions, and topological superconductivity. Robust ferromagnetic order in correlated topological materials particularly attracts attention, as it can provide a versatile platform for novel quantum devices. Here, we report singular Hall response arising from a unique band structure of flat topological nodal lines in combination with electron correlation in an itinerant, van der Waals ferromagnetic semimetal, Fe3GaTe2, with a high Curie temperature of Tc=360 K. High anomalous Hall conductivity violating the conventional scaling, resistivity upturn at low temperature, and a large Sommerfeld coefficient are observed in Fe3GaTe2, which implies heavy fermion features in this ferromagnetic topological material. Our circular dichroism in angle-resolved photoemission spectroscopy and theoretical calculations support the original electronic features in the material. Thus, low-dimensional Fe3GaTe2 with electronic correlation, topology, and room-temperature ferromagnetic order appears to be a promising candidate for robust quantum devices.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer
Authors:
Youn-Yeol Yu,
Jeongwhan Choi,
Woojin Cho,
Kookjin Lee,
Nayong Kim,
Kiseok Chang,
Chang-Seung Woo,
Ilho Kim,
Seok-Woo Lee,
Joon-Young Yang,
Sooyoung Yoon,
Noseong Park
Abstract:
Recently, many mesh-based graph neural network (GNN) models have been proposed for modeling complex high-dimensional physical systems. Remarkable achievements have been made in significantly reducing the solving time compared to traditional numerical solvers. These methods are typically designed to i) reduce the computational cost in solving physical dynamics and/or ii) propose techniques to enhan…
▽ More
Recently, many mesh-based graph neural network (GNN) models have been proposed for modeling complex high-dimensional physical systems. Remarkable achievements have been made in significantly reducing the solving time compared to traditional numerical solvers. These methods are typically designed to i) reduce the computational cost in solving physical dynamics and/or ii) propose techniques to enhance the solution accuracy in fluid and rigid body dynamics. However, it remains under-explored whether they are effective in addressing the challenges of flexible body dynamics, where instantaneous collisions occur within a very short timeframe. In this paper, we present Hierarchical Contact Mesh Transformer (HCMT), which uses hierarchical mesh structures and can learn long-range dependencies (occurred by collisions) among spatially distant positions of a body -- two close positions in a higher-level mesh correspond to two distant positions in a lower-level mesh. HCMT enables long-range interactions, and the hierarchical mesh structure quickly propagates collision effects to faraway positions. To this end, it consists of a contact mesh Transformer and a hierarchical mesh Transformer (CMT and HMT, respectively). Lastly, we propose a flexible body dynamics dataset, consisting of trajectories that reflect experimental settings frequently used in the display industry for product designs. We also compare the performance of several baselines using well-known benchmark datasets. Our results show that HCMT provides significant performance improvements over existing methods. Our code is available at https://github.com/yuyudeep/hcmt.
△ Less
Submitted 25 March, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Operator-learning-inspired Modeling of Neural Ordinary Differential Equations
Authors:
Woojin Cho,
Seunghyeon Cho,
Hyundong Jin,
Jinsung Jeon,
Kookjin Lee,
Sanghyun Hong,
Dongeun Lee,
Jonghyun Choi,
Noseong Park
Abstract:
Neural ordinary differential equations (NODEs), one of the most influential works of the differential equation-based deep learning, are to continuously generalize residual networks and opened a new field. They are currently utilized for various downstream tasks, e.g., image classification, time series classification, image generation, etc. Its key part is how to model the time-derivative of the hi…
▽ More
Neural ordinary differential equations (NODEs), one of the most influential works of the differential equation-based deep learning, are to continuously generalize residual networks and opened a new field. They are currently utilized for various downstream tasks, e.g., image classification, time series classification, image generation, etc. Its key part is how to model the time-derivative of the hidden state, denoted dh(t)/dt. People have habitually used conventional neural network architectures, e.g., fully-connected layers followed by non-linear activations. In this paper, however, we present a neural operator-based method to define the time-derivative term. Neural operators were initially proposed to model the differential operator of partial differential equations (PDEs). Since the time-derivative of NODEs can be understood as a special type of the differential operator, our proposed method, called branched Fourier neural operator (BFNO), makes sense. In our experiments with general downstream tasks, our method significantly outperforms existing methods.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification
Authors:
June-Woo Kim,
Sangmin Bae,
Won-Yang Cho,
Byungjo Lee,
Ho-Young Jung
Abstract:
Despite the remarkable advances in deep learning technology, achieving satisfactory performance in lung sound classification remains a challenge due to the scarcity of available data. Moreover, the respiratory sound samples are collected from a variety of electronic stethoscopes, which could potentially introduce biases into the trained models. When a significant distribution shift occurs within t…
▽ More
Despite the remarkable advances in deep learning technology, achieving satisfactory performance in lung sound classification remains a challenge due to the scarcity of available data. Moreover, the respiratory sound samples are collected from a variety of electronic stethoscopes, which could potentially introduce biases into the trained models. When a significant distribution shift occurs within the test dataset or in a practical scenario, it can substantially decrease the performance. To tackle this issue, we introduce cross-domain adaptation techniques, which transfer the knowledge from a source domain to a distinct target domain. In particular, by considering different stethoscope types as individual domains, we propose a novel stethoscope-guided supervised contrastive learning approach. This method can mitigate any domain-related disparities and thus enables the model to distinguish respiratory sounds of the recording variation of the stethoscope. The experimental results on the ICBHI dataset demonstrate that the proposed methods are effective in reducing the domain dependency and achieving the ICBHI Score of 61.71%, which is a significant improvement of 2.16% over the baseline.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Electronic band structure of Sb2Te3
Authors:
I. Mohelsky,
J. Wyzula,
F. Le Mardele,
F. Abadizaman,
O. Caha,
A. Dubroka,
X. D. Sun,
C. W. Cho,
B. A. Piot,
M. F. Tanzim,
I. Aguilera,
G. Bauer,
G. Springholz,
M. Orlita
Abstract:
Here we report on Landau level spectroscopy of an epitaxially grown thin film of the topological insulator Sb2Te3, complemented by ellipsometry and magneto-transport measurements. The observed response suggests that Sb2Te3 is a direct-gap semiconductor with the fundamental band gap located at the Γpoint, or along the trigonal axis, and its width reaches Eg = 190 meV at low temperatures. Our data a…
▽ More
Here we report on Landau level spectroscopy of an epitaxially grown thin film of the topological insulator Sb2Te3, complemented by ellipsometry and magneto-transport measurements. The observed response suggests that Sb2Te3 is a direct-gap semiconductor with the fundamental band gap located at the Γpoint, or along the trigonal axis, and its width reaches Eg = 190 meV at low temperatures. Our data also indicate the presence of other low-energy extrema with a higher multiplicity in both the conduction and valence bands. The conclusions based on our experimental data are confronted with and to a great extent corroborated by the electronic band structure calculated using the GW method.
△ Less
Submitted 15 March, 2024; v1 submitted 12 December, 2023;
originally announced December 2023.
-
The Seoul National University AGN Monitoring Project III: H$β$ lag measurements of 32 luminous AGNs and the high-luminosity end of the size--luminosity relation
Authors:
Jong-Hak Woo,
Shu Wang,
Suvendu Rakshit,
Hojin Cho,
Donghoon Son,
Vardha N. Bennert,
Elena Gallo,
Edmund Hodges-Kluck,
Tommaso Treu,
Aaron J. Barth,
Wanjin Cho,
Adi Foord,
Jaehyuk Geum,
Hengxiao Guo,
Yashashree Jadhav,
Yiseul Jeon,
Kyle M. Kabasares,
Won-Suk Kang,
Changseok Kim,
Minjin Kim,
Tae-Woo Kim,
Huynh Anh N. Le,
Matthew A. Malkan,
Amit Kumar Mandal,
Daeseong Park
, et al. (6 additional authors not shown)
Abstract:
We present the main results from a long-term reverberation mapping campaign carried out for the Seoul National University Active Galactic Nuclei (AGN) Monitoring Project. High-quality data were obtained during 2015-2021 for 32 luminous AGNs (i.e., continuum luminosity in the range of $10^{44-46}$ erg s$^{-1}$) at a regular cadence, of 20-30 days for spectroscopy and 3-5 days for photometry. We obt…
▽ More
We present the main results from a long-term reverberation mapping campaign carried out for the Seoul National University Active Galactic Nuclei (AGN) Monitoring Project. High-quality data were obtained during 2015-2021 for 32 luminous AGNs (i.e., continuum luminosity in the range of $10^{44-46}$ erg s$^{-1}$) at a regular cadence, of 20-30 days for spectroscopy and 3-5 days for photometry. We obtain time lag measurements between the variability in the H$β$ emission and the continuum for 32 AGNs; twenty-five of those have the best lag measurements based on our quality assessment, examining correlation strength, and the posterior lag distribution. Our study significantly increases the current sample of reverberation-mapped AGNs, particularly at the moderate to high luminosity end. Combining our results with literature measurements, we derive a H$β$ broad line region size--luminosity relation with a shallower slope than reported in the literature. For a given luminosity, most of our measured lags are shorter than the expectation, implying that single-epoch black hole mass estimators based on previous calibrations could suffer large systematic uncertainties.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning
Authors:
Joseph Suárez,
Phillip Isola,
Kyoung Whan Choe,
David Bloomin,
Hao Xiang Li,
Nikhil Pinnaparaju,
Nishaanth Kanna,
Daniel Scott,
Ryan Sullivan,
Rose S. Shuman,
Lucas de Alcântara,
Herbie Bradley,
Louis Castricato,
Kirsty You,
Yuhao Jiang,
Qimai Li,
Jiaxin Chen,
Xiaolong Zhu
Abstract:
Neural MMO 2.0 is a massively multi-agent environment for reinforcement learning research. The key feature of this new version is a flexible task system that allows users to define a broad range of objectives and reward signals. We challenge researchers to train agents capable of generalizing to tasks, maps, and opponents never seen during training. Neural MMO features procedurally generated maps…
▽ More
Neural MMO 2.0 is a massively multi-agent environment for reinforcement learning research. The key feature of this new version is a flexible task system that allows users to define a broad range of objectives and reward signals. We challenge researchers to train agents capable of generalizing to tasks, maps, and opponents never seen during training. Neural MMO features procedurally generated maps with 128 agents in the standard setting and support for up to. Version 2.0 is a complete rewrite of its predecessor with three-fold improved performance and compatibility with CleanRL. We release the platform as free and open-source software with comprehensive documentation available at neuralmmo.github.io and an active community Discord. To spark initial research on this new platform, we are concurrently running a competition at NeurIPS 2023.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Investigation of the mechanism of the anomalous Hall effects in Cr2Te3/(BiSb)2(TeSe)3 heterostructure
Authors:
Seong Won Cho,
In Hak Lee,
Youngwoong Lee,
Sangheon Kim,
Yeong Gwang Khim,
Seung-Young Park,
Younghun Jo,
Junwoo Choi,
Seungwu Han,
Young Jun Chang,
Suyoun Lee
Abstract:
The interplay between ferromagnetism and the non-trivial topology has unveiled intriguing phases in the transport of charges and spins. For example, it is consistently observed the so-called topological Hall effect (THE) featuring a hump structure in the curve of the Hall resistance (Rxy) vs. a magnetic field (H) of a heterostructure consisting of a ferromagnet (FM) and a topological insulator (TI…
▽ More
The interplay between ferromagnetism and the non-trivial topology has unveiled intriguing phases in the transport of charges and spins. For example, it is consistently observed the so-called topological Hall effect (THE) featuring a hump structure in the curve of the Hall resistance (Rxy) vs. a magnetic field (H) of a heterostructure consisting of a ferromagnet (FM) and a topological insulator (TI). The origin of the hump structure is still controversial between the topological Hall effect model and the multi-component anomalous Hall effect (AHE) model. In this work, we have investigated a heterostructure consisting of BixSb2-xTeySe3-y (BSTS) and Cr2Te3 (CT), which are well-known TI and two-dimensional FM, respectively. By using the so-called minor-loop measurement, we have found that the hump structure observed in the CT/BSTS is more likely to originate from two AHE channels. Moreover, by analyzing the scaling behavior of each amplitude of two AHE with the longitudinal resistivities of CT and BSTS, we have found that one AHE is attributed to the extrinsic contribution of CT while the other is due to the intrinsic contribution of BSTS. It implies that the proximity-induced ferromagnetic layer inside BSTS serves as a source of the intrinsic AHE, resulting in the hump structure explained by the two AHE model.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
WaveFlex: A Smart Surface for Private CBRS Wireless Cellular Networks
Authors:
Fan Yi,
Kun Woo Cho,
Yaxiong Xie,
Kyle Jamieson
Abstract:
We present the design and implementation of WaveFlex, the first smart surface that enhances Private LTE/5G networks operating under the shared-license framework in the Citizens Broadband Radio Service frequency band. WaveFlex works in the presence of frequency diversity: multiple nearby base stations operating on different frequencies, as dictated by a Spectrum Access System coordinator. It also h…
▽ More
We present the design and implementation of WaveFlex, the first smart surface that enhances Private LTE/5G networks operating under the shared-license framework in the Citizens Broadband Radio Service frequency band. WaveFlex works in the presence of frequency diversity: multiple nearby base stations operating on different frequencies, as dictated by a Spectrum Access System coordinator. It also handles time dynamism: due to the dynamic sharing rules of the band, base stations occasionally switch channels, especially when priority users enter the network. Finally, WaveFlex operates independently of the network itself, not requiring access to nor modification of the base station or mobile users, yet it remain compliant with and effective on prevailing cellular protocols. We have designed and fabricated WaveFlex on a custom multi-layer PCB, software defined radio-based network monitor, and supporting control software and hardware. Our experimental evaluation benchmarks an operational Private LTE network running at full line rate. Results demonstrate an 8.50 dB average SNR gain, and an average throughput gain of 4.36 Mbps for a single small cell, and 3.19 Mbps for four small cells, in a realistic indoor office scenario.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Hypernetwork-based Meta-Learning for Low-Rank Physics-Informed Neural Networks
Authors:
Woojin Cho,
Kookjin Lee,
Donsub Rim,
Noseong Park
Abstract:
In various engineering and applied science applications, repetitive numerical simulations of partial differential equations (PDEs) for varying input parameters are often required (e.g., aircraft shape optimization over many design parameters) and solvers are required to perform rapid execution. In this study, we suggest a path that potentially opens up a possibility for physics-informed neural net…
▽ More
In various engineering and applied science applications, repetitive numerical simulations of partial differential equations (PDEs) for varying input parameters are often required (e.g., aircraft shape optimization over many design parameters) and solvers are required to perform rapid execution. In this study, we suggest a path that potentially opens up a possibility for physics-informed neural networks (PINNs), emerging deep-learning-based solvers, to be considered as one such solver. Although PINNs have pioneered a proper integration of deep-learning and scientific computing, they require repetitive time-consuming training of neural networks, which is not suitable for many-query scenarios. To address this issue, we propose a lightweight low-rank PINNs containing only hundreds of model parameters and an associated hypernetwork-based meta-learning algorithm, which allows efficient approximation of solutions of PDEs for varying ranges of PDE input parameters. Moreover, we show that the proposed method is effective in overcoming a challenging issue, known as "failure modes" of PINNs.
△ Less
Submitted 14 October, 2023;
originally announced October 2023.
-
Modulating spin-valley relaxation in WSe$_2$ with variable thickness VOPc layers
Authors:
Daphné Lubert-Perquel,
Byeong Wook Cho,
Alan J. Philips,
Young Hee Lee,
Jeffrey L. Blackburn,
Justin C. Johnson
Abstract:
Combining the synthetic tunability of molecular compounds with the optical selection rules of transition metal dichalcogenides (TMDC) that derive from spin-valley coupling could provide interesting opportunities for the readout of quantum information. However, little is known about the electronic and spin interactions at such interfaces and the influence on spin-valley relaxation. In this work, va…
▽ More
Combining the synthetic tunability of molecular compounds with the optical selection rules of transition metal dichalcogenides (TMDC) that derive from spin-valley coupling could provide interesting opportunities for the readout of quantum information. However, little is known about the electronic and spin interactions at such interfaces and the influence on spin-valley relaxation. In this work, vanadyl phthalocyanine (VOPc) molecular layers are thermally evaporated on WSe$_2$ to explore the effect of molecular layer thickness on excited-state spin-valley polarization. The thinnest molecular layer supports an interfacial state which destroys the spin-valley polarization almost instantaneously, whereas a thicker molecular layer results in longer-lived spin-valley polarization than the WSe$_2$ monolayer alone. The mechanism appears to involve a tightly-bound species at the molecule/TMDC interface that strengthens exchange interactions and is largely avoided in thicker VOPc layers that isolate electrons from WSe$_2$ holes.
△ Less
Submitted 9 August, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
PaperCard for Reporting Machine Assistance in Academic Writing
Authors:
Won Ik Cho,
Eunjung Cho,
Kyunghyun Cho
Abstract:
Academic writing process has benefited from various technological developments over the years including search engines, automatic translators, and editing tools that review grammar and spelling mistakes. They have enabled human writers to become more efficient in writing academic papers, for example by helping with finding relevant literature more effectively and polishing texts. While these devel…
▽ More
Academic writing process has benefited from various technological developments over the years including search engines, automatic translators, and editing tools that review grammar and spelling mistakes. They have enabled human writers to become more efficient in writing academic papers, for example by helping with finding relevant literature more effectively and polishing texts. While these developments have so far played a relatively assistive role, recent advances in large-scale language models (LLMs) have enabled LLMs to play a more major role in the writing process, such as coming up with research questions and generating key contents. This raises critical questions surrounding the concept of authorship in academia. ChatGPT, a question-answering system released by OpenAI in November 2022, has demonstrated a range of capabilities that could be utilised in producing academic papers. The academic community will have to address relevant pressing questions, including whether Artificial Intelligence (AI) should be merited authorship if it made significant contributions in the writing process, or whether its use should be restricted such that human authorship would not be undermined. In this paper, we aim to address such questions, and propose a framework we name "PaperCard", a documentation for human authors to transparently declare the use of AI in their writing process.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
Impact of Human-AI Interaction on User Trust and Reliance in AI-Assisted Qualitative Coding
Authors:
Jie Gao,
Junming Cao,
ShunYi Yeo,
Kenny Tsu Wei Choo,
Zheng Zhang,
Toby Jia-Jun Li,
Shengdong Zhao,
Simon Tangi Perrault
Abstract:
While AI shows promise for enhancing the efficiency of qualitative analysis, the unique human-AI interaction resulting from varied coding strategies makes it challenging to develop a trustworthy AI-assisted qualitative coding system (AIQCs) that supports coding tasks effectively. We bridge this gap by exploring the impact of varying coding strategies on user trust and reliance on AI. We conducted…
▽ More
While AI shows promise for enhancing the efficiency of qualitative analysis, the unique human-AI interaction resulting from varied coding strategies makes it challenging to develop a trustworthy AI-assisted qualitative coding system (AIQCs) that supports coding tasks effectively. We bridge this gap by exploring the impact of varying coding strategies on user trust and reliance on AI. We conducted a mixed-methods split-plot 3x3 study, involving 30 participants, and a follow-up study with 6 participants, exploring varying text selection and code length in the use of our AIQCs system for qualitative analysis. Our results indicate that qualitative open coding should be conceptualized as a series of distinct subtasks, each with differing levels of complexity, and therefore, should be given tailored design considerations. We further observed a discrepancy between perceived and behavioral measures, and emphasized the potential challenges of under- and over-reliance on AIQCs systems. Additional design implications were also proposed for consideration.
△ Less
Submitted 24 September, 2023;
originally announced September 2023.
-
3SAT on an All-to-All-Connected CMOS Ising Solver Chip
Authors:
Hüsrev Cılasun,
Ziqing Zeng,
Ramprasath S,
Abhimanyu Kumar,
Hao Lo,
William Cho,
Chris H. Kim,
Ulya R. Karpuzcu,
Sachin S. Sapatnekar
Abstract:
This work solves 3SAT, a classical NP-complete problem, on a CMOS-based Ising hardware chip with all-to-all connectivity. The paper addresses practical issues in going from algorithms to hardware. It considers several degrees of freedom in mapping the 3SAT problem to the chip - using multiple Ising formulations for 3SAT; exploring multiple strategies for decomposing large problems into subproblems…
▽ More
This work solves 3SAT, a classical NP-complete problem, on a CMOS-based Ising hardware chip with all-to-all connectivity. The paper addresses practical issues in going from algorithms to hardware. It considers several degrees of freedom in mapping the 3SAT problem to the chip - using multiple Ising formulations for 3SAT; exploring multiple strategies for decomposing large problems into subproblems that can be accommodated on the Ising chip; and executing a sequence of these subproblems on CMOS hardware to obtain the solution to the larger problem. These are evaluated within a software framework, and the results are used to identify the most promising formulations and decomposition techniques. These best approaches are then mapped to the all-to-all hardware, and the performance of 3SAT is evaluated on the chip. Experimental data shows that the deployed decomposition and mapping strategies impact SAT solution quality: without our methods, the CMOS hardware cannot achieve 3SAT solutions on SATLIB benchmarks.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Magnon gap excitations in van der Waals antiferromagnet MnPSe$_3$
Authors:
Dipankar Jana,
D. Vaclavkova,
I. Mohelsky,
P. Kapuscinski,
C. W. Cho,
I. Breslavetz,
M. Białek,
J. -Ph. Ansermet,
B. A. Piot,
M. Orlita,
C. Faugeras,
M. Potemski
Abstract:
Magneto-spectroscopy methods have been employed to study the zero-wavevector magnon excitations in MnPSe$_3$. Experiments carried out as a function of temperature and the applied magnetic field show that two low-energy magnon branches of MnPSe$_3$ in its antiferromagnetic phase are gapped. The observation of two low-energy magnon gaps (at 14 and 0.7 cm$^{-1}$) implies that MnPSe$_3$ is a biaxial a…
▽ More
Magneto-spectroscopy methods have been employed to study the zero-wavevector magnon excitations in MnPSe$_3$. Experiments carried out as a function of temperature and the applied magnetic field show that two low-energy magnon branches of MnPSe$_3$ in its antiferromagnetic phase are gapped. The observation of two low-energy magnon gaps (at 14 and 0.7 cm$^{-1}$) implies that MnPSe$_3$ is a biaxial antiferromagnet. A relatively strong out-of-plane anisotropy imposes the spin alignment to be in-plane whereas the spin directionality within the plane is governed by a factor of 2.5 $\times$ 10$^{-3}$ weaker in-plane anisotropy.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Authors:
Taehoon Kim,
Pyunghwan Ahn,
Sangyun Kim,
Sihaeng Lee,
Mark Marsden,
Alessandra Sala,
Seung Hwan Kim,
Bohyung Han,
Kyoung Mu Lee,
Honglak Lee,
Kyounghoon Bae,
Xiangyu Wu,
Yi Gao,
Hailiang Zhang,
Yang Yang,
Weili Guo,
Jianfeng Lu,
Youngtaek Oh,
Jae Won Cho,
Dong-jin Kim,
In So Kweon,
Junmo Kim,
Wooyoung Kang,
Won Young Jhoo,
Byungseok Roh
, et al. (17 additional authors not shown)
Abstract:
In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested…
▽ More
In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested using a new evaluation dataset that includes a large variety of visual concepts from many domains. There was no specific training data provided for the challenge, and therefore the challenge entries were required to adapt to new types of image descriptions that had not been seen during training. This report includes information on the newly proposed NICE dataset, evaluation methods, challenge results, and technical details of top-ranking entries. We expect that the outcomes of the challenge will contribute to the improvement of AI models on various vision-language tasks.
△ Less
Submitted 10 September, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Data-Based MHE for Agile Quadrotor Flight
Authors:
Wonoo Choo,
Erkan Kayacan
Abstract:
This paper develops a data-based moving horizon estimation (MHE) method for agile quadrotors. Accurate state estimation of the system is paramount for precise trajectory control for agile quadrotors; however, the high level of aerodynamic forces experienced by the quadrotors during high-speed flights make this task extremely challenging. These complex turbulent effects are difficult to model and t…
▽ More
This paper develops a data-based moving horizon estimation (MHE) method for agile quadrotors. Accurate state estimation of the system is paramount for precise trajectory control for agile quadrotors; however, the high level of aerodynamic forces experienced by the quadrotors during high-speed flights make this task extremely challenging. These complex turbulent effects are difficult to model and the unmodelled dynamics introduce inaccuracies in the state estimation. In this work, we propose a method to model these aerodynamic effects using Gaussian Processes which we integrate into the MHE to achieve efficient and accurate state estimation with minimal computational burden. Through extensive simulation and experimental studies, this method has demonstrated significant improvement in state estimation performance displaying superior robustness to poor state measurements.
△ Less
Submitted 31 July, 2023;
originally announced July 2023.
-
The Seoul National University AGN Monitoring Project IV: H$α$ reverberation mapping of 6 AGNs and the H$α$ Size-Luminosity Relation
Authors:
Hojin Cho,
Jong-Hak Woo,
Shu Wang,
Donghoon Son,
Jaejin Shin,
Suvendu Rakshit,
Aaron J. Barth,
Vardha N. Bennert,
Elena Gallo,
Edmund Hodges-Kluck,
Tommaso Treu,
Hyun-Jin Bae,
Wanjin Cho,
Adi Foord,
Jaehyuk Geum,
Yashashree Jadhav,
Yiseul Jeon,
Kyle M. Kabasares,
Daeun Kang,
Wonseok Kang,
Changseok Kim,
Donghwa Kim,
Minjin Kim,
Taewoo Kim,
Huynh Anh N. Le
, et al. (7 additional authors not shown)
Abstract:
The broad line region (BLR) size-luminosity relation has paramount importance for estimating the mass of black holes in active galactic nuclei (AGNs). Traditionally, the size of the H$β$ BLR is often estimated from the optical continuum luminosity at 5100\angstrom{} , while the size of the H$α$ BLR and its correlation with the luminosity is much less constrained. As a part of the Seoul National Un…
▽ More
The broad line region (BLR) size-luminosity relation has paramount importance for estimating the mass of black holes in active galactic nuclei (AGNs). Traditionally, the size of the H$β$ BLR is often estimated from the optical continuum luminosity at 5100\angstrom{} , while the size of the H$α$ BLR and its correlation with the luminosity is much less constrained. As a part of the Seoul National University AGN Monitoring Project (SAMP) which provides six-year photometric and spectroscopic monitoring data, we present our measurements of the H$α$ lags of 6 high-luminosity AGNs. Combined with the measurements for 42 AGNs from the literature, we derive the size-luminosity relations of H$α$ BLR against broad H$α$ and 5100\angstrom{} continuum luminosities. We find the slope of the relations to be $0.61\pm0.04$ and $0.59\pm0.04$, respectively, which are consistent with the \hb{} size-luminosity relation. Moreover, we find a linear relation between the 5100\angstrom{} continuum luminosity and the broad H$α$ luminosity across 7 orders of magnitude. Using these results, we propose a new virial mass estimator based on the H$α$ broad emission line, finding that the previous mass estimates based on the scaling relations in the literature are overestimated by up to 0.7 dex at masses lower than $10^7$~M$_{\odot}$.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Evaluating GPT-3 Generated Explanations for Hateful Content Moderation
Authors:
Han Wang,
Ming Shan Hee,
Md Rabiul Awal,
Kenny Tsu Wei Choo,
Roy Ka-Wei Lee
Abstract:
Recent research has focused on using large language models (LLMs) to generate explanations for hate speech through fine-tuning or prompting. Despite the growing interest in this area, these generated explanations' effectiveness and potential limitations remain poorly understood. A key concern is that these explanations, generated by LLMs, may lead to erroneous judgments about the nature of flagged…
▽ More
Recent research has focused on using large language models (LLMs) to generate explanations for hate speech through fine-tuning or prompting. Despite the growing interest in this area, these generated explanations' effectiveness and potential limitations remain poorly understood. A key concern is that these explanations, generated by LLMs, may lead to erroneous judgments about the nature of flagged content by both users and content moderators. For instance, an LLM-generated explanation might inaccurately convince a content moderator that a benign piece of content is hateful. In light of this, we propose an analytical framework for examining hate speech explanations and conducted an extensive survey on evaluating such explanations. Specifically, we prompted GPT-3 to generate explanations for both hateful and non-hateful content, and a survey was conducted with 2,400 unique respondents to evaluate the generated explanations. Our findings reveal that (1) human evaluators rated the GPT-generated explanations as high quality in terms of linguistic fluency, informativeness, persuasiveness, and logical soundness, (2) the persuasive nature of these explanations, however, varied depending on the prompting strategy employed, and (3) this persuasiveness may result in incorrect judgments about the hatefulness of the content. Our study underscores the need for caution in applying LLM-generated explanations for content moderation. Code and results are available at https://github.com/Social-AI-Studio/GPT3-HateEval.
△ Less
Submitted 30 August, 2023; v1 submitted 28 May, 2023;
originally announced May 2023.
-
Computationally Efficient Data-Driven MPC for Agile Quadrotor Flight
Authors:
Wonoo Choo,
Erkan Kayacan
Abstract:
This paper develops computationally efficient data-driven model predictive control (MPC) for Agile quadrotor flight. Agile quadrotors in high-speed flights can experience high levels of aerodynamic effects. Modeling these turbulent aerodynamic effects is a cumbersome task and the resulting model may be overly complex and computationally infeasible. Combining Gaussian Process (GP) regression models…
▽ More
This paper develops computationally efficient data-driven model predictive control (MPC) for Agile quadrotor flight. Agile quadrotors in high-speed flights can experience high levels of aerodynamic effects. Modeling these turbulent aerodynamic effects is a cumbersome task and the resulting model may be overly complex and computationally infeasible. Combining Gaussian Process (GP) regression models with a simple dynamic model of the system has demonstrated significant improvements in control performance. However, direct integration of the GP models to the MPC pipeline poses a significant computational burden to the optimization process. Therefore, we present an approach to separate the GP models to the MPC pipeline by computing the model corrections using reference trajectory and the current state measurements prior to the online MPC optimization. This method has been validated in the Gazebo simulation environment and has demonstrated of up to $50\%$ reduction in trajectory tracking error, matching the performance of the direct GP integration method with improved computational efficiency.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
Authors:
Sangmin Bae,
June-Woo Kim,
Won-Yang Cho,
Hyerim Baek,
Soyoun Son,
Byungjo Lee,
Changwan Ha,
Kyongpil Tae,
Sungnyun Kim,
Se-Young Yun
Abstract:
Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study,…
▽ More
Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study, we demonstrate that the pretrained model on large-scale visual and audio datasets can be generalized to the respiratory sound classification task. In addition, we introduce a straightforward Patch-Mix augmentation, which randomly mixes patches between different samples, with Audio Spectrogram Transformer (AST). We further propose a novel and effective Patch-Mix Contrastive Learning to distinguish the mixed representations in the latent space. Our method achieves state-of-the-art performance on the ICBHI dataset, outperforming the prior leading score by an improvement of 4.08%.
△ Less
Submitted 22 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
CoAIcoder: Examining the Effectiveness of AI-assisted Human-to-Human Collaboration in Qualitative Analysis
Authors:
Jie Gao,
Kenny Tsu Wei Choo,
Junming Cao,
Roy Ka Wei Lee,
Simon Perrault
Abstract:
While AI-assisted individual qualitative analysis has been substantially studied, AI-assisted collaborative qualitative analysis (CQA)-a process that involves multiple researchers working together to interpret data-remains relatively unexplored. After identifying CQA practices and design opportunities through formative interviews, we designed and implemented CoAIcoder, a tool leveraging AI to enha…
▽ More
While AI-assisted individual qualitative analysis has been substantially studied, AI-assisted collaborative qualitative analysis (CQA)-a process that involves multiple researchers working together to interpret data-remains relatively unexplored. After identifying CQA practices and design opportunities through formative interviews, we designed and implemented CoAIcoder, a tool leveraging AI to enhance human-to-human collaboration within CQA through four distinct collaboration methods. With a between-subject design, we evaluated CoAIcoder with 32 pairs of CQA-trained participants across common CQA phases under each collaboration method. Our findings suggest that while using a shared AI model as a mediator among coders could improve CQA efficiency and foster agreement more quickly in the early coding stage, it might affect the final code diversity. We also emphasize the need to consider the independence level when using AI to assist human-to-human collaboration in various CQA scenarios. Lastly, we suggest design implications for future AI-assisted CQA systems.
△ Less
Submitted 24 July, 2023; v1 submitted 11 April, 2023;
originally announced April 2023.