-
On the training and generalization of deep operator networks
Authors:
Sanghyun Lee,
Yeonjong Shin
Abstract:
We present a novel training method for deep operator networks (DeepONets), one of the most popular neural network models for operators. DeepONets are constructed by two sub-networks, namely the branch and trunk networks. Typically, the two sub-networks are trained simultaneously, which amounts to solving a complex optimization problem in a high dimensional space. In addition, the nonconvex and non…
▽ More
We present a novel training method for deep operator networks (DeepONets), one of the most popular neural network models for operators. DeepONets are constructed by two sub-networks, namely the branch and trunk networks. Typically, the two sub-networks are trained simultaneously, which amounts to solving a complex optimization problem in a high dimensional space. In addition, the nonconvex and nonlinear nature makes training very challenging. To tackle such a challenge, we propose a two-step training method that trains the trunk network first and then sequentially trains the branch network. The core mechanism is motivated by the divide-and-conquer paradigm and is the decomposition of the entire complex training task into two subtasks with reduced complexity. Therein the Gram-Schmidt orthonormalization process is introduced which significantly improves stability and generalization ability. On the theoretical side, we establish a generalization error estimate in terms of the number of training data, the width of DeepONets, and the number of input and output sensors. Numerical examples are presented to demonstrate the effectiveness of the two-step training method, including Darcy flow in heterogeneous porous media.
△ Less
Submitted 2 September, 2023;
originally announced September 2023.
-
SGMM: Stochastic Approximation to Generalized Method of Moments
Authors:
Xiaohong Chen,
Sokbae Lee,
Yuan Liao,
Myung Hwan Seo,
Youngki Shin,
Myunghyun Song
Abstract:
We introduce a new class of algorithms, Stochastic Generalized Method of Moments (SGMM), for estimation and inference on (overidentified) moment restriction models. Our SGMM is a novel stochastic approximation alternative to the popular Hansen (1982) (offline) GMM, and offers fast and scalable implementation with the ability to handle streaming datasets in real time. We establish the almost sure c…
▽ More
We introduce a new class of algorithms, Stochastic Generalized Method of Moments (SGMM), for estimation and inference on (overidentified) moment restriction models. Our SGMM is a novel stochastic approximation alternative to the popular Hansen (1982) (offline) GMM, and offers fast and scalable implementation with the ability to handle streaming datasets in real time. We establish the almost sure convergence, and the (functional) central limit theorem for the inefficient online 2SLS and the efficient SGMM. Moreover, we propose online versions of the Durbin-Wu-Hausman and Sargan-Hansen tests that can be seamlessly integrated within the SGMM framework. Extensive Monte Carlo simulations show that as the sample size increases, the SGMM matches the standard (offline) GMM in terms of estimation accuracy and gains over computational efficiency, indicating its practical value for both large-scale and online datasets. We demonstrate the efficacy of our approach by a proof of concept using two well known empirical examples with large sample sizes.
△ Less
Submitted 30 October, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
Camera-Driven Representation Learning for Unsupervised Domain Adaptive Person Re-identification
Authors:
Geon Lee,
Sanghoon Lee,
Dohyung Kim,
Younghoon Shin,
Yongsang Yoon,
Bumsub Ham
Abstract:
We present a novel unsupervised domain adaption method for person re-identification (reID) that generalizes a model trained on a labeled source domain to an unlabeled target domain. We introduce a camera-driven curriculum learning (CaCL) framework that leverages camera labels of person images to transfer knowledge from source to target domains progressively. To this end, we divide target domain da…
▽ More
We present a novel unsupervised domain adaption method for person re-identification (reID) that generalizes a model trained on a labeled source domain to an unlabeled target domain. We introduce a camera-driven curriculum learning (CaCL) framework that leverages camera labels of person images to transfer knowledge from source to target domains progressively. To this end, we divide target domain dataset into multiple subsets based on the camera labels, and initially train our model with a single subset (i.e., images captured by a single camera). We then gradually exploit more subsets for training, according to a curriculum sequence obtained with a camera-driven scheduling rule. The scheduler considers maximum mean discrepancies (MMD) between each subset and the source domain dataset, such that the subset closer to the source domain is exploited earlier within the curriculum. For each curriculum sequence, we generate pseudo labels of person images in a target domain to train a reID model in a supervised way. We have observed that the pseudo labels are highly biased toward cameras, suggesting that person images obtained from the same camera are likely to have the same pseudo labels, even for different IDs. To address the camera bias problem, we also introduce a camera-diversity (CD) loss encouraging person images of the same pseudo label, but captured across various cameras, to involve more for discriminative feature learning, providing person representations robust to inter-camera variations. Experimental results on standard benchmarks, including real-to-real and synthetic-to-real scenarios, demonstrate the effectiveness of our framework.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Vortex detection in atomic Bose-Einstein condensates using neural networks trained on synthetic images
Authors:
Myeonghyeon Kim,
Junhwan Kwon,
Tenzin Rabga,
Yong-il Shin
Abstract:
Quantum vortices in atomic Bose-Einstein condensates (BECs) are topological defects characterized by quantized circulation of particles around them. In experimental studies, vortices are commonly detected by time-of-flight imaging, where their density-depleted cores are enlarged. In this work, we describe a machine learning-based method for detecting vortices in experimental BEC images, particular…
▽ More
Quantum vortices in atomic Bose-Einstein condensates (BECs) are topological defects characterized by quantized circulation of particles around them. In experimental studies, vortices are commonly detected by time-of-flight imaging, where their density-depleted cores are enlarged. In this work, we describe a machine learning-based method for detecting vortices in experimental BEC images, particularly focusing on turbulent condensates containing irregularly distributed vortices. Our approach employs a convolutional neural network (CNN) trained solely on synthetic simulated images, eliminating the need for manual labeling of the vortex positions as ground truth. We find that the CNN achieves accurate vortex detection in real experimental images, thereby facilitating analysis of large experimental datasets without being constrained by specific experimental conditions. This novel approach represents a significant advancement in studying quantum vortex dynamics and streamlines the analysis process in the investigation of turbulent BECs.
△ Less
Submitted 30 October, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Random spin textures in turbulent spinor Bose-Einstein condensates
Authors:
Jong Heum Jung,
Junghoon Lee,
Jongmin Kim,
Yong-il Shin
Abstract:
We numerically investigate the stationary turbulent states of spin-1 Bose-Einstein condensates under continuous spin driving. We analyze the entanglement entropy and magnetization correlation function to demonstrate the isotropic nature of the intricate spin texture that is generated in the nonequilibrium steady state. We observe a $-7/3$ power-law behavior in the spin-dependent interaction energy…
▽ More
We numerically investigate the stationary turbulent states of spin-1 Bose-Einstein condensates under continuous spin driving. We analyze the entanglement entropy and magnetization correlation function to demonstrate the isotropic nature of the intricate spin texture that is generated in the nonequilibrium steady state. We observe a $-7/3$ power-law behavior in the spin-dependent interaction energy spectrum. To gain further insight into the statistical properties of the spin texture, we introduce a spin state ensemble obtained through position projection, revealing its close resemblance to the Haar random ensemble for spin-1 systems. We also present the probability distribution of the spin vector magnitude in the turbulent condensate, which can be tested in experiments. Our numerical study highlights the characteristics of stationary turbulence in the spinor BEC system and confirms previous experimental findings by Hong et al. [Phys. Rev. A 108, 013318 (2023)].
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
Temperature Dependence of the Optical Transition Characteristics of MAPbClBr Single Crystals
Authors:
D. Y. Park,
Y. H. Shin,
Yongmin Kim
Abstract:
Methylammonium-lead-halide compounds have emerged as promising bandgap engineering materials due to their ability to fine-tune the energy gap through halogen element mixing. We present a comprehensive investigation of the temperature-dependent photoluminescence (PL) transition characteristics exhibited by single crystals of chlorine and bromine-based methylammonium lead halides. MAPbCl3 and MAPbBr…
▽ More
Methylammonium-lead-halide compounds have emerged as promising bandgap engineering materials due to their ability to fine-tune the energy gap through halogen element mixing. We present a comprehensive investigation of the temperature-dependent photoluminescence (PL) transition characteristics exhibited by single crystals of chlorine and bromine-based methylammonium lead halides. MAPbCl3 and MAPbBr3 crystals exhibit a distinct sharp free exciton transition with an abrupt transition behavior associated with the structural phase transition as the temperature varies. However, when the two halogen elements are mixed within the crystals, no structural phase transition is observed. This study explores the temperature-dependent variations in integrated PL intensity, full-width-half-maximum, and peak transition energy of the crystals. The obtained results discuss the intricate interplay between temperature, crystal structure, and composition, providing valuable insights into the optical properties and potential applications of organic-inorganic hybrid methyl-ammonium lead halide single crystals as tunable energy gap semiconductor materials.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
The Waldschmidt constant of a standard $\Bbbk$-configuration in $\mathbb P^2$
Authors:
Maria Virginia Catalisano,
Giuseppe Favacchio,
Elena Guardo,
Yong-Su Shin
Abstract:
A $\Bbbk$-configuration of type $(d_1,\dots,d_s)$ is a specific set of points in $\mathbb P^2$ that has a number of algebraic and geometric properties. For example, the graded Betti numbers and Hilbert functions of all $\Bbbk$-configurations in $\mathbb P^2$ are determined by the type $(d_1,\dots,d_s)$. However the Waldschmidt constant of a $\Bbbk$-configuration in $\mathbb P^2$ of the same type m…
▽ More
A $\Bbbk$-configuration of type $(d_1,\dots,d_s)$ is a specific set of points in $\mathbb P^2$ that has a number of algebraic and geometric properties. For example, the graded Betti numbers and Hilbert functions of all $\Bbbk$-configurations in $\mathbb P^2$ are determined by the type $(d_1,\dots,d_s)$. However the Waldschmidt constant of a $\Bbbk$-configuration in $\mathbb P^2$ of the same type may vary. In this paper, we find that the Waldschmidt constant of a $\Bbbk$-configuration in $\mathbb P^2$ of type $(d_1,\dots,d_s)$ with $d_1\ge s\ge 1$ is $s$. We also find the Waldschmidt constant of a standard $\Bbbk$-configuration in $\mathbb P^2$ of type $(a,b,c)$ with $a\ge 1$ except the type $(2,3,5)$. In particular, we prove that the Waldschmidt constant of a standard $\Bbbk$-configuration in $\mathbb P^2$ of type $(1,b,c)$ with $c\ge 2b+2$ does not depend on $c$.
△ Less
Submitted 27 July, 2023; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Improving Segmentation and Detection of Lesions in CT Scans Using Intensity Distribution Supervision
Authors:
Seung Yeon Shin,
Thomas C. Shen,
Ronald M. Summers
Abstract:
We propose a method to incorporate the intensity information of a target lesion on CT scans in training segmentation and detection networks. We first build an intensity-based lesion probability (ILP) function from an intensity histogram of the target lesion. It is used to compute the probability of being the lesion for each voxel based on its intensity. Finally, the computed ILP map of each input…
▽ More
We propose a method to incorporate the intensity information of a target lesion on CT scans in training segmentation and detection networks. We first build an intensity-based lesion probability (ILP) function from an intensity histogram of the target lesion. It is used to compute the probability of being the lesion for each voxel based on its intensity. Finally, the computed ILP map of each input CT scan is provided as additional supervision for network training, which aims to inform the network about possible lesion locations in terms of intensity values at no additional labeling cost. The method was applied to improve the segmentation of three different lesion types, namely, small bowel carcinoid tumor, kidney tumor, and lung nodule. The effectiveness of the proposed method on a detection task was also investigated. We observed improvements of 41.3% -> 47.8%, 74.2% -> 76.0%, and 26.4% -> 32.7% in segmenting small bowel carcinoid tumor, kidney tumor, and lung nodule, respectively, in terms of per case Dice scores. An improvement of 64.6% -> 75.5% was achieved in detecting kidney tumors in terms of average precision. The results of different usages of the ILP map and the effect of varied amount of training data are also presented.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Tunable ferroelectricity in oxygen-deficient perovskites
Authors:
Yongjin Shin,
Giulia Galli
Abstract:
Using first-principles calculations, we predict that tunable ferroelectricity can be realized in oxide perovskites with the Grenier structure and ordered oxygen vacancies. Specifically, we show that $R_{1/3}A_{2/3}\mathrm{FeO}_{2.67}$ solids (where $R$ is a rare-earth ion and $A$ an alkaline-earth cation) exhibit stable polar phases, with a spontaneous polarization tunable by an appropriate choice…
▽ More
Using first-principles calculations, we predict that tunable ferroelectricity can be realized in oxide perovskites with the Grenier structure and ordered oxygen vacancies. Specifically, we show that $R_{1/3}A_{2/3}\mathrm{FeO}_{2.67}$ solids (where $R$ is a rare-earth ion and $A$ an alkaline-earth cation) exhibit stable polar phases, with a spontaneous polarization tunable by an appropriate choice of $R$ and $A$. We find that larger cations combined with small $R$ elements lead to a maximum in the polarization and to a minimum in the energy barriers required to switch the sign of the polarization. Ferroelectricity arises from cooperative distortions of octahedral and tetrahedral units, where a combination of rotational and sliding modes controls the emergence of polarization within three-dimensional connected layers. Our results indicate that polar Grenier phases of oxide perovskites are promising materials for microelectronic applications and, in general, for the study of phenomena emerging from breaking inversion symmetry in solids.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Variations of the Kibble-Zurek scaling exponents of trapped Bose gases
Authors:
Tenzin Rabga,
Yangheon Lee,
Dalmin Bae,
Myeonghyeon Kim,
Yong-il Shin
Abstract:
We study the vortex nucleation dynamics in inhomogeneous atomic Bose gases quenched into a superfluid phase and investigate the dependence of the Kibble-Zurek (KZ) scaling exponent on the underlying trap configuration. For samples in a number of different inhomogeneous traps, we observe the characteristic power-law scaling of the vortex number with the thermal quench rate, as well as an enhanced v…
▽ More
We study the vortex nucleation dynamics in inhomogeneous atomic Bose gases quenched into a superfluid phase and investigate the dependence of the Kibble-Zurek (KZ) scaling exponent on the underlying trap configuration. For samples in a number of different inhomogeneous traps, we observe the characteristic power-law scaling of the vortex number with the thermal quench rate, as well as an enhanced vortex suppression in the outer regions with lower particle density, in agreement with the causality effect as encapsulated in the inhomogeneous Kibble-Zurek mechanism (IKZM). However, the measured KZ scaling exponents show significant differences from the theoretical estimates, and furthermore their trends as a function of the underlying trap configuration deviate from the IKZM prediction. We also investigate the early-time coarsening effect using a two-step quench protocol as proposed in a recent study and show that the interpretation of the measurement results without including the causality effect might be misleading. This paper provides a comprehensive study of vortex formation dynamics in quenched Bose gases confined in inhomogeneous trapping potentials and calls for a refined theoretical framework for quantitative understanding of the phase transition and defect formation processes in such inhomogeneous systems.
△ Less
Submitted 28 November, 2023; v1 submitted 30 May, 2023;
originally announced May 2023.
-
3DTeethSeg'22: 3D Teeth Scan Segmentation and Labeling Challenge
Authors:
Achraf Ben-Hamadou,
Oussama Smaoui,
Ahmed Rekik,
Sergi Pujades,
Edmond Boyer,
Hoyeon Lim,
Minchang Kim,
Minkyung Lee,
Minyoung Chung,
Yeong-Gil Shin,
Mathieu Leclercq,
Lucia Cevidanes,
Juan Carlos Prieto,
Shaojie Zhuang,
Guangshun Wei,
Zhiming Cui,
Yuanfeng Zhou,
Tudor Dascalu,
Bulat Ibragimov,
Tae-Hoon Yong,
Hong-Gi Ahn,
Wan Kim,
Jae-Hwan Han,
Byungsun Choi,
Niels van Nistelrooij
, et al. (7 additional authors not shown)
Abstract:
Teeth localization, segmentation, and labeling from intra-oral 3D scans are essential tasks in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, developing automated algorithms for teeth analysis presents significant challenges due to variations in dental anatomy, imaging protocols, and limited availability of publicly accessi…
▽ More
Teeth localization, segmentation, and labeling from intra-oral 3D scans are essential tasks in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, developing automated algorithms for teeth analysis presents significant challenges due to variations in dental anatomy, imaging protocols, and limited availability of publicly accessible data. To address these challenges, the 3DTeethSeg'22 challenge was organized in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) in 2022, with a call for algorithms tackling teeth localization, segmentation, and labeling from intraoral 3D scans. A dataset comprising a total of 1800 scans from 900 patients was prepared, and each tooth was individually annotated by a human-machine hybrid algorithm. A total of 6 algorithms were evaluated on this dataset. In this study, we present the evaluation results of the 3DTeethSeg'22 challenge. The 3DTeethSeg'22 challenge code can be accessed at: https://github.com/abenhamadou/3DTeethSeg22_challenge
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
NASA's Cold Atom Laboratory: Four Years of Quantum Science Operations in Space
Authors:
Kamal Oudrhiri,
James M. Kohel,
Nate Harvey,
James R. Kellogg,
David C. Aveline,
Roy L. Butler,
Javier Bosch-Lluis,
John L. Callas,
Leo Y. Cheng,
Arvid P. Croonquist,
Walker L. Dula,
Ethan R. Elliott,
Jose E. Fernandez,
Jorge Gonzales,
Raymond J. Higuera,
Shahram Javidnia,
Sandy M. Kwan,
Norman E. Lay,
Dennis K. Lee,
Irena Li,
Gregory J. Miles,
Michael T. Pauken,
Kelly L. Perry,
Leah E. Phillips,
Diane C. Malarik
, et al. (14 additional authors not shown)
Abstract:
The Cold Atom Laboratory (CAL) is a quantum facility for studying ultra-cold gases in the microgravity environment of the International Space Station. It enables research in a temperature regime and force-free environment inaccessible to terrestrial laboratories. In the microgravity environment, observation times over a few seconds and temperatures below 100 pK are achievable, unlocking the potent…
▽ More
The Cold Atom Laboratory (CAL) is a quantum facility for studying ultra-cold gases in the microgravity environment of the International Space Station. It enables research in a temperature regime and force-free environment inaccessible to terrestrial laboratories. In the microgravity environment, observation times over a few seconds and temperatures below 100 pK are achievable, unlocking the potential to observe new quantum phenomena. CAL launched to the International Space Station in May 2018 and has been operating since then as the world's first multi-user facility for studying ultra\-cold atoms in space. CAL is the first quantum science facility to produce the fifth state of matter called a Bose-Einstein condensate with rubidium-87 and potassium-41 in Earth orbit. We will give an overview of CAL's operational setup, outline its contributions to date, present planned upgrades for the next few years, and consider design choices for microgravity BEC successor-mission planning.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Automated Smell Detection and Recommendation in Natural Language Requirements
Authors:
Alvaro Veizaga,
Seung Yeob Shin,
Lionel C. Briand
Abstract:
Requirement specifications are typically written in natural language (NL) due to its usability across multiple domains and understandability by all stakeholders. However, unstructured NL is prone to quality problems (e.g., ambiguity) when writing requirements, which can result in project failures. To address this issue, we present a tool, named Paska, that takes as input any NL requirements, autom…
▽ More
Requirement specifications are typically written in natural language (NL) due to its usability across multiple domains and understandability by all stakeholders. However, unstructured NL is prone to quality problems (e.g., ambiguity) when writing requirements, which can result in project failures. To address this issue, we present a tool, named Paska, that takes as input any NL requirements, automatically detects quality problems as smells in the requirements, and offers recommendations to improve their quality. Our approach relies on natural language processing (NLP) techniques and a state-of-the-art controlled natural language (CNL) for requirements (Rimay), to detect smells and suggest recommendations using patterns defined in Rimay to improve requirement quality. We evaluated Paska through an industrial case study in the financial domain involving 13 systems and 2725 annotated requirements. The results show that our tool is accurate in detecting smells (89% precision and recall) and suggesting appropriate Rimay pattern recommendations (96% precision and 94% recall).
△ Less
Submitted 25 November, 2023; v1 submitted 11 May, 2023;
originally announced May 2023.
-
What can a GNOME do? Search targets for the Global Network of Optical Magnetometers for Exotic physics searches
Authors:
S. Afach,
D. Aybas Tumturk,
H. Bekker,
B. C. Buchler,
D. Budker,
K. Cervantes,
A. Derevianko,
J. Eby,
N. L. Figueroa,
R. Folman,
D. Gavil'an Martin,
M. Givon,
Z. D. Grujic,
H. Guo,
P. Hamilton,
M. P. Hedges,
D. F. Jackson Kimball,
S. Khamis,
D. Kim,
E. Klinger,
A. Kryemadhi,
X. Liu,
G. Lukasiewicz,
H. Masia-Roig,
M. Padniuk
, et al. (28 additional authors not shown)
Abstract:
Numerous observations suggest that there exist undiscovered beyond-the-Standard-Model particles and fields. Because of their unknown nature, these exotic particles and fields could interact with Standard Model particles in many different ways and assume a variety of possible configurations. Here we present an overview of the Global Network of Optical Magnetometers for Exotic physics searches (GNOM…
▽ More
Numerous observations suggest that there exist undiscovered beyond-the-Standard-Model particles and fields. Because of their unknown nature, these exotic particles and fields could interact with Standard Model particles in many different ways and assume a variety of possible configurations. Here we present an overview of the Global Network of Optical Magnetometers for Exotic physics searches (GNOME), our ongoing experimental program designed to test a wide range of exotic physics scenarios. The GNOME experiment utilizes a worldwide network of shielded atomic magnetometers (and, more recently, comagnetometers) to search for spatially and temporally correlated signals due to torques on atomic spins from exotic fields of astrophysical origin. We survey the temporal characteristics of a variety of possible signals currently under investigation such as those from topological defect dark matter (axion-like particle domain walls), axion-like particle stars, solitons of complex-valued scalar fields (Q-balls), stochastic fluctuations of bosonic dark matter fields, a solar axion-like particle halo, and bursts of ultralight bosonic fields produced by cataclysmic astrophysical events such as binary black hole mergers.
△ Less
Submitted 4 May, 2023; v1 submitted 2 May, 2023;
originally announced May 2023.
-
Development of an eReaxFF Force Field for BZY20 Solid Oxide Electrocatalysis
Authors:
Md Jamil Hossain,
Prashik Gaikwad,
Yun Kyung Shin,
Jessica Schulze,
Kate Penrod,
Meng Li,
Yuxiao Lin,
Gorakh Pawar,
Adri C. T. van Duin
Abstract:
Electrocatalysis is a catalytic process where the rate of an electrochemical reaction occurring at the electrode-electrolyte interface can be controlled by varying the electrical potential. Electrocatalysis can be applied to generate hydrogen which can be stored for future use in fuel cells for clean electricity. The use of solid oxide in electrocatalysis specially in hydrogen evolution reaction i…
▽ More
Electrocatalysis is a catalytic process where the rate of an electrochemical reaction occurring at the electrode-electrolyte interface can be controlled by varying the electrical potential. Electrocatalysis can be applied to generate hydrogen which can be stored for future use in fuel cells for clean electricity. The use of solid oxide in electrocatalysis specially in hydrogen evolution reaction is promising. However, further improvements are essential in order to meet the ever-increasing global energy demand. Improvement of the performance of these high energy chemical systems is directly linked to the understanding and improving the complex physical and chemical phenomena and exchanges that take place at their different interfaces. To enable large length and time scale atomistic simulations of solid oxide electrocatalysis for hydrogen generation, we developed an eReaxFF force field for barium zirconate doped with 20 mol% of yttrium (BZY20). All parameters for the eReaxFF were optimized to reproduce quantum mechanical (QM) calculations on relevant condensed phase and cluster systems describing oxygen vacancies, vacancy migrations, water adsorption, water splitting and hydrogen generation on the surfaces of the BZY20 solid oxide. Using the developed force field, we performed zero-voltage molecular dynamics simulations to observe water adsorption and the eventual hydrogen production. Based on our simulation results, we conclude that this force field sets a stage for the introduction of explicit electron concept in order to simulate electron conductivity, electron leakage and non-zero-voltage effects on hydrogen generation. Overall, we demonstrate how atomistic-scale simulations can enhance our understanding of processes at interfaces in solid oxide materials.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Half-quantum vortex generation in a two-component Bose-Einstein condensate by an oscillatory magnetic obstacle
Authors:
Jong Heum Jung,
Yong-il Shin
Abstract:
We numerically investigate the dynamics of vortex generation in a two-dimensional, twocomponent Bose-Einstein condensate subjected to an oscillatory magnetic obstacle. The obstacle creates both repulsive and attractive Gaussian potentials for the two symmetric spin-$\uparrow$ and $\downarrow$ components, respectively. We demonstrate that, as the oscillating frequency f increases, two distinct crit…
▽ More
We numerically investigate the dynamics of vortex generation in a two-dimensional, twocomponent Bose-Einstein condensate subjected to an oscillatory magnetic obstacle. The obstacle creates both repulsive and attractive Gaussian potentials for the two symmetric spin-$\uparrow$ and $\downarrow$ components, respectively. We demonstrate that, as the oscillating frequency f increases, two distinct critical dynamics arise in the generation of half-quantum vortices (HQVs) with different spin circulations. Spin-$\uparrow$ vortices are nucleated directly from the moving obstacle at low f, while spin-$\downarrow$ vortices are created at high f by breaking a spin wave pulse in front of the obstacle. We find that vortex generation is suppressed for sufficiently weak obstacles, in agreement with recent experimental results by Kim et al. [Phys. Rev. Lett. 127, 095302 (2021)]. This suppression is caused by the finite sweeping distance of the oscillating obstacle and the reduction in friction in a supersonic regime. Finally, we show that the characteristic length scale of the HQV generation dynamics is determined by the spin healing length of the system.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan
Authors:
Muhammad Usman,
Azka Rehman,
Abdullah Shahid,
Siddique Latif,
Shi Sub Byon,
Sung Hyun Kim,
Tariq Mahmood Khan,
Yeong Gil Shin
Abstract:
Accurate lung nodule segmentation is crucial for early-stage lung cancer diagnosis, as it can substantially enhance patient survival rates. Computed tomography (CT) images are widely employed for early diagnosis in lung nodule analysis. However, the heterogeneity of lung nodules, size diversity, and the complexity of the surrounding environment pose challenges for developing robust nodule segmenta…
▽ More
Accurate lung nodule segmentation is crucial for early-stage lung cancer diagnosis, as it can substantially enhance patient survival rates. Computed tomography (CT) images are widely employed for early diagnosis in lung nodule analysis. However, the heterogeneity of lung nodules, size diversity, and the complexity of the surrounding environment pose challenges for developing robust nodule segmentation methods. In this study, we propose an efficient end-to-end framework, the multi-encoder-based self-adaptive hard attention network (MESAHA-Net), for precise lung nodule segmentation in CT scans. MESAHA-Net comprises three encoding paths, an attention block, and a decoder block, facilitating the integration of three types of inputs: CT slice patches, forward and backward maximum intensity projection (MIP) images, and region of interest (ROI) masks encompassing the nodule. By employing a novel adaptive hard attention mechanism, MESAHA-Net iteratively performs slice-by-slice 2D segmentation of lung nodules, focusing on the nodule region in each slice to generate 3D volumetric segmentation of lung nodules. The proposed framework has been comprehensively evaluated on the LIDC-IDRI dataset, the largest publicly available dataset for lung nodule segmentation. The results demonstrate that our approach is highly robust for various lung nodule types, outperforming previous state-of-the-art techniques in terms of segmentation accuracy and computational complexity, rendering it suitable for real-time clinical implementation.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Optimal Delegation in Markets for Matching with Signaling
Authors:
Seungjin Han,
Alex Sam,
Youngki Shin
Abstract:
This paper studies a delegation problem faced by the planner who wants to regulate receivers' reaction choices in markets for matching between receivers and senders with signaling. We provide a noble insight into the planner's willingness to delegate and the design of optimal (reaction) interval delegation as a solution to the planner's general mechanism design problem. The relative heterogeneity…
▽ More
This paper studies a delegation problem faced by the planner who wants to regulate receivers' reaction choices in markets for matching between receivers and senders with signaling. We provide a noble insight into the planner's willingness to delegate and the design of optimal (reaction) interval delegation as a solution to the planner's general mechanism design problem. The relative heterogeneity of receiver types and the productivity of the sender' signal are crucial in deriving optimal interval delegation in the presence of the trade-off between matching efficiency and signaling costs.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Cross-speaker Emotion Transfer by Manipulating Speech Style Latents
Authors:
Suhee Jo,
Younggun Lee,
Yookyung Shin,
Yeongtae Hwang,
Taesu Kim
Abstract:
In recent years, emotional text-to-speech has shown considerable progress. However, it requires a large amount of labeled data, which is not easily accessible. Even if it is possible to acquire an emotional speech dataset, there is still a limitation in controlling emotion intensity. In this work, we propose a novel method for cross-speaker emotion transfer and manipulation using vector arithmetic…
▽ More
In recent years, emotional text-to-speech has shown considerable progress. However, it requires a large amount of labeled data, which is not easily accessible. Even if it is possible to acquire an emotional speech dataset, there is still a limitation in controlling emotion intensity. In this work, we propose a novel method for cross-speaker emotion transfer and manipulation using vector arithmetic in latent style space. By leveraging only a few labeled samples, we generate emotional speech from reading-style speech without losing the speaker identity. Furthermore, emotion strength is readily controllable using a scalar value, providing an intuitive way for users to manipulate speech. Experimental results show the proposed method affords superior performance in terms of expressiveness, naturalness, and controllability, preserving speaker identity.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Accurate Real-time Polyp Detection in Videos from Concatenation of Latent Features Extracted from Consecutive Frames
Authors:
Hemin Ali Qadir,
Younghak Shin,
Jacob Bergsland,
Ilangko Balasingham
Abstract:
An efficient deep learning model that can be implemented in real-time for polyp detection is crucial to reducing polyp miss-rate during screening procedures. Convolutional neural networks (CNNs) are vulnerable to small changes in the input image. A CNN-based model may miss the same polyp appearing in a series of consecutive frames and produce unsubtle detection output due to changes in camera pose…
▽ More
An efficient deep learning model that can be implemented in real-time for polyp detection is crucial to reducing polyp miss-rate during screening procedures. Convolutional neural networks (CNNs) are vulnerable to small changes in the input image. A CNN-based model may miss the same polyp appearing in a series of consecutive frames and produce unsubtle detection output due to changes in camera pose, lighting condition, light reflection, etc. In this study, we attempt to tackle this problem by integrating temporal information among neighboring frames. We propose an efficient feature concatenation method for a CNN-based encoder-decoder model without adding complexity to the model. The proposed method incorporates extracted feature maps of previous frames to detect polyps in the current frame. The experimental results demonstrate that the proposed method of feature concatenation improves the overall performance of automatic polyp detection in videos. The following results are obtained on a public video dataset: sensitivity 90.94\%, precision 90.53\%, and specificity 92.46%
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
Stress Testing Control Loops in Cyber-Physical Systems
Authors:
Claudio Mandrioli,
Seung Yeob Shin,
Martina Maggio,
Domenico Bianculli,
Lionel Briand
Abstract:
Cyber-Physical Systems (CPSs) are often safety-critical and deployed in uncertain environments. Identifying scenarios where CPSs do not comply with requirements is fundamental but difficult due to the multidisciplinary nature of CPSs. We investigate the testing of control-based CPSs, where control and software engineers develop the software collaboratively. Control engineers make design assumption…
▽ More
Cyber-Physical Systems (CPSs) are often safety-critical and deployed in uncertain environments. Identifying scenarios where CPSs do not comply with requirements is fundamental but difficult due to the multidisciplinary nature of CPSs. We investigate the testing of control-based CPSs, where control and software engineers develop the software collaboratively. Control engineers make design assumptions during system development to leverage control theory and obtain guarantees on CPS behaviour. In the implemented system, however, such assumptions are not always satisfied, and their falsification can lead to loss of guarantees. We define stress testing of control-based CPSs as generating tests to falsify such design assumptions. We highlight different types of assumptions, focusing on the use of linearised physics models. To generate stress tests falsifying such assumptions, we leverage control theory to qualitatively characterise the input space of a control-based CPS. We propose a novel test parametrisation for control-based CPSs and use it with the input space characterisation to develop a stress testing approach. We evaluate our approach on three case study systems, including a drone, a continuous-current motor (in five configurations), and an aircraft.Our results show the effectiveness of the proposed testing approach in falsifying the design assumptions and highlighting the causes of assumption violations.
△ Less
Submitted 18 September, 2023; v1 submitted 27 February, 2023;
originally announced February 2023.
-
Probabilistic Safe WCET Estimation for Weakly Hard Real-Time Systems at Design Stages
Authors:
Jaekwon Lee,
Seung Yeob Shin,
Lionel Briand,
Shiva Nejati
Abstract:
Weakly hard real-time systems can, to some degree, tolerate deadline misses, but their schedulability still needs to be analyzed to ensure their quality of service. Such analysis usually occurs at early design stages to provide implementation guidelines to engineers so that they can make better design decisions. Estimating worst-case execution times (WCET) is a key input to schedulability analysis…
▽ More
Weakly hard real-time systems can, to some degree, tolerate deadline misses, but their schedulability still needs to be analyzed to ensure their quality of service. Such analysis usually occurs at early design stages to provide implementation guidelines to engineers so that they can make better design decisions. Estimating worst-case execution times (WCET) is a key input to schedulability analysis. However, early on during system design, estimating WCET values is challenging and engineers usually determine them as plausible ranges based on their domain knowledge. Our approach aims at finding restricted, safe WCET sub-ranges given a set of ranges initially estimated by experts in the context of weakly hard real-time systems. To this end, we leverage (1) multi-objective search aiming at maximizing the violation of weakly hard constraints in order to find worst-case scheduling scenarios and (2) polynomial logistic regression to infer safe WCET ranges with a probabilistic interpretation. We evaluated our approach by applying it to an industrial system in the satellite domain and several realistic synthetic systems. The results indicate that our approach significantly outperforms a baseline relying on random search without learning, and estimates safe WCET ranges with a high degree of confidence in practical time (< 23h).
△ Less
Submitted 11 August, 2023; v1 submitted 20 February, 2023;
originally announced February 2023.
-
Simple U-net Based Synthetic Polyp Image Generation: Polyp to Negative and Negative to Polyp
Authors:
Hemin Ali Qadir,
Ilangko Balasingham,
Younghak Shin
Abstract:
Synthetic polyp generation is a good alternative to overcome the privacy problem of medical data and the lack of various polyp samples. In this study, we propose a deep learning-based polyp image generation framework that generates synthetic polyp images that are similar to real ones. We suggest a framework that converts a given polyp image into a negative image (image without a polyp) using a sim…
▽ More
Synthetic polyp generation is a good alternative to overcome the privacy problem of medical data and the lack of various polyp samples. In this study, we propose a deep learning-based polyp image generation framework that generates synthetic polyp images that are similar to real ones. We suggest a framework that converts a given polyp image into a negative image (image without a polyp) using a simple conditional GAN architecture and then converts the negative image into a new-looking polyp image using the same network. In addition, by using the controllable polyp masks, polyps with various characteristics can be generated from one input condition. The generated polyp images can be used directly as training images for polyp detection and segmentation without additional labeling. To quantitatively assess the quality of generated synthetic polyps, we use public polyp image and video datasets combined with the generated synthetic images to examine the performance improvement of several detection and segmentation models. Experimental results show that we obtain performance gains when the generated polyp images are added to the training set.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
SEMI-PointRend: Improved Semiconductor Wafer Defect Classification and Segmentation as Rendering
Authors:
MinJin Hwang,
Bappaditya Dey,
Enrique Dehaerne,
Sandip Halder,
Young-han Shin
Abstract:
In this study, we applied the PointRend (Point-based Rendering) method to semiconductor defect segmentation. PointRend is an iterative segmentation algorithm inspired by image rendering in computer graphics, a new image segmentation method that can generate high-resolution segmentation masks. It can also be flexibly integrated into common instance segmentation meta-architecture such as Mask-RCNN a…
▽ More
In this study, we applied the PointRend (Point-based Rendering) method to semiconductor defect segmentation. PointRend is an iterative segmentation algorithm inspired by image rendering in computer graphics, a new image segmentation method that can generate high-resolution segmentation masks. It can also be flexibly integrated into common instance segmentation meta-architecture such as Mask-RCNN and semantic meta-architecture such as FCN. We implemented a model, termed as SEMI-PointRend, to generate precise segmentation masks by applying the PointRend neural network module. In this paper, we focus on comparing the defect segmentation predictions of SEMI-PointRend and Mask-RCNN for various defect types (line-collapse, single bridge, thin bridge, multi bridge non-horizontal). We show that SEMI-PointRend can outperforms Mask R-CNN by up to 18.8% in terms of segmentation mean average precision.
△ Less
Submitted 19 February, 2023;
originally announced February 2023.
-
Improved Learning-Augmented Algorithms for the Multi-Option Ski Rental Problem via Best-Possible Competitive Analysis
Authors:
Yongho Shin,
Changyeol Lee,
Gukryeol Lee,
Hyung-Chan An
Abstract:
In this paper, we present improved learning-augmented algorithms for the multi-option ski rental problem. Learning-augmented algorithms take ML predictions as an added part of the input and incorporates these predictions in solving the given problem. Due to their unique strength that combines the power of ML predictions with rigorous performance guarantees, they have been extensively studied in th…
▽ More
In this paper, we present improved learning-augmented algorithms for the multi-option ski rental problem. Learning-augmented algorithms take ML predictions as an added part of the input and incorporates these predictions in solving the given problem. Due to their unique strength that combines the power of ML predictions with rigorous performance guarantees, they have been extensively studied in the context of online optimization problems. Even though ski rental problems are one of the canonical problems in the field of online optimization, only deterministic algorithms were previously known for multi-option ski rental, with or without learning augmentation. We present the first randomized learning-augmented algorithm for this problem, surpassing previous performance guarantees given by deterministic algorithms. Our learning-augmented algorithm is based on a new, provably best-possible randomized competitive algorithm for the problem. Our results are further complemented by lower bounds for deterministic and randomized algorithms, and computational experiments evaluating our algorithms' performance improvements.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Chiral electroluminescence from thin-film perovskite metacavities
Authors:
Seongheon Kim,
Soo-Chan An,
Younggon Kim,
Yun Seop Shin,
Alexander A. Antonov,
In Cheol Seo,
Byung Hoon Woo,
Yeonsoo Lim,
Maxim V. Gorkunov,
Yuri S. Kivshar,
Jin Young Kim,
Young Chul Jun
Abstract:
Chiral light sources realized in ultracompact device platforms are highly desirable for various applications. Among active media employed for thin-film emission devices, lead-halide perovskites have been extensively studied for photoluminescence due to their exceptional properties. However, up to date, there have been no demonstrations of chiral electroluminescence with a substantial degree of cir…
▽ More
Chiral light sources realized in ultracompact device platforms are highly desirable for various applications. Among active media employed for thin-film emission devices, lead-halide perovskites have been extensively studied for photoluminescence due to their exceptional properties. However, up to date, there have been no demonstrations of chiral electroluminescence with a substantial degree of circular polarization (DCP), being critical for the development of practical devices. Here, we propose a new concept of chiral light sources based on a thin-film perovskite metacavity and experimentally demonstrate chiral electroluminescence with DCP approaching 0.38. We design a metacavity created by a metal and a dielectric metasurface supporting photonic eigenstates with close-to-maximum chiral response. Chiral cavity modes facilitate asymmetric electroluminescence of pairs of left and right circularly polarized waves propagating in the opposite oblique directions. The proposed ultracompact light sources are especially advantageous for many applications requiring chiral light beams of both helicities.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Search for the Sagittarius Tidal Stream of Axion Dark Matter around 4.55 $μ$eV
Authors:
Andrew K. Yi,
Saebyeok Ahn,
Çağlar Kutlu,
JinMyeong Kim,
Byeong Rok Ko,
Boris I. Ivanov,
HeeSu Byun,
Arjan F. van Loo,
SeongTae Park,
Junu Jeong,
Ohjoon Kwon,
Yasunobu Nakamura,
Sergey V. Uchaikin,
Jihoon Choi,
Soohyung Lee,
MyeongJae Lee,
Yun Chang Shin,
Jinsu Kim,
Doyu Lee,
Danho Ahn,
SungJae Bae,
Jiwon Lee,
Younggeun Kim,
Violeta Gkika,
Ki Woong Lee
, et al. (7 additional authors not shown)
Abstract:
We report the first search for the Sagittarius tidal stream of axion dark matter around 4.55 $μ$eV using CAPP-12TB haloscope data acquired in March of 2022. Our result excluded the Sagittarius tidal stream of Dine-Fischler-Srednicki-Zhitnitskii and Kim-Shifman-Vainshtein-Zakharov axion dark matter densities of $ρ_a\gtrsim0.184$ and $\gtrsim0.025$ GeV/cm$^{3}$, respectively, over a mass range from…
▽ More
We report the first search for the Sagittarius tidal stream of axion dark matter around 4.55 $μ$eV using CAPP-12TB haloscope data acquired in March of 2022. Our result excluded the Sagittarius tidal stream of Dine-Fischler-Srednicki-Zhitnitskii and Kim-Shifman-Vainshtein-Zakharov axion dark matter densities of $ρ_a\gtrsim0.184$ and $\gtrsim0.025$ GeV/cm$^{3}$, respectively, over a mass range from 4.51 to 4.59 $μ$eV at a 90% confidence level.
△ Less
Submitted 13 July, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Development and Application of a ReaxFF Reactive Force Field for Ni-Doped MoS$_2$
Authors:
Karen Mohammadtabar,
Enrique Guerrero,
Sergio Romero Garcia,
Yun Kyung Shin,
Adri C. T. van Duin,
David A. Strubbe,
Ashlie Martini
Abstract:
The properties of $\mathrm{MoS_2}$ can be tuned or optimized through doping. In particular, Ni doping has been shown to improve the performance of $\mathrm{MoS_2}$ for various applications, including catalysis and tribology. To enable investigation of Ni-doped $\mathrm{MoS_2}$ with reactive molecular dynamics simulations, we developed a new ReaxFF force field to describe this material. The force f…
▽ More
The properties of $\mathrm{MoS_2}$ can be tuned or optimized through doping. In particular, Ni doping has been shown to improve the performance of $\mathrm{MoS_2}$ for various applications, including catalysis and tribology. To enable investigation of Ni-doped $\mathrm{MoS_2}$ with reactive molecular dynamics simulations, we developed a new ReaxFF force field to describe this material. The force field parameters were optimized to match a large set of density-functional theory (DFT) calculations of 2H-$\mathrm{MoS_2}$ doped with Ni, at four different sites (Mo-substituted, S-substituted, octahedral intercalation, and tetrahedral intercalation), under uniaxial, biaxial, triaxial, and shear strain. The force field was evaluated by comparing ReaxFF- and DFT-relaxed structural parameters, the tetrahedral/octahedral energy difference in doped 2H, energies of doped 1H and 1T monolayers, and doped 2H structures with vacancies. We demonstrated the application of the force field with reactive simulations of sputtering deposition and annealing of Ni-doped $\mathrm{MoS_2}$ films. Results show that the developed force field can successfully model the phase transition of Ni-doped $\mathrm{MoS_2}$ from amorphous to crystalline. The newly developed force field can be used in subsequent investigations to study the properties and behavior of Ni-doped $\mathrm{MoS_2}$ using reactive molecular dynamics simulations.
△ Less
Submitted 12 June, 2024; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Spin-driven stationary turbulence in spinor Bose-Einstein condensates
Authors:
Deokhwa Hong,
Junghoon Lee,
Jongmin Kim,
Jong Heum Jung,
Kyuhwan Lee,
Seji Kang,
Yong-il Shin
Abstract:
We report the observation of stationary turbulence in antiferromagnetic spin-1 Bose-Einstein condensates driven by a radio-frequency magnetic field. The magnetic driving injects energy into the system by spin rotation and the energy is dissipated via dynamic instability, resulting in the emergence of an irregular spin texture in the condensate. Under continuous driving, the spinor condensate evolv…
▽ More
We report the observation of stationary turbulence in antiferromagnetic spin-1 Bose-Einstein condensates driven by a radio-frequency magnetic field. The magnetic driving injects energy into the system by spin rotation and the energy is dissipated via dynamic instability, resulting in the emergence of an irregular spin texture in the condensate. Under continuous driving, the spinor condensate evolves into a nonequilibrium steady state with characteristic spin turbulence, while the low energy scale of spin excitations ensures that the sample's lifetime is minimally affected. When the driving strength is on par with the system's spin interaction energy and the quadratic Zeeman energy, remarkably, the stationary turbulent state exhibits spin-isotropic features in spin composition and spatial spin texture. We numerically show that ambient field fluctuations play a crucial role in sustaining the turbulent state within the system. These results open up new avenues for exploring quantum turbulence in spinor superfluid systems.
△ Less
Submitted 13 July, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Towards Equitable Representation in Text-to-Image Synthesis Models with the Cross-Cultural Understanding Benchmark (CCUB) Dataset
Authors:
Zhixuan Liu,
Youeun Shin,
Beverley-Claire Okogwu,
Youngsik Yun,
Lia Coleman,
Peter Schaldenbrand,
Jihie Kim,
Jean Oh
Abstract:
It has been shown that accurate representation in media improves the well-being of the people who consume it. By contrast, inaccurate representations can negatively affect viewers and lead to harmful perceptions of other cultures. To achieve inclusive representation in generated images, we propose a culturally-aware priming approach for text-to-image synthesis using a small but culturally curated…
▽ More
It has been shown that accurate representation in media improves the well-being of the people who consume it. By contrast, inaccurate representations can negatively affect viewers and lead to harmful perceptions of other cultures. To achieve inclusive representation in generated images, we propose a culturally-aware priming approach for text-to-image synthesis using a small but culturally curated dataset that we collected, known here as Cross-Cultural Understanding Benchmark (CCUB) Dataset, to fight the bias prevalent in giant datasets. Our proposed approach is comprised of two fine-tuning techniques: (1) Adding visual context via fine-tuning a pre-trained text-to-image synthesis model, Stable Diffusion, on the CCUB text-image pairs, and (2) Adding semantic context via automated prompt engineering using the fine-tuned large language model, GPT-3, trained on our CCUB culturally-aware text data. CCUB dataset is curated and our approach is evaluated by people who have a personal relationship with that particular culture. Our experiments indicate that priming using both text and image is effective in improving the cultural relevance and decreasing the offensiveness of generated images while maintaining quality.
△ Less
Submitted 26 April, 2023; v1 submitted 27 January, 2023;
originally announced January 2023.
-
Magnetoelasticity-driven phase inversion of ultrafast spin precession in NixFe100-x thin films
Authors:
Yooleemi Shin,
Seongsoo Yoon,
Jung-Il Hong,
Ji-Wan Kim
Abstract:
We present strong evidences for the deterministic role of magnetoelasticity in ultrafast spin dynamics of ferromagnetic NixFe100-x alloy films. Without a change in the crystal structure, we observed sudden Pi-phase inversion of the spin precession in the range of x = 87.0 - 97.5. In addition, it was found that the phase was continuously changed and reversed its sign by varying the pump fluence. Th…
▽ More
We present strong evidences for the deterministic role of magnetoelasticity in ultrafast spin dynamics of ferromagnetic NixFe100-x alloy films. Without a change in the crystal structure, we observed sudden Pi-phase inversion of the spin precession in the range of x = 87.0 - 97.5. In addition, it was found that the phase was continuously changed and reversed its sign by varying the pump fluence. These cannot be explained simply by temperature dependence of magnetocrystalline, demagnetizing, and Zeeman fields which have been conventionally considered so far in describing the spin dynamics. Through the temperature- and composition-dependent simulations adding the magnetoelastic field generated from the lattice thermal strain, we revealed that the conventional and magnetoelastic fields were competing around x = 95.3, where the spin dynamics showed the largest phase shift. For analytic understanding, we further show that the temperature-dependent interplay of the Curie temperature, saturation magnetization, and magnetostriction, which are demonstrated to be the most important macroscopic parameters, determines the ultrafast spin dynamics. Our extensive study emphasizes that magnetoelasticity is the key ingredient for fully understanding the driving mechanism of ultrafast spin dynamics.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
Young wall construction of level-1 highest weight crystals over $U_q(D_4^{(3)})$ and $U_q(G_2^{(1)})$
Authors:
Zhaobing Fan,
Shaolong Han,
Seok-Jin Kang,
Yong-Su Shin
Abstract:
With the help of path realization and affine energy function, we give a Young wall construction of level-1 highest weight crystals $B(λ)$ over $U_{q}(G_{2}^{(1)})$ and $U_{q}(D_{4}^{(3)})$. Our construction is based on four different shapes of colored blocks, $\mathbf O$-block, $\mathbf I$-block, $\mathbf L$-block and $\mathbf{LL}$-block, obtained by cutting the unit cube in three different ways.
With the help of path realization and affine energy function, we give a Young wall construction of level-1 highest weight crystals $B(λ)$ over $U_{q}(G_{2}^{(1)})$ and $U_{q}(D_{4}^{(3)})$. Our construction is based on four different shapes of colored blocks, $\mathbf O$-block, $\mathbf I$-block, $\mathbf L$-block and $\mathbf{LL}$-block, obtained by cutting the unit cube in three different ways.
△ Less
Submitted 25 February, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
MEDS-Net: Self-Distilled Multi-Encoders Network with Bi-Direction Maximum Intensity projections for Lung Nodule Detection
Authors:
Muhammad Usman,
Azka Rehman,
Abdullah Shahid,
Siddique Latif,
Shi Sub Byon,
Byoung Dai Lee,
Sung Hyun Kim,
Byung il Lee,
Yeong Gil Shin
Abstract:
In this study, we propose a lung nodule detection scheme which fully incorporates the clinic workflow of radiologists. Particularly, we exploit Bi-Directional Maximum intensity projection (MIP) images of various thicknesses (i.e., 3, 5 and 10mm) along with a 3D patch of CT scan, consisting of 10 adjacent slices to feed into self-distillation-based Multi-Encoders Network (MEDS-Net). The proposed ar…
▽ More
In this study, we propose a lung nodule detection scheme which fully incorporates the clinic workflow of radiologists. Particularly, we exploit Bi-Directional Maximum intensity projection (MIP) images of various thicknesses (i.e., 3, 5 and 10mm) along with a 3D patch of CT scan, consisting of 10 adjacent slices to feed into self-distillation-based Multi-Encoders Network (MEDS-Net). The proposed architecture first condenses 3D patch input to three channels by using a dense block which consists of dense units which effectively examine the nodule presence from 2D axial slices. This condensed information, along with the forward and backward MIP images, is fed to three different encoders to learn the most meaningful representation, which is forwarded into the decoded block at various levels. At the decoder block, we employ a self-distillation mechanism by connecting the distillation block, which contains five lung nodule detectors. It helps to expedite the convergence and improves the learning ability of the proposed architecture. Finally, the proposed scheme reduces the false positives by complementing the main detector with auxiliary detectors. The proposed scheme has been rigorously evaluated on 888 scans of LUNA16 dataset and obtained a CPM score of 93.6\%. The results demonstrate that incorporating of bi-direction MIP images enables MEDS-Net to effectively distinguish nodules from surroundings which help to achieve the sensitivity of 91.5% and 92.8% with false positives rate of 0.25 and 0.5 per scan, respectively.
△ Less
Submitted 26 December, 2022; v1 submitted 30 October, 2022;
originally announced November 2022.
-
PAGE: Prototype-Based Model-Level Explanations for Graph Neural Networks
Authors:
Yong-Min Shin,
Sun-Woo Kim,
Won-Yong Shin
Abstract:
Aside from graph neural networks (GNNs) attracting significant attention as a powerful framework revolutionizing graph representation learning, there has been an increasing demand for explaining GNN models. Although various explanation methods for GNNs have been developed, most studies have focused on instance-level explanations, which produce explanations tailored to a given graph instance. In ou…
▽ More
Aside from graph neural networks (GNNs) attracting significant attention as a powerful framework revolutionizing graph representation learning, there has been an increasing demand for explaining GNN models. Although various explanation methods for GNNs have been developed, most studies have focused on instance-level explanations, which produce explanations tailored to a given graph instance. In our study, we propose Prototype-bAsed GNN-Explainer (PAGE), a novel model-level GNN explanation method that explains what the underlying GNN model has learned for graph classification by discovering human-interpretable prototype graphs. Our method produces explanations for a given class, thus being capable of offering more concise and comprehensive explanations than those of instance-level explanations. First, PAGE selects embeddings of class-discriminative input graphs on the graph-level embedding space after clustering them. Then, PAGE discovers a common subgraph pattern by iteratively searching for high matching node tuples using node-level embeddings via a prototype scoring function, thereby yielding a prototype graph as our explanation. Using six graph classification datasets, we demonstrate that PAGE qualitatively and quantitatively outperforms the state-of-the-art model-level explanation method. We also carry out systematic experimental studies by demonstrating the relationship between PAGE and instance-level explanation methods, the robustness of PAGE to input data scarce environments, and the computational efficiency of the proposed prototype scoring function in PAGE.
△ Less
Submitted 19 March, 2024; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Learning Failure-Inducing Models for Testing Software-Defined Networks
Authors:
Raphaël Ollando,
Seung Yeob Shin,
Lionel C. Briand
Abstract:
Software-defined networks (SDN) enable flexible and effective communication systems that are managed by centralized software controllers. However, such a controller can undermine the underlying communication network of an SDN-based system and thus must be carefully tested. When an SDN-based system fails, in order to address such a failure, engineers need to precisely understand the conditions unde…
▽ More
Software-defined networks (SDN) enable flexible and effective communication systems that are managed by centralized software controllers. However, such a controller can undermine the underlying communication network of an SDN-based system and thus must be carefully tested. When an SDN-based system fails, in order to address such a failure, engineers need to precisely understand the conditions under which it occurs. In this article, we introduce a machine learning-guided fuzzing method, named FuzzSDN, aiming at both (1) generating effective test data leading to failures in SDN-based systems and (2) learning accurate failure-inducing models that characterize conditions under which such system fails. To our knowledge, no existing work simultaneously addresses these two objectives for SDNs. We evaluate FuzzSDN by applying it to systems controlled by two open-source SDN controllers. Further, we compare FuzzSDN with two state-of-the-art methods for fuzzing SDNs and two baselines for learning failure-inducing models. Our results show that (1) compared to the state-of-the-art methods, FuzzSDN generates at least 12 times more failures, within the same time budget, with a controller that is fairly robust to fuzzing and (2) our failure-inducing models have, on average, a precision of 98% and a recall of 86%, significantly outperforming the baselines.
△ Less
Submitted 8 January, 2024; v1 submitted 27 October, 2022;
originally announced October 2022.
-
Axion Dark Matter Search around 4.55 $μ$eV with Dine-Fischler-Srednicki-Zhitnitskii Sensitivity
Authors:
Andrew K. Yi,
Saebyeok Ahn,
Çağlar Kutlu,
JinMyeong Kim,
Byeong Rok Ko,
Boris I. Ivanov,
HeeSu Byun,
Arjan F. van Loo,
SeongTae Park,
Junu Jeong,
Ohjoon Kwon,
Yasunobu Nakamura,
Sergey V. Uchaikin,
Jihoon Choi,
Soohyung Lee,
MyeongJae Lee,
Yun Chang Shin,
Jinsu Kim,
Doyu Lee,
Danho Ahn,
SungJae Bae,
Jiwon Lee,
Younggeun Kim,
Violeta Gkika,
Ki Woong Lee
, et al. (7 additional authors not shown)
Abstract:
We report an axion dark matter search at Dine-Fischler-Srednicki-Zhitnitskii sensitivity with the CAPP-12TB haloscope, assuming axions contribute 100\% of the local dark matter density.
The search excluded the axion--photon coupling $g_{aγγ}$ down to about $6.2\times10^{-16}$ GeV$^{-1}$ over the axion mass range between 4.51 and 4.59 $μ$eV at a 90\% confidence level.
The achieved experimental…
▽ More
We report an axion dark matter search at Dine-Fischler-Srednicki-Zhitnitskii sensitivity with the CAPP-12TB haloscope, assuming axions contribute 100\% of the local dark matter density.
The search excluded the axion--photon coupling $g_{aγγ}$ down to about $6.2\times10^{-16}$ GeV$^{-1}$ over the axion mass range between 4.51 and 4.59 $μ$eV at a 90\% confidence level.
The achieved experimental sensitivity can also exclude Kim-Shifman-Vainshtein-Zakharov axion dark matter that makes up just 13\% of the local dark matter density.
The CAPP-12TB haloscope will continue the search over a wide range of axion masses.
△ Less
Submitted 16 February, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Improving GANs with a Feature Cycling Generator
Authors:
Seung Park,
Yong-Goo Shin
Abstract:
Generative adversarial networks (GANs), built with a generator and discriminator, significantly have advanced image generation. Typically, existing papers build their generators by stacking up multiple residual blocks since it makes ease the training of generators. However, some recent papers commented on the limitation of the residual block and proposed a new architectural unit that improves the…
▽ More
Generative adversarial networks (GANs), built with a generator and discriminator, significantly have advanced image generation. Typically, existing papers build their generators by stacking up multiple residual blocks since it makes ease the training of generators. However, some recent papers commented on the limitation of the residual block and proposed a new architectural unit that improves the GANs performance. Following this trend, this paper presents a novel unit, called feature cycling block (FCB), which achieves impressive results in the image generation task. Specifically, the FCB has two branches: one is a memory branch and the other is an image branch. The memory branch keeps meaningful information at each stage of the generator, whereas the image branch takes some useful features from the memory branch to produce a high-quality image. To show the capability of the proposed method, we conducted extensive experiments using various datasets including CIFAR-10, CIFAR-100, FFHQ, AFHQ, and subsets of LSUN. Experimental results demonstrate the substantial superiority of our approach over the baseline without incurring any objective functions or training skills. For instance, the proposed method improves Frechet inception distance (FID) of StyleGAN2 from 4.89 to 3.72 on the FFHQ dataset and from 6.64 to 5.57 on the LSUN Bed dataset. We believe that the pioneering attempt presented in this paper could inspire the community with better-designed generator architecture and with training objectives or skills compatible with the proposed method.
△ Less
Submitted 17 February, 2023; v1 submitted 18 October, 2022;
originally announced October 2022.
-
Meta-Query-Net: Resolving Purity-Informativeness Dilemma in Open-set Active Learning
Authors:
Dongmin Park,
Yooju Shin,
Jihwan Bang,
Youngjun Lee,
Hwanjun Song,
Jae-Gil Lee
Abstract:
Unlabeled data examples awaiting annotations contain open-set noise inevitably. A few active learning studies have attempted to deal with this open-set noise for sample selection by filtering out the noisy examples. However, because focusing on the purity of examples in a query set leads to overlooking the informativeness of the examples, the best balancing of purity and informativeness remains an…
▽ More
Unlabeled data examples awaiting annotations contain open-set noise inevitably. A few active learning studies have attempted to deal with this open-set noise for sample selection by filtering out the noisy examples. However, because focusing on the purity of examples in a query set leads to overlooking the informativeness of the examples, the best balancing of purity and informativeness remains an important question. In this paper, to solve this purity-informativeness dilemma in open-set active learning, we propose a novel Meta-Query-Net,(MQ-Net) that adaptively finds the best balancing between the two factors. Specifically, by leveraging the multi-round property of active learning, we train MQ-Net using a query set without an additional validation set. Furthermore, a clear dominance relationship between unlabeled examples is effectively captured by MQ-Net through a novel skyline regularization. Extensive experiments on multiple open-set active learning scenarios demonstrate that the proposed MQ-Net achieves 20.14% improvement in terms of accuracy, compared with the state-of-the-art methods.
△ Less
Submitted 11 January, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Medha: Microcoded Hardware Accelerator for computing on Encrypted Data
Authors:
Ahmet Can Mert,
Aikata,
Sunmin Kwon,
Youngsam Shin,
Donghoon Yoo,
Yongwoo Lee,
Sujoy Sinha Roy
Abstract:
Homomorphic encryption (HE) enables computation on encrypted data, and hence it has a great potential in privacy-preserving outsourcing of computations to the cloud. Hardware acceleration of HE is crucial as software implementations are very slow. In this paper, we present design methodologies for building a programmable hardware accelerator for speeding up the cloud-side homomorphic evaluations o…
▽ More
Homomorphic encryption (HE) enables computation on encrypted data, and hence it has a great potential in privacy-preserving outsourcing of computations to the cloud. Hardware acceleration of HE is crucial as software implementations are very slow. In this paper, we present design methodologies for building a programmable hardware accelerator for speeding up the cloud-side homomorphic evaluations on encrypted data. First, we propose a divide-and-conquer technique that enables homomorphic evaluations in a large polynomial ring $R_{Q,2N}$ to use a hardware accelerator that has been built for the smaller ring $R_{Q,N}$. The technique makes it possible to use a single hardware accelerator flexibly for supporting several HE parameter sets. Next, we present several architectural design methods that we use to realize the flexible and instruction-set accelerator architecture, which we call `Medha'. At every level of the implementation hierarchy, we explore possibilities for parallel processing. Starting from hardware-friendly parallel algorithms for the basic building blocks, we gradually build heavily parallel RNS polynomial arithmetic units. Next, many of these parallel units are interconnected elegantly so that their interconnections require the minimum number of nets, therefore making the overall architecture placement-friendly on the platform. For Medha, we take a memory-conservative design approach and get rid of any off-chip memory access during homomorphic evaluations. Finally, we implement Medha in a Xilinx Alveo U250 FPGA and measure timing performances of the microcoded homomorphic addition, multiplication, key-switching, and rescaling for the leveled HE scheme RNS-HEAAN at 200 MHz clock frequency. For two large parameter sets, Medha achieves accelerations by up to 68x and 78x times respectively compared to a highly optimized software implementation Microsoft SEAL running at 2.3 GHz.
△ Less
Submitted 12 October, 2022; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Minimum critical velocity of a Gaussian obstacle in a Bose-Einstein condensate
Authors:
Haneul Kwak,
Jong Heum Jung,
Yong-il Shin
Abstract:
When a superfluid flows past an obstacle, quantized vortices can be created in the wake above a certain critical velocity. In the experiment by Kwon et al. [Phys. Rev. A 91, 053615 (2015)], the critical velocity $v_c$ was measured for atomic Bose-Einstein condensates (BECs) using a moving repulsive Gaussian potential and $v_c$ was minimized when the potential height $V_0$ of the obstacle was close…
▽ More
When a superfluid flows past an obstacle, quantized vortices can be created in the wake above a certain critical velocity. In the experiment by Kwon et al. [Phys. Rev. A 91, 053615 (2015)], the critical velocity $v_c$ was measured for atomic Bose-Einstein condensates (BECs) using a moving repulsive Gaussian potential and $v_c$ was minimized when the potential height $V_0$ of the obstacle was close to the condensate chemical potential $μ$. Here we numerically investigate the evolution of the critical vortex shedding in a two-dimensional BEC with increasing $V_0$ and show that the minimum $v_c$ at the critical strength $V_{0c}\approx μ$ results from the local density reduction and vortex pinning effect of the repulsive obstacle. The spatial distribution of the superflow around the moving obstacle just below $v_c$ is examined. The particle density at the tip of the obstacle decreases as $V_0$ increases to $V_{c0}$ and at the critical strength, a vortex dipole is suddenly formed and dragged by the moving obstacle, indicating the onset of vortex pinning. The minimum $v_c$ exhibits power-law scaling with the obstacle size $σ$ as $v_c\sim σ^{-γ}$ with $γ\approx 1/2$.
△ Less
Submitted 13 February, 2023; v1 submitted 9 October, 2022;
originally announced October 2022.
-
Dual-Stage Deeply Supervised Attention-based Convolutional Neural Networks for Mandibular Canal Segmentation in CBCT Scans
Authors:
Azka Rehman,
Muhammad Usman,
Rabeea Jawaid,
Amal Muhammad Saleem,
Shi Sub Byon,
Sung Hyun Kim,
Byoung Dai Lee,
Byung il Lee,
Yeong Gil Shin
Abstract:
Accurate segmentation of mandibular canals in lower jaws is important in dental implantology. Medical experts determine the implant position and dimensions manually from 3D CT images to avoid damaging the mandibular nerve inside the canal. In this paper, we propose a novel dual-stage deep learning-based scheme for the automatic segmentation of the mandibular canal. Particularly, we first enhance t…
▽ More
Accurate segmentation of mandibular canals in lower jaws is important in dental implantology. Medical experts determine the implant position and dimensions manually from 3D CT images to avoid damaging the mandibular nerve inside the canal. In this paper, we propose a novel dual-stage deep learning-based scheme for the automatic segmentation of the mandibular canal. Particularly, we first enhance the CBCT scans by employing the novel histogram-based dynamic windowing scheme, which improves the visibility of mandibular canals. After enhancement, we design 3D deeply supervised attention U-Net architecture for localizing the volumes of interest (VOIs), which contain the mandibular canals (i.e., left and right canals). Finally, we employed the multi-scale input residual U-Net architecture (MS-R-UNet) to segment the mandibular canals using VOIs accurately. The proposed method has been rigorously evaluated on 500 scans. The results demonstrate that our technique outperforms the current state-of-the-art segmentation performance and robustness methods.
△ Less
Submitted 2 November, 2022; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Fast Inference for Quantile Regression with Tens of Millions of Observations
Authors:
Sokbae Lee,
Yuan Liao,
Myung Hwan Seo,
Youngki Shin
Abstract:
Big data analytics has opened new avenues in economic research, but the challenge of analyzing datasets with tens of millions of observations is substantial. Conventional econometric methods based on extreme estimators require large amounts of computing resources and memory, which are often not readily available. In this paper, we focus on linear quantile regression applied to "ultra-large" datase…
▽ More
Big data analytics has opened new avenues in economic research, but the challenge of analyzing datasets with tens of millions of observations is substantial. Conventional econometric methods based on extreme estimators require large amounts of computing resources and memory, which are often not readily available. In this paper, we focus on linear quantile regression applied to "ultra-large" datasets, such as U.S. decennial censuses. A fast inference framework is presented, utilizing stochastic subgradient descent (S-subGD) updates. The inference procedure handles cross-sectional data sequentially: (i) updating the parameter estimate with each incoming "new observation", (ii) aggregating it as a $\textit{Polyak-Ruppert}$ average, and (iii) computing a pivotal statistic for inference using only a solution path. The methodology draws from time-series regression to create an asymptotically pivotal statistic through random scaling. Our proposed test statistic is calculated in a fully online fashion and critical values are calculated without resampling. We conduct extensive numerical studies to showcase the computational merits of our proposed inference. For inference problems as large as $(n, d) \sim (10^7, 10^3)$, where $n$ is the sample size and $d$ is the number of regressors, our method generates new insights, surpassing current inference methods in computation. Our method specifically reveals trends in the gender gap in the U.S. college wage premium using millions of observations, while controlling over $10^3$ covariates to mitigate confounding effects.
△ Less
Submitted 31 October, 2023; v1 submitted 28 September, 2022;
originally announced September 2022.
-
Statistical Treatment Rules under Social Interaction
Authors:
Seungjin Han,
Julius Owusu,
Youngki Shin
Abstract:
In this paper we study treatment assignment rules in the presence of social interaction. We construct an analytical framework under the anonymous interaction assumption, where the decision problem becomes choosing a treatment fraction. We propose a multinomial empirical success (MES) rule that includes the empirical success rule of Manski (2004) as a special case. We investigate the non-asymptotic…
▽ More
In this paper we study treatment assignment rules in the presence of social interaction. We construct an analytical framework under the anonymous interaction assumption, where the decision problem becomes choosing a treatment fraction. We propose a multinomial empirical success (MES) rule that includes the empirical success rule of Manski (2004) as a special case. We investigate the non-asymptotic bounds of the expected utility based on the MES rule. Finally, we prove that the MES rule achieves the asymptotic optimality with the minimax regret criterion.
△ Less
Submitted 9 November, 2022; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Movement Detection of Tongue and Related Body Parts Using IR-UWB Radar
Authors:
Sunghwa Lee,
Younghoon Shin
Abstract:
Because an impulse radio ultra-wideband (IR-UWB) radar can detect targets with high accuracy, work through occluding materials, and operate without contact, it is an attractive hardware solution for building silent speech interfaces, which are non-audio-based speech communication devices. As tongue movement is strongly engaged in pronunciation, detecting its movement is crucial for developing sile…
▽ More
Because an impulse radio ultra-wideband (IR-UWB) radar can detect targets with high accuracy, work through occluding materials, and operate without contact, it is an attractive hardware solution for building silent speech interfaces, which are non-audio-based speech communication devices. As tongue movement is strongly engaged in pronunciation, detecting its movement is crucial for developing silent speech interfaces. In this study, we attempted to classify the motionless and moving states of an invisible tongue and its related body parts using an IR-UWB radar whose antennas were pointed toward the participant's chin. Using the proposed feature extraction algorithm and a Gaussian mixture model - hidden Markov model, we classified two states of the invisible tongue of four individual participants with a minimum accuracy of 90%.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Suppression of Spontaneous Defect Formation in Inhomogeneous Bose Gases
Authors:
Myeonghyeon Kim,
Tenzin Rabga,
Yangheon Lee,
Junhong Goo,
Dalmin Bae,
Yong-il Shin
Abstract:
In phase transition dynamics involving symmetry breaking, topological defects can be spontaneously created but it is suppressed in a spatially inhomogeneous system due to the spreading of the ordered phase information. We demonstrate the defect suppression effect in a trapped atomic Bose gas which is quenched into a superfluid phase. The spatial distribution of created defects is measured for vari…
▽ More
In phase transition dynamics involving symmetry breaking, topological defects can be spontaneously created but it is suppressed in a spatially inhomogeneous system due to the spreading of the ordered phase information. We demonstrate the defect suppression effect in a trapped atomic Bose gas which is quenched into a superfluid phase. The spatial distribution of created defects is measured for various quench times and it is shown that for slower quenches, the spontaneous defect production is relatively more suppressed in the sample's outer region with higher atomic density gradient. The power-law scaling of the local defect density with the quench time is enhanced in the outer region, which is consistent with the Kibble-Zurek mechanism including the causality effect due to the spatial inhomogeneity of the system. This work opens an avenue in the study of nonequilibrium phase transition dynamics using the defect position information.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Improving Small Lesion Segmentation in CT Scans using Intensity Distribution Supervision: Application to Small Bowel Carcinoid Tumor
Authors:
Seung Yeon Shin,
Thomas C. Shen,
Stephen A. Wank,
Ronald M. Summers
Abstract:
Finding small lesions is very challenging due to lack of noticeable features, severe class imbalance, as well as the size itself. One approach to improve small lesion segmentation is to reduce the region of interest and inspect it at a higher sensitivity rather than performing it for the entire region. It is usually implemented as sequential or joint segmentation of organ and lesion, which require…
▽ More
Finding small lesions is very challenging due to lack of noticeable features, severe class imbalance, as well as the size itself. One approach to improve small lesion segmentation is to reduce the region of interest and inspect it at a higher sensitivity rather than performing it for the entire region. It is usually implemented as sequential or joint segmentation of organ and lesion, which requires additional supervision on organ segmentation. Instead, we propose to utilize an intensity distribution of a target lesion at no additional labeling cost to effectively separate regions where the lesions are possibly located from the background. It is incorporated into network training as an auxiliary task. We applied the proposed method to segmentation of small bowel carcinoid tumors in CT scans. We observed improvements for all metrics (33.5% $\rightarrow$ 38.2%, 41.3% $\rightarrow$ 47.8%, 30.0% $\rightarrow$ 35.9% for the global, per case, and per tumor Dice scores, respectively.) compared to the baseline method, which proves the validity of our idea. Our method can be one option for explicitly incorporating intensity distribution information of a target in network training.
△ Less
Submitted 29 July, 2022;
originally announced July 2022.
-
Graph-Based Small Bowel Path Tracking with Cylindrical Constraints
Authors:
Seung Yeon Shin,
Sungwon Lee,
Ronald M. Summers
Abstract:
We present a new graph-based method for small bowel path tracking based on cylindrical constraints. A distinctive characteristic of the small bowel compared to other organs is the contact between parts of itself along its course, which makes the path tracking difficult together with the indistinct appearance of the wall. It causes the tracked path to easily cross over the walls when relying on low…
▽ More
We present a new graph-based method for small bowel path tracking based on cylindrical constraints. A distinctive characteristic of the small bowel compared to other organs is the contact between parts of itself along its course, which makes the path tracking difficult together with the indistinct appearance of the wall. It causes the tracked path to easily cross over the walls when relying on low-level features like the wall detection. To circumvent this, a series of cylinders that are fitted along the course of the small bowel are used to guide the tracking to more reliable directions. It is implemented as soft constraints using a new cost function. The proposed method is evaluated against ground-truth paths that are all connected from start to end of the small bowel for 10 abdominal CT scans. The proposed method showed clear improvements compared to the baseline method in tracking the path without making an error. Improvements of 6.6% and 17.0%, in terms of the tracked length, were observed for two different settings related to the small bowel segmentation.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
Extraction of Coronary Vessels in Fluoroscopic X-Ray Sequences Using Vessel Correspondence Optimization
Authors:
Seung Yeon Shin,
Soochahn Lee,
Kyoung Jin Noh,
Il Dong Yun,
Kyoung Mu Lee
Abstract:
We present a method to extract coronary vessels from fluoroscopic x-ray sequences. Given the vessel structure for the source frame, vessel correspondence candidates in the subsequent frame are generated by a novel hierarchical search scheme to overcome the aperture problem. Optimal correspondences are determined within a Markov random field optimization framework. Post-processing is performed to e…
▽ More
We present a method to extract coronary vessels from fluoroscopic x-ray sequences. Given the vessel structure for the source frame, vessel correspondence candidates in the subsequent frame are generated by a novel hierarchical search scheme to overcome the aperture problem. Optimal correspondences are determined within a Markov random field optimization framework. Post-processing is performed to extract vessel branches newly visible due to the inflow of contrast agent. Quantitative and qualitative evaluation conducted on a dataset of 18 sequences demonstrates the effectiveness of the proposed method.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
Accurate Ground-Truth Depth Image Generation via Overfit Training of Point Cloud Registration using Local Frame Sets
Authors:
Jiwan Kim,
Minchang Kim,
Yeong-Gil Shin,
Minyoung Chung
Abstract:
Accurate three-dimensional perception is a fundamental task in several computer vision applications. Recently, commercial RGB-depth (RGB-D) cameras have been widely adopted as single-view depth-sensing devices owing to their efficient depth-sensing abilities. However, the depth quality of most RGB-D sensors remains insufficient owing to the inherent noise from a single-view environment. Recently,…
▽ More
Accurate three-dimensional perception is a fundamental task in several computer vision applications. Recently, commercial RGB-depth (RGB-D) cameras have been widely adopted as single-view depth-sensing devices owing to their efficient depth-sensing abilities. However, the depth quality of most RGB-D sensors remains insufficient owing to the inherent noise from a single-view environment. Recently, several studies have focused on the single-view depth enhancement of RGB-D cameras. Recent research has proposed deep-learning-based approaches that typically train networks using high-quality supervised depth datasets, which indicates that the quality of the ground-truth (GT) depth dataset is a top-most important factor for accurate system; however, such high-quality GT datasets are difficult to obtain. In this study, we developed a novel method for high-quality GT depth generation based on an RGB-D stream dataset. First, we defined consecutive depth frames in a local spatial region as a local frame set. Then, the depth frames were aligned to a certain frame in the local frame set using an unsupervised point cloud registration scheme. The registration parameters were trained based on an overfit-training scheme, which was primarily used to construct a single GT depth image for each frame set. The final GT depth dataset was constructed using several local frame sets, and each local frame set was trained independently. The primary advantage of this study is that a high-quality GT depth dataset can be constructed under various scanning environments using only the RGB-D stream dataset. Moreover, our proposed method can be used as a new benchmark GT dataset for accurate performance evaluations. We evaluated our GT dataset on previously benchmarked GT depth datasets and demonstrated that our method is superior to state-of-the-art depth enhancement frameworks.
△ Less
Submitted 26 July, 2022; v1 submitted 14 July, 2022;
originally announced July 2022.
-
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS
Authors:
Yookyung Shin,
Younggun Lee,
Suhee Jo,
Yeongtae Hwang,
Taesu Kim
Abstract:
Expressive text-to-speech has shown improved performance in recent years. However, the style control of synthetic speech is often restricted to discrete emotion categories and requires training data recorded by the target speaker in the target style. In many practical situations, users may not have reference speech recorded in target emotion but still be interested in controlling speech style just…
▽ More
Expressive text-to-speech has shown improved performance in recent years. However, the style control of synthetic speech is often restricted to discrete emotion categories and requires training data recorded by the target speaker in the target style. In many practical situations, users may not have reference speech recorded in target emotion but still be interested in controlling speech style just by typing text description of desired emotional style. In this paper, we propose a text-based interface for emotional style control and cross-speaker style transfer in multi-speaker TTS. We propose the bi-modal style encoder which models the semantic relationship between text description embedding and speech style embedding with a pretrained language model. To further improve cross-speaker style transfer on disjoint, multi-style datasets, we propose the novel style loss. The experimental results show that our model can generate high-quality expressive speech even in unseen style.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.