-
High-temperature superconductivity in hydrides: experimental evidence and details
Authors:
M. I. Eremets,
V. S. Minkov,
A. P. Drozdov,
P. P. Kong,
V. Ksenofontov,
S. I. Shylin,
S. L. Bud ko,
R. Prozorov,
F. F. Balakirev,
Dan Sun,
S. Mozaffari,
L. Balicas
Abstract:
Since the discovery of superconductivity at 200 K in H3S [1] similar or higher transition temperatures, Tcs, have been reported for various hydrogen-rich compounds under ultra-high pressures [2]. Superconductivity was experimentally proved by different methods, including electrical resistance, magnetic susceptibility, optical infrared, and nuclear resonant scattering measurements. The crystal stru…
▽ More
Since the discovery of superconductivity at 200 K in H3S [1] similar or higher transition temperatures, Tcs, have been reported for various hydrogen-rich compounds under ultra-high pressures [2]. Superconductivity was experimentally proved by different methods, including electrical resistance, magnetic susceptibility, optical infrared, and nuclear resonant scattering measurements. The crystal structures of superconducting phases were determined by X-ray diffraction. Numerous electrical transport measurements demonstrate the typical behaviour of a conventional phonon-mediated superconductor: zero resistance below Tc, the shift of Tc to lower temperatures under external magnetic fields, and pronounced isotope effect. Remarkably, the results are in good agreement with the theoretical predictions, which describe superconductivity in hydrides within the framework of the conventional BCS theory. However, despite this acknowledgment, experimental evidence for the superconducting state in these compounds has recently been treated with criticism [3, 4], which apparently stems from misunderstanding and misinterpretation of complicated experiments performed under very high pressures. Here, we describe in greater detail the experiments revealing high-temperature superconductivity in hydrides under high pressures. We show that the arguments against superconductivity [3, 4] can be either refuted or explained. The experiments on the high-temperature superconductivity in hydrides clearly contradict the theory of hole superconductivity [4] and eliminate it [3].
△ Less
Submitted 13 January, 2022;
originally announced January 2022.
-
Deep Hash Distillation for Image Retrieval
Authors:
Young Kyun Jang,
Geonmo Gu,
Byungsoo Ko,
Isaac Kang,
Nam Ik Cho
Abstract:
In hash-based image retrieval systems, degraded or transformed inputs usually generate different codes from the original, deteriorating the retrieval accuracy. To mitigate this issue, data augmentation can be applied during training. However, even if augmented samples of an image are similar in real feature space, the quantization can scatter them far away in Hamming space. This results in represe…
▽ More
In hash-based image retrieval systems, degraded or transformed inputs usually generate different codes from the original, deteriorating the retrieval accuracy. To mitigate this issue, data augmentation can be applied during training. However, even if augmented samples of an image are similar in real feature space, the quantization can scatter them far away in Hamming space. This results in representation discrepancies that can impede training and degrade performance. In this work, we propose a novel self-distilled hashing scheme to minimize the discrepancy while exploiting the potential of augmented data. By transferring the hash knowledge of the weakly-transformed samples to the strong ones, we make the hash code insensitive to various transformations. We also introduce hash proxy-based similarity learning and binary cross entropy-based quantization loss to provide fine quality hash codes. Ultimately, we construct a deep hashing framework that not only improves the existing deep hashing approaches, but also achieves the state-of-the-art retrieval results. Extensive experiments are conducted and confirm the effectiveness of our work.
△ Less
Submitted 13 July, 2022; v1 submitted 16 December, 2021;
originally announced December 2021.
-
3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and Faces
Authors:
Simone Foti,
Bongjin Koo,
Danail Stoyanov,
Matthew J. Clarkson
Abstract:
Learning a disentangled, interpretable, and structured latent representation in 3D generative models of faces and bodies is still an open problem. The problem is particularly acute when control over identity features is required. In this paper, we propose an intuitive yet effective self-supervised approach to train a 3D shape variational autoencoder (VAE) which encourages a disentangled latent rep…
▽ More
Learning a disentangled, interpretable, and structured latent representation in 3D generative models of faces and bodies is still an open problem. The problem is particularly acute when control over identity features is required. In this paper, we propose an intuitive yet effective self-supervised approach to train a 3D shape variational autoencoder (VAE) which encourages a disentangled latent representation of identity features. Curating the mini-batch generation by swapping arbitrary features across different shapes allows to define a loss function leveraging known differences and similarities in the latent representations. Experimental results conducted on 3D meshes show that state-of-the-art methods for latent disentanglement are not able to disentangle identity features of faces and bodies. Our proposed method properly decouples the generation of such features while maintaining good representation and reconstruction capabilities.
△ Less
Submitted 23 March, 2022; v1 submitted 24 November, 2021;
originally announced November 2021.
-
Beyond NDCG: behavioral testing of recommender systems with RecList
Authors:
Patrick John Chia,
Jacopo Tagliabue,
Federico Bianchi,
Chloe He,
Brian Ko
Abstract:
As with most Machine Learning systems, recommender systems are typically evaluated through performance metrics computed over held-out data points. However, real-world behavior is undoubtedly nuanced: ad hoc error analysis and deployment-specific tests must be employed to ensure the desired quality in actual deployments. In this paper, we propose RecList, a behavioral-based testing methodology. Rec…
▽ More
As with most Machine Learning systems, recommender systems are typically evaluated through performance metrics computed over held-out data points. However, real-world behavior is undoubtedly nuanced: ad hoc error analysis and deployment-specific tests must be employed to ensure the desired quality in actual deployments. In this paper, we propose RecList, a behavioral-based testing methodology. RecList organizes recommender systems by use case and introduces a general plug-and-play procedure to scale up behavioral testing. We demonstrate its capabilities by analyzing known algorithms and black-box commercial systems, and we release RecList as an open source, extensible package for the community.
△ Less
Submitted 27 March, 2022; v1 submitted 18 November, 2021;
originally announced November 2021.
-
Development of an FPGA-based realtime DAQ system for axion haloscope experiments
Authors:
MyeongJae Lee,
ByeongRok Ko,
Saebyeok Ahn
Abstract:
A real-time Data Acquisition (DAQ) system for the CULTASK axion haloscope experiment was constructed and tested. The CULTASK is an experiment to search for cosmic axions using resonant cavities, to detect photons from axion conversion through the inverse Primakoff effect in a few GHz frequency range in a very high magnetic field and at an ultra low temperature. The constructed DAQ system utilizes…
▽ More
A real-time Data Acquisition (DAQ) system for the CULTASK axion haloscope experiment was constructed and tested. The CULTASK is an experiment to search for cosmic axions using resonant cavities, to detect photons from axion conversion through the inverse Primakoff effect in a few GHz frequency range in a very high magnetic field and at an ultra low temperature. The constructed DAQ system utilizes a Field Programmable Gate Array (FPGA) for data processing and Fast Fourier Transformation. This design along with a custom Ethernet packet designed for real-time data transfer enables 100% DAQ efficiency, which is the key feature compared with a commercial spectrum analyzer. This DAQ system is optimally designed for RF signal detection in the axion experiment, with 100 Hz frequency resolution and 500 kHz analysis window. The noise level of the DAQ system averaged over 100,000 measurements is around -111.7 dBm. From a pseudo-data analysis, an improvement of the signal-to-noise ratio due to repeating and averaging the measurements using this real-time DAQ system was confirmed.
△ Less
Submitted 6 November, 2021;
originally announced November 2021.
-
Fast DAQ system with image rejection for axion dark matter searches
Authors:
S. Ahn,
M. J. Lee,
A. K. Yi,
B. Yeo,
B. R. Ko,
Y. K. Semertzidis
Abstract:
A fast data acquisition (DAQ) system for axion dark matter searches utilizing a microwave resonant cavity, also known as axion haloscope searches, has been developed with a two-channel digitizer that can sample 16-bit amplitudes at rates up to 180 MSamples/s. First, we realized a practical DAQ efficiency of greater than 99% for a single DAQ channel, where the DAQ process includes the online fast F…
▽ More
A fast data acquisition (DAQ) system for axion dark matter searches utilizing a microwave resonant cavity, also known as axion haloscope searches, has been developed with a two-channel digitizer that can sample 16-bit amplitudes at rates up to 180 MSamples/s. First, we realized a practical DAQ efficiency of greater than 99% for a single DAQ channel, where the DAQ process includes the online fast Fourier transforms (FFTs). Using an IQ mixer and two parallel DAQ channels, we then also implemented a software-based image rejection without losing the DAQ efficiency. This work extends our continuing effort to improve the figure of merit in axion haloscope searches, the scanning rate.
△ Less
Submitted 17 April, 2022; v1 submitted 16 September, 2021;
originally announced September 2021.
-
Metasurface Holography over 90% Efficiency in the Visible via Nanoparticle-Embedded-Resin Printing
Authors:
Joohoon Kim,
Dong Kyo Oh,
Hongyoon Kim,
Gwanho Yoon,
Chunghwan Jung,
Jae Kyung Kim,
Trevon Badloe,
Seokwoo Kim,
Younghwan Yang,
Jihae Lee,
Byoungsu Ko,
Jong G. Ok,
Junsuk Rho
Abstract:
Metasurface holography, the reconstruction of holographic images by modulating the spatial amplitude and phase of light using metasurfaces, has emerged as a next-generation display technology. However, conventional fabrication techniques used to realize metaholograms are limited by their small patterning areas, high manufacturing costs, and low throughput, which hinder their practical use. Herein,…
▽ More
Metasurface holography, the reconstruction of holographic images by modulating the spatial amplitude and phase of light using metasurfaces, has emerged as a next-generation display technology. However, conventional fabrication techniques used to realize metaholograms are limited by their small patterning areas, high manufacturing costs, and low throughput, which hinder their practical use. Herein, we demonstrate a high efficiency hologram using a one-step nanomanufacturing method with a titanium dioxide nanoparticle-embedded-resin, allowing for high-throughput and low-cost fabrication. At a single wavelength, a record high 96.4% theoretical efficiency is demonstrated with an experimentally measured conversion efficiency of 90.6% and zero-order diffraction of 7.3% producing an ultrahigh-efficiency, twin-image free hologram, that can even be directly observed under ambient light conditions. Moreover, we design a broadband meta-atom with an average efficiency of 76.0% and experimentally demonstrate a metahologram with an average efficiency of 62.4% at visible wavelengths from 450 to 650 nm.
△ Less
Submitted 2 September, 2021;
originally announced September 2021.
-
Performance of a Triple-GEM Demonstrator in $pp$ Collisions at the CMS Detector
Authors:
M. Abbas,
M. Abbrescia,
H. Abdalla,
A. Abdelalim,
S. AbuZeid,
A. Agapitos,
A. Ahmad,
A. Ahmed,
W. Ahmed,
C. Aimè,
C. Aruta,
I. Asghar,
P. Aspell,
C. Avila,
J. Babbar,
Y. Ban,
R. Band,
S. Bansal,
L. Benussi,
V. Bhatnagar,
M. Bianco,
S. Bianco,
K. Black,
L. Borgonovi,
O. Bouhali
, et al. (156 additional authors not shown)
Abstract:
After the Phase-2 high-luminosity upgrade to the Large Hadron Collider (LHC), the collision rate and therefore the background rate will significantly increase, particularly in the high $η$ region. To improve both the tracking and triggering of muons, the Compact Muon Solenoid (CMS) Collaboration plans to install triple-layer Gas Electron Multiplier (GEM) detectors in the CMS muon endcaps. Demonstr…
▽ More
After the Phase-2 high-luminosity upgrade to the Large Hadron Collider (LHC), the collision rate and therefore the background rate will significantly increase, particularly in the high $η$ region. To improve both the tracking and triggering of muons, the Compact Muon Solenoid (CMS) Collaboration plans to install triple-layer Gas Electron Multiplier (GEM) detectors in the CMS muon endcaps. Demonstrator GEM detectors were installed in CMS during 2017 to gain operational experience and perform a preliminary investigation of detector performance. We present the results of triple-GEM detector performance studies performed in situ during normal CMS and LHC operations in 2018. The distribution of cluster size and the efficiency to reconstruct high $p_T$ muons in proton--proton collisions are presented as well as the measurement of the environmental background rate to produce hits in the GEM detector.
△ Less
Submitted 22 September, 2021; v1 submitted 20 July, 2021;
originally announced July 2021.
-
Heavily Augmented Sound Event Detection utilizing Weak Predictions
Authors:
Hyeonuk Nam,
Byeong-Yun Ko,
Gyeong-Tae Lee,
Seong-Hu Kim,
Won-Ho Jung,
Sang-Min Choi,
Yong-Hwa Park
Abstract:
The performances of Sound Event Detection (SED) systems are greatly limited by the difficulty in generating large strongly labeled dataset. In this work, we used two main approaches to overcome the lack of strongly labeled data. First, we applied heavy data augmentation on input features. Data augmentation methods used include not only conventional methods used in speech/audio domains but also our…
▽ More
The performances of Sound Event Detection (SED) systems are greatly limited by the difficulty in generating large strongly labeled dataset. In this work, we used two main approaches to overcome the lack of strongly labeled data. First, we applied heavy data augmentation on input features. Data augmentation methods used include not only conventional methods used in speech/audio domains but also our proposed method named FilterAugment. Second, we propose two methods to utilize weak predictions to enhance weakly supervised SED performance. As a result, we obtained the best PSDS1 of 0.4336 and best PSDS2 of 0.8161 on the DESED real validation dataset. This work is submitted to DCASE 2021 Task4 and is ranked on the 3rd place. Code availa-ble: https://github.com/frednam93/FilterAugSED.
△ Less
Submitted 14 September, 2021; v1 submitted 8 July, 2021;
originally announced July 2021.
-
Modeling the triple-GEM detector response to background particles for the CMS Experiment
Authors:
M. Abbas,
M. Abbrescia,
H. Abdalla,
A. Abdelalim,
S. AbuZeid,
A. Agapitos,
A. Ahmad,
A. Ahmed,
W. Ahmed,
C. Aimè,
C. Aruta,
I. Asghar,
P. Aspell,
C. Avila,
I. Azhgirey,
J. Babbar,
Y. Ban,
R. Band,
S. Bansal,
L. Benussi,
V. Bhatnagar,
M. Bianco,
S. Bianco,
K. Black,
L. Borgonovi
, et al. (164 additional authors not shown)
Abstract:
An estimate of environmental background hit rate on triple-GEM chambers is performed using Monte Carlo (MC) simulation and compared to data taken by test chambers installed in the CMS experiment (GE1/1) during Run-2 at the Large Hadron Collider (LHC). The hit rate is measured using data collected with proton-proton collisions at 13 TeV and a luminosity of 1.5$\times10^{34}$ cm$^{-2}$ s$^{-1}$. The…
▽ More
An estimate of environmental background hit rate on triple-GEM chambers is performed using Monte Carlo (MC) simulation and compared to data taken by test chambers installed in the CMS experiment (GE1/1) during Run-2 at the Large Hadron Collider (LHC). The hit rate is measured using data collected with proton-proton collisions at 13 TeV and a luminosity of 1.5$\times10^{34}$ cm$^{-2}$ s$^{-1}$. The simulation framework uses a combination of the FLUKA and Geant4 packages to obtain the hit rate. FLUKA provides the radiation environment around the GE1/1 chambers, which is comprised of the particle flux with momentum direction and energy spectra ranging from $10^{-11}$ to $10^{4}$ MeV for neutrons, $10^{-3}$ to $10^{4}$ MeV for $γ$'s, $10^{-2}$ to $10^{4}$ MeV for $e^{\pm}$, and $10^{-1}$ to $10^{4}$ MeV for charged hadrons. Geant4 provides an estimate of detector response (sensitivity) based on an accurate description of detector geometry, material composition and interaction of particles with the various detector layers. The MC simulated hit rate is estimated as a function of the perpendicular distance from the beam line and agrees with data within the assigned uncertainties of 10-14.5%. This simulation framework can be used to obtain a reliable estimate of background rates expected at the High Luminosity LHC.
△ Less
Submitted 8 July, 2021;
originally announced July 2021.
-
MIMO Operations in Molecular Communications: Theory, Prototypes, and Open Challenges
Authors:
Bon-Hong Koo,
Changmin Lee,
Ali E. Pusane,
Tuna Tugcu,
Chan-Byoung Chae
Abstract:
The Internet of Bio-nano Things is a significant development for next generation communication technologies. Because conventional wireless communication technologies face challenges in realizing new applications (e.g., in-body area networks for health monitoring) and necessitate the substitution of information carriers, researchers have shifted their interest to molecular communications (MC). Alth…
▽ More
The Internet of Bio-nano Things is a significant development for next generation communication technologies. Because conventional wireless communication technologies face challenges in realizing new applications (e.g., in-body area networks for health monitoring) and necessitate the substitution of information carriers, researchers have shifted their interest to molecular communications (MC). Although remarkable progress has been made in this field over the last decade, advances have been far from acceptable for the achievement of its application objectives. A crucial problem of MC is the low data rate and high error rate inherent in particle dynamics specifications, in contrast to wave-based conventional communications. Therefore, it is important to investigate the resources by which MC can obtain additional information paths and provide strategies to exploit these resources. This study aims to examine techniques involving resource aggregation and exploitation to provide prospective directions for future progress in MC. In particular, we focus on state-of-the-art studies on multiple-input multiple-output (MIMO) systems. We discuss the possible advantages of applying MIMO to various MC system models. Furthermore, we survey various studies that aimed to achieve MIMO gains for the respective models, from theoretical background to prototypes. Finally, we conclude this study by summarizing the challenges that need to be addressed.
△ Less
Submitted 5 July, 2021;
originally announced July 2021.
-
MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting
Authors:
Anne Lauscher,
Brandon Ko,
Bailey Kuehl,
Sophie Johnson,
David Jurgens,
Arman Cohan,
Kyle Lo
Abstract:
Citation context analysis (CCA) is an important task in natural language processing that studies how and why scholars discuss each others' work. Despite decades of study, traditional frameworks for CCA have largely relied on overly-simplistic assumptions of how authors cite, which ignore several important phenomena. For instance, scholarly papers often contain rich discussions of cited work that s…
▽ More
Citation context analysis (CCA) is an important task in natural language processing that studies how and why scholars discuss each others' work. Despite decades of study, traditional frameworks for CCA have largely relied on overly-simplistic assumptions of how authors cite, which ignore several important phenomena. For instance, scholarly papers often contain rich discussions of cited work that span multiple sentences and express multiple intents concurrently. Yet, CCA is typically approached as a single-sentence, single-label classification task, and thus existing datasets fail to capture this interesting discourse. In our work, we address this research gap by proposing a novel framework for CCA as a document-level context extraction and labeling task. We release MultiCite, a new dataset of 12,653 citation contexts from over 1,200 computational linguistics papers. Not only is it the largest collection of expert-annotated citation contexts to-date, MultiCite contains multi-sentence, multi-label citation contexts within full paper texts. Finally, we demonstrate how our dataset, while still usable for training classic CCA models, also supports the development of new types of models for CCA beyond fixed-width text classification. We release our code and dataset at https://github.com/allenai/multicite.
△ Less
Submitted 31 July, 2021; v1 submitted 1 July, 2021;
originally announced July 2021.
-
Towards Light-weight and Real-time Line Segment Detection
Authors:
Geonmo Gu,
Byungsoo Ko,
SeoungHyun Go,
Sung-Hyun Lee,
Jingeun Lee,
Minchul Shin
Abstract:
Previous deep learning-based line segment detection (LSD) suffers from the immense model size and high computational cost for line prediction. This constrains them from real-time inference on computationally restricted environments. In this paper, we propose a real-time and light-weight line segment detector for resource-constrained environments named Mobile LSD (M-LSD). We design an extremely eff…
▽ More
Previous deep learning-based line segment detection (LSD) suffers from the immense model size and high computational cost for line prediction. This constrains them from real-time inference on computationally restricted environments. In this paper, we propose a real-time and light-weight line segment detector for resource-constrained environments named Mobile LSD (M-LSD). We design an extremely efficient LSD architecture by minimizing the backbone network and removing the typical multi-module process for line prediction found in previous methods. To maintain competitive performance with a light-weight network, we present novel training schemes: Segments of Line segment (SoL) augmentation, matching and geometric loss. SoL augmentation splits a line segment into multiple subparts, which are used to provide auxiliary line data during the training process. Moreover, the matching and geometric loss allow a model to capture additional geometric cues. Compared with TP-LSD-Lite, previously the best real-time LSD method, our model (M-LSD-tiny) achieves competitive performance with 2.5% of model size and an increase of 130.5% in inference speed on GPU. Furthermore, our model runs at 56.8 FPS and 48.6 FPS on the latest Android and iPhone mobile devices, respectively. To the best of our knowledge, this is the first real-time deep LSD available on mobile devices. Our code is available.
△ Less
Submitted 26 April, 2022; v1 submitted 31 May, 2021;
originally announced June 2021.
-
Shocked Molecular Hydrogen and Broad CO lines from the Interacting Supernova Remnant HB 3
Authors:
J. Rho,
T. H. Jarrett,
L. N. Tram,
W. Lim,
W. T. Reach,
J. Bieging,
H. -G. Lee,
B. -C. Koo,
B. Whitney
Abstract:
We present the detections of shocked molecular hydrogen (H2) gas in near- and mid-infrared and broad CO in millimeter from the mixed-morphology supernova remnant (SNR) HB~3 (G132.7+1.3) using Palomar WIRC, the Spitzer GLIMPSE360 and WISE surveys, and HHSMT. Our near-infrared narrow-band filter H2 2.12 micron images of HB~3 show that both Spitzer IRAC and WISE 4.6 micron emission originates from sh…
▽ More
We present the detections of shocked molecular hydrogen (H2) gas in near- and mid-infrared and broad CO in millimeter from the mixed-morphology supernova remnant (SNR) HB~3 (G132.7+1.3) using Palomar WIRC, the Spitzer GLIMPSE360 and WISE surveys, and HHSMT. Our near-infrared narrow-band filter H2 2.12 micron images of HB~3 show that both Spitzer IRAC and WISE 4.6 micron emission originates from shocked H2 gas. The morphology of H2 exhibits thin filamentary structures and a large scale of interaction sites between the HB~3 and nearby molecular clouds. Half of HB~3, the southern and eastern shell of the SNR, emits H2 in a shape of a "butterfly" or "W", indicating the interaction sites between the SNR and dense molecular clouds. Interestingly, the H2 emitting region in the southeast is also co-spatial to the interacting area between HB~3 and the H~II regions of the W3 complex, where we identified star-forming activity.
We further explore the interaction between HB~3 and dense molecular clouds with detections of broad CO(3-2) and CO(2-1) molecular lines from the southern and southeastern shells along the H2 emitting region. The widths of the broad lines are 8-20 km/s; the detection of such broad lines is unambiguous, dynamic evidence of the interactions between the SNR and clouds. The CO broad lines are from two branches of the bright, southern H2 shell. We apply the Paris-Durham shock model to the CO line profiles, which infer the shock velocities of 20 - 40 km/s, relatively low densities of 10^{3-4} cm^{-3} and strong (>200 micro Gauss) magnetic fields.
△ Less
Submitted 10 August, 2021; v1 submitted 21 May, 2021;
originally announced May 2021.
-
Loss-Based Variational Bayes Prediction
Authors:
David T. Frazier,
Ruben Loaiza-Maya,
Gael M. Martin,
Bonsoo Koo
Abstract:
We propose a new approach to Bayesian prediction that caters for models with a large number of parameters and is robust to model misspecification. Given a class of high-dimensional (but parametric) predictive models, this new approach constructs a posterior predictive using a variational approximation to a generalized posterior that is directly focused on predictive accuracy. The theoretical behav…
▽ More
We propose a new approach to Bayesian prediction that caters for models with a large number of parameters and is robust to model misspecification. Given a class of high-dimensional (but parametric) predictive models, this new approach constructs a posterior predictive using a variational approximation to a generalized posterior that is directly focused on predictive accuracy. The theoretical behavior of the new prediction approach is analyzed and a form of optimality demonstrated. Applications to both simulated and empirical data using high-dimensional Bayesian neural network and autoregressive mixture models demonstrate that the approach provides more accurate results than various alternatives, including misspecified likelihood-based predictions.
△ Less
Submitted 12 May, 2022; v1 submitted 28 April, 2021;
originally announced April 2021.
-
RTIC: Residual Learning for Text and Image Composition using Graph Convolutional Network
Authors:
Minchul Shin,
Yoonjae Cho,
Byungsoo Ko,
Geonmo Gu
Abstract:
In this paper, we study the compositional learning of images and texts for image retrieval. The query is given in the form of an image and text that describes the desired modifications to the image; the goal is to retrieve the target image that satisfies the given modifications and resembles the query by composing information in both the text and image modalities. To remedy this, we propose a nove…
▽ More
In this paper, we study the compositional learning of images and texts for image retrieval. The query is given in the form of an image and text that describes the desired modifications to the image; the goal is to retrieve the target image that satisfies the given modifications and resembles the query by composing information in both the text and image modalities. To remedy this, we propose a novel architecture designed for the image-text composition task and show that the proposed structure can effectively encode the differences between the source and target images conditioned on the text. Furthermore, we introduce a new joint training technique based on the graph convolutional network that is generally applicable for any existing composition methods in a plug-and-play manner. We found that the proposed technique consistently improves performance and achieves state-of-the-art scores on various benchmarks. To avoid misleading experimental results caused by trivial training hyper-parameters, we reproduce all individual baselines and train models with a unified training environment. We expect this approach to suppress undesirable effects from irrelevant components and emphasize the image-text composition module's ability. Also, we achieve the state-of-the-art score without restricting the training environment, which implies the superiority of our method considering the gains from hyper-parameter tuning. The code, including all the baseline methods, are released https://github.com/nashory/rtic-gcn-pytorch.
△ Less
Submitted 25 October, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Learning with Memory-based Virtual Classes for Deep Metric Learning
Authors:
Byungsoo Ko,
Geonmo Gu,
Han-Gyu Kim
Abstract:
The core of deep metric learning (DML) involves learning visual similarities in high-dimensional embedding space. One of the main challenges is to generalize from seen classes of training data to unseen classes of test data. Recent works have focused on exploiting past embeddings to increase the number of instances for the seen classes. Such methods achieve performance improvement via augmentation…
▽ More
The core of deep metric learning (DML) involves learning visual similarities in high-dimensional embedding space. One of the main challenges is to generalize from seen classes of training data to unseen classes of test data. Recent works have focused on exploiting past embeddings to increase the number of instances for the seen classes. Such methods achieve performance improvement via augmentation, while the strong focus on seen classes still remains. This can be undesirable for DML, where training and test data exhibit entirely different classes. In this work, we present a novel training strategy for DML called MemVir. Unlike previous works, MemVir memorizes both embedding features and class weights to utilize them as additional virtual classes. The exploitation of virtual classes not only utilizes augmented information for training but also alleviates a strong focus on seen classes for better generalization. Moreover, we embed the idea of curriculum learning by slowly adding virtual classes for a gradual increase in learning difficulty, which improves the learning stability as well as the final performance. MemVir can be easily applied to many existing loss functions without any modification. Extensive experimental results on famous benchmarks demonstrate the superiority of MemVir over state-of-the-art competitors. Code of MemVir is publicly available.
△ Less
Submitted 8 October, 2021; v1 submitted 31 March, 2021;
originally announced March 2021.
-
Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning
Authors:
Geonmo Gu,
Byungsoo Ko,
Han-Gyu Kim
Abstract:
One of the main purposes of deep metric learning is to construct an embedding space that has well-generalized embeddings on both seen (training) classes and unseen (test) classes. Most existing works have tried to achieve this using different types of metric objectives and hard sample mining strategies with given training data. However, learning with only the training data can be overfitted to the…
▽ More
One of the main purposes of deep metric learning is to construct an embedding space that has well-generalized embeddings on both seen (training) classes and unseen (test) classes. Most existing works have tried to achieve this using different types of metric objectives and hard sample mining strategies with given training data. However, learning with only the training data can be overfitted to the seen classes, leading to the lack of generalization capability on unseen classes. To address this problem, we propose a simple regularizer called Proxy Synthesis that exploits synthetic classes for stronger generalization in deep metric learning. The proposed method generates synthetic embeddings and proxies that work as synthetic classes, and they mimic unseen classes when computing proxy-based losses. Proxy Synthesis derives an embedding space considering class relations and smooth decision boundaries for robustness on unseen classes. Our method is applicable to any proxy-based losses, including softmax and its variants. Extensive experiments on four famous benchmarks in image retrieval tasks demonstrate that Proxy Synthesis significantly boosts the performance of proxy-based losses and achieves state-of-the-art performance.
△ Less
Submitted 29 March, 2021;
originally announced March 2021.
-
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Authors:
Byungchan Ko,
Jungseul Ok
Abstract:
In deep reinforcement learning (RL), data augmentation is widely considered as a tool to induce a set of useful priors about semantic consistency and improve sample efficiency and generalization performance. However, even when the prior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency. Meanwhile, the agent is forgetful of t…
▽ More
In deep reinforcement learning (RL), data augmentation is widely considered as a tool to induce a set of useful priors about semantic consistency and improve sample efficiency and generalization performance. However, even when the prior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency. Meanwhile, the agent is forgetful of the prior due to the non-stationary nature of RL. These observations suggest two extreme schedules of distillation: (i) over the entire training; or (ii) only at the end. Hence, we devise a stand-alone network distillation method to inject the consistency prior at any time (even after RL), and a simple yet efficient framework to automatically schedule the distillation. Specifically, the proposed framework first focuses on mastering train environments regardless of generalization by adaptively deciding which {\it or no} augmentation to be used for the training. After this, we add the distillation to extract the remaining benefits for generalization from all the augmentations, which requires no additional new samples. In our experiments, we demonstrate the utility of the proposed framework, in particular, that considers postponing the augmentation to the end of RL training.
△ Less
Submitted 18 October, 2022; v1 submitted 17 February, 2021;
originally announced February 2021.
-
Isogeometric Configuration Design Optimization of Three-dimensional Curved Beam Structures for Maximal Fundamental Frequency
Authors:
Myung-Jin Choi,
Jae-Hyun Kim,
Bonyong Koo,
Seonho Cho
Abstract:
This paper presents a configuration design optimization method for three-dimensional curved beam built-up structures having maximized fundamental eigenfrequency. We develop the method of computation of design velocity field and optimal design of beam structures constrained on a curved surface, where both designs of the embedded beams and the curved surface are simultaneously varied during the opti…
▽ More
This paper presents a configuration design optimization method for three-dimensional curved beam built-up structures having maximized fundamental eigenfrequency. We develop the method of computation of design velocity field and optimal design of beam structures constrained on a curved surface, where both designs of the embedded beams and the curved surface are simultaneously varied during the optimal design process. A shear-deformable beam model is used in the response analyses of structural vibrations within an isogeometric framework using the NURBS basis functions. An analytical design sensitivity expression of repeated eigenvalues is derived. The developed method is demonstrated through several illustrative examples.
△ Less
Submitted 23 January, 2021;
originally announced January 2021.
-
Quasi-one-dimensional magnetism in the spin-$\frac12$ antiferromagnet BaNa$_{2}$Cu(VO$_{4}$)$_{2}$
Authors:
Sebin J. Sebastian,
K. Somesh,
M. Nandi,
N. Ahmed,
P. Bag,
M. Baenitz,
B. Koo,
J. Sichelschmidt,
A. A. Tsirlin,
Y. Furukawa,
R. Nath
Abstract:
We report synthesis and magnetic properties of quasi-one-dimensional spin-$\frac{1}{2}$ Heisenberg antiferromagnetic chain compound BaNa$_2$Cu(VO$_4$)$_2$. This orthovanadate has a centrosymmetric crystal structure, $C2/c$, where the magnetic Cu$^{2+}$ ions form spin chains. These chains are arranged in layers, with the chain direction changing by 62$^0$ between the two successive layers. Alternat…
▽ More
We report synthesis and magnetic properties of quasi-one-dimensional spin-$\frac{1}{2}$ Heisenberg antiferromagnetic chain compound BaNa$_2$Cu(VO$_4$)$_2$. This orthovanadate has a centrosymmetric crystal structure, $C2/c$, where the magnetic Cu$^{2+}$ ions form spin chains. These chains are arranged in layers, with the chain direction changing by 62$^0$ between the two successive layers. Alternatively, the spin lattice can be viewed as anisotropic triangular layers upon taking the inter-chain interactions into consideration. Despite this potential structural complexity, temperature-dependent magnetic susceptibility, heat capacity, ESR intensity, and NMR shift agree well with the uniform spin-$1/2$ Heisenberg chain model with an intrachain coupling of $J/k_{\rm B} \simeq 5.6$ K. The saturation field obtained from the magnetic isotherm measurement consistently reproduces the value of $J/k_{\rm B}$. Further, the $^{51}$V NMR spin-lattice relaxation rate mimics the 1D character in the intermediate temperature range, whereas magnetic long-range order sets in below $T_{\rm N} \simeq 0.25$ K. The effective interchain coupling is estimated to be $J_{\perp}/k_{\rm B} \simeq 0.1$ K. The theoretical estimation of exchange couplings using band-structure calculations reciprocate our experimental findings and unambiguously establish the 1D character of the compound. Finally, the spin lattice of BaNa$_2$Cu(VO$_4$)$_2$ is compared with the chemically similar but not isostructural compound BaAg$_2$Cu(VO$_4)_2$.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
Gapless quantum spin liquid in the triangular system Sr$_{3}$CuSb$_{2}$O$_{9}$
Authors:
S. Kundu,
Aga Shahee,
Atasi Chakraborty,
K. M. Ranjith,
B. Koo,
Jörg Sichelschmidt,
Mark T. F. Telling,
P. K. Biswas,
M. Baenitz,
I. Dasgupta,
Sumiran Pujari,
A. V. Mahajan
Abstract:
We report gapless quantum spin liquid behavior in the layered triangular Sr$_{3}$CuSb$_{2}$O$_{9}$ (SCSO) system. X-ray diffraction shows superlattice reflections associated with atomic site ordering into triangular Cu planes well-separated by Sb planes. Muon spin relaxation ($μ$SR) measurements show that the $S = \frac{1}{2}$ moments at the magnetically active Cu sites remain dynamic down to 65 m…
▽ More
We report gapless quantum spin liquid behavior in the layered triangular Sr$_{3}$CuSb$_{2}$O$_{9}$ (SCSO) system. X-ray diffraction shows superlattice reflections associated with atomic site ordering into triangular Cu planes well-separated by Sb planes. Muon spin relaxation ($μ$SR) measurements show that the $S = \frac{1}{2}$ moments at the magnetically active Cu sites remain dynamic down to 65 mK in spite of a large antiferromagnetic exchange scale evidenced by a large Curie-Weiss temperature $θ_{\mathrm{cw}} \simeq $ -143 K as extracted from the bulk susceptibility. Specific heat measurements also show no sign of long-range order down to 0.35 K. The magnetic specific heat ($\mathit{C}$$_{\mathrm{m}}$) below 5 K reveals a $\mathit{C}$$_{\mathrm{m}}$ $=$ $γT$ + $αT$$^{2}$ behavior. The significant $T$$^{2}$ contribution to the magnetic specific heat invites a phenomenology in terms of the so-called Dirac spinon excitations with a linear dispersion. From the low-$T$ specific heat data, we estimate the dominant exchange scale to be $\sim $ 36 K using a Dirac spin liquid ansatz which is not far from the values inferred from microscopic density functional theory calculations ($\sim $ 45 K) as well as high-temperature susceptibility analysis ($\sim$ 70 K). The linear specific heat coefficient is about 18 mJ/mol-K$^2$ which is somewhat larger than for typical Fermi liquids.
△ Less
Submitted 28 December, 2020; v1 submitted 2 December, 2020;
originally announced December 2020.
-
A Secure Deep Probabilistic Dynamic Thermal Line Rating Prediction
Authors:
N. Safari,
S. M. Mazhari,
C. Y. Chung,
S. B. Ko
Abstract:
Accurate short-term prediction of overhead line (OHL) transmission ampacity can directly affect the efficiency of power system operation and planning. Any overestimation of the dynamic thermal line rating (DTLR) can lead to lifetime degradation and failure of OHLs, safety hazards, etc. This paper presents a secure yet sharp probabilistic prediction model for the hour-ahead forecasting of the DTLR.…
▽ More
Accurate short-term prediction of overhead line (OHL) transmission ampacity can directly affect the efficiency of power system operation and planning. Any overestimation of the dynamic thermal line rating (DTLR) can lead to lifetime degradation and failure of OHLs, safety hazards, etc. This paper presents a secure yet sharp probabilistic prediction model for the hour-ahead forecasting of the DTLR. The security of the proposed DTLR limits the frequency of DTLR prediction exceeding the actual DTLR. The model is based on an augmented deep learning architecture that makes use of a wide range of predictors, including historical climatology data and latent variables obtained during DTLR calculation. Furthermore, by introducing a customized cost function, the deep neural network is trained to consider the DTLR security based on the required probability of exceedance while minimizing deviations of the predicted DTLRs from the actual values. The proposed probabilistic DTLR is developed and verified using recorded experimental data. The simulation results validate the superiority of the proposed DTLR compared to state-of-the-art prediction models using well-known evaluation metrics.
△ Less
Submitted 21 November, 2020;
originally announced November 2020.
-
High-resolution Near-infrared Spectroscopic Study of Galactic Supernova Remnants. I. Kinematic Distances
Authors:
Yong-Hyun Lee,
Bon-Chul Koo,
Jae-Joon Lee
Abstract:
We have carried out high-resolution near-infrared spectroscopic observations toward 16 Galactic supernova remnants (SNRs) showing strong H$_{2}$ emission features. A dozen bright H$_{2}$ emission lines are clearly detected for individual SNRs, and we have measured their central velocities, line widths, and fluxes. For all SNRs except one (G9.9$-$0.8), the H$_{2}$ line ratios are well consistent wi…
▽ More
We have carried out high-resolution near-infrared spectroscopic observations toward 16 Galactic supernova remnants (SNRs) showing strong H$_{2}$ emission features. A dozen bright H$_{2}$ emission lines are clearly detected for individual SNRs, and we have measured their central velocities, line widths, and fluxes. For all SNRs except one (G9.9$-$0.8), the H$_{2}$ line ratios are well consistent with that of thermal excitation at $T\sim2000$ K, indicating that the H$_{2}$ emission lines are most likely from shock-excited gas and therefore that they are physically associated with the remnants. The kinematic distances to the 15 SNRs are derived from the central velocities of the H$_{2}$ lines using a Galactic rotation model. We derive for the first time the kinematic distances to four SNRs: G13.5$+$0.2, G16.0$-$0.5, G32.1$-$0.9, and G33.2$-$0.6. Among the remaining 11 SNRs, the central velocities of the H$_{2}$ emission lines for six SNRs are well consistent ($\pm5$ km s$^{-1}$) with those obtained in previous radio observations, while for the other five SNRs (G18.1$-$0.1, G18.9$-$1.1, Kes 69, 3C 396, W49B) they are significantly different. We discuss the velocity discrepancies in these five SNRs. In G9.9$-$0.8, the H$_{2}$ emission shows nonthermal line ratios and narrow line width ($\sim 4$ km s$^{-1}$), and we discuss its origin.
△ Less
Submitted 15 November, 2020;
originally announced November 2020.
-
Radiative Supernova Remnants and Supernova Feedback
Authors:
Bon-Chul Koo,
Chang-Goo Kim,
Sangwook Park,
Eve C. Ostriker
Abstract:
Supernova (SN) explosions are a major feedback mechanism regulating star formation in galaxies through their momentum input. We review the observations of SNRs in radiative stages in the Milky Way to validate the theoretical results on the momentum/energy injection from a single SN explosion. For seven SNRs where we can observe fast-expanding, atomic radiative shells, we show that the shell moment…
▽ More
Supernova (SN) explosions are a major feedback mechanism regulating star formation in galaxies through their momentum input. We review the observations of SNRs in radiative stages in the Milky Way to validate the theoretical results on the momentum/energy injection from a single SN explosion. For seven SNRs where we can observe fast-expanding, atomic radiative shells, we show that the shell momentum inferred from HI 21 cm line observations is in the range of (0.5--4.5)$\times 10^5$ $M_\odot$ km s$^{-1}$. In two SNRs (W44 and IC 443), shocked molecular gas with momentum comparable to that of the atomic SNR shells has been also observed. We compare the momentum and kinetic/thermal energy of these seven SNRs with the results from 1D and 3D numerical simulations. The observation-based momentum and kinetic energy agree well with the expected momentum/energy input from an SN explosion of $\sim 10^{51}$ erg. It is much more difficult to use data/model comparisons of thermal energy to constrain the initial explosion energy, however, due to rapid cooling and complex physics at the hot/cool interface in radiative SNRs. We discuss the observational and theoretical uncertainties of these global parameters and explosion energy estimates for SNRs in complex environments.
△ Less
Submitted 12 November, 2020;
originally announced November 2020.
-
Auxiliary Sequence Labeling Tasks for Disfluency Detection
Authors:
Dongyub Lee,
Byeongil Ko,
Myeong Cheol Shin,
Taesun Whang,
Daniel Lee,
Eun Hwa Kim,
EungGyun Kim,
Jaechoon Jo
Abstract:
Detecting disfluencies in spontaneous speech is an important preprocessing step in natural language processing and speech recognition applications. Existing works for disfluency detection have focused on designing a single objective only for disfluency detection, while auxiliary objectives utilizing linguistic information of a word such as named entity or part-of-speech information can be effectiv…
▽ More
Detecting disfluencies in spontaneous speech is an important preprocessing step in natural language processing and speech recognition applications. Existing works for disfluency detection have focused on designing a single objective only for disfluency detection, while auxiliary objectives utilizing linguistic information of a word such as named entity or part-of-speech information can be effective. In this paper, we focus on detecting disfluencies on spoken transcripts and propose a method utilizing named entity recognition (NER) and part-of-speech (POS) as auxiliary sequence labeling (SL) tasks for disfluency detection. First, we investigate cases that utilizing linguistic information of a word can prevent mispredicting important words and can be helpful for the correct detection of disfluencies. Second, we show that training a disfluency detection model with auxiliary SL tasks can improve its F-score in disfluency detection. Then, we analyze which auxiliary SL tasks are influential depending on baseline models. Experimental results on the widely used English Switchboard dataset show that our method outperforms the previous state-of-the-art in disfluency detection.
△ Less
Submitted 5 April, 2021; v1 submitted 23 October, 2020;
originally announced November 2020.
-
Intraoperative Liver Surface Completion with Graph Convolutional VAE
Authors:
Simone Foti,
Bongjin Koo,
Thomas Dowrick,
Joao Ramalhinho,
Moustafa Allam,
Brian Davidson,
Danail Stoyanov,
Matthew J. Clarkson
Abstract:
In this work we propose a method based on geometric deep learning to predict the complete surface of the liver, given a partial point cloud of the organ obtained during the surgical laparoscopic procedure. We introduce a new data augmentation technique that randomly perturbs shapes in their frequency domain to compensate the limited size of our dataset. The core of our method is a variational auto…
▽ More
In this work we propose a method based on geometric deep learning to predict the complete surface of the liver, given a partial point cloud of the organ obtained during the surgical laparoscopic procedure. We introduce a new data augmentation technique that randomly perturbs shapes in their frequency domain to compensate the limited size of our dataset. The core of our method is a variational autoencoder (VAE) that is trained to learn a latent space for complete shapes of the liver. At inference time, the generative part of the model is embedded in an optimisation procedure where the latent representation is iteratively updated to generate a model that matches the intraoperative partial point cloud. The effect of this optimisation is a progressive non-rigid deformation of the initially generated shape. Our method is qualitatively evaluated on real data and quantitatively evaluated on synthetic data. We compared with a state-of-the-art rigid registration algorithm, that our method outperformed in visible areas.
△ Less
Submitted 12 July, 2021; v1 submitted 8 September, 2020;
originally announced September 2020.
-
CAPP-8TB: Axion Dark Matter Search Experiment around 6.7 $μ$eV
Authors:
J. Choi,
S. Ahn,
B. R. Ko,
S. Lee,
Y. K. Semertzidis
Abstract:
CAPP-8TB is an axion dark matter search experiment dedicated to an axion mass search near 6.7 $μ$eV. The experiment uses a microwave resonant cavity under a strong magnetic field of 8 T produced by a superconducting solenoid magnet in a dilution refrigerator. We describe the experimental configuration used to search for a mass range of 6.62 to 6.82 $μ$eV in the first phase of the experiment. We al…
▽ More
CAPP-8TB is an axion dark matter search experiment dedicated to an axion mass search near 6.7 $μ$eV. The experiment uses a microwave resonant cavity under a strong magnetic field of 8 T produced by a superconducting solenoid magnet in a dilution refrigerator. We describe the experimental configuration used to search for a mass range of 6.62 to 6.82 $μ$eV in the first phase of the experiment. We also discuss the next phase of the experiment and its prospects.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
An Effective Pipeline for a Real-world Clothes Retrieval System
Authors:
Yang-Ho Ji,
HeeJae Jun,
Insik Kim,
Jongtack Kim,
Youngjoon Kim,
Byungsoo Ko,
Hyong-Keun Kook,
Jingeun Lee,
Sangwon Lee,
Sanghyuk Park
Abstract:
In this paper, we propose an effective pipeline for clothes retrieval system which has sturdiness on large-scale real-world fashion data. Our proposed method consists of three components: detection, retrieval, and post-processing. We firstly conduct a detection task for precise retrieval on target clothes, then retrieve the corresponding items with the metric learning-based model. To improve the r…
▽ More
In this paper, we propose an effective pipeline for clothes retrieval system which has sturdiness on large-scale real-world fashion data. Our proposed method consists of three components: detection, retrieval, and post-processing. We firstly conduct a detection task for precise retrieval on target clothes, then retrieve the corresponding items with the metric learning-based model. To improve the retrieval robustness against noise and misleading bounding boxes, we apply post-processing methods such as weighted boxes fusion and feature concatenation. With the proposed methodology, we achieved 2nd place in the DeepFashion2 Clothes Retrieval 2020 challenge.
△ Less
Submitted 26 May, 2020;
originally announced May 2020.
-
Reference and Document Aware Semantic Evaluation Methods for Korean Language Summarization
Authors:
Dongyub Lee,
Myeongcheol Shin,
Taesun Whang,
Seungwoo Cho,
Byeongil Ko,
Daniel Lee,
Eunggyun Kim,
Jaechoon Jo
Abstract:
Text summarization refers to the process that generates a shorter form of text from the source document preserving salient information. Many existing works for text summarization are generally evaluated by using recall-oriented understudy for gisting evaluation (ROUGE) scores. However, as ROUGE scores are computed based on n-gram overlap, they do not reflect semantic meaning correspondences betwee…
▽ More
Text summarization refers to the process that generates a shorter form of text from the source document preserving salient information. Many existing works for text summarization are generally evaluated by using recall-oriented understudy for gisting evaluation (ROUGE) scores. However, as ROUGE scores are computed based on n-gram overlap, they do not reflect semantic meaning correspondences between generated and reference summaries. Because Korean is an agglutinative language that combines various morphemes into a word that express several meanings, ROUGE is not suitable for Korean summarization. In this paper, we propose evaluation metrics that reflect semantic meanings of a reference summary and the original document, Reference and Document Aware Semantic Score (RDASS). We then propose a method for improving the correlation of the metrics with human judgment. Evaluation results show that the correlation with human judgment is significantly higher for our evaluation metrics than for ROUGE scores.
△ Less
Submitted 1 November, 2020; v1 submitted 29 April, 2020;
originally announced May 2020.
-
Unbiased Spectroscopic Study of the Cygnus Loop with LAMOST. I. Optical Properties of Emission Lines and the Global Spectrum
Authors:
Ji Yeon Seok,
Bon-Chul Koo,
Gang Zhao,
John C. Raymond
Abstract:
We present an unbiased spectroscopic study of the Galactic supernova remnant (SNR) Cygnus Loop using the Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST) DR5. LAMOST features both a large field of view and a large aperture, which allow us to simultaneously obtain 4000 spectra at $\sim$3700-9000 Åwith R$\approx$1800. The Cygnus Loop is a prototype of middle-aged SNRs, which has th…
▽ More
We present an unbiased spectroscopic study of the Galactic supernova remnant (SNR) Cygnus Loop using the Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST) DR5. LAMOST features both a large field of view and a large aperture, which allow us to simultaneously obtain 4000 spectra at $\sim$3700-9000 Åwith R$\approx$1800. The Cygnus Loop is a prototype of middle-aged SNRs, which has the advantages of being bright, large in angular size, and relatively unobscured by dust. Along the line of sight to the Cygnus Loop, 2747 LAMOST DR5 spectra are found in total, which are spatially distributed over the entire remnant. This spectral sample is free of the selection bias of most previous studies, which often focus on bright filaments or regions bright in [O III]. Visual inspection verifies that 368 spectra (13$\%$ of the total) show clear spectral features to confirm their association with the remnant. In addition, 176 spectra with line emission show ambiguity of their origin but have a possible association to the SNR. In particular, the 154 spectra dominated by the SNR emission are further analyzed by identifying emission lines and measuring their intensities. We examine distributions of physical properties such as electron density and temperature, which vary significantly inside the remnant, using theoretical models. By combining a large number of the LAMOST spectra, a global spectrum representing the Cygnus Loop is constructed, which presents characteristics of radiative shocks. Finally, we discuss the effect of the unbiased spectral sample on the global spectrum and its implication to understand a spatially unresolved SNR in a distant galaxy.
△ Less
Submitted 19 April, 2020;
originally announced April 2020.
-
Improved axion haloscope search analysis
Authors:
S. Ahn,
S. Lee,
J. Choi,
B. R. Ko,
Y. K. Semertzidis
Abstract:
One of the most significant and practical figures of merit in axion haloscope searches is the scanning rate, because of the unknown axion mass. Under the best experimental parameters, the only way to improve the figure of merit is to increase the experimentally designed signal to noise ratio in the axion haloscope search analysis procedure. In this paper, we report an improved axion haloscope sear…
▽ More
One of the most significant and practical figures of merit in axion haloscope searches is the scanning rate, because of the unknown axion mass. Under the best experimental parameters, the only way to improve the figure of merit is to increase the experimentally designed signal to noise ratio in the axion haloscope search analysis procedure. In this paper, we report an improved axion haloscope search analysis using the data taken by the CAPP-8TB haloscope. By correcting for the background biased by the background parametrizations in the presence of axion signals, we realized a signal to noise ratio efficiency of about 100\%. Given the axion haloscope search analyses to date, the scanning rate can be improved by 21\%, with about a 10\% improvement in the signal to noise ratio. This improvement is another low cost innovation in axion haloscope searches, where all the experimental parameters are currently at their best.
△ Less
Submitted 7 April, 2021; v1 submitted 16 April, 2020;
originally announced April 2020.
-
Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning
Authors:
Byungsoo Ko,
Geonmo Gu
Abstract:
Learning the distance metric between pairs of samples has been studied for image retrieval and clustering. With the remarkable success of pair-based metric learning losses, recent works have proposed the use of generated synthetic points on metric learning losses for augmentation and generalization. However, these methods require additional generative networks along with the main network, which ca…
▽ More
Learning the distance metric between pairs of samples has been studied for image retrieval and clustering. With the remarkable success of pair-based metric learning losses, recent works have proposed the use of generated synthetic points on metric learning losses for augmentation and generalization. However, these methods require additional generative networks along with the main network, which can lead to a larger model size, slower training speed, and harder optimization. Meanwhile, post-processing techniques, such as query expansion and database augmentation, have proposed the combination of feature points to obtain additional semantic information. In this paper, inspired by query expansion and database augmentation, we propose an augmentation method in an embedding space for pair-based metric learning losses, called embedding expansion. The proposed method generates synthetic points containing augmented information by a combination of feature points and performs hard negative pair mining to learn with the most informative feature representations. Because of its simplicity and flexibility, it can be used for existing metric learning losses without affecting model size, training speed, or optimization difficulty. Finally, the combination of embedding expansion and representative metric learning losses outperforms the state-of-the-art losses and previous sample generation methods in both image retrieval and clustering tasks. The implementation is publicly available.
△ Less
Submitted 23 April, 2020; v1 submitted 5 March, 2020;
originally announced March 2020.
-
Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks
Authors:
Shindong Lee,
BongGu Ko,
Keonnyeong Lee,
In-Chul Yoo,
Dongsuk Yook
Abstract:
Voice conversion (VC) refers to transforming the speaker characteristics of an utterance without altering its linguistic contents. Many works on voice conversion require to have parallel training data that is highly expensive to acquire. Recently, the cycle-consistent adversarial network (CycleGAN), which does not require parallel training data, has been applied to voice conversion, showing the st…
▽ More
Voice conversion (VC) refers to transforming the speaker characteristics of an utterance without altering its linguistic contents. Many works on voice conversion require to have parallel training data that is highly expensive to acquire. Recently, the cycle-consistent adversarial network (CycleGAN), which does not require parallel training data, has been applied to voice conversion, showing the state-of-the-art performance. The CycleGAN based voice conversion, however, can be used only for a pair of speakers, i.e., one-to-one voice conversion between two speakers. In this paper, we extend the CycleGAN by conditioning the network on speakers. As a result, the proposed method can perform many-to-many voice conversion among multiple speakers using a single generative adversarial network (GAN). Compared to building multiple CycleGANs for each pair of speakers, the proposed method reduces the computational and spatial cost significantly without compromising the sound quality of the converted voice. Experimental results using the VCC2018 corpus confirm the efficiency of the proposed method.
△ Less
Submitted 15 February, 2020;
originally announced February 2020.
-
Symmetrical Synthesis for Deep Metric Learning
Authors:
Geonmo Gu,
Byungsoo Ko
Abstract:
Deep metric learning aims to learn embeddings that contain semantic similarity information among data points. To learn better embeddings, methods to generate synthetic hard samples have been proposed. Existing methods of synthetic hard sample generation are adopting autoencoders or generative adversarial networks, but this leads to more hyper-parameters, harder optimization, and slower training sp…
▽ More
Deep metric learning aims to learn embeddings that contain semantic similarity information among data points. To learn better embeddings, methods to generate synthetic hard samples have been proposed. Existing methods of synthetic hard sample generation are adopting autoencoders or generative adversarial networks, but this leads to more hyper-parameters, harder optimization, and slower training speed. In this paper, we address these problems by proposing a novel method of synthetic hard sample generation called symmetrical synthesis. Given two original feature points from the same class, the proposed method firstly generates synthetic points with each other as an axis of symmetry. Secondly, it performs hard negative pair mining within the original and synthetic points to select a more informative negative pair for computing the metric learning loss. Our proposed method is hyper-parameter free and plug-and-play for existing metric learning losses without network modification. We demonstrate the superiority of our proposed method over existing methods for a variety of loss functions on clustering and image retrieval tasks. Our implementations is publicly available.
△ Less
Submitted 23 April, 2020; v1 submitted 30 January, 2020;
originally announced January 2020.
-
Overcoming Noisy and Irrelevant Data in Federated Learning
Authors:
Tiffany Tuor,
Shiqiang Wang,
Bong Jun Ko,
Changchang Liu,
Kin K. Leung
Abstract:
Many image and vision applications require a large amount of data for model training. Collecting all such data at a central location can be challenging due to data privacy and communication bandwidth restrictions. Federated learning is an effective way of training a machine learning model in a distributed manner from local data collected by client devices, which does not require exchanging the raw…
▽ More
Many image and vision applications require a large amount of data for model training. Collecting all such data at a central location can be challenging due to data privacy and communication bandwidth restrictions. Federated learning is an effective way of training a machine learning model in a distributed manner from local data collected by client devices, which does not require exchanging the raw data among clients. A challenge is that among the large variety of data collected at each client, it is likely that only a subset is relevant for a learning task while the rest of data has a negative impact on model training. Therefore, before starting the learning process, it is important to select the subset of data that is relevant to the given federated learning task. In this paper, we propose a method for distributedly selecting relevant data, where we use a benchmark model trained on a small benchmark dataset that is task-specific, to evaluate the relevance of individual data samples at each client and select the data with sufficiently high relevance. Then, each client only uses the selected subset of its data in the federated learning process. The effectiveness of our proposed approach is evaluated on multiple real-world image datasets in a simulated system with a large number of clients, showing up to $25\%$ improvement in model accuracy compared to training with all data.
△ Less
Submitted 22 June, 2020; v1 submitted 22 January, 2020;
originally announced January 2020.
-
Axion Dark Matter Search around 6.7 $μ$eV
Authors:
S. Lee,
S. Ahn,
J. Choi,
B. R. Ko,
Y. K. Semertzidis
Abstract:
An axion dark matter search with the CAPP-8TB haloscope is reported. Our results are sensitive to axion-photon coupling $g_{aγγ}$ down to the QCD axion band over the axion mass range between 6.62 and 6.82 $μ$eV at a 90\% confidence level, which is the most sensitive result in the mass range to date.
An axion dark matter search with the CAPP-8TB haloscope is reported. Our results are sensitive to axion-photon coupling $g_{aγγ}$ down to the QCD axion band over the axion mass range between 6.62 and 6.82 $μ$eV at a 90\% confidence level, which is the most sensitive result in the mass range to date.
△ Less
Submitted 15 March, 2020; v1 submitted 14 January, 2020;
originally announced January 2020.
-
Interpretation and Simplification of Deep Forest
Authors:
Sangwon Kim,
Mira Jeong,
Byoung Chul Ko
Abstract:
This paper proposes a new method for interpreting and simplifying a black box model of a deep random forest (RF) using a proposed rule elimination. In deep RF, a large number of decision trees are connected to multiple layers, thereby making an analysis difficult. It has a high performance similar to that of a deep neural network (DNN), but achieves a better generalizability. Therefore, in this st…
▽ More
This paper proposes a new method for interpreting and simplifying a black box model of a deep random forest (RF) using a proposed rule elimination. In deep RF, a large number of decision trees are connected to multiple layers, thereby making an analysis difficult. It has a high performance similar to that of a deep neural network (DNN), but achieves a better generalizability. Therefore, in this study, we consider quantifying the feature contributions and frequency of the fully trained deep RF in the form of a decision rule set. The feature contributions provide a basis for determining how features affect the decision process in a rule set. Model simplification is achieved by eliminating unnecessary rules by measuring the feature contributions. Consequently, the simplified model has fewer parameters and rules than before. Experiment results have shown that a feature contribution analysis allows a black box model to be decomposed for quantitatively interpreting a rule set. The proposed method was successfully applied to various deep RF models and benchmark datasets while maintaining a robust performance despite the elimination of a large number of rules.
△ Less
Submitted 11 December, 2020; v1 submitted 14 January, 2020;
originally announced January 2020.
-
Revealing The CO X-factor In Dark Molecular Gas through Sensitive ALMA Absorption Observations
Authors:
Gan Luo,
Di Li,
Ningyu Tang,
J. R. Dawson,
John M. Dickey,
L. Bronfman,
Sheng-Li Qin,
Steven J. Gibson,
Richard Plambeck,
Ricardo Finger,
Anne Green,
Diego Mardones,
Bon-Chul Koo,
Nadia Lo
Abstract:
Carbon-bearing molecules, particularly CO, have been widely used as tracers of molecular gas in the interstellar medium (ISM). In this work, we aim to study the properties of molecules in diffuse, cold environments, where CO tends to be under-abundant and/or sub-thermally excited. We performed one of the most sensitive (down to $\mathrm{τ_{rms}^{CO} \sim 0.002}$ and…
▽ More
Carbon-bearing molecules, particularly CO, have been widely used as tracers of molecular gas in the interstellar medium (ISM). In this work, we aim to study the properties of molecules in diffuse, cold environments, where CO tends to be under-abundant and/or sub-thermally excited. We performed one of the most sensitive (down to $\mathrm{τ_{rms}^{CO} \sim 0.002}$ and $\mathrm{τ_{rms}^{HCO^+} \sim 0.0008}$) sub-millimeter molecular absorption line observations towards 13 continuum sources with the ALMA. CO absorption was detected in diffuse ISM down to $\mathrm{A_v< 0.32\,mag}$ and \hcop was down to $\mathrm{A_v < 0.2\,mag}$, where atomic gas and dark molecular gas (DMG) starts to dominate. Multiple transitions measured in absorption toward 3C454.3 allow for a direct determination of excitation temperatures $\mathrm{T_{ex}}$ of 4.1\,K and 2.7\,K, for CO and for \hcop, respectively, which are close to the cosmic microwave background (CMB) and provide explanation for their being undercounted in emission surveys. A stronger linear correlation was found between $\mathrm{N_{HCO^+}}$ and $\mathrm{N_{H_2}}$ (Pearson correlation coefficient P $\sim$ 0.93) than that of $\mathrm{N_{CO}}$ and $\mathrm{N_{H_2}}$ (P $\sim$ 0.33), suggesting \hcop\ being a better tracer of H$_2$ than CO in diffuse gas. The derived CO-to-\h2 conversion factor (the CO X-factor) of (14 $\pm$ 3) $\times$ 10$^{20}$ cm$^{-2}$ (K \kms)$^{-1}$ is approximately 6 times larger than the average value found in the Milky Way.
△ Less
Submitted 18 December, 2019;
originally announced December 2019.
-
Detection of Pristine Circumstellar Material of the Cassiopeia A Supernova
Authors:
Bon-Chul Koo,
Hyun-Jeong Kim,
Heeyoung Oh,
John C. Raymond,
Sung-Chul Yoon,
Yong-Hyun Lee,
Daniel T. Jaffe
Abstract:
Cassiopeia A is a nearby young supernova remnant that provides a unique laboratory for the study of core-collapse supernova explosions. Cassiopeia A is known to be a Type IIb supernova from the optical spectrum of its light echo, but the immediate progenitor of the supernova remains uncertain. Here we report results of near-infrared, high-resolution spectroscopic observations of Cassiopeia A where…
▽ More
Cassiopeia A is a nearby young supernova remnant that provides a unique laboratory for the study of core-collapse supernova explosions. Cassiopeia A is known to be a Type IIb supernova from the optical spectrum of its light echo, but the immediate progenitor of the supernova remains uncertain. Here we report results of near-infrared, high-resolution spectroscopic observations of Cassiopeia A where we detected the pristine circumstellar material of the supernova progenitor. Our observations revealed a strong emission line of iron (Fe) from a circumstellar clump that has not yet been processed by the supernova shock wave. A comprehensive analysis of the observed spectra, together with an HST image, indicates that the majority of Fe in this unprocessed circumstellar material is in the gas phase, not depleted onto dust grains as in the general interstellar medium. This result is consistent with a theoretical model of dust condensation in material that is heavily enriched with CNO-cycle products, supporting the idea that the clump originated near the He core of the progenitor. It has been recently found that Type IIb supernovae can result from the explosion of a blue supergiant with a thin hydrogen envelope, and our results support such a scenario for Cassiopeia A.
△ Less
Submitted 4 December, 2019;
originally announced December 2019.
-
Axion Dark Matter Research with IBS/CAPP
Authors:
Yannis K. Semertzidis,
Jihn E. Kim,
SungWoo Youn,
Jihoon Choi,
Woohyun Chung,
Selcuk Haciomeroglu,
Dongmin Kim,
Jingeun Kim,
ByeongRok Ko,
Ohjoon Kwon,
Andrei Matlashov,
Lino Miceli,
Hiroaki Natori,
Seongtae Park,
MyeongJae Lee,
Soohyung Lee,
Elena Sala,
Yunchang Shin,
Taehyeon Seong,
Sergey Uchaykin,
Danho Ahn,
Saebyeok Ahn,
Seung Pyo Chang,
Wheeyeon Cheong,
Hoyong Jeong
, et al. (12 additional authors not shown)
Abstract:
The axion, a consequence of the PQ mechanism, has been considered as the most elegant solution to the strong-CP problem and a compelling candidate for cold dark matter. The Center for Axion and Precision Physics Research (CAPP) of the Institute for Basic Science (IBS) was established on 16 October 2013 with a main objective to launch state of the art axion experiments in South Korea. Relying on th…
▽ More
The axion, a consequence of the PQ mechanism, has been considered as the most elegant solution to the strong-CP problem and a compelling candidate for cold dark matter. The Center for Axion and Precision Physics Research (CAPP) of the Institute for Basic Science (IBS) was established on 16 October 2013 with a main objective to launch state of the art axion experiments in South Korea. Relying on the haloscope technique, our strategy is to run several experiments in parallel to explore a wide range of axion masses with sensitivities better than the QCD axion models. We utilize not only the advanced technologies, such as high-field large-volume superconducting (SC) magnets, ultra low temperature dilution refrigerators, and nearly quantum-limited noise amplifiers, but also some unique features solely developed at the Center, including high-quality SC resonant cavities surviving high magnetic fields and efficient cavity geometries to reach high-frequency regions. Our goal is to probe axion dark matter in the frequency range of 1-10 GHz in the first phase and then ultimately up to 25 GHz, even in a scenario where axions constitute only 10% of the local dark matter halo. In this report, the current status and future prospects of the experiments and R&D activities at IBS/CAPP are described.
△ Less
Submitted 25 October, 2019;
originally announced October 2019.
-
CAPP-8TB: Search for Axion Dark Matter in a Mass Range of 6.62 to 7.04 $μ$eV
Authors:
Soohyung Lee,
Saebyeok Ahn,
Jihoon Choi,
Byeong Rok Ko,
Yannis K. Semertzidis
Abstract:
The axion is a hypothetical particle proposed to solve the strong $CP$ problem, and also a candidate for dark matter. This non-relativistic particle in the galactic halo can be converted into a photon under a strong magnetic field and detected with a microwave resonant cavity. Relying on this detection method, many experiments have excluded some mass regions with certain sensitivities in terms of…
▽ More
The axion is a hypothetical particle proposed to solve the strong $CP$ problem, and also a candidate for dark matter. This non-relativistic particle in the galactic halo can be converted into a photon under a strong magnetic field and detected with a microwave resonant cavity. Relying on this detection method, many experiments have excluded some mass regions with certain sensitivities in terms of axion-photon coupling ($g_{aγγ}$) for decades, but no axion dark matter has been discovered to date. CAPP-8TB is an axion haloscope experiment at IBS/CAPP designed to search for the axion in a mass range of 6.62 to 7.04 $μ$eV. The experiment aims for the most sensitive axion dark matter search in this particular mass range with its first-phase sensitivity reaching the QCD axion band. In this presentation, we discuss the overview of the experiment, and present the first result. We also discuss an upgrade of the experiment to achieve higher sensitivity.
△ Less
Submitted 14 October, 2019; v1 submitted 30 September, 2019;
originally announced October 2019.
-
Model Pruning Enables Efficient Federated Learning on Edge Devices
Authors:
Yuang Jiang,
Shiqiang Wang,
Victor Valls,
Bong Jun Ko,
Wei-Han Lee,
Kin K. Leung,
Leandros Tassiulas
Abstract:
Federated learning (FL) allows model training from local data collected by edge/mobile devices while preserving data privacy, which has wide applicability to image and vision applications. A challenge is that client devices in FL usually have much more limited computation and communication resources compared to servers in a datacenter. To overcome this challenge, we propose PruneFL -- a novel FL a…
▽ More
Federated learning (FL) allows model training from local data collected by edge/mobile devices while preserving data privacy, which has wide applicability to image and vision applications. A challenge is that client devices in FL usually have much more limited computation and communication resources compared to servers in a datacenter. To overcome this challenge, we propose PruneFL -- a novel FL approach with adaptive and distributed parameter pruning, which adapts the model size during FL to reduce both communication and computation overhead and minimize the overall training time, while maintaining a similar accuracy as the original model. PruneFL includes initial pruning at a selected client and further pruning as part of the FL process. The model size is adapted during this process, which includes maximizing the approximate empirical risk reduction divided by the time of one FL round. Our experiments with various datasets on edge devices (e.g., Raspberry Pi) show that: (i) we significantly reduce the training time compared to conventional FL and various other pruning-based methods; (ii) the pruned model with automatically determined size converges to an accuracy that is very similar to the original model, and it is also a lottery ticket of the original model.
△ Less
Submitted 6 April, 2022; v1 submitted 26 September, 2019;
originally announced September 2019.
-
3D characterization of the primary Al3Sc phases in an Al-Sc alloy using Synchrotron X-ray tomography and electron microscopy
Authors:
Yuliang Zhao,
Weiwen Zhang,
Billy Koe,
Wenjia Du,
Mengmeng Wang,
Weilin Wang,
Elodie Boller,
Alexander Rack,
Zhenzhong Sun,
Jiawei Mi
Abstract:
The three-dimensional structures of the primary Al3Sc particles in an Al-2Sc master alloy were studied by synchrotron X-ray microtomography, scanning and transmission electron microscopy. The Al3Sc phases were found to be a single cube and a cluster of cubes. The surface area, equivalent diameter of the Al3Sc cubes increased with the increasing of cube volume, but the specific surface area decreas…
▽ More
The three-dimensional structures of the primary Al3Sc particles in an Al-2Sc master alloy were studied by synchrotron X-ray microtomography, scanning and transmission electron microscopy. The Al3Sc phases were found to be a single cube and a cluster of cubes. The surface area, equivalent diameter of the Al3Sc cubes increased with the increasing of cube volume, but the specific surface area decreases. The primary Al3Sc cubes and Al-matrix have the same crystal orientation, indicating that the Al3Sc phases are the heterogeneous nucleation sites for Al. The experimental results show that α-Al2O3 are the possible nucleation sites for the Al3Sc cubes.
△ Less
Submitted 20 September, 2019;
originally announced September 2019.
-
Exploring the pattern of the Galactic HI foreground of GRBs with the ATCA
Authors:
H. Denes,
P. A. Jones,
L. V. Toth,
S. Zahorecz,
B-C. Koo,
S. Pinter,
I. I. Racz,
L. G. Balazs,
M. R. Cunningham,
Y. Doi,
I. Horvath,
T. Kovacs,
T. Onishi,
N. Suleiman,
Z. Bagoly
Abstract:
The afterglow of a gamma ray burst (GRB) can give us valuable insight into the properties of its host galaxy. To correctly interpret the spectra of the afterglow we need to have a good understanding of the foreground interstellar medium (ISM) in our own Galaxy. The common practice to correct for the foreground is to use neutral hydrogen (HI) data from the Leiden/Argentina/Bonn (LAB) survey. Howeve…
▽ More
The afterglow of a gamma ray burst (GRB) can give us valuable insight into the properties of its host galaxy. To correctly interpret the spectra of the afterglow we need to have a good understanding of the foreground interstellar medium (ISM) in our own Galaxy. The common practice to correct for the foreground is to use neutral hydrogen (HI) data from the Leiden/Argentina/Bonn (LAB) survey. However, the poor spatial resolution of the single dish data may have a significant effect on the derived column densities. To investigate this, we present new high-resolution HI observations with the Australia Telescope Compact Array (ATCA) towards 4 GRBs. We combine the interferometric ATCA data with single dish data from the Galactic All Sky Survey (GASS) and derive new Galactic HI column densities towards the GRBs. We use these new foreground column densities to fit the Swift XRT X-ray spectra and calculate new intrinsic hydrogen column density values for the GRB host galaxies. We find that the new ATCA data shows higher Galactic HI column densities compared to the previous single dish data, which results in lower intrinsic column densities for the hosts. We investigate the line of sight optical depth near the GRBs and find that it may not be negligible towards one of the GRBs, which indicates that the intrinsic hydrogen column density of its host galaxy may be even lower. In addition, we compare our results to column densities derived from far-infrared data and find a reasonable agreement with the HI data.
△ Less
Submitted 2 September, 2019;
originally announced September 2019.
-
Bose-Einstein condensation of triplons close to the quantum critical point in the quasi-one-dimensional spin-$1/2$ antiferromagnet NaVOPO$_4$
Authors:
Prashanta K. Mukharjee,
K. M. Ranjith,
B. Koo,
J. Sichelschmidt,
M. Baenitz,
Y. Skourski,
Y. Inagaki,
Y. Furukawa,
A. A. Tsirlin,
R. Nath
Abstract:
Structural and magnetic properties of a quasi-one-dimensional spin-$1/2$ compound NaVOPO$_4$ are explored by x-ray diffraction, magnetic susceptibility, high-field magnetization, specific heat, electron spin resonance, and $^{31}$P nuclear magnetic resonance measurements, as well as complementary \textit{ab initio} calculations. Whereas magnetic susceptibility of NaVOPO$_4$ may be compatible with…
▽ More
Structural and magnetic properties of a quasi-one-dimensional spin-$1/2$ compound NaVOPO$_4$ are explored by x-ray diffraction, magnetic susceptibility, high-field magnetization, specific heat, electron spin resonance, and $^{31}$P nuclear magnetic resonance measurements, as well as complementary \textit{ab initio} calculations. Whereas magnetic susceptibility of NaVOPO$_4$ may be compatible with the gapless uniform spin chain model, detailed examination of the crystal structure reveals a weak alternation of the exchange couplings with the alternation ratio $α\simeq 0.98$ and the ensuing zero-field spin gap $Δ_{0}/k_{\rm B} \simeq 2.4$~K directly probed by field-dependent magnetization measurements. No long-range order is observed down to 50\,mK in zero field. However, applied fields above the critical field $H_{c1}\simeq 1.6$\,T give rise to a magnetic ordering transition with the phase boundary $T_{\rm N} \propto {(H - H_{\rm c1})^{\frac{1}φ}}$, where $φ\simeq 1.8$ is close to the value expected for Bose-Einstein condensation of triplons. With its weak alternation of the exchange couplings and small spin gap, NaVOPO$_4$ lies close to the quantum critical point.
△ Less
Submitted 30 August, 2019;
originally announced September 2019.
-
Data Context Adaptation for Accurate Recommendation with Additional Information
Authors:
Hyunsik Jeon,
Bonhun Koo,
U Kang
Abstract:
Given a sparse rating matrix and an auxiliary matrix of users or items, how can we accurately predict missing ratings considering different data contexts of entities? Many previous studies proved that utilizing the additional information with rating data is helpful to improve the performance. However, existing methods are limited in that 1) they ignore the fact that data contexts of rating and aux…
▽ More
Given a sparse rating matrix and an auxiliary matrix of users or items, how can we accurately predict missing ratings considering different data contexts of entities? Many previous studies proved that utilizing the additional information with rating data is helpful to improve the performance. However, existing methods are limited in that 1) they ignore the fact that data contexts of rating and auxiliary matrices are different, 2) they have restricted capability of expressing independence information of users or items, and 3) they assume the relation between a user and an item is linear. We propose DaConA, a neural network based method for recommendation with a rating matrix and an auxiliary matrix. DaConA is designed with the following three main ideas. First, we propose a data context adaptation layer to extract pertinent features for different data contexts. Second, DaConA represents each entity with latent interaction vector and latent independence vector. Unlike previous methods, both of the two vectors are not limited in size. Lastly, while previous matrix factorization based methods predict missing values through the inner-product of latent vectors, DaConA learns a non-linear function of them via a neural network. We show that DaConA is a generalized algorithm including the standard matrix factorization and the collective matrix factorization as special cases. Through comprehensive experiments on real-world datasets, we show that DaConA provides the state-of-the-art accuracy.
△ Less
Submitted 22 August, 2019;
originally announced August 2019.
-
More unlabelled data or label more data? A study on semi-supervised laparoscopic image segmentation
Authors:
Yunguan Fu,
Maria R. Robu,
Bongjin Koo,
Crispin Schneider,
Stijn van Laarhoven,
Danail Stoyanov,
Brian Davidson,
Matthew J. Clarkson,
Yipeng Hu
Abstract:
Improving a semi-supervised image segmentation task has the option of adding more unlabelled images, labelling the unlabelled images or combining both, as neither image acquisition nor expert labelling can be considered trivial in most clinical applications. With a laparoscopic liver image segmentation application, we investigate the performance impact by altering the quantities of labelled and un…
▽ More
Improving a semi-supervised image segmentation task has the option of adding more unlabelled images, labelling the unlabelled images or combining both, as neither image acquisition nor expert labelling can be considered trivial in most clinical applications. With a laparoscopic liver image segmentation application, we investigate the performance impact by altering the quantities of labelled and unlabelled training data, using a semi-supervised segmentation algorithm based on the mean teacher learning paradigm. We first report a significantly higher segmentation accuracy, compared with supervised learning. Interestingly, this comparison reveals that the training strategy adopted in the semi-supervised algorithm is also responsible for this observed improvement, in addition to the added unlabelled data. We then compare different combinations of labelled and unlabelled data set sizes for training semi-supervised segmentation networks, to provide a quantitative example of the practically useful trade-off between the two data planning strategies in this surgical guidance application.
△ Less
Submitted 20 August, 2019;
originally announced August 2019.
-
A case study of bilayered spin-$1/2$ square lattice compound [VO(HCOO)$_2\cdot$(H$_2$O)]
Authors:
S. Guchhait,
U. Arjun,
P. K. Anjana,
M. Sahoo,
A. Thirumurugan,
A. Madhi,
Y. Skourski,
B. Koo,
J. Sichelschmidt,
B. Schmidt,
M. Baenitz,
R. Nath
Abstract:
We present the synthesis and a detail investigation of structural and magnetic properties of polycrystalline [VO(HCOO)$_2\cdot$(H$_2$O)] by means of x-ray diffraction, magnetic susceptibility, high-field magnetization, heat capacity, and electron spin resonance measurements. It crystallizes in a orthorhombic structure with space group $Pcca$. It features distorted VO$_6$ octahedra connected via HC…
▽ More
We present the synthesis and a detail investigation of structural and magnetic properties of polycrystalline [VO(HCOO)$_2\cdot$(H$_2$O)] by means of x-ray diffraction, magnetic susceptibility, high-field magnetization, heat capacity, and electron spin resonance measurements. It crystallizes in a orthorhombic structure with space group $Pcca$. It features distorted VO$_6$ octahedra connected via HCOO linker (formate anions) forming a two-dimensional square lattice network with a bilayered structure. Analysis of magnetic susceptibility, high field magnetization, and heat capacity data in terms of the frustrated square lattice model unambiguously establish quasi-two-dimensional nature of the compound with nearest neighbour interaction $J_1/k_{\rm B} \simeq 11.7$~K and next-nearest-neighbour interaction $J_2/k_{\rm B} \simeq 0.02$~K. It undergoes a Néel antiferromagnetic ordering at $T_{\rm N} \simeq 1.1$~K. The ratio $θ_{\rm CW}/T_{\rm N} \simeq 10.9$ reflects excellent two-dimensionality of the spin-lattice in the compound. A strong in-plane anisotropy is inferred from the linear increase of $T_{\rm N}$ with magnetic field, consistent with the structural data.
△ Less
Submitted 16 August, 2019;
originally announced August 2019.
-
A Benchmark on Tricks for Large-scale Image Retrieval
Authors:
Byungsoo Ko,
Minchul Shin,
Geonmo Gu,
HeeJae Jun,
Tae Kwan Lee,
Youngjoon Kim
Abstract:
Many studies have been performed on metric learning, which has become a key ingredient in top-performing methods of instance-level image retrieval. Meanwhile, less attention has been paid to pre-processing and post-processing tricks that can significantly boost performance. Furthermore, we found that most previous studies used small scale datasets to simplify processing. Because the behavior of a…
▽ More
Many studies have been performed on metric learning, which has become a key ingredient in top-performing methods of instance-level image retrieval. Meanwhile, less attention has been paid to pre-processing and post-processing tricks that can significantly boost performance. Furthermore, we found that most previous studies used small scale datasets to simplify processing. Because the behavior of a feature representation in a deep learning model depends on both domain and data, it is important to understand how model behave in large-scale environments when a proper combination of retrieval tricks is used. In this paper, we extensively analyze the effect of well-known pre-processing, post-processing tricks, and their combination for large-scale image retrieval. We found that proper use of these tricks can significantly improve model performance without necessitating complex architecture or introducing loss, as confirmed by achieving a competitive result on the Google Landmark Retrieval Challenge 2019.
△ Less
Submitted 23 April, 2020; v1 submitted 27 July, 2019;
originally announced July 2019.