Search | arXiv e-print repository

doi 10.13140/RG.2.2.13490.82883

How to Align Large Language Models for Teaching English? Designing and Developing LLM based-Chatbot for Teaching English Conversation in EFL, Findings and Limitations

Authors: Jaekwon Park, Jiyoung Bae, Unggi Lee, Taekyung Ahn, Sookbun Lee, Dohee Kim, Aram Choi, Yeil Jeong, Jewoong Moon, Hyeoncheol Kim

Abstract: This study investigates the design, development, and evaluation of a Large Language Model (LLM)-based chatbot for teaching English conversations in an English as a Foreign Language (EFL) context. Employing the Design and Development Research (DDR), we analyzed needs, established design principles, and iteratively refined a chatbot through experimenting various LLMs and alignment methods. Through b… ▽ More This study investigates the design, development, and evaluation of a Large Language Model (LLM)-based chatbot for teaching English conversations in an English as a Foreign Language (EFL) context. Employing the Design and Development Research (DDR), we analyzed needs, established design principles, and iteratively refined a chatbot through experimenting various LLMs and alignment methods. Through both quantitative and qualitative evaluations, we identified the most effective LLM and its prompt combination to generate high-quality, contextually appropriate responses. Interviews with teachers provided insights into desirable system features, potential educational applications, and ethical considerations in the development and deployment of the chatbots. The design iterations yielded the importance of feedback mechanisms and customizable AI personas. Future research should explore adaptive feedback strategies, collaborative approaches with various stakeholders, and the integration of insights from human-computer interaction (HCI) and user experience (UX) design. This study contributes to the growing body of research on applying LLMs in language education, providing insights and recommendations for the design, development, and evaluation of LLM-based chatbots for EFL conversation practice. As the field evolves, ongoing research and collaboration among educators, AI engineers, and other stakeholders will be essential to harness the potential of these technologies to enhance language learning experiences. △ Less

Submitted 8 September, 2024; originally announced September 2024.

Comments: 56 pages

arXiv:2408.14916 [pdf, other]

Towards Real-world Event-guided Low-light Video Enhancement and Deblurring

Authors: Taewoo Kim, Jaeseok Jeong, Hoonhee Cho, Yuhwan Jeong, Kuk-Jin Yoon

Abstract: In low-light conditions, capturing videos with frame-based cameras often requires long exposure times, resulting in motion blur and reduced visibility. While frame-based motion deblurring and low-light enhancement have been studied, they still pose significant challenges. Event cameras have emerged as a promising solution for improving image quality in low-light environments and addressing motion… ▽ More In low-light conditions, capturing videos with frame-based cameras often requires long exposure times, resulting in motion blur and reduced visibility. While frame-based motion deblurring and low-light enhancement have been studied, they still pose significant challenges. Event cameras have emerged as a promising solution for improving image quality in low-light environments and addressing motion blur. They provide two key advantages: capturing scene details well even in low light due to their high dynamic range, and effectively capturing motion information during long exposures due to their high temporal resolution. Despite efforts to tackle low-light enhancement and motion deblurring using event cameras separately, previous work has not addressed both simultaneously. To explore the joint task, we first establish real-world datasets for event-guided low-light enhancement and deblurring using a hybrid camera system based on beam splitters. Subsequently, we introduce an end-to-end framework to effectively handle these tasks. Our framework incorporates a module to efficiently leverage temporal information from events and frames. Furthermore, we propose a module to utilize cross-modal feature information to employ a low-pass filter for noise suppression while enhancing the main structural information. Our proposed method significantly outperforms existing approaches in addressing the joint task. Our project pages are available at https://github.com/intelpro/ELEDNet. △ Less

Submitted 27 August, 2024; originally announced August 2024.

Comments: Accepted in ECCV2024

arXiv:2408.12150 [pdf, other]

DeepHQ: Learned Hierarchical Quantizer for Progressive Deep Image Coding

Authors: Jooyoung Lee, Se Yoon Jeong, Munchurl Kim

Abstract: Unlike fixed- or variable-rate image coding, progressive image coding (PIC) aims to compress various qualities of images into a single bitstream, increasing the versatility of bitstream utilization and providing high compression efficiency compared to simulcast compression. Research on neural network (NN)-based PIC is in its early stages, mainly focusing on applying varying quantization step sizes… ▽ More Unlike fixed- or variable-rate image coding, progressive image coding (PIC) aims to compress various qualities of images into a single bitstream, increasing the versatility of bitstream utilization and providing high compression efficiency compared to simulcast compression. Research on neural network (NN)-based PIC is in its early stages, mainly focusing on applying varying quantization step sizes to the transformed latent representations in a hierarchical manner. These approaches are designed to compress only the progressively added information as the quality improves, considering that a wider quantization interval for lower-quality compression includes multiple narrower sub-intervals for higher-quality compression. However, the existing methods are based on handcrafted quantization hierarchies, resulting in sub-optimal compression efficiency. In this paper, we propose an NN-based progressive coding method that firstly utilizes learned quantization step sizes via learning for each quantization layer. We also incorporate selective compression with which only the essential representation components are compressed for each quantization layer. We demonstrate that our method achieves significantly higher coding efficiency than the existing approaches with decreased decoding time and reduced model size. △ Less

Submitted 22 August, 2024; originally announced August 2024.

arXiv:2407.21035 [pdf, other]

Direct Unlearning Optimization for Robust and Safe Text-to-Image Models

Authors: Yong-Hyun Park, Sangdoo Yun, Jin-Hwa Kim, Junho Kim, Geonhui Jang, Yonghyun Jeong, Junghyo Jo, Gayoung Lee

Abstract: Recent advancements in text-to-image (T2I) models have greatly benefited from large-scale datasets, but they also pose significant risks due to the potential generation of unsafe content. To mitigate this issue, researchers have developed unlearning techniques to remove the model's ability to generate potentially harmful content. However, these methods are easily bypassed by adversarial attacks, m… ▽ More Recent advancements in text-to-image (T2I) models have greatly benefited from large-scale datasets, but they also pose significant risks due to the potential generation of unsafe content. To mitigate this issue, researchers have developed unlearning techniques to remove the model's ability to generate potentially harmful content. However, these methods are easily bypassed by adversarial attacks, making them unreliable for ensuring the safety of generated images. In this paper, we propose Direct Unlearning Optimization (DUO), a novel framework for removing Not Safe For Work (NSFW) content from T2I models while preserving their performance on unrelated topics. DUO employs a preference optimization approach using curated paired image data, ensuring that the model learns to remove unsafe visual concepts while retaining unrelated features. Furthermore, we introduce an output-preserving regularization term to maintain the model's generative capabilities on safe content. Extensive experiments demonstrate that DUO can robustly defend against various state-of-the-art red teaming methods without significant performance degradation on unrelated topics, as measured by FID and CLIP scores. Our work contributes to the development of safer and more reliable T2I models, paving the way for their responsible deployment in both closed-source and open-source scenarios. △ Less

Submitted 17 July, 2024; originally announced July 2024.

Comments: Extended abstract accepted in GenLaw 2024 workshop @ ICML2024

arXiv:2407.10703 [pdf, other]

Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation

Authors: Yuhwan Jeong, Hoonhee Cho, Kuk-Jin Yoon

Abstract: Event cameras with high dynamic range ensure scene capture even in low-light conditions. However, night events exhibit patterns different from those captured during the day. This difference causes performance degradation when applying night events to a model trained solely on day events. This limitation persists due to a lack of annotated night events. To overcome the limitation, we aim to allevia… ▽ More Event cameras with high dynamic range ensure scene capture even in low-light conditions. However, night events exhibit patterns different from those captured during the day. This difference causes performance degradation when applying night events to a model trained solely on day events. This limitation persists due to a lack of annotated night events. To overcome the limitation, we aim to alleviate data imbalance by translating annotated day data into night events. However, generating events from different modalities challenges reproducing their unique properties. Accordingly, we propose an unpaired event-to-event day-to-night translation model that effectively learns to map from one domain to another using Diffusion GAN. The proposed translation model analyzes events in spatio-temporal dimension with wavelet decomposition and disentangled convolution layers. We also propose a new temporal contrastive learning with a novel shuffling and sampling strategy to regularize temporal continuity. To validate the efficacy of the proposed methodology, we redesign metrics for evaluating events translated in an unpaired setting, aligning them with the event modality for the first time. Our framework shows the successful day-to-night event translation while preserving the characteristics of events. In addition, through our translation method, we facilitate event-based modes to learn about night events by translating annotated day events into night events. Our approach effectively mitigates the performance degradation of applying real night events to downstream tasks. The code is available at https://github.com/jeongyh98/UDNET. △ Less

Submitted 15 July, 2024; originally announced July 2024.

Comments: Accepted by ECCV 2024

arXiv:2407.05551 [pdf, other]

Read, Watch and Scream! Sound Generation from Text and Video

Authors: Yujin Jeong, Yunji Kim, Sanghyuk Chun, Jiyoung Lee

Abstract: Multimodal generative models have shown impressive advances with the help of powerful diffusion models. Despite the progress, generating sound solely from text poses challenges in ensuring comprehensive scene depiction and temporal alignment. Meanwhile, video-to-sound generation limits the flexibility to prioritize sound synthesis for specific objects within the scene. To tackle these challenges,… ▽ More Multimodal generative models have shown impressive advances with the help of powerful diffusion models. Despite the progress, generating sound solely from text poses challenges in ensuring comprehensive scene depiction and temporal alignment. Meanwhile, video-to-sound generation limits the flexibility to prioritize sound synthesis for specific objects within the scene. To tackle these challenges, we propose a novel video-and-text-to-sound generation method, called ReWaS, where video serves as a conditional control for a text-to-audio generation model. Our method estimates the structural information of audio (namely, energy) from the video while receiving key content cues from a user prompt. We employ a well-performing text-to-sound model to consolidate the video control, which is much more efficient for training multimodal diffusion models with massive triplet-paired (audio-video-text) data. In addition, by separating the generative components of audio, it becomes a more flexible system that allows users to freely adjust the energy, surrounding environment, and primary sound source according to their preferences. Experimental results demonstrate that our method shows superiority in terms of quality, controllability, and training efficiency. Our demo is available at https://naver-ai.github.io/rewas △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: Project page: https://naver-ai.github.io/rewas

arXiv:2407.03231 [pdf]

doi 10.1021/acs.nanolett.4c01536

Dimensionality Engineering of Magnetic Anisotropy from Anomalous Hall Effect in Synthetic SrRuO3 Crystals

Authors: Seung Gyo Jeong, Seong Won Cho, Sehwan Song, Jin Young Oh, Do Gyeom Jeong, Gyeongtak Han, Hu Young Jeong, Ahmed Yousef Mohamed, Woo-suk Noh, Sungkyun Park, Jong Seok Lee, Suyoun Lee, Young-Min Kim, Deok-Yong Cho, Woo Seok Choi

Abstract: Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designi… ▽ More Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designing oxide superlattices with a correlated ferromagnetic SrRuO3 and nonmagnetic SrTiO3 layers, we observed modulated ferromagnetic behavior with the change of the SrRuO3 thickness. Especially, for three-unit-cell-thick layers, we observe a significant 1,500% improvement of coercive field in the anomalous Hall effect, which cannot be solely attributed to the dimensional crossover in ferromagnetism. The atomic-scale heterostructures further reveal the systematic modulation of anisotropy for the lattice structure and orbital hybridization, explaining the enhanced magnetic anisotropy. Our findings provide valuable insights into engineering the anisotropic hybridization of synthetic magnetic crystals, offering a tunable spin order for various applications. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 23 pages

Journal ref: published 2024

arXiv:2406.12258 [pdf, other]

Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics

Authors: Hyojin Kim, Jiyoon Lee, Yonghyun Jeong, Haneol Jang, YoungJoon Yoo

Abstract: This paper presents a novel perspective for enhancing anti-spoofing performance in zero-shot data domain generalization. Unlike traditional image classification tasks, face anti-spoofing datasets display unique generalization characteristics, necessitating novel zero-shot data domain generalization. One step forward to the previous frame-wise spoofing prediction, we introduce a nuanced metric calc… ▽ More This paper presents a novel perspective for enhancing anti-spoofing performance in zero-shot data domain generalization. Unlike traditional image classification tasks, face anti-spoofing datasets display unique generalization characteristics, necessitating novel zero-shot data domain generalization. One step forward to the previous frame-wise spoofing prediction, we introduce a nuanced metric calculation that aggregates frame-level probabilities for a video-wise prediction, to tackle the gap between the reported frame-wise accuracy and instability in real-world use-case. This approach enables the quantification of bias and variance in model predictions, offering a more refined analysis of model generalization. Our investigation reveals that simply scaling up the backbone of models does not inherently improve the mentioned instability, leading us to propose an ensembled backbone method from a Bayesian perspective. The probabilistically ensembled backbone both improves model robustness measured from the proposed metric and spoofing accuracy, and also leverages the advantages of measuring uncertainty, allowing for enhanced sampling during training that contributes to model generalization across new datasets. We evaluate the proposed method from the benchmark OMIC dataset and also the public CelebA-Spoof and SiW-Mv2. Our final model outperforms existing state-of-the-art methods across the datasets, showcasing advancements in Bias, Variance, HTER, and AUC metrics. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 10 pages with 4 figures, Accepted by CVPRW 2024

arXiv:2405.18623 [pdf]

I See You: Teacher Analytics with GPT-4 Vision-Powered Observational Assessment

Authors: Unggi Lee, Yeil Jeong, Junbo Koh, Gyuri Byun, Yunseo Lee, Hyunwoong Lee, Seunmin Eun, Jewoong Moon, Cheolil Lim, Hyeoncheol Kim

Abstract: This preliminary study explores the integration of GPT-4 Vision (GPT-4V) technology into teacher analytics, focusing on its applicability in observational assessment to enhance reflective teaching practice. This research is grounded in developing a Video-based Automatic Assessment System (VidAAS) empowered by GPT-4V. Our approach aims to revolutionize teachers' assessment of students' practices by… ▽ More This preliminary study explores the integration of GPT-4 Vision (GPT-4V) technology into teacher analytics, focusing on its applicability in observational assessment to enhance reflective teaching practice. This research is grounded in developing a Video-based Automatic Assessment System (VidAAS) empowered by GPT-4V. Our approach aims to revolutionize teachers' assessment of students' practices by leveraging Generative Artificial Intelligence (GenAI) to offer detailed insights into classroom dynamics. Our research methodology encompasses a comprehensive literature review, prototype development of the VidAAS, and usability testing with in-service teachers. The study findings provide future research avenues for VidAAS design, implementation, and integration in teacher analytics, underscoring the potential of GPT-4V to provide real-time, scalable feedback and a deeper understanding of the classroom. △ Less

Submitted 30 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

Comments: 27 pages, 5 figures, 4 tables

arXiv:2405.13915 [pdf, other]

HeteGraph-Mamba: Heterogeneous Graph Learning via Selective State Space Model

Authors: Zhenyu Pan, Yoonsung Jeong, Xiaoda Liu, Han Liu

Abstract: We propose a heterogeneous graph mamba network (HGMN) as the first exploration in leveraging the selective state space models (SSSMs) for heterogeneous graph learning. Compared with the literature, our HGMN overcomes two major challenges: (i) capturing long-range dependencies among heterogeneous nodes and (ii) adapting SSSMs to heterogeneous graph data. Our key contribution is a general graph arch… ▽ More We propose a heterogeneous graph mamba network (HGMN) as the first exploration in leveraging the selective state space models (SSSMs) for heterogeneous graph learning. Compared with the literature, our HGMN overcomes two major challenges: (i) capturing long-range dependencies among heterogeneous nodes and (ii) adapting SSSMs to heterogeneous graph data. Our key contribution is a general graph architecture that can solve heterogeneous nodes in real-world scenarios, followed an efficient flow. Methodologically, we introduce a two-level efficient tokenization approach that first captures long-range dependencies within identical node types, and subsequently across all node types. Empirically, we conduct comparisons between our framework and 19 state-of-the-art methods on the heterogeneous benchmarks. The extensive comparisons demonstrate that our framework outperforms other methods in both the accuracy and efficiency dimensions. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.08851 [pdf]

Enhanced Terahertz Spectroscopy of a Monolayer Transition Metal Dichalcogenide

Authors: Xin Jin, Vincenzo Aglieri, Young-Gyun Jeong, Atiye Pezeshki, Lilian Skokan, Mostafa Shagar, Yuechen Jia, Pablo Bianucci, Andreas Ruediger, Emanuele Orgiu, Andrea Toma, Luca Razzari

Abstract: Two-dimensional materials, including transition metal dichalcogenides, are attractive for a variety of applications in electronics as well as photonics and have recently been envisioned as an appealing platform for phonon polaritonics. However, their direct characterization in the terahertz spectral region, of interest for retrieving, e.g., their phonon response, represents a major challenge, due… ▽ More Two-dimensional materials, including transition metal dichalcogenides, are attractive for a variety of applications in electronics as well as photonics and have recently been envisioned as an appealing platform for phonon polaritonics. However, their direct characterization in the terahertz spectral region, of interest for retrieving, e.g., their phonon response, represents a major challenge, due to the limited sensitivity of typical terahertz spectroscopic tools and the weak interaction of such long-wavelength radiation with sub-nanometer systems. In this work, by exploiting an ad-hoc engineered metallic surface enabling a ten-thousand-fold local absorption boost, we perform enhanced terahertz spectroscopy of a monolayer transition metal dichalcogenide (tungsten diselenide) and extract its dipole-active phonon resonance features. In addition, we use these data to obtain the monolayer effective permittivity around its phonon resonance. Via the direct terahertz characterization of the phonon response of such two-dimensional systems, this method opens the path to the rational design of phonon polariton devices exploiting monolayer transition metal dichalcogenides. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2405.06267 [pdf]

doi 10.1063/5.0216377

Characterization of a graphene-hBN superlattice field effect transistor

Authors: Won Beom Choi, Youngoh Son, Hangyeol Park, Yungi Jeong, Junhyeok Oh, K. Watanabe, T. Taniguchi, Joonho Jang

Abstract: Graphene provides a unique platform for hosting high quality 2D electron systems. Encapsulating graphene with hexagonal boron nitride (hBN) to shield it from noisy environments offers the potential to achieve ultrahigh performance nanodevices, such as photodiodes and transistors. However, the absence of a bandgap at the Dirac point presents challenges for using this system as a useful transistor.… ▽ More Graphene provides a unique platform for hosting high quality 2D electron systems. Encapsulating graphene with hexagonal boron nitride (hBN) to shield it from noisy environments offers the potential to achieve ultrahigh performance nanodevices, such as photodiodes and transistors. However, the absence of a bandgap at the Dirac point presents challenges for using this system as a useful transistor. In this study, we investigated the functionality of hBN-aligned monolayer graphene as a field effect transistor (FET). By precisely aligning the hBN and graphene, bandgaps open at the first Dirac point and at the hole-doped induced Dirac point via an interfacial moiré potential. To characterize this as a submicrometer scale FET, we fabricated a global bottom gate to tune the density of a conducting channel and a local top gate to switch off this channel. This demonstrated that the system could be tuned to an optimal on/off ratio regime by separately controlling the gates. These findings provide a valuable reference point for the further development of FETs based on graphene heterostructures. △ Less

Submitted 12 July, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

Journal ref: Appl. Phys. Lett. 125, 033503 (2024)

arXiv:2404.18370 [pdf, other]

Out-of-distribution generalization under random, dense distributional shifts

Authors: Yujin Jeong, Dominik Rothenhäusler

Abstract: Many existing approaches for estimating parameters in settings with distributional shifts operate under an invariance assumption. For example, under covariate shift, it is assumed that p(y|x) remains invariant. We refer to such distribution shifts as sparse, since they may be substantial but affect only a part of the data generating system. In contrast, in various real-world settings, shifts might… ▽ More Many existing approaches for estimating parameters in settings with distributional shifts operate under an invariance assumption. For example, under covariate shift, it is assumed that p(y|x) remains invariant. We refer to such distribution shifts as sparse, since they may be substantial but affect only a part of the data generating system. In contrast, in various real-world settings, shifts might be dense. More specifically, these dense distributional shifts may arise through numerous small and random changes in the population and environment. First, we will discuss empirical evidence for such random dense distributional shifts and explain why commonly used models for distribution shifts-including adversarial approaches-may not be appropriate under these conditions. Then, we will develop tools to infer parameters and make predictions for partially observed, shifted distributions. Finally, we will apply the framework to several real-world data sets and discuss diagnostics to evaluate the fit of the distributional uncertainty model. △ Less

Submitted 28 April, 2024; originally announced April 2024.

arXiv:2404.14644 [pdf, other]

Identifying sparse treatment effects

Authors: Yujin Jeong, Emily Fox, Ramesh Johari

Abstract: Based on technological advances in sensing modalities, randomized trials with primary outcomes represented as high-dimensional vectors have become increasingly prevalent. For example, these outcomes could be week-long time-series data from wearable devices or high-dimensional neuroimaging data, such as from functional magnetic resonance imaging. This paper focuses on randomized treatment studies w… ▽ More Based on technological advances in sensing modalities, randomized trials with primary outcomes represented as high-dimensional vectors have become increasingly prevalent. For example, these outcomes could be week-long time-series data from wearable devices or high-dimensional neuroimaging data, such as from functional magnetic resonance imaging. This paper focuses on randomized treatment studies with such high-dimensional outcomes characterized by sparse treatment effects, where interventions may influence a small number of dimensions, e.g., small temporal windows or specific brain regions. Conventional practices, such as using fixed, low-dimensional summaries of the outcomes, result in significantly reduced power for detecting treatment effects. To address this limitation, we propose a procedure that involves subset selection followed by inference. Specifically, given a potentially large set of outcome summaries, we identify the subset that captures treatment effects, which requires only one call to the Lasso, and subsequently conduct inference on the selected subset. Via theoretical analysis as well as simulations, we demonstrate that our method asymptotically selects the correct subset and increases statistical power. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.11810 [pdf, other]

Holographic Parallax Improves 3D Perceptual Realism

Authors: Dongyeon Kim, Seung-Woo Nam, Suyeon Choi, Jong-Mo Seo, Gordon Wetzstein, Yoonchan Jeong

Abstract: Holographic near-eye displays are a promising technology to solve long-standing challenges in virtual and augmented reality display systems. Over the last few years, many different computer-generated holography (CGH) algorithms have been proposed that are supervised by different types of target content, such as 2.5D RGB-depth maps, 3D focal stacks, and 4D light fields. It is unclear, however, what… ▽ More Holographic near-eye displays are a promising technology to solve long-standing challenges in virtual and augmented reality display systems. Over the last few years, many different computer-generated holography (CGH) algorithms have been proposed that are supervised by different types of target content, such as 2.5D RGB-depth maps, 3D focal stacks, and 4D light fields. It is unclear, however, what the perceptual implications are of the choice of algorithm and target content type. In this work, we build a perceptual testbed of a full-color, high-quality holographic near-eye display. Under natural viewing conditions, we examine the effects of various CGH supervision formats and conduct user studies to assess their perceptual impacts on 3D realism. Our results indicate that CGH algorithms designed for specific viewpoints exhibit noticeable deficiencies in achieving 3D realism. In contrast, holograms incorporating parallax cues consistently outperform other formats across different viewing conditions, including the center of the eyebox. This finding is particularly interesting and suggests that the inclusion of parallax cues in CGH rendering plays a crucial role in enhancing the overall quality of the holographic experience. This work represents an initial stride towards delivering a perceptually realistic 3D experience with holographic near-eye displays. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 33 pages, 34 figures

arXiv:2404.01954 [pdf, other]

HyperCLOVA X Technical Report

Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs. △ Less

Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 44 pages; updated authors list and fixed author names

arXiv:2404.00963 [pdf, other]

doi 10.1039/D4CP00517A

Inversion and Tunability of Van Hove Singularities in $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, and Cs) kagome metals

Authors: Sangjun Sim, Min Yong Jeong, Hyunggeun Lee, Dong Hyun David Lee, Myung Joon Han

Abstract: To understand the alkali-metal-dependent material properties of recently discovered $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, and Cs), we conducted a detailed electronic structure analysis based on first-principles density functional theory calculations. Contrary to the case of $A$ = K and Rb, the energetic positions of the low-lying Van Hove singularities are reversed in CsV$_{3}$Sb$_{5}$, and the charact… ▽ More To understand the alkali-metal-dependent material properties of recently discovered $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, and Cs), we conducted a detailed electronic structure analysis based on first-principles density functional theory calculations. Contrary to the case of $A$ = K and Rb, the energetic positions of the low-lying Van Hove singularities are reversed in CsV$_{3}$Sb$_{5}$, and the characteristic higher-order Van Hove point gets closer to the Fermi level. We found that this notable difference can be attributed to the chemical effect, apart from structural differences. Due to their different orbital compositions, Van Hove points show qualitatively different responses to the structure changes. A previously unnoticed highest lying point can be lowered, locating close to or even below the other ones in response to a reasonable range of bi- and uni-axial strain. Our results can be useful in better understanding the material-dependent features reported in this family and in realizing experimental control of exotic quantum phases. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: Physical Chemistry Chemical Physics (PCCP) in press

arXiv:2403.10882 [pdf, other]

Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean

Authors: ChangSu Choi, Yongbin Jeong, Seoyoon Park, InHo Won, HyeonSeok Lim, SangMin Kim, Yejee Kang, Chanhyuk Yoon, Jaewan Park, Yiseul Lee, HyeJin Lee, Younggyun Hahm, Hansaem Kim, KyungTae Lim

Abstract: Large language models (LLMs) use pretraining to predict the subsequent word; however, their expansion requires significant computing resources. Numerous big tech companies and research institutes have developed multilingual LLMs (MLLMs) to meet current demands, overlooking less-resourced languages (LRLs). This study proposed three strategies to enhance the performance of LRLs based on the publicly… ▽ More Large language models (LLMs) use pretraining to predict the subsequent word; however, their expansion requires significant computing resources. Numerous big tech companies and research institutes have developed multilingual LLMs (MLLMs) to meet current demands, overlooking less-resourced languages (LRLs). This study proposed three strategies to enhance the performance of LRLs based on the publicly available MLLMs. First, the MLLM vocabularies of LRLs were expanded to enhance expressiveness. Second, bilingual data were used for pretraining to align the high- and less-resourced languages. Third, a high-quality small-scale instruction dataset was constructed and instruction-tuning was performed to augment the LRL. The experiments employed the Llama2 model and Korean was used as the LRL, which was quantitatively evaluated against other developed LLMs across eight tasks. Furthermore, a qualitative assessment was performed based on human evaluation and GPT4. Experimental results showed that our proposed Bllossom model exhibited superior performance in qualitative analyses compared to previously proposed Korean monolingual models. △ Less

Submitted 21 March, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

arXiv:2403.06592 [pdf, other]

Exploiting Style Latent Flows for Generalizing Deepfake Video Detection

Authors: Jongwook Choi, Taehoon Kim, Yonghyun Jeong, Seungryul Baek, Jongwon Choi

Abstract: This paper presents a new approach for the detection of fake videos, based on the analysis of style latent vectors and their abnormal behavior in temporal changes in the generated videos. We discovered that the generated facial videos suffer from the temporal distinctiveness in the temporal changes of style latent vectors, which are inevitable during the generation of temporally stable videos with… ▽ More This paper presents a new approach for the detection of fake videos, based on the analysis of style latent vectors and their abnormal behavior in temporal changes in the generated videos. We discovered that the generated facial videos suffer from the temporal distinctiveness in the temporal changes of style latent vectors, which are inevitable during the generation of temporally stable videos with various facial expressions and geometric transformations. Our framework utilizes the StyleGRU module, trained by contrastive learning, to represent the dynamic properties of style latent vectors. Additionally, we introduce a style attention module that integrates StyleGRU-generated features with content-based features, enabling the detection of visual and temporal artifacts. We demonstrate our approach across various benchmark scenarios in deepfake detection, showing its superiority in cross-dataset and cross-manipulation scenarios. Through further analysis, we also validate the importance of using temporal changes of style latent vectors to improve the generality of deepfake video detection. △ Less

Submitted 20 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

Comments: Preprint version, final version will be available at https://openaccess.thecvf.com The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) (2024) Published by: IEEE & CVF

arXiv:2402.17275 [pdf, other]

One-Shot Structure-Aware Stylized Image Synthesis

Authors: Hansam Cho, Jonghyun Lee, Seunggyu Chang, Yonghyun Jeong

Abstract: While GAN-based models have been successful in image stylization tasks, they often struggle with structure preservation while stylizing a wide range of input images. Recently, diffusion models have been adopted for image stylization but still lack the capability to maintain the original quality of input images. Building on this, we propose OSASIS: a novel one-shot stylization method that is robust… ▽ More While GAN-based models have been successful in image stylization tasks, they often struggle with structure preservation while stylizing a wide range of input images. Recently, diffusion models have been adopted for image stylization but still lack the capability to maintain the original quality of input images. Building on this, we propose OSASIS: a novel one-shot stylization method that is robust in structure preservation. We show that OSASIS is able to effectively disentangle the semantics from the structure of an image, allowing it to control the level of content and style implemented to a given input. We apply OSASIS to various experimental settings, including stylization with out-of-domain reference images and stylization with text-driven manipulation. Results show that OSASIS outperforms other stylization methods, especially for input images that were rarely encountered during training, providing a promising solution to stylization via diffusion models. △ Less

Submitted 1 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

Comments: CVPR 2024

arXiv:2402.11353 [pdf, other]

doi 10.1145/3613904.3642420

Understanding the Impact of Long-Term Memory on Self-Disclosure with Large Language Model-Driven Chatbots for Public Health Intervention

Authors: Eunkyung Jo, Yuin Jeong, SoHyun Park, Daniel A. Epstein, Young-Ho Kim

Abstract: Recent large language models (LLMs) offer the potential to support public health monitoring by facilitating health disclosure through open-ended conversations but rarely preserve the knowledge gained about individuals across repeated interactions. Augmenting LLMs with long-term memory (LTM) presents an opportunity to improve engagement and self-disclosure, but we lack an understanding of how LTM i… ▽ More Recent large language models (LLMs) offer the potential to support public health monitoring by facilitating health disclosure through open-ended conversations but rarely preserve the knowledge gained about individuals across repeated interactions. Augmenting LLMs with long-term memory (LTM) presents an opportunity to improve engagement and self-disclosure, but we lack an understanding of how LTM impacts people's interaction with LLM-driven chatbots in public health interventions. We examine the case of CareCall -- an LLM-driven voice chatbot with LTM -- through the analysis of 1,252 call logs and interviews with nine users. We found that LTM enhanced health disclosure and fostered positive perceptions of the chatbot by offering familiarity. However, we also observed challenges in promoting self-disclosure through LTM, particularly around addressing chronic health conditions and privacy concerns. We discuss considerations for LTM integration in LLM-driven chatbots for public health monitoring, including carefully deciding what topics need to be remembered in light of public health goals. △ Less

Submitted 17 February, 2024; originally announced February 2024.

Comments: Accepted to ACM CHI 2024 as a full paper

ACM Class: H.5.2; I.2.7

Journal ref: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA. ACM, New York, NY, USA

arXiv:2402.05095 [pdf, other]

Motile bacteria crossing liquid-liquid interfaces

Authors: Jiyong Cheon, Joowang Son, Sungbin Lim, Yundon Jeong, Jung-Hoon Park, Robert J. Mitchell, Jaeup U. Kim, Joonwoo Jeong

Abstract: Real-life bacteria often swim in complex fluids, but our understanding of the interactions between bacteria and complex surroundings is still evolving. In this work, rod-like \textit{Bacillus subtilis} swims in a quasi-2D environment with aqueous liquid-liquid interfaces, i.e., the isotropic-nematic coexistence phase of an aqueous chromonic liquid crystal. Focusing on the bacteria motion near and… ▽ More Real-life bacteria often swim in complex fluids, but our understanding of the interactions between bacteria and complex surroundings is still evolving. In this work, rod-like \textit{Bacillus subtilis} swims in a quasi-2D environment with aqueous liquid-liquid interfaces, i.e., the isotropic-nematic coexistence phase of an aqueous chromonic liquid crystal. Focusing on the bacteria motion near and at the liquid-liquid interfaces, we collect and quantify bacterial trajectories ranging across the isotropic to the nematic phase. Despite its small magnitude, the interfacial tension of the order of 10 $\mathrm{μN/m}$ at the isotropic-nematic interface justifies our observations that bacteria swimming more perpendicular to the interface have a higher probability of crossing the interface. Our force-balance model, considering the interfacial tension, further predicts how the length and speed of the bacteria affect their crossing behaviors. We also find, as soon as the bacteria cross the interface and enter the nematic phase, they wiggle less, but faster, and that this occurs as the flagellar bundles aggregate within the nematic phase. △ Less

Submitted 12 April, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

arXiv:2402.04625 [pdf, other]

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing

Authors: Hansam Cho, Jonghyun Lee, Seoung Bum Kim, Tae-Hyun Oh, Yonghyun Jeong

Abstract: Text-guided diffusion models have become a popular tool in image synthesis, known for producing high-quality and diverse images. However, their application to editing real images often encounters hurdles primarily due to the text condition deteriorating the reconstruction quality and subsequently affecting editing fidelity. Null-text Inversion (NTI) has made strides in this area, but it fails to c… ▽ More Text-guided diffusion models have become a popular tool in image synthesis, known for producing high-quality and diverse images. However, their application to editing real images often encounters hurdles primarily due to the text condition deteriorating the reconstruction quality and subsequently affecting editing fidelity. Null-text Inversion (NTI) has made strides in this area, but it fails to capture spatial context and requires computationally intensive per-timestep optimization. Addressing these challenges, we present Noise Map Guidance (NMG), an inversion method rich in a spatial context, tailored for real-image editing. Significantly, NMG achieves this without necessitating optimization, yet preserves the editing quality. Our empirical investigations highlight NMG's adaptability across various editing techniques and its robustness to variants of DDIM inversions. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: ICLR 2024

arXiv:2401.09081 [pdf, other]

Volcano Transition in a System of Generalized Kuramoto Oscillators with Random Frustrated Interactions

Authors: Seungjae Lee, Yeonsu Jeong, Seung-Woo Son, Katharina Krischer

Abstract: In a system of heterogeneous (Abelian) Kuramoto oscillators with random or `frustrated' interactions, transitions from states of incoherence to partial synchronization were observed. These so-called volcano transitions are characterized by a change in the shape of a local field distribution and were discussed in connection with an oscillator glass. In this paper, we consider a different class of o… ▽ More In a system of heterogeneous (Abelian) Kuramoto oscillators with random or `frustrated' interactions, transitions from states of incoherence to partial synchronization were observed. These so-called volcano transitions are characterized by a change in the shape of a local field distribution and were discussed in connection with an oscillator glass. In this paper, we consider a different class of oscillators, namely a system of (non-Abelian) SU(2)-Lohe oscillators that can also be defined on the 3-sphere, i.e., an oscillator is generalized to be defined as a unit vector in 4D Euclidean space. We demonstrate that such higher-dimensional Kuramoto models with reciprocal and nonreciprocal random interactions represented by a low-rank matrix exhibit a volcano transition as well. We determine the critical coupling strength at which a volcano-like transition occurs, employing an Ott-Antonsen ansatz. Numerical simulations provide additional validations of our analytical findings and reveal the differences in observable collective dynamics prior to and following the transition. Furthermore, we show that a system of unit 3-vector oscillators on the 2-sphere does not possess a volcano transition. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2401.09048 [pdf, other]

Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis

Authors: Jonghyun Lee, Hansam Cho, Youngjoon Yoo, Seoung Bum Kim, Yonghyun Jeong

Abstract: Addressing the limitations of text as a source of accurate layout representation in text-conditional diffusion models, many works incorporate additional signals to condition certain attributes within a generated image. Although successful, previous works do not account for the specific localization of said attributes extended into the three dimensional plane. In this context, we present a conditio… ▽ More Addressing the limitations of text as a source of accurate layout representation in text-conditional diffusion models, many works incorporate additional signals to condition certain attributes within a generated image. Although successful, previous works do not account for the specific localization of said attributes extended into the three dimensional plane. In this context, we present a conditional diffusion model that integrates control over three-dimensional object placement with disentangled representations of global stylistic semantics from multiple exemplar images. Specifically, we first introduce \textit{depth disentanglement training} to leverage the relative depth of objects as an estimator, allowing the model to identify the absolute positions of unseen objects through the use of synthetic image triplets. We also introduce \textit{soft guidance}, a method for imposing global semantics onto targeted regions without the use of any additional localization cues. Our integrated framework, \textsc{Compose and Conquer (CnC)}, unifies these techniques to localize multiple conditions in a disentangled manner. We demonstrate that our approach allows perception of objects at varying depths while offering a versatile framework for composing localized objects with different global semantics. Code: https://github.com/tomtom1103/compose-and-conquer/ △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: ICLR 2024

arXiv:2312.07315 [pdf, other]

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Authors: Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim, Minsu Cho, Doyup Lee

Abstract: Transfer learning of large-scale Text-to-Image (T2I) models has recently shown impressive potential for Novel View Synthesis (NVS) of diverse objects from a single image. While previous methods typically train large models on multi-view datasets for NVS, fine-tuning the whole parameters of T2I models not only demands a high cost but also reduces the generalization capacity of T2I models in generat… ▽ More Transfer learning of large-scale Text-to-Image (T2I) models has recently shown impressive potential for Novel View Synthesis (NVS) of diverse objects from a single image. While previous methods typically train large models on multi-view datasets for NVS, fine-tuning the whole parameters of T2I models not only demands a high cost but also reduces the generalization capacity of T2I models in generating diverse images in a new domain. In this study, we propose an effective method, dubbed NVS-Adapter, which is a plug-and-play module for a T2I model, to synthesize novel multi-views of visual objects while fully exploiting the generalization capacity of T2I models. NVS-Adapter consists of two main components; view-consistency cross-attention learns the visual correspondences to align the local details of view features, and global semantic conditioning aligns the semantic structure of generated views with the reference view. Experimental results demonstrate that the NVS-Adapter can effectively synthesize geometrically consistent multi-views and also achieve high performance on benchmarks without full fine-tuning of T2I models. The code and data are publicly available in ~\href{https://postech-cvlab.github.io/nvsadapter/}{https://postech-cvlab.github.io/nvsadapter/}. △ Less

Submitted 10 August, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

Comments: [ECCV2024] Project Page: https://postech-cvlab.github.io/nvsadapter/

arXiv:2312.02340 [pdf, other]

Prompt neutrinos from the atmosphere to the forward region of LHC

Authors: Weidong Bai, Milind Diwan, Maria Vittoria Garzelli, Yu Seon Jeong, Mary Hall Reno

Abstract: We investigate the kinematical regions that are important for producing prompt neutrinos in the atmosphere and in the forward region of the LHC, as probed by different experiments. We illustrate the results as a function of the center-of-mass nucleon-nucleon collision energies and rapidities of neutrinos and of the parent heavy-flavoured hadrons. We find overlap in part of the kinematic space. We investigate the kinematical regions that are important for producing prompt neutrinos in the atmosphere and in the forward region of the LHC, as probed by different experiments. We illustrate the results as a function of the center-of-mass nucleon-nucleon collision energies and rapidities of neutrinos and of the parent heavy-flavoured hadrons. We find overlap in part of the kinematic space. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: 6 pages, 3 figures, talk at "The European Physical Society Conference on High Energy Physics (EPS-HEP2023)", 21-25 August 2023, Hamburg, Germany; submitted to PoS - Proceedings of Science

arXiv:2312.00459 [pdf, other]

A continuous-wave and pulsed X-band electron spin resonance spectrometer operating in ultra-high vacuum for the study of low dimensional spin ensembles

Authors: Franklin H. Cho, Juyoung Park, Soyoung Oh, Jisoo Yu, Yejin Jeong, Luciano Colazzo, Lukas Spree, Caroline Hommel, Arzhang Ardavan, Giovanni Boero, Fabio Donati

Abstract: We report the development of a continuous-wave and pulsed X-band electron spin resonance (ESR) spectrometer for the study of spins on ordered surfaces down to cryogenic temperatures. The spectrometer operates in ultra-high vacuum and utilizes a half-wavelength microstrip line resonator realized using epitaxially grown copper films on single crystal Al$_2$O$_3$ substrates. The one-dimensional micro… ▽ More We report the development of a continuous-wave and pulsed X-band electron spin resonance (ESR) spectrometer for the study of spins on ordered surfaces down to cryogenic temperatures. The spectrometer operates in ultra-high vacuum and utilizes a half-wavelength microstrip line resonator realized using epitaxially grown copper films on single crystal Al$_2$O$_3$ substrates. The one-dimensional microstrip line resonator exhibits a quality factor of more than 200 at room temperature, close to the upper limit determined by radiation losses. The surface characterizations of the copper strip of the resonator by atomic force microscope, low-energy electron diffraction, and scanning tunneling microscope show that the surface is atomically clean, flat, and single crystalline. Measuring the ESR spectrum at 15 K from a few nm thick molecular film of YPc$_2$, we find a continuous-wave ESR sensitivity of $2.6 \cdot 10^{11}~\text{spins}/\text{G} \cdot \text{Hz}^{1/2}$ indicating that a signal-to-noise ratio of $3.9~\text{G} \cdot \text{Hz}^{1/2}$ is expected from a monolayer of YPc$_2$ molecules. Advanced pulsed ESR experimental capabilities including dynamical decoupling and electron-nuclear double resonance are demonstrated using free radicals diluted in a glassy matrix. △ Less

Submitted 20 February, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

Comments: 14 pages, 7 figures

arXiv:2311.11212 [pdf, other]

Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?

Authors: Chanhui Lee, Juhyeon Kim, Yongjun Jeong, Juhyun Lyu, Junghee Kim, Sangmin Lee, Sangjun Han, Hyeokjun Choe, Soyeon Park, Woohyung Lim, Sungbin Lim, Sanghack Lee

Abstract: Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of r… ▽ More Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of repetitive causal reasoning, achieved through specifically designed prompts. It highlights the usefulness of PLMs in discovering cause and effect, which is often limited by a lack of data, especially when dealing with multiple variables. Conversely, the characteristics of PLMs which are that PLMs do not analyze data and they are highly dependent on prompt design leads to a crucial limitation for directly using PLMs in causal discovery. Accordingly, PLM-based causal reasoning deeply depends on the prompt design and carries out the risk of overconfidence and false predictions in determining causal relationships. In this paper, we empirically demonstrate the aforementioned limitations of PLM-based causal reasoning through experiments on physics-inspired synthetic data. Then, we propose a new framework that integrates prior knowledge obtained from PLM with a causal discovery algorithm. This is accomplished by initializing an adjacency matrix for causal discovery and incorporating regularization using prior knowledge. Our proposed framework not only demonstrates improved performance through the integration of PLM and causal discovery but also suggests how to leverage PLM-extracted prior knowledge with existing causal discovery algorithms. △ Less

Submitted 18 November, 2023; originally announced November 2023.

ACM Class: I.2

arXiv:2310.14205 [pdf]

doi 10.1186/s40580-023-00359-5

Machine-learning-assisted analysis of transition metal dichalcogenide thin-film growth

Authors: Hyuk Jin Kim, Minsu Chong, Tae Gyu Rhee, Yeong Gwang Khim, Min-Hyoung Jung, Young-Min Kim, Hu Young Jeong, Byoung Ki Choi, Young Jun Chang

Abstract: In situ reflective high-energy electron diffraction (RHEED) is widely used to monitor the surface crystalline state during thin-film growth by molecular beam epitaxy (MBE) and pulsed laser deposition. With the recent development of machine learning (ML), ML-assisted analysis of RHEED videos aids in interpreting the complete RHEED data of oxide thin films. The quantitative analysis of RHEED data al… ▽ More In situ reflective high-energy electron diffraction (RHEED) is widely used to monitor the surface crystalline state during thin-film growth by molecular beam epitaxy (MBE) and pulsed laser deposition. With the recent development of machine learning (ML), ML-assisted analysis of RHEED videos aids in interpreting the complete RHEED data of oxide thin films. The quantitative analysis of RHEED data allows us to characterize and categorize the growth modes step by step, and extract hidden knowledge of the epitaxial film growth process. In this study, we employed the ML-assisted RHEED analysis method to investigate the growth of 2D thin films of transition metal dichalcogenides (ReSe2) on graphene substrates by MBE. Principal component analysis (PCA) and K-means clustering were used to separate statistically important patterns and visualize the trend of pattern evolution without any notable loss of information. Using the modified PCA, we could monitor the diffraction intensity of solely the ReSe2 layers by filtering out the substrate contribution. These findings demonstrate that ML analysis can be successfully employed to examine and understand the film-growth dynamics of 2D materials. Further, the ML-based method can pave the way for the development of advanced real-time monitoring and autonomous material synthesis techniques. △ Less

Submitted 22 October, 2023; originally announced October 2023.

Comments: 21 pages, 4 figures

Journal ref: Nano Convergence 10, 10 (2023)

arXiv:2310.05667 [pdf]

doi 10.1038/s41467-024-50475-x

Interplay of valley, layer and band topology towards interacting quantum phases in moiré bilayer graphene

Authors: Yungi Jeong, Hangyeol Park, Taeho Kim, Kenji Watanabe, Takashi Taniguchi, Jeil Jung, Joonho Jang

Abstract: In Bernal-stacked bilayer graphene (BBG), the Landau levels give rise to an intimate connection between valley and layer degrees of freedom. Adding a moiré superlattice potential enriches the BBG physics with the formation of topological minibands - potentially leading to tunable exotic quantum transport. Here, we present magnetotransport measurements of a high-quality bilayer graphene-hexagonal b… ▽ More In Bernal-stacked bilayer graphene (BBG), the Landau levels give rise to an intimate connection between valley and layer degrees of freedom. Adding a moiré superlattice potential enriches the BBG physics with the formation of topological minibands - potentially leading to tunable exotic quantum transport. Here, we present magnetotransport measurements of a high-quality bilayer graphene-hexagonal boron nitride (hBN) heterostructure. The zero-degree alignment generates a strong moiré superlattice potential for the electrons in BBG and the resulting Landau fan diagram of longitudinal and Hall resistance displays a Hofstadter butterfly pattern with a high level of detail. We demonstrate that the intricate relationship between valley and layer degrees of freedom controls the topology of moiré-induced bands, significantly influencing the energetics of interacting quantum phases in the BBG superlattice. We further observe signatures of field-induced correlated insulators, helical edge states and clear quantizations of interaction-driven topological quantum phases, such as symmetry broken Chern insulators. △ Less

Submitted 1 August, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

Journal ref: Nat Commun 15, 6351 (2024)

arXiv:2309.14668 [pdf]

Depolarized Holography with Polarization-multiplexing Metasurface

Authors: Seung-Woo Nam, Youngjin Kim, Dongyeon Kim, Yoonchan Jeong

Abstract: The evolution of computer-generated holography (CGH) algorithms has prompted significant improvements in the performances of holographic displays. Nonetheless, they start to encounter a limited degree of freedom in CGH optimization and physical constraints stemming from the coherent nature of holograms. To surpass the physical limitations, we consider polarization as a new degree of freedom by uti… ▽ More The evolution of computer-generated holography (CGH) algorithms has prompted significant improvements in the performances of holographic displays. Nonetheless, they start to encounter a limited degree of freedom in CGH optimization and physical constraints stemming from the coherent nature of holograms. To surpass the physical limitations, we consider polarization as a new degree of freedom by utilizing a novel optical platform called metasurface. Polarization-multiplexing metasurfaces enable incoherent-like behavior in holographic displays due to the mutual incoherence of orthogonal polarization states. We leverage this unique characteristic of a metasurface by integrating it into a holographic display and exploiting polarization diversity to bring an additional degree of freedom for CGH algorithms. To minimize the speckle noise while maximizing the image quality, we devise a fully differentiable optimization pipeline by taking into account the metasurface proxy model, thereby jointly optimizing spatial light modulator phase patterns and geometric parameters of metasurface nanostructures. We evaluate the metasurface-enabled depolarized holography through simulations and experiments, demonstrating its ability to reduce speckle noise and enhance image quality. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 15 pages, 13 figures, to be published in SIGGRAPH Asia 2023

arXiv:2309.04509 [pdf, other]

The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion

Authors: Yujin Jeong, Wonjeong Ryoo, Seunghyun Lee, Dabin Seo, Wonmin Byeon, Sangpil Kim, Jinkyu Kim

Abstract: In recent years, video generation has become a prominent generative tool and has drawn significant attention. However, there is little consideration in audio-to-video generation, though audio contains unique qualities like temporal semantics and magnitude. Hence, we propose The Power of Sound (TPoS) model to incorporate audio input that includes both changeable temporal semantics and magnitude. To… ▽ More In recent years, video generation has become a prominent generative tool and has drawn significant attention. However, there is little consideration in audio-to-video generation, though audio contains unique qualities like temporal semantics and magnitude. Hence, we propose The Power of Sound (TPoS) model to incorporate audio input that includes both changeable temporal semantics and magnitude. To generate video frames, TPoS utilizes a latent stable diffusion model with textual semantic information, which is then guided by the sequential audio embedding from our pretrained Audio Encoder. As a result, this method produces audio reactive video contents. We demonstrate the effectiveness of TPoS across various tasks and compare its results with current state-of-the-art techniques in the field of audio-to-video generation. More examples are available at https://ku-vai.github.io/TPoS/ △ Less

Submitted 8 September, 2023; originally announced September 2023.

Comments: ICCV2023

arXiv:2309.02064 [pdf, other]

doi 10.1145/3583780.3615243

MvFS: Multi-view Feature Selection for Recommender System

Authors: Youngjune Lee, Yeongjong Jeong, Keunchan Park, SeongKu Kang

Abstract: Feature selection, which is a technique to select key features in recommender systems, has received increasing research attention. Recently, Adaptive Feature Selection (AdaFS) has shown remarkable performance by adaptively selecting features for each data instance, considering that the importance of a given feature field can vary significantly across data. However, this method still has limitation… ▽ More Feature selection, which is a technique to select key features in recommender systems, has received increasing research attention. Recently, Adaptive Feature Selection (AdaFS) has shown remarkable performance by adaptively selecting features for each data instance, considering that the importance of a given feature field can vary significantly across data. However, this method still has limitations in that its selection process could be easily biased to major features that frequently occur. To address these problems, we propose Multi-view Feature Selection (MvFS), which selects informative features for each instance more effectively. Most importantly, MvFS employs a multi-view network consisting of multiple sub-networks, each of which learns to measure the feature importance of a part of data with different feature patterns. By doing so, MvFS mitigates the bias problem towards dominant patterns and promotes a more balanced feature selection process. Moreover, MvFS adopts an effective importance score modeling strategy which is applied independently to each field without incurring dependency among features. Experimental results on real-world datasets demonstrate the effectiveness of MvFS compared to state-of-the-art baselines. △ Less

Submitted 6 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: CIKM 2023

arXiv:2308.04176 [pdf, other]

On Monotonic Aggregation for Open-domain QA

Authors: Sang-eun Han, Yeonseok Jeong, Seung-won Hwang, Kyungjae Lee

Abstract: Question answering (QA) is a critical task for speech-based retrieval from knowledge sources, by sifting only the answers without requiring to read supporting documents. Specifically, open-domain QA aims to answer user questions on unrestricted knowledge sources. Ideally, adding a source should not decrease the accuracy, but we find this property (denoted as "monotonicity") does not hold for curre… ▽ More Question answering (QA) is a critical task for speech-based retrieval from knowledge sources, by sifting only the answers without requiring to read supporting documents. Specifically, open-domain QA aims to answer user questions on unrestricted knowledge sources. Ideally, adding a source should not decrease the accuracy, but we find this property (denoted as "monotonicity") does not hold for current state-of-the-art methods. We identify the cause, and based on that we propose Judge-Specialist framework. Our framework consists of (1) specialist retrievers/readers to cover individual sources, and (2) judge, a dedicated language model to select the final answer. Our experiments show that our framework not only ensures monotonicity, but also outperforms state-of-the-art multi-source QA methods on Natural Questions. Additionally, we show that our models robustly preserve the monotonicity against noise from speech recognition. We publicly release our code and setting. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: INTERSPEECH 2023 Camera Ready

arXiv:2308.02808 [pdf, other]

Forward production of prompt neutrinos in the atmosphere and at high-energy colliders

Authors: Yu Seon Jeong, Weidong Bai, Milind Diwan, Maria Vittoria Garzelli, Karan Kumar, Mary Hall Reno

Abstract: The atmospheric neutrino flux at very high energies is dominated by prompt neutrinos, mostly contributed by the decays of charmed hadrons produced in the forward direction by cosmic ray interactions with air nuclei. Theoretical predictions of the prompt atmospheric neutrino flux have large uncertainties mainly related to charm hadron production. Prompt neutrinos can also be studied through high-en… ▽ More The atmospheric neutrino flux at very high energies is dominated by prompt neutrinos, mostly contributed by the decays of charmed hadrons produced in the forward direction by cosmic ray interactions with air nuclei. Theoretical predictions of the prompt atmospheric neutrino flux have large uncertainties mainly related to charm hadron production. Prompt neutrinos can also be studied through high-energy colliders. In particular, two ongoing forward experiments and the proposed Forward Physics Facility at the LHC can detect forward prompt neutrinos. We will present the kinematic regions relevant to the prompt atmospheric neutrino flux in terms of collider kinematic variables, the collision energy $\sqrt{s}$ and the center-of-mass rapidity of charm hadrons $y$, and discuss implications of the forward experiments at the LHC on the theoretical predictions of the prompt atmospheric neutrino flux. △ Less

Submitted 5 August, 2023; originally announced August 2023.

Comments: 8 pages, 4 figures, a proceeding for ICRC 2023

arXiv:2308.00558 [pdf, other]

Gradient Scaling on Deep Spiking Neural Networks with Spike-Dependent Local Information

Authors: Seongsik Park, Jeonghee Jo, Jongkil Park, Yeonjoo Jeong, Jaewook Kim, Suyoun Lee, Joon Young Kwak, Inho Kim, Jong-Keuk Park, Kyeong Seok Lee, Gye Weon Hwang, Hyun Jae Jang

Abstract: Deep spiking neural networks (SNNs) are promising neural networks for their model capacity from deep neural network architecture and energy efficiency from SNNs' operations. To train deep SNNs, recently, spatio-temporal backpropagation (STBP) with surrogate gradient was proposed. Although deep SNNs have been successfully trained with STBP, they cannot fully utilize spike information. In this work,… ▽ More Deep spiking neural networks (SNNs) are promising neural networks for their model capacity from deep neural network architecture and energy efficiency from SNNs' operations. To train deep SNNs, recently, spatio-temporal backpropagation (STBP) with surrogate gradient was proposed. Although deep SNNs have been successfully trained with STBP, they cannot fully utilize spike information. In this work, we proposed gradient scaling with local spike information, which is the relation between pre- and post-synaptic spikes. Considering the causality between spikes, we could enhance the training performance of deep SNNs. According to our experiments, we could achieve higher accuracy with lower spikes by adopting the gradient scaling on image classification tasks, such as CIFAR10 and CIFAR100. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: ICML-23 Localized Learning Workshop: Decentralized Model Updates via Non-Global Objectives

arXiv:2307.09241 [pdf, other]

Neutrino Cross Sections: Interface of shallow- and deep-inelastic scattering for collider neutrinos

Authors: Yu Seon Jeong, Mary Hall Reno

Abstract: Neutrino experiments in a Forward Physics Facility at the Large Hadron Collider can measure neutrino and antineutrino cross sections for energies up to a few TeV. For neutrino energies below 100 GeV, the inelastic cross section evaluations have contributions from weak structure functions at low momentum transfers and low hadronic final state invariant mass. To evaluate the size of these contributi… ▽ More Neutrino experiments in a Forward Physics Facility at the Large Hadron Collider can measure neutrino and antineutrino cross sections for energies up to a few TeV. For neutrino energies below 100 GeV, the inelastic cross section evaluations have contributions from weak structure functions at low momentum transfers and low hadronic final state invariant mass. To evaluate the size of these contributions to the neutrino cross section, we use a parametrization of the electron-proton structure function, adapted for neutrino scattering, augmented with a correction to account for the partial conservation of the axial vector current, and normalized to structure functions evaluated at next-to-leading order in QCD, with target mass corrections and heavy quark corrections. We compare our results with other approaches to account for this kinematic region in neutrino cross section for energies between 10--1000 GeV on isoscalar nucleon and iron targets. △ Less

Submitted 4 December, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

Comments: 17 pages, 11 figures, to be published in PRD

arXiv:2306.17391 [pdf, other]

EyeBAG: Accurate Control of Eye Blink and Gaze Based on Data Augmentation Leveraging Style Mixing

Authors: Bryan S. Kim, Jeong Young Jeong, Wonjong Ryu

Abstract: Recent developments in generative models have enabled the generation of photo-realistic human face images, and downstream tasks utilizing face generation technology have advanced accordingly. However, models for downstream tasks are yet substandard at eye control (e.g. eye blink, gaze redirection). To overcome such eye control problems, we introduce a novel framework consisting of two distinct mod… ▽ More Recent developments in generative models have enabled the generation of photo-realistic human face images, and downstream tasks utilizing face generation technology have advanced accordingly. However, models for downstream tasks are yet substandard at eye control (e.g. eye blink, gaze redirection). To overcome such eye control problems, we introduce a novel framework consisting of two distinct modules: a blink control module and a gaze redirection module. We also propose a novel data augmentation method to train each module, leveraging style mixing to obtain images with desired features. We show that our framework produces eye-controlled images of high quality, and demonstrate how it can be used to improve the performance of downstream tasks. △ Less

Submitted 29 June, 2023; originally announced June 2023.

arXiv:2306.16670 [pdf, other]

doi 10.1109/TCSVT.2023.3302858

End-to-End Learnable Multi-Scale Feature Compression for VCM

Authors: Yeongwoong Kim, Hyewon Jeong, Janghyun Yu, Younhee Kim, Jooyoung Lee, Se Yoon Jeong, Hui Yong Kim

Abstract: The proliferation of deep learning-based machine vision applications has given rise to a new type of compression, so called video coding for machine (VCM). VCM differs from traditional video coding in that it is optimized for machine vision performance instead of human visual quality. In the feature compression track of MPEG-VCM, multi-scale features extracted from images are subject to compressio… ▽ More The proliferation of deep learning-based machine vision applications has given rise to a new type of compression, so called video coding for machine (VCM). VCM differs from traditional video coding in that it is optimized for machine vision performance instead of human visual quality. In the feature compression track of MPEG-VCM, multi-scale features extracted from images are subject to compression. Recent feature compression works have demonstrated that the versatile video coding (VVC) standard-based approach can achieve a BD-rate reduction of up to 96% against MPEG-VCM feature anchor. However, it is still sub-optimal as VVC was not designed for extracted features but for natural images. Moreover, the high encoding complexity of VVC makes it difficult to design a lightweight encoder without sacrificing performance. To address these challenges, we propose a novel multi-scale feature compression method that enables both the end-to-end optimization on the extracted features and the design of lightweight encoders. The proposed model combines a learnable compressor with a multi-scale feature fusion network so that the redundancy in the multi-scale features is effectively removed. Instead of simply cascading the fusion network and the compression network, we integrate the fusion and encoding processes in an interleaved way. Our model first encodes a larger-scale feature to obtain a latent representation and then fuses the latent with a smaller-scale feature. This process is successively performed until the smallest-scale feature is fused and then the encoded latent at the final stage is entropy-coded for transmission. The results show that our model outperforms previous approaches by at least 52% BD-rate reduction and has $\times5$ to $\times27$ times less encoding time for object detection... △ Less

Submitted 8 August, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

Comments: 13 pages, accepted by IEEE Transactions on Circuits and Systems for Video Technology

arXiv:2306.11406 [pdf, other]

Stable and Consistent Prediction of 3D Characteristic Orientation via Invariant Residual Learning

Authors: Seungwook Kim, Chunghyun Park, Yoonwoo Jeong, Jaesik Park, Minsu Cho

Abstract: Learning to predict reliable characteristic orientations of 3D point clouds is an important yet challenging problem, as different point clouds of the same class may have largely varying appearances. In this work, we introduce a novel method to decouple the shape geometry and semantics of the input point cloud to achieve both stability and consistency. The proposed method integrates shape-geometry-… ▽ More Learning to predict reliable characteristic orientations of 3D point clouds is an important yet challenging problem, as different point clouds of the same class may have largely varying appearances. In this work, we introduce a novel method to decouple the shape geometry and semantics of the input point cloud to achieve both stability and consistency. The proposed method integrates shape-geometry-based SO(3)-equivariant learning and shape-semantics-based SO(3)-invariant residual learning, where a final characteristic orientation is obtained by calibrating an SO(3)-equivariant orientation hypothesis using an SO(3)-invariant residual rotation. In experiments, the proposed method not only demonstrates superior stability and consistency but also exhibits state-of-the-art performances when applied to point cloud part segmentation, given randomly rotated inputs. △ Less

Submitted 20 June, 2023; originally announced June 2023.

Comments: Accepted to ICML 2023

arXiv:2306.10989 [pdf, other]

Scaling of Class-wise Training Losses for Post-hoc Calibration

Authors: Seungjin Jung, Seungmo Seo, Yonghyun Jeong, Jongwon Choi

Abstract: The class-wise training losses often diverge as a result of the various levels of intra-class and inter-class appearance variation, and we find that the diverging class-wise training losses cause the uncalibrated prediction with its reliability. To resolve the issue, we propose a new calibration method to synchronize the class-wise training losses. We design a new training loss to alleviate the va… ▽ More The class-wise training losses often diverge as a result of the various levels of intra-class and inter-class appearance variation, and we find that the diverging class-wise training losses cause the uncalibrated prediction with its reliability. To resolve the issue, we propose a new calibration method to synchronize the class-wise training losses. We design a new training loss to alleviate the variance of class-wise training losses by using multiple class-wise scaling factors. Since our framework can compensate the training losses of overfitted classes with those of under-fitted classes, the integrated training loss is preserved, preventing the performance drop even after the model calibration. Furthermore, our method can be easily employed in the post-hoc calibration methods, allowing us to use the pre-trained model as an initial model and reduce the additional computation for model calibration. We validate the proposed framework by employing it in the various post-hoc calibration methods, which generally improves calibration performance while preserving accuracy, and discover through the investigation that our approach performs well with unbalanced datasets and untuned hyperparameters. △ Less

Submitted 19 June, 2023; originally announced June 2023.

Comments: Published at ICML 2023. Camera ready version

arXiv:2305.10084 [pdf, other]

CWD30: A Comprehensive and Holistic Dataset for Crop Weed Recognition in Precision Agriculture

Authors: Talha Ilyas, Dewa Made Sri Arsa, Khubaib Ahmad, Yong Chae Jeong, Okjae Won, Jong Hoon Lee, Hyongsuk Kim

Abstract: The growing demand for precision agriculture necessitates efficient and accurate crop-weed recognition and classification systems. Current datasets often lack the sample size, diversity, and hierarchical structure needed to develop robust deep learning models for discriminating crops and weeds in agricultural fields. Moreover, the similar external structure and phenomics of crops and weeds complic… ▽ More The growing demand for precision agriculture necessitates efficient and accurate crop-weed recognition and classification systems. Current datasets often lack the sample size, diversity, and hierarchical structure needed to develop robust deep learning models for discriminating crops and weeds in agricultural fields. Moreover, the similar external structure and phenomics of crops and weeds complicate recognition tasks. To address these issues, we present the CWD30 dataset, a large-scale, diverse, holistic, and hierarchical dataset tailored for crop-weed recognition tasks in precision agriculture. CWD30 comprises over 219,770 high-resolution images of 20 weed species and 10 crop species, encompassing various growth stages, multiple viewing angles, and environmental conditions. The images were collected from diverse agricultural fields across different geographic locations and seasons, ensuring a representative dataset. The dataset's hierarchical taxonomy enables fine-grained classification and facilitates the development of more accurate, robust, and generalizable deep learning models. We conduct extensive baseline experiments to validate the efficacy of the CWD30 dataset. Our experiments reveal that the dataset poses significant challenges due to intra-class variations, inter-class similarities, and data imbalance. Additionally, we demonstrate that minor training modifications like using CWD30 pretrained backbones can significantly enhance model performance and reduce convergence time, saving training resources on several downstream tasks. These challenges provide valuable insights and opportunities for future research in crop-weed recognition. We believe that the CWD30 dataset will serve as a benchmark for evaluating crop-weed recognition algorithms, promoting advancements in precision agriculture, and fostering collaboration among researchers in the field. △ Less

Submitted 17 May, 2023; originally announced May 2023.

Comments: 15 pages, 14 figures, journal research article

arXiv:2305.09986 [pdf, other]

A robust multi-domain network for short-scanning amyloid PET reconstruction

Authors: Hyoung Suk Park, Young Jin Jeong, Kiwan Jeon

Abstract: This paper presents a robust multi-domain network designed to restore low-quality amyloid PET images acquired in a short period of time. The proposed method is trained on pairs of PET images from short (2 minutes) and standard (20 minutes) scanning times, sourced from multiple domains. Learning relevant image features between these domains with a single network is challenging. Our key contribution… ▽ More This paper presents a robust multi-domain network designed to restore low-quality amyloid PET images acquired in a short period of time. The proposed method is trained on pairs of PET images from short (2 minutes) and standard (20 minutes) scanning times, sourced from multiple domains. Learning relevant image features between these domains with a single network is challenging. Our key contribution is the introduction of a mapping label, which enables effective learning of specific representations between different domains. The network, trained with various mapping labels, can efficiently correct amyloid PET datasets in multiple training domains and unseen domains, such as those obtained with new radiotracers, acquisition protocols, or PET scanners. Internal, temporal, and external validations demonstrate the effectiveness of the proposed method. Notably, for external validation datasets from unseen domains, the proposed method achieved comparable or superior results relative to methods trained with these datasets, in terms of quantitative metrics such as normalized root mean-square error and structure similarity index measure. Two nuclear medicine physicians evaluated the amyloid status as positive or negative for the external validation datasets, with accuracies of 0.970 and 0.930 for readers 1 and 2, respectively. △ Less

Submitted 17 May, 2023; originally announced May 2023.

Comments: 21 pages, 7 figures, 3 tables

MSC Class: 92C55; 68T05; 15A29; 65F22

arXiv:2305.04468 [pdf, other]

AnomalyBERT: Self-Supervised Transformer for Time Series Anomaly Detection using Data Degradation Scheme

Authors: Yungi Jeong, Eunseok Yang, Jung Hyun Ryu, Imseong Park, Myungjoo Kang

Abstract: Mechanical defects in real situations affect observation values and cause abnormalities in multivariate time series, such as sensor values or network data. To perceive abnormalities in such data, it is crucial to understand the temporal context and interrelation between variables simultaneously. The anomaly detection task for time series, especially for unlabeled data, has been a challenging probl… ▽ More Mechanical defects in real situations affect observation values and cause abnormalities in multivariate time series, such as sensor values or network data. To perceive abnormalities in such data, it is crucial to understand the temporal context and interrelation between variables simultaneously. The anomaly detection task for time series, especially for unlabeled data, has been a challenging problem, and we address it by applying a suitable data degradation scheme to self-supervised model training. We define four types of synthetic outliers and propose the degradation scheme in which a portion of input data is replaced with one of the synthetic outliers. Inspired by the self-attention mechanism, we design a Transformer-based architecture to recognize the temporal context and detect unnatural sequences with high efficiency. Our model converts multivariate data points into temporal representations with relative position bias and yields anomaly scores from these representations. Our method, AnomalyBERT, shows a great capability of detecting anomalies contained in complex time series and surpasses previous state-of-the-art methods on five real-world benchmarks. Our code is available at https://github.com/Jhryu30/AnomalyBERT. △ Less

Submitted 8 May, 2023; originally announced May 2023.

Comments: 11 pages, Presented at ICLR 2023 workshop on Machine Learning for IoT

arXiv:2304.03173 [pdf]

doi 10.1515/nanoph-2023-0125

Electrical addressing of exceptional points in compact plasmonic structures

Authors: Hoon Yeub Jeong, Yeonsoo Lim, Jungho Han, Soo-Chan An, Young Chul Jun

Abstract: Exceptional points (EPs) are degenerate singularities in a non-Hermitian system that can be induced by controlling the interaction between resonant photonic modes. EPs can enable unusual optical phenomena and significantly enhance the optical sensitivity under small perturbations. However, most studies thus far have been limited to static photonic structures. In this study, we propose and experime… ▽ More Exceptional points (EPs) are degenerate singularities in a non-Hermitian system that can be induced by controlling the interaction between resonant photonic modes. EPs can enable unusual optical phenomena and significantly enhance the optical sensitivity under small perturbations. However, most studies thus far have been limited to static photonic structures. In this study, we propose and experimentally demonstrate electrically addressable EP in a plasmonic structure. Inspired by optical microcavity studies, we employ a localized spoof plasmon structure that supports circulating plasmonic modes in a compact single-resonator geometry. The plasmonic modes are perturbed by an angled metal line, and the interaction between the plasmonic modes is electrically controlled using a varactor. Continuous electrical tuning of the varactor capacitance facilitates simultaneous coalescence of the real and imaginary parts of the eigenfrequency, allowing the direct addressing of EPs. We first investigate the eigenmodes and their coupling in localized plasmonic structures using numerical simulations. We then present experimentally measured spectra that manifest the coalescence of the two resonant modes in both the resonance frequency and linewidth. Electrically addressable EPs in compact plasmonic structures may provide exciting opportunities for highly functional and tunable elements in integrated device platforms. △ Less

Submitted 2 April, 2023; originally announced April 2023.

Comments: 25 pages

Journal ref: Nanophotonics, 12, 2029 (2023)

arXiv:2304.01888 [pdf]

Quantitative perfusion and water transport time model from multi b-value diffusion magnetic resonance imaging validated against neutron capture microspheres

Authors: M. Liu, N. Saadat, Y. Jeong, S. Roth, M. Niekrasz, M. Giurcanu, T. Carroll, G. Christoforidis

Abstract: Intravoxel Incoherent Motion (IVIM) is a non-contrast magnetic resonance imaging diffusion-based scan that uses a multitude of b-values to measure various speeds of molecular perfusion and diffusion, sidestepping inaccuracy of arterial input functions or bolus kinetics in quantitative imaging. We test a new method of IVIM quantification and compare our values to reference standard neutron capture… ▽ More Intravoxel Incoherent Motion (IVIM) is a non-contrast magnetic resonance imaging diffusion-based scan that uses a multitude of b-values to measure various speeds of molecular perfusion and diffusion, sidestepping inaccuracy of arterial input functions or bolus kinetics in quantitative imaging. We test a new method of IVIM quantification and compare our values to reference standard neutron capture microspheres across normocapnia, CO2 induced hypercapnia, and middle cerebral artery occlusion in a controlled animal model. Perfusion quantification in ml/100g/min compared to microsphere perfusion uses the 3D gaussian probability distribution and defined water transport time as when 50% of the molecules remain in the tissue of interest. Perfusion, water transport time, and infarct volume was compared to reference standards. Simulations were studied to suppress non-specific cerebrospinal fluid (CSF). Linear regression analysis of quantitative perfusion returned correlation (slope = .55, intercept = 52.5, $R^2$= .64). Linear regression for water transport time asymmetry in infarcted tissue was excellent (slope = .59, intercept = .3, $R^2$ = .93). Strong linear agreement also was found for infarct volume (slope = 1.01, $R^2$= .79). Simulation of CSF suppression via inversion recovery returned blood signal reduced by 82% from combined T1 and T2 effects. Intra-physiologic state comparison of perfusion shows potential partial volume effects which require further study especially in disease states. The accuracy and sensitivity of IVIM provides evidence that observed signal changes reflect cytotoxic edema and tissue perfusion. Partial volume contamination of CSF may be better removed during post-processing rather than with inversion recovery to avoid artificial loss of blood signal. △ Less

Submitted 4 April, 2023; originally announced April 2023.

Comments: 7 pages, 1 table, 6 figures, 3 pages appendix

arXiv:2303.17007 [pdf]

doi 10.1103/PhysRevD.107.112012

Impact of cross-section uncertainties on supernova neutrino spectral parameter fitting in the Deep Underground Neutrino Experiment

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, Z. Ahmad, J. Ahmed, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, P. Amedo, J. Anderson, D. A. Andrade , et al. (1294 additional authors not shown)

Abstract: A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics… ▽ More A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics and astrophysics measurements. A key requirement for a correct interpretation of these measurements is a good understanding of the energy-dependent total cross section $σ(E_ν)$ for charged-current $ν_e$ absorption on argon. In the context of a simulated extraction of supernova $ν_e$ spectral parameters from a toy analysis, we investigate the impact of $σ(E_ν)$ modeling uncertainties on DUNE's supernova neutrino physics sensitivity for the first time. We find that the currently large theoretical uncertainties on $σ(E_ν)$ must be substantially reduced before the $ν_e$ flux parameters can be extracted reliably: in the absence of external constraints, a measurement of the integrated neutrino luminosity with less than 10\% bias with DUNE requires $σ(E_ν)$ to be known to about 5%. The neutrino spectral shape parameters can be known to better than 10% for a 20% uncertainty on the cross-section scale, although they will be sensitive to uncertainties on the shape of $σ(E_ν)$. A direct measurement of low-energy $ν_e$-argon scattering would be invaluable for improving the theoretical precision to the needed level. △ Less

Submitted 7 July, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

Comments: 25 pages, 21 figures

Report number: FERMILAB-PUB-23-132-CSAID-LBNF-ND-T

Journal ref: Phys. Rev. D 107, 112012 (2023)

arXiv:2303.15314 [pdf]

Development of a thorium coating on an aluminium substrate by using electrodeposition method and alpha spectroscopy

Authors: Dal-Ho Moon, Vivek Chavan, Vasant Bhoraskar, Yeong Hoon Jeong, Jung Ho Park, Su-Jeong Suh, Seung-Woo Hong

Abstract: A thin coating of thorium on aluminium substrates with the areal density of 110 to 130 $μg/cm^2$ is developed over a circular area of 22 mm diameter by using the electrodeposition method. An electrodeposition system is fabricated to consist of three components; an anode made of a platinum mesh, a cylindrical-shape vessel to contain the thorium solution, and a cathode in the form of a circular alum… ▽ More A thin coating of thorium on aluminium substrates with the areal density of 110 to 130 $μg/cm^2$ is developed over a circular area of 22 mm diameter by using the electrodeposition method. An electrodeposition system is fabricated to consist of three components; an anode made of a platinum mesh, a cylindrical-shape vessel to contain the thorium solution, and a cathode in the form of a circular aluminium plate. The aluminium plate is mounted horizontally, and the platinum mesh is connected to an axial rod of an electric motor, mounted vertically and normal to the plane of the aluminium. The electrolyte solution is prepared by dissolving a known-weight thorium nitrate powder in 0.8 M HNO3 and isopropanol. The system is operated either in constant voltage (CV) or constant current (CC) mode. Under the electric field between the anode and cathode, thorium ions were deposited on the aluminium substrate mounted on the cathode. In the CV mode at 320, 360, and 400 V and in the CC mode at 15 mA, thorium films were formed over a circular area of the aluminium substrate. The areal density of thorium coating was measured by detecting emitted alpha particles. The areal density of thorium varied from 80 to 130 $μg/cm^2$ by changing the deposition time from 10 to 60 min. The results from the CV mode and CC mode are compared, and the radial dependence in the measured areal density is discussed for different modes of the electric field. The developed thorium coatings are to be used in the in-house development of particle detectors, fast neutron converters, targets for thorium fission experiments, and other purposes. △ Less

Submitted 11 March, 2023; originally announced March 2023.

Comments: 11 pages, 5 figures, 1 table

arXiv:2303.09947 [pdf, ps, other]

Optimizing EV Chargers Location via Integer Programming

Authors: Seungmo Kim, Yeonho Jeong, Jae-Won Nam

Abstract: There is no question to the fact that electric vehicles (EVs) are the most viable solution to the climate change that the planet has long been combating. Along the same line, it is a salient subject to expand the availability of charging infrastructure, which quintessentially necessitates the optimization of the charger's locations. This paper proposes to formulate the optimal EV charger location… ▽ More There is no question to the fact that electric vehicles (EVs) are the most viable solution to the climate change that the planet has long been combating. Along the same line, it is a salient subject to expand the availability of charging infrastructure, which quintessentially necessitates the optimization of the charger's locations. This paper proposes to formulate the optimal EV charger location problem into a facility location problem (FLP). As an effort to find an efficient method to solve the well-known nonpolynomial deterministic (NP)-hard problem, we present a comparative quantification among several representative solving techniques. △ Less

Submitted 17 March, 2023; originally announced March 2023.

Showing 1–50 of 254 results for author: Jeong, Y