Search | arXiv e-print repository

Lowering threshold of NaI(Tl) scintillator to 0.7 keV in the COSINE-100 experiment

Authors: G. H. Yu, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. França, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim, Y. J. Ko, D. H. Lee , et al. (34 additional authors not shown)

Abstract: COSINE-100 is a direct dark matter search experiment, with the primary goal of testing the annual modulation signal observed by DAMA/LIBRA, using the same target material, NaI(Tl). In previous analyses, we achieved the same 1 keV energy threshold used in the DAMA/LIBRA's analysis that reported an annual modulation signal with 11.6$σ$ significance. In this article, we report an improved analysis th… ▽ More COSINE-100 is a direct dark matter search experiment, with the primary goal of testing the annual modulation signal observed by DAMA/LIBRA, using the same target material, NaI(Tl). In previous analyses, we achieved the same 1 keV energy threshold used in the DAMA/LIBRA's analysis that reported an annual modulation signal with 11.6$σ$ significance. In this article, we report an improved analysis that lowered the threshold to 0.7 keV, thanks to the application of Multi-Layer Perception network and a new likelihood parameter with waveforms in the frequency domain. The lower threshold would enable a better comparison of COSINE-100 with new DAMA results with a 0.75 keV threshold and account for differences in quenching factors. Furthermore the lower threshold can enhance COSINE-100's sensitivity to sub-GeV dark matter searches. △ Less

Submitted 26 August, 2024; originally announced August 2024.

arXiv:2408.10593 [pdf, other]

An Efficient Sign Language Translation Using Spatial Configuration and Motion Dynamics with LLMs

Authors: Eui Jun Hwang, Sukmin Cho, Junmyeong Lee, Jong C. Park

Abstract: Gloss-free Sign Language Translation (SLT) converts sign videos directly into spoken language sentences without relying on glosses. Recently, Large Language Models (LLMs) have shown remarkable translation performance in gloss-free methods by harnessing their powerful natural language generation capabilities. However, these methods often rely on domain-specific fine-tuning of visual encoders to ach… ▽ More Gloss-free Sign Language Translation (SLT) converts sign videos directly into spoken language sentences without relying on glosses. Recently, Large Language Models (LLMs) have shown remarkable translation performance in gloss-free methods by harnessing their powerful natural language generation capabilities. However, these methods often rely on domain-specific fine-tuning of visual encoders to achieve optimal results. By contrast, this paper emphasizes the importance of capturing the spatial configurations and motion dynamics inherent in sign language. With this in mind, we introduce Spatial and Motion-based Sign Language Translation (SpaMo), a novel LLM-based SLT framework. The core idea of SpaMo is simple yet effective. We first extract spatial and motion features using off-the-shelf visual encoders and then input these features into an LLM with a language prompt. Additionally, we employ a visual-text alignment process as a warm-up before the SLT supervision. Our experiments demonstrate that SpaMo achieves state-of-the-art performance on two popular datasets, PHOENIX14T and How2Sign. △ Less

Submitted 20 August, 2024; originally announced August 2024.

Comments: Under Review

arXiv:2408.09806 [pdf, other]

Improved background modeling for dark matter search with COSINE-100

Authors: G. H. Yu, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Franca, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim, Y. J. Ko, D. H. Lee , et al. (33 additional authors not shown)

Abstract: COSINE-100 aims to conclusively test the claimed dark matter annual modulation signal detected by DAMA/LIBRA collaboration. DAMA/LIBRA has released updated analysis results by lowering the energy threshold to 0.75 keV through various upgrades. They have consistently claimed to have observed the annual modulation. In COSINE-100, it is crucial to lower the energy threshold for a direct comparison wi… ▽ More COSINE-100 aims to conclusively test the claimed dark matter annual modulation signal detected by DAMA/LIBRA collaboration. DAMA/LIBRA has released updated analysis results by lowering the energy threshold to 0.75 keV through various upgrades. They have consistently claimed to have observed the annual modulation. In COSINE-100, it is crucial to lower the energy threshold for a direct comparison with DAMA/LIBRA, which also enhances the sensitivity of the search for low-mass dark matter, enabling COSINE-100 to explore this area. Therefore, it is essential to have a precise and quantitative understanding of the background spectrum across all energy ranges. This study expands the background modeling from 0.7 to 4000 keV using 2.82 years of COSINE-100 data. The modeling has been improved to describe the background spectrum across all energy ranges accurately. Assessments of the background spectrum are presented, considering the nonproportionality of NaI(Tl) crystals at both low and high energies and the characteristic X-rays produced by the interaction of external backgrounds with materials such as copper. Additionally, constraints on the fit parameters obtained from the alpha spectrum modeling fit are integrated into this model. These improvements are detailed in the paper. △ Less

Submitted 19 August, 2024; originally announced August 2024.

arXiv:2407.03627 [pdf, other]

DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation

Authors: Taeho Hwang, Soyeong Jeong, Sukmin Cho, SeungYoon Han, Jong C. Park

Abstract: Recent advancements in Large Language Models (LLMs) have significantly improved their performance across various Natural Language Processing (NLP) tasks. However, LLMs still struggle with generating non-factual responses due to limitations in their parametric memory. Retrieval-Augmented Generation (RAG) systems address this issue by incorporating external knowledge with a retrieval module. Despite… ▽ More Recent advancements in Large Language Models (LLMs) have significantly improved their performance across various Natural Language Processing (NLP) tasks. However, LLMs still struggle with generating non-factual responses due to limitations in their parametric memory. Retrieval-Augmented Generation (RAG) systems address this issue by incorporating external knowledge with a retrieval module. Despite their successes, however, current RAG systems face challenges with retrieval failures and the limited ability of LLMs to filter out irrelevant information. Therefore, in this work, we propose DSLR (Document Refinement with Sentence-Level Re-ranking and Reconstruction), an unsupervised framework that decomposes retrieved documents into sentences, filters out irrelevant sentences, and reconstructs them again into coherent passages. We experimentally validate DSLR on multiple open-domain QA datasets and the results demonstrate that DSLR significantly enhances the RAG performance over conventional fixed-size passage. Furthermore, our DSLR enhances performance in specific, yet realistic scenarios without the need for additional training, providing an effective and efficient solution for refining retrieved documents in RAG systems. △ Less

Submitted 20 August, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

Comments: 20 pages

Journal ref: KnowledgeNLP@ACL 2024

arXiv:2407.02854 [pdf, other]

Universal Gloss-level Representation for Gloss-free Sign Language Translation and Production

Authors: Eui Jun Hwang, Sukmin Cho, Huije Lee, Youngwoo Yoon, Jong C. Park

Abstract: Sign language, essential for the deaf and hard-of-hearing, presents unique challenges in translation and production due to its multimodal nature and the inherent ambiguity in mapping sign language motion to spoken language words. Previous methods often rely on gloss annotations, requiring time-intensive labor and specialized expertise in sign language. Gloss-free methods have emerged to address th… ▽ More Sign language, essential for the deaf and hard-of-hearing, presents unique challenges in translation and production due to its multimodal nature and the inherent ambiguity in mapping sign language motion to spoken language words. Previous methods often rely on gloss annotations, requiring time-intensive labor and specialized expertise in sign language. Gloss-free methods have emerged to address these limitations, but they often depend on external sign language data or dictionaries, failing to completely eliminate the need for gloss annotations. There is a clear demand for a comprehensive approach that can supplant gloss annotations and be utilized for both Sign Language Translation (SLT) and Sign Language Production (SLP). We introduce Universal Gloss-level Representation (UniGloR), a unified and self-supervised solution for both SLT and SLP, trained on multiple datasets including PHOENIX14T, How2Sign, and NIASL2021. Our results demonstrate UniGloR's effectiveness in the translation and production tasks. We further report an encouraging result for the Sign Language Recognition (SLR) on previously unseen data. Our study suggests that self-supervised learning can be made in a unified manner, paving the way for innovative and practical applications in future research. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 14 pages, 5 figures

arXiv:2406.16013 [pdf, other]

Database-Augmented Query Representation for Information Retrieval

Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

Abstract: Information retrieval models that aim to search for the documents relevant to the given query have shown many successes, which have been applied to diverse tasks. However, the query provided by the user is oftentimes very short, which challenges the retrievers to correctly fetch relevant documents. To tackle this, existing studies have proposed expanding the query with a couple of additional (user… ▽ More Information retrieval models that aim to search for the documents relevant to the given query have shown many successes, which have been applied to diverse tasks. However, the query provided by the user is oftentimes very short, which challenges the retrievers to correctly fetch relevant documents. To tackle this, existing studies have proposed expanding the query with a couple of additional (user-related) features related to the query. Yet, they may be suboptimal to effectively augment the query, though there is plenty of information available to augment it in a relational database. Motivated by this, we present a novel retrieval framework called Database-Augmented Query representation (DAQu), which augments the original query with various (query-related) metadata across multiple tables. In addition, as the number of features in the metadata can be very large and there is no order among them, we encode them with our graph-based set encoding strategy, which considers hierarchies of features in the database without order. We validate DAQu in diverse retrieval scenarios that can incorporate metadata from the relational database, demonstrating that ours significantly enhances overall retrieval performance, compared to existing query augmentation methods. △ Less

Submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.09719 [pdf, other]

Self-Knowledge Distillation for Learning Ambiguity

Authors: Hancheol Park, Soyeong Jeong, Sukmin Cho, Jong C. Park

Abstract: Recent language models have shown remarkable performance on natural language understanding (NLU) tasks. However, they are often sub-optimal when faced with ambiguous samples that can be interpreted in multiple ways, over-confidently predicting a single label without consideration for its correctness. To address this issue, we propose a novel self-knowledge distillation method that enables models t… ▽ More Recent language models have shown remarkable performance on natural language understanding (NLU) tasks. However, they are often sub-optimal when faced with ambiguous samples that can be interpreted in multiple ways, over-confidently predicting a single label without consideration for its correctness. To address this issue, we propose a novel self-knowledge distillation method that enables models to learn label distributions more accurately by leveraging knowledge distilled from their lower layers. This approach also includes a learning phase that re-calibrates the unnecessarily strengthened confidence for training samples judged as extremely ambiguous based on the distilled distribution knowledge. We validate our method on diverse NLU benchmark datasets and the experimental results demonstrate its effectiveness in producing better label distributions. Particularly, through the process of re-calibrating the confidence for highly ambiguous samples, the issue of over-confidence when predictions for unseen samples do not match with their ground-truth labels has been significantly alleviated. This has been shown to contribute to generating better distributions than the existing state-of-the-art method. Moreover, our method is more efficient in training the models compared to the existing method, as it does not involve additional training processes to refine label distributions. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: 9 pages, 5 figures

arXiv:2406.04064 [pdf, other]

Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models

Authors: Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong C. Park

Abstract: Social bias is shaped by the accumulation of social perceptions towards targets across various demographic identities. To fully understand such social bias in large language models (LLMs), it is essential to consider the composite of social perceptions from diverse perspectives among identities. Previous studies have either evaluated biases in LLMs by indirectly assessing the presence of sentiment… ▽ More Social bias is shaped by the accumulation of social perceptions towards targets across various demographic identities. To fully understand such social bias in large language models (LLMs), it is essential to consider the composite of social perceptions from diverse perspectives among identities. Previous studies have either evaluated biases in LLMs by indirectly assessing the presence of sentiments towards demographic identities in the generated text or measuring the degree of alignment with given stereotypes. These methods have limitations in directly quantifying social biases at the level of distinct perspectives among identities. In this paper, we aim to investigate how social perceptions from various viewpoints contribute to the development of social bias in LLMs. To this end, we propose a novel strategy to intuitively quantify these social perceptions and suggest metrics that can evaluate the social biases within LLMs by aggregating diverse social perceptions. The experimental results show the quantitative demonstration of the social attitude in LLMs by examining social perception. The analysis we conducted shows that our proposed metrics capture the multi-dimensional aspects of social bias, enabling a fine-grained and comprehensive investigation of bias in LLMs. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: Findings of ACL 2024

arXiv:2404.13948 [pdf, other]

Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations

Authors: Sukmin Cho, Soyeong Jeong, Jeongyeon Seo, Taeho Hwang, Jong C. Park

Abstract: The robustness of recent Large Language Models (LLMs) has become increasingly crucial as their applicability expands across various domains and real-world applications. Retrieval-Augmented Generation (RAG) is a promising solution for addressing the limitations of LLMs, yet existing studies on the robustness of RAG often overlook the interconnected relationships between RAG components or the potent… ▽ More The robustness of recent Large Language Models (LLMs) has become increasingly crucial as their applicability expands across various domains and real-world applications. Retrieval-Augmented Generation (RAG) is a promising solution for addressing the limitations of LLMs, yet existing studies on the robustness of RAG often overlook the interconnected relationships between RAG components or the potential threats prevalent in real-world databases, such as minor textual errors. In this work, we investigate two underexplored aspects when assessing the robustness of RAG: 1) vulnerability to noisy documents through low-level perturbations and 2) a holistic evaluation of RAG robustness. Furthermore, we introduce a novel attack method, the Genetic Attack on RAG (\textit{GARAG}), which targets these aspects. Specifically, GARAG is designed to reveal vulnerabilities within each component and test the overall system functionality against noisy documents. We validate RAG robustness by applying our \textit{GARAG} to standard QA datasets, incorporating diverse retrievers and LLMs. The experimental results show that GARAG consistently achieves high attack success rates. Also, it significantly devastates the performance of each component and their synergy, highlighting the substantial risk that minor textual inaccuracies pose in disrupting RAG systems in the real world. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: Under Review

arXiv:2403.14403 [pdf, other]

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

Abstract: Retrieval-Augmented Large Language Models (LLMs), which incorporate the non-parametric knowledge from external knowledge bases into LLMs, have emerged as a promising approach to enhancing response accuracy in several tasks, such as Question-Answering (QA). However, even though there are various approaches dealing with queries of different complexities, they either handle simple queries with unnece… ▽ More Retrieval-Augmented Large Language Models (LLMs), which incorporate the non-parametric knowledge from external knowledge bases into LLMs, have emerged as a promising approach to enhancing response accuracy in several tasks, such as Question-Answering (QA). However, even though there are various approaches dealing with queries of different complexities, they either handle simple queries with unnecessary computational overhead or fail to adequately address complex multi-step queries; yet, not all user requests fall into only one of the simple or complex categories. In this work, we propose a novel adaptive QA framework, that can dynamically select the most suitable strategy for (retrieval-augmented) LLMs from the simplest to the most sophisticated ones based on the query complexity. Also, this selection process is operationalized with a classifier, which is a smaller LM trained to predict the complexity level of incoming queries with automatically collected labels, obtained from actual predicted outcomes of models and inherent inductive biases in datasets. This approach offers a balanced strategy, seamlessly adapting between the iterative and single-step retrieval-augmented LLMs, as well as the no-retrieval methods, in response to a range of query complexities. We validate our model on a set of open-domain QA datasets, covering multiple query complexities, and show that ours enhances the overall efficiency and accuracy of QA systems, compared to relevant baselines including the adaptive retrieval approaches. Code is available at: https://github.com/starsuzi/Adaptive-RAG. △ Less

Submitted 28 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

Comments: NAACL 2024

arXiv:2401.07462 [pdf, other]

doi 10.1140/epjc/s10052-024-12770-1

Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments

Authors: S. M. Lee, G. Adhikari, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Fran. a, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (37 additional authors not shown)

Abstract: We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced… ▽ More We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced by decays supported by both long and short-lived isotopes. Analyzing peaks from decays supported only by short-lived isotopes presented a unique challenge due to their limited statistics and overlapping energies, which was overcome by long-term data collection and a time-dependent analysis. A key achievement is the direct measurement of the 0.87 keV light yield, resulting from the cascade following electron capture decay of $^{22}$Na from internal contamination. This measurement, previously accessible only indirectly, deepens our understanding of NaI(Tl) scintillator behavior in the region of interest for dark matter searches. This study holds substantial implications for background modeling and the interpretation of dark matter signals in NaI(Tl) experiments. △ Less

Submitted 10 May, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

Comments: 12 pages, 7 figures

Journal ref: Eur. Phys. J. C 84 (2024) 484

arXiv:2311.05010 [pdf, other]

doi 10.1016/j.astropartphys.2024.102945

Alpha backgrounds in NaI(Tl) crystals of COSINE-100

Authors: G. Adhikari, N. Carlin, D. F. F. S. Cavalcante, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Franca, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (38 additional authors not shown)

Abstract: COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Ca… ▽ More COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Carlo simulation results and activity quantification of the alpha decay components of the COSINE-100 NaI(Tl) crystals. The data strongly indicate that the alpha decays probabilistically undergo two possible quenching factors but require further investigation. The fitted results are consistent with independent measurements and improve the overall understanding of the COSINE-100 backgrounds. Furthermore, the half-life of 216Po has been measured to be 143.4 +/- 1.2 ms, which is consistent with and more precise than recent measurements. △ Less

Submitted 30 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

arXiv:2310.17490 [pdf, other]

Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering

Authors: Sukmin Cho, Jeongyeon Seo, Soyeong Jeong, Jong C. Park

Abstract: Large language models (LLMs) enable zero-shot approaches in open-domain question answering (ODQA), yet with limited advancements as the reader is compared to the retriever. This study aims at the feasibility of a zero-shot reader that addresses the challenges of computational cost and the need for labeled data. We find that LLMs are distracted due to irrelevant documents in the retrieved set and t… ▽ More Large language models (LLMs) enable zero-shot approaches in open-domain question answering (ODQA), yet with limited advancements as the reader is compared to the retriever. This study aims at the feasibility of a zero-shot reader that addresses the challenges of computational cost and the need for labeled data. We find that LLMs are distracted due to irrelevant documents in the retrieved set and the overconfidence of the generated answers when they are exploited as zero-shot readers. To tackle these problems, we mitigate the impact of such documents via Distraction-aware Answer Selection (DAS) with a negation-based instruction and score adjustment for proper answer selection. Experimental results show that our approach successfully handles distraction across diverse scenarios, enhancing the performance of zero-shot readers. Furthermore, unlike supervised readers struggling with unseen data, zero-shot readers demonstrate outstanding transferability without any training. △ Less

Submitted 14 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

Comments: Findings of EMNLP 2023 Camera Ready

arXiv:2310.13307 [pdf, other]

Test-Time Self-Adaptive Small Language Models for Question Answering

Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

Abstract: Recent instruction-finetuned large language models (LMs) have achieved notable performances in various tasks, such as question-answering (QA). However, despite their ability to memorize a vast amount of general knowledge across diverse tasks, they might be suboptimal on specific tasks due to their limited capacity to transfer and adapt knowledge to target tasks. Moreover, further finetuning LMs wi… ▽ More Recent instruction-finetuned large language models (LMs) have achieved notable performances in various tasks, such as question-answering (QA). However, despite their ability to memorize a vast amount of general knowledge across diverse tasks, they might be suboptimal on specific tasks due to their limited capacity to transfer and adapt knowledge to target tasks. Moreover, further finetuning LMs with labeled datasets is often infeasible due to their absence, but it is also questionable if we can transfer smaller LMs having limited knowledge only with unlabeled test data. In this work, we show and investigate the capabilities of smaller self-adaptive LMs, only with unlabeled test data. In particular, we first stochastically generate multiple answers, and then ensemble them while filtering out low-quality samples to mitigate noise from inaccurate labels. Our proposed self-adaption strategy demonstrates significant performance improvements on benchmark QA datasets with higher robustness across diverse prompts, enabling LMs to stay stable. Code is available at: https://github.com/starsuzi/T-SAS. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: EMNLP Findings 2023

arXiv:2310.12836 [pdf, other]

Knowledge-Augmented Language Model Verification

Authors: Jinheon Baek, Soyeong Jeong, Minki Kang, Jong C. Park, Sung Ju Hwang

Abstract: Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowledge internalized in parameters. Yet, LMs often generate the factually incorrect responses to the given queries, since their knowledge may be inaccurate, incomplete, and outdated. To address this problem, previous works propose to augment LMs with the knowledge retrieved from an external knowledge sou… ▽ More Recent Language Models (LMs) have shown impressive capabilities in generating texts with the knowledge internalized in parameters. Yet, LMs often generate the factually incorrect responses to the given queries, since their knowledge may be inaccurate, incomplete, and outdated. To address this problem, previous works propose to augment LMs with the knowledge retrieved from an external knowledge source. However, such approaches often show suboptimal text generation performance due to two reasons: 1) the model may fail to retrieve the knowledge relevant to the given query, or 2) the model may not faithfully reflect the retrieved knowledge in the generated text. To overcome these, we propose to verify the output and the knowledge of the knowledge-augmented LMs with a separate verifier, which is a small LM that is trained to detect those two types of errors through instruction-finetuning. Then, when the verifier recognizes an error, we can rectify it by either retrieving new knowledge or generating new text. Further, we use an ensemble of the outputs from different instructions with a single verifier to enhance the reliability of the verification processes. We validate the effectiveness of the proposed verification steps on multiple question answering benchmarks, whose results show that the proposed verifier effectively identifies retrieval and generation errors, allowing LMs to provide more factually correct outputs. Our code is available at https://github.com/JinheonBaek/KALMV. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: EMNLP 2023

arXiv:2309.12179 [pdf, other]

Autoregressive Sign Language Production: A Gloss-Free Approach with Discrete Representations

Authors: Eui Jun Hwang, Huije Lee, Jong C. Park

Abstract: Gloss-free Sign Language Production (SLP) offers a direct translation of spoken language sentences into sign language, bypassing the need for gloss intermediaries. This paper presents the Sign language Vector Quantization Network, a novel approach to SLP that leverages Vector Quantization to derive discrete representations from sign pose sequences. Our method, rooted in both manual and non-manual… ▽ More Gloss-free Sign Language Production (SLP) offers a direct translation of spoken language sentences into sign language, bypassing the need for gloss intermediaries. This paper presents the Sign language Vector Quantization Network, a novel approach to SLP that leverages Vector Quantization to derive discrete representations from sign pose sequences. Our method, rooted in both manual and non-manual elements of signing, supports advanced decoding methods and integrates latent-level alignment for enhanced linguistic coherence. Through comprehensive evaluations, we demonstrate superior performance of our method over prior SLP methods and highlight the reliability of Back-Translation and Fréchet Gesture Distance as evaluation metrics. △ Less

Submitted 8 June, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

Comments: 5 pages, 3 figures, 6 tables

arXiv:2306.07061 [pdf, other]

Deep Model Compression Also Helps Models Capture Ambiguity

Authors: Hancheol Park, Jong C. Park

Abstract: Natural language understanding (NLU) tasks face a non-trivial amount of ambiguous samples where veracity of their labels is debatable among annotators. NLU models should thus account for such ambiguity, but they approximate the human opinion distributions quite poorly and tend to produce over-confident predictions. To address this problem, we must consider how to exactly capture the degree of rela… ▽ More Natural language understanding (NLU) tasks face a non-trivial amount of ambiguous samples where veracity of their labels is debatable among annotators. NLU models should thus account for such ambiguity, but they approximate the human opinion distributions quite poorly and tend to produce over-confident predictions. To address this problem, we must consider how to exactly capture the degree of relationship between each sample and its candidate classes. In this work, we propose a novel method with deep model compression and show how such relationship can be accounted for. We see that more reasonably represented relationships can be discovered in the lower layers and that validation accuracies are converging at these layers, which naturally leads to layer pruning. We also see that distilling the relationship knowledge from a lower layer helps models produce better distribution. Experimental results demonstrate that our method makes substantial improvement on quantifying ambiguity without gold distribution labels. As positive side-effects, our method is found to reduce the model size significantly and improve latency, both attractive aspects of NLU products. △ Less

Submitted 12 June, 2023; originally announced June 2023.

Comments: ACL 2023

arXiv:2306.04293 [pdf, other]

Phrase Retrieval for Open-Domain Conversational Question Answering with Conversational Dependency Modeling via Contrastive Learning

Authors: Soyeong Jeong, Jinheon Baek, Sung Ju Hwang, Jong C. Park

Abstract: Open-Domain Conversational Question Answering (ODConvQA) aims at answering questions through a multi-turn conversation based on a retriever-reader pipeline, which retrieves passages and then predicts answers with them. However, such a pipeline approach not only makes the reader vulnerable to the errors propagated from the retriever, but also demands additional effort to develop both the retriever… ▽ More Open-Domain Conversational Question Answering (ODConvQA) aims at answering questions through a multi-turn conversation based on a retriever-reader pipeline, which retrieves passages and then predicts answers with them. However, such a pipeline approach not only makes the reader vulnerable to the errors propagated from the retriever, but also demands additional effort to develop both the retriever and the reader, which further makes it slower since they are not runnable in parallel. In this work, we propose a method to directly predict answers with a phrase retrieval scheme for a sequence of words, reducing the conventional two distinct subtasks into a single one. Also, for the first time, we study its capability for ODConvQA tasks. However, simply adopting it is largely problematic, due to the dependencies between previous and current turns in a conversation. To address this problem, we further introduce a novel contrastive learning strategy, making sure to reflect previous turns when retrieving the phrase for the current context, by maximizing representational similarities of consecutive turns in a conversation while minimizing irrelevant conversational contexts. We validate our model on two ODConvQA datasets, whose experimental results show that it substantially outperforms the relevant baselines with the retriever-reader. Code is available at: https://github.com/starsuzi/PRO-ConvQA. △ Less

Submitted 7 June, 2023; originally announced June 2023.

Comments: Findings of ACL 2023

arXiv:2306.02955 [pdf, other]

A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical Questionnaires

Authors: Hoyun Song, Jisu Shin, Huije Lee, Jong C. Park

Abstract: Social media is one of the most highly sought resources for analyzing characteristics of the language by its users. In particular, many researchers utilized various linguistic features of mental health problems from social media. However, existing approaches to detecting mental disorders face critical challenges, such as the scarcity of high-quality data or the trade-off between addressing the com… ▽ More Social media is one of the most highly sought resources for analyzing characteristics of the language by its users. In particular, many researchers utilized various linguistic features of mental health problems from social media. However, existing approaches to detecting mental disorders face critical challenges, such as the scarcity of high-quality data or the trade-off between addressing the complexity of models and presenting interpretable results grounded in expert domain knowledge. To address these challenges, we design a simple but flexible model that preserves domain-based interpretability. We propose a novel approach that captures the semantic meanings directly from the text and compares them to symptom-related descriptions. Experimental results demonstrate that our model outperforms relevant baselines on various mental disorder detection tasks. Our detailed analysis shows that the proposed model is effective at leveraging domain knowledge, transferable to other mental disorders, and providing interpretable detection results. △ Less

Submitted 5 June, 2023; originally announced June 2023.

Comments: ACL 2023, 15 pages, 11 tables, 4 figures

arXiv:2305.13729 [pdf, other]

Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker

Authors: Sukmin Cho, Soyeong Jeong, Jeongyeon Seo, Jong C. Park

Abstract: Re-rankers, which order retrieved documents with respect to the relevance score on the given query, have gained attention for the information retrieval (IR) task. Rather than fine-tuning the pre-trained language model (PLM), the large-scale language model (LLM) is utilized as a zero-shot re-ranker with excellent results. While LLM is highly dependent on the prompts, the impact and the optimization… ▽ More Re-rankers, which order retrieved documents with respect to the relevance score on the given query, have gained attention for the information retrieval (IR) task. Rather than fine-tuning the pre-trained language model (PLM), the large-scale language model (LLM) is utilized as a zero-shot re-ranker with excellent results. While LLM is highly dependent on the prompts, the impact and the optimization of the prompts for the zero-shot re-ranker are not explored yet. Along with highlighting the impact of optimization on the zero-shot re-ranker, we propose a novel discrete prompt optimization method, Constrained Prompt generation (Co-Prompt), with the metric estimating the optimum for re-ranking. Co-Prompt guides the generated texts from PLM toward optimal prompts based on the metric without parameter update. The experimental results demonstrate that Co-Prompt leads to outstanding re-ranking performance against the baselines. Also, Co-Prompt generates more interpretable prompts for humans against other prompt optimization methods. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: Findings of ACL 2023 Camera Ready

arXiv:2302.05137 [pdf, other]

Realistic Conversational Question Answering with Answer Selection based on Calibrated Confidence and Uncertainty Measurement

Authors: Soyeong Jeong, Jinheon Baek, Sung Ju Hwang, Jong C. Park

Abstract: Conversational Question Answering (ConvQA) models aim at answering a question with its relevant paragraph and previous question-answer pairs that occurred during conversation multiple times. To apply such models to a real-world scenario, some existing work uses predicted answers, instead of unavailable ground-truth answers, as the conversation history for inference. However, since these models usu… ▽ More Conversational Question Answering (ConvQA) models aim at answering a question with its relevant paragraph and previous question-answer pairs that occurred during conversation multiple times. To apply such models to a real-world scenario, some existing work uses predicted answers, instead of unavailable ground-truth answers, as the conversation history for inference. However, since these models usually predict wrong answers, using all the predictions without filtering significantly hampers the model performance. To address this problem, we propose to filter out inaccurate answers in the conversation history based on their estimated confidences and uncertainties from the ConvQA model, without making any architectural changes. Moreover, to make the confidence and uncertainty values more reliable, we propose to further calibrate them, thereby smoothing the model predictions. We validate our models, Answer Selection-based realistic Conversation Question Answering, on two standard ConvQA datasets, and the results show that our models significantly outperform relevant baselines. Code is available at: https://github.com/starsuzi/AS-ConvQA. △ Less

Submitted 10 February, 2023; originally announced February 2023.

Comments: EACL 2023

arXiv:2208.06183 [pdf, other]

Non-Autoregressive Sign Language Production via Knowledge Distillation

Authors: Eui Jun Hwang, Jung Ho Kim, Suk Min Cho, Jong C. Park

Abstract: Sign Language Production (SLP) aims to translate expressions in spoken language into corresponding ones in sign language, such as skeleton-based sign poses or videos. Existing SLP models are either AutoRegressive (AR) or Non-Autoregressive (NAR). However, AR-SLP models suffer from regression to the mean and error propagation during decoding. NSLP-G, a NAR-based model, resolves these issues to some… ▽ More Sign Language Production (SLP) aims to translate expressions in spoken language into corresponding ones in sign language, such as skeleton-based sign poses or videos. Existing SLP models are either AutoRegressive (AR) or Non-Autoregressive (NAR). However, AR-SLP models suffer from regression to the mean and error propagation during decoding. NSLP-G, a NAR-based model, resolves these issues to some extent but engenders other problems. For example, it does not consider target sign lengths and suffers from false decoding initiation. We propose a novel NAR-SLP model via Knowledge Distillation (KD) to address these problems. First, we devise a length regulator to predict the end of the generated sign pose sequence. We then adopt KD, which distills spatial-linguistic features from a pre-trained pose encoder to alleviate false decoding initiation. Extensive experiments show that the proposed approach significantly outperforms existing SLP models in both Frechet Gesture Distance and Back-Translation evaluation. △ Less

Submitted 12 August, 2022; originally announced August 2022.

Comments: 10 pages, 4 figures, 3 tables, submitted to ECCV2023

arXiv:2208.00176 [pdf, other]

ELF22: A Context-based Counter Trolling Dataset to Combat Internet Trolls

Authors: Huije Lee, Young Ju NA, Hoyun Song, Jisu Shin, Jong C. Park

Abstract: Online trolls increase social costs and cause psychological damage to individuals. With the proliferation of automated accounts making use of bots for trolling, it is difficult for targeted individual users to handle the situation both quantitatively and qualitatively. To address this issue, we focus on automating the method to counter trolls, as counter responses to combat trolls encourage commun… ▽ More Online trolls increase social costs and cause psychological damage to individuals. With the proliferation of automated accounts making use of bots for trolling, it is difficult for targeted individual users to handle the situation both quantitatively and qualitatively. To address this issue, we focus on automating the method to counter trolls, as counter responses to combat trolls encourage community users to maintain ongoing discussion without compromising freedom of expression. For this purpose, we propose a novel dataset for automatic counter response generation. In particular, we constructed a pair-wise dataset that includes troll comments and counter responses with labeled response strategies, which enables models fine-tuned on our dataset to generate responses by varying counter responses according to the specified strategy. We conducted three tasks to assess the effectiveness of our dataset and evaluated the results through both automatic and human evaluation. In human evaluation, we demonstrate that the model fine-tuned on our dataset shows a significantly improved performance in strategy-controlled sentence generation. △ Less

Submitted 7 September, 2022; v1 submitted 30 July, 2022; originally announced August 2022.

Comments: Accepted for LREC 2022

arXiv:2203.07735 [pdf, other]

Augmenting Document Representations for Dense Retrieval with Interpolation and Perturbation

Authors: Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

Abstract: Dense retrieval models, which aim at retrieving the most relevant document for an input query on a dense representation space, have gained considerable attention for their remarkable success. Yet, dense models require a vast amount of labeled training data for notable performance, whereas it is often challenging to acquire query-document pairs annotated by humans. To tackle this problem, we propos… ▽ More Dense retrieval models, which aim at retrieving the most relevant document for an input query on a dense representation space, have gained considerable attention for their remarkable success. Yet, dense models require a vast amount of labeled training data for notable performance, whereas it is often challenging to acquire query-document pairs annotated by humans. To tackle this problem, we propose a simple but effective Document Augmentation for dense Retrieval (DAR) framework, which augments the representations of documents with their interpolation and perturbation. We validate the performance of DAR on retrieval tasks with two benchmark datasets, showing that the proposed DAR significantly outperforms relevant baselines on the dense retrieval of both the labeled and unlabeled documents. △ Less

Submitted 16 March, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: ACL 2022

arXiv:2203.07361 [pdf, other]

Coherent elastic neutrino-nucleus scattering: Terrestrial and astrophysical applications

Authors: M. Abdullah, H. Abele, D. Akimov, G. Angloher, D. Aristizabal-Sierra, C. Augier, A. B. Balantekin, L. Balogh, P. S. Barbeau, L. Baudis, A. L. Baxter, C. Beaufort, G. Beaulieu, V. Belov, A. Bento, L. Berge, I. A. Bernardi, J. Billard, A. Bolozdynya, A. Bonhomme, G. Bres, J-. L. Bret, A. Broniatowski, A. Brossard, C. Buck , et al. (250 additional authors not shown)

Abstract: Coherent elastic neutrino-nucleus scattering (CE$ν$NS) is a process in which neutrinos scatter on a nucleus which acts as a single particle. Though the total cross section is large by neutrino standards, CE$ν$NS has long proven difficult to detect, since the deposited energy into the nucleus is $\sim$ keV. In 2017, the COHERENT collaboration announced the detection of CE$ν$NS using a stopped-pion… ▽ More Coherent elastic neutrino-nucleus scattering (CE$ν$NS) is a process in which neutrinos scatter on a nucleus which acts as a single particle. Though the total cross section is large by neutrino standards, CE$ν$NS has long proven difficult to detect, since the deposited energy into the nucleus is $\sim$ keV. In 2017, the COHERENT collaboration announced the detection of CE$ν$NS using a stopped-pion source with CsI detectors, followed up the detection of CE$ν$NS using an Ar target. The detection of CE$ν$NS has spawned a flurry of activities in high-energy physics, inspiring new constraints on beyond the Standard Model (BSM) physics, and new experimental methods. The CE$ν$NS process has important implications for not only high-energy physics, but also astrophysics, nuclear physics, and beyond. This whitepaper discusses the scientific importance of CE$ν$NS, highlighting how present experiments such as COHERENT are informing theory, and also how future experiments will provide a wealth of information across the aforementioned fields of physics. △ Less

Submitted 14 March, 2022; originally announced March 2022.

Comments: contribution to Snowmasss 2021. Contact authors: P. S. Barbeau, R. Strauss, L. E. Strigari

arXiv:2202.03978 [pdf]

doi 10.1002/mp.15960

Segmentation by Test-Time Optimization (TTO) for CBCT-based Adaptive Radiation Therapy

Authors: Xiao Liang, Jaehee Chun, Howard Morgan, Ti Bai, Dan Nguyen, Justin C. Park, Steve Jiang

Abstract: Online adaptive radiotherapy (ART) requires accurate and efficient auto-segmentation of target volumes and organs-at-risk (OARs) in mostly cone-beam computed tomography (CBCT) images. Propagating expert-drawn contours from the pre-treatment planning CT (pCT) through traditional or deep learning (DL) based deformable image registration (DIR) can achieve improved results in many situations. Typical… ▽ More Online adaptive radiotherapy (ART) requires accurate and efficient auto-segmentation of target volumes and organs-at-risk (OARs) in mostly cone-beam computed tomography (CBCT) images. Propagating expert-drawn contours from the pre-treatment planning CT (pCT) through traditional or deep learning (DL) based deformable image registration (DIR) can achieve improved results in many situations. Typical DL-based DIR models are population based, that is, trained with a dataset for a population of patients, so they may be affected by the generalizability problem. In this paper, we propose a method called test-time optimization (TTO) to refine a pre-trained DL-based DIR population model, first for each individual test patient, and then progressively for each fraction of online ART treatment. Our proposed method is less susceptible to the generalizability problem, and thus can improve overall performance of different DL-based DIR models by improving model accuracy, especially for outliers. Our experiments used data from 239 patients with head and neck squamous cell carcinoma to test the proposed method. Firstly, we trained a population model with 200 patients, and then applied TTO to the remaining 39 test patients by refining the trained population model to obtain 39 individualized models. We compared each of the individualized models with the population model in terms of segmentation accuracy. The number of patients with at least 0.05 DSC improvement or 2 mm HD95 improvement by TTO averaged over the 17 selected structures for the state-of-the-art architecture Voxelmorph is 10 out of 39 test patients. The average time for deriving the individualized model using TTO from the pre-trained population model is approximately four minutes. When adapting the individualized model to a later fraction of the same patient, the average time is reduced to about one minute and the accuracy is slightly improved. △ Less

Submitted 8 February, 2022; originally announced February 2022.

arXiv:2108.08016 [pdf, ps, other]

Low-Complexity Algorithm for Outage Optimal Resource Allocation in Energy Harvesting-Based UAV Identification Networks

Authors: Jae Cheol Park, Kyu-Min Kang, Junil Choi

Abstract: We study an unmanned aerial vehicle (UAV) identification network equipped with an energy harvesting (EH) technique. In the network, the UAVs harvest energy through radio frequency (RF) signals transmitted from ground control stations (GCSs) and then transmit their identification information to the ground receiver station (GRS). Specifically, we first derive a closed-form expression of the outage p… ▽ More We study an unmanned aerial vehicle (UAV) identification network equipped with an energy harvesting (EH) technique. In the network, the UAVs harvest energy through radio frequency (RF) signals transmitted from ground control stations (GCSs) and then transmit their identification information to the ground receiver station (GRS). Specifically, we first derive a closed-form expression of the outage probability to evaluate the network performance. Then we obtain the closed-form expression of the optimal time allocation when the bandwidth is equally allocated to the UAVs. We also propose a fast-converging algorithm for time and the bandwidth allocation, which is necessary for the UAV environment with high mobility, to optimize the outage performance of EH-based UAV identification network. Simulation results show that the proposed algorithm outperforms the conventional bisection algorithm and achieves near-optimal performance. △ Less

Submitted 21 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

Comments: 5 pages, 4 figures, accepted to IEEE Communications Letters, Aug. 2021

arXiv:2107.01257 [pdf, other]

doi 10.1088/1361-6560/ac279e

Abdominal synthetic CT reconstruction with intensity projection prior for MRI-only adaptive radiotherapy

Authors: Sven Olberg, Jaehee Chun, Byong Su Choi, Inkyung Park, Hyun Kim, Taeho Kim, Jin Sung Kim, Olga Green, Justin C. Park

Abstract: An MRI-only adaptive radiotherapy (ART) workflow is desirable for managing interfractional changes in anatomy, but producing synthetic CT (sCT) data through paired data-driven deep learning (DL) for abdominal dose calculations remains a challenge due to the highly variable presence of intestinal gas. We present the preliminary dosimetric evaluation of our novel approach to sCT reconstruction that… ▽ More An MRI-only adaptive radiotherapy (ART) workflow is desirable for managing interfractional changes in anatomy, but producing synthetic CT (sCT) data through paired data-driven deep learning (DL) for abdominal dose calculations remains a challenge due to the highly variable presence of intestinal gas. We present the preliminary dosimetric evaluation of our novel approach to sCT reconstruction that is well suited to handling intestinal gas in abdominal MRI-only ART. We utilize a paired data DL approach enabled by the intensity projection prior, in which well-matching training pairs are created by propagating air from MRI to corresponding CT scans. Evaluations focus on two classes: patients with (1) little involvement of intestinal gas, and (2) notable differences in intestinal gas presence between corresponding scans. Comparisons between sCT-based plans and CT-based clinical plans for both classes are made at the first treatment fraction to highlight the dosimetric impact of the variable presence of intestinal gas. Class 1 patients ($n=13$) demonstrate differences in prescribed dose coverage of the PTV of $1.3 \pm 2.1\%$ between clinical plans and sCT-based plans. Mean DVH differences in all structures for Class 1 patients are found to be statistically insignificant. In Class 2 ($n=20$), target coverage is $13.3 \pm 11.0\%$ higher in the clinical plans and mean DVH differences are found to be statistically significant. Significant deviations in calculated doses arising from the variable presence of intestinal gas in corresponding CT and MRI scans may limit the effectiveness of adaptive dose escalation efforts. We have proposed a paired data-driven DL approach to sCT reconstruction for accurate dose calculations in abdominal ART enabled by the creation of a clinically unavailable training data set with well-matching representations of intestinal gas. △ Less

Submitted 2 July, 2021; originally announced July 2021.

arXiv:2105.00666 [pdf, other]

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

Authors: Soyeong Jeong, Jinheon Baek, ChaeHun Park, Jong C. Park

Abstract: One of the challenges in information retrieval (IR) is the vocabulary mismatch problem, which happens when the terms between queries and documents are lexically different but semantically similar. While recent work has proposed to expand the queries or documents by enriching their representations with additional relevant terms to address this challenge, they usually require a large volume of query… ▽ More One of the challenges in information retrieval (IR) is the vocabulary mismatch problem, which happens when the terms between queries and documents are lexically different but semantically similar. While recent work has proposed to expand the queries or documents by enriching their representations with additional relevant terms to address this challenge, they usually require a large volume of query-document pairs to train an expansion model. In this paper, we propose an Unsupervised Document Expansion with Generation (UDEG) framework with a pre-trained language model, which generates diverse supplementary sentences for the original document without using labels on query-document pairs for training. For generating sentences, we further stochastically perturb their embeddings to generate more diverse sentences for document expansion. We validate our framework on two standard IR benchmark datasets. The results show that our framework significantly outperforms relevant expansion baselines for IR. △ Less

Submitted 14 October, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

Comments: SDP@NAACL2021

arXiv:2104.11401 [pdf]

doi 10.1002/mp.15352

Intentional Deep Overfit Learning (IDOL): A Novel Deep Learning Strategy for Adaptive Radiation Therapy

Authors: Jaehee Chun, Justin C. Park, Sven Olberg, You Zhang, Dan Nguyen, Jing Wang, Jin Sung Kim, Steve Jiang

Abstract: In this study, we propose a tailored DL framework for patient-specific performance that leverages the behavior of a model intentionally overfitted to a patient-specific training dataset augmented from the prior information available in an ART workflow - an approach we term Intentional Deep Overfit Learning (IDOL). Implementing the IDOL framework in any task in radiotherapy consists of two training… ▽ More In this study, we propose a tailored DL framework for patient-specific performance that leverages the behavior of a model intentionally overfitted to a patient-specific training dataset augmented from the prior information available in an ART workflow - an approach we term Intentional Deep Overfit Learning (IDOL). Implementing the IDOL framework in any task in radiotherapy consists of two training stages: 1) training a generalized model with a diverse training dataset of N patients, just as in the conventional DL approach, and 2) intentionally overfitting this general model to a small training dataset-specific the patient of interest (N+1) generated through perturbations and augmentations of the available task- and patient-specific prior information to establish a personalized IDOL model. The IDOL framework itself is task-agnostic and is thus widely applicable to many components of the ART workflow, three of which we use as a proof of concept here: the auto-contouring task on re-planning CTs for traditional ART, the MRI super-resolution (SR) task for MRI-guided ART, and the synthetic CT (sCT) reconstruction task for MRI-only ART. In the re-planning CT auto-contouring task, the accuracy measured by the Dice similarity coefficient improves from 0.847 with the general model to 0.935 by adopting the IDOL model. In the case of MRI SR, the mean absolute error (MAE) is improved by 40% using the IDOL framework over the conventional model. Finally, in the sCT reconstruction task, the MAE is reduced from 68 to 22 HU by utilizing the IDOL framework. △ Less

Submitted 22 April, 2021; originally announced April 2021.

arXiv:2101.10525 [pdf]

doi 10.1103/PhysRevB.101.235434

Evidence of shallow bandgap in ultra-thin 1T'-MoTe2 via infrared spectroscopy

Authors: Jin Cheol Park, Eilho Jung, Sangyun Lee, Jungseek Hwang, Young Hee Lee

Abstract: Although van der Waals (vdW) layered MoS2 shows the phase transformation from the semiconducting 2H-phase to the metallic 1T-phase through chemical lithium intercalation, vdW MoTe2 is thermodynamically reversible between the 2H- and 1T'-phases, and can be further transformed by energetics, laser irradiation, strain or pressure, and electrical doping. Here, thickness- and temperature-dependent opti… ▽ More Although van der Waals (vdW) layered MoS2 shows the phase transformation from the semiconducting 2H-phase to the metallic 1T-phase through chemical lithium intercalation, vdW MoTe2 is thermodynamically reversible between the 2H- and 1T'-phases, and can be further transformed by energetics, laser irradiation, strain or pressure, and electrical doping. Here, thickness- and temperature-dependent optical properties of 1T'-MoTe2 thin films grown by chemical vapor depsition are investigated via Fourier-transformed infrared spectroscopy. An optical gap of 28 +/- 2 meV in a 3-layer (or 2 nm) thick 1T'-MoTe2 is clearly observed at a low temperature region below 50K. No discernible optical bandgap is observed in samples thicker than ~4 nm. The observed thickness-dependent bandgap results agree with the measured dc resistivity data; the thickness-dependent 1T'-MoTe2 clearly demonstrates the metal-semiconductor transition at a crossover below the 2 nm-thick sample. △ Less

Submitted 25 January, 2021; originally announced January 2021.

Comments: 18 pages, 4 figures

Journal ref: Physical Review B 101, 235434 (2020)

arXiv:2008.12769 [pdf, other]

doi 10.1140/epjc/s10052-021-09007-w

Prospects for Beyond the Standard Model Physics Searches at the Deep Underground Neutrino Experiment

Authors: DUNE Collaboration, B. Abi, R. Acciarri, M. A. Acero, G. Adamov, D. Adams, M. Adinolfi, Z. Ahmad, J. Ahmed, T. Alion, S. Alonso Monsalve, C. Alt, J. Anderson, C. Andreopoulos, M. P. Andrews, F. Andrianala, S. Andringa, A. Ankowski, M. Antonova, S. Antusch, A. Aranda-Fernandez, A. Ariga, L. O. Arnold, M. A. Arroyave, J. Asaadi , et al. (953 additional authors not shown)

Abstract: The Deep Underground Neutrino Experiment (DUNE) will be a powerful tool for a variety of physics topics. The high-intensity proton beams provide a large neutrino flux, sampled by a near detector system consisting of a combination of capable precision detectors, and by the massive far detector system located deep underground. This configuration sets up DUNE as a machine for discovery, as it enables… ▽ More The Deep Underground Neutrino Experiment (DUNE) will be a powerful tool for a variety of physics topics. The high-intensity proton beams provide a large neutrino flux, sampled by a near detector system consisting of a combination of capable precision detectors, and by the massive far detector system located deep underground. This configuration sets up DUNE as a machine for discovery, as it enables opportunities not only to perform precision neutrino measurements that may uncover deviations from the present three-flavor mixing paradigm, but also to discover new particles and unveil new interactions and symmetries beyond those predicted in the Standard Model (SM). Of the many potential beyond the Standard Model (BSM) topics DUNE will probe, this paper presents a selection of studies quantifying DUNE's sensitivities to sterile neutrino mixing, heavy neutral leptons, non-standard interactions, CPT symmetry violation, Lorentz invariance violation, neutrino trident production, dark matter from both beam induced and cosmogenic sources, baryon number violation, and other new physics topics that complement those at high-energy colliders and significantly extend the present reach. △ Less

Submitted 23 April, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

Comments: 54 pages, 40 figures, paper based on the DUNE Technical Design Report (arXiv:2002.03005)

Report number: FERMILAB-PUB-20-459-LBNF-ND

Journal ref: European Physical Journal C 81 (2021) 322

arXiv:1907.08311 [pdf, other]

doi 10.1088/1361-6633/ab9d12

White Paper on New Opportunities at the Next-Generation Neutrino Experiments (Part 1: BSM Neutrino Physics and Dark Matter)

Authors: C. A. Argüelles, A. J. Aurisano, B. Batell, J. Berger, M. Bishai, T. Boschi, N. Byrnes, A. Chatterjee, A. Chodos, T. Coan, Y. Cui, A. de Gouvêa, P. B. Denton, A. De Roeck, W. Flanagan, R. P. Gandrajula, A. Hatzikoutelis, M. Hostert, B. Jones, B. J. Kayser, K. J. Kelly, D. Kim, J. Kopp, A. Kubik, K. Lang , et al. (25 additional authors not shown)

Abstract: With the advent of a new generation of neutrino experiments which leverage high-intensity neutrino beams for precision measurements, it is timely to explore physics topics beyond the standard neutrino-related physics. Given that the realm of beyond the standard model (BSM) physics has been mostly sought at high-energy regimes at colliders, such as the LHC at CERN, the exploration of BSM physics in… ▽ More With the advent of a new generation of neutrino experiments which leverage high-intensity neutrino beams for precision measurements, it is timely to explore physics topics beyond the standard neutrino-related physics. Given that the realm of beyond the standard model (BSM) physics has been mostly sought at high-energy regimes at colliders, such as the LHC at CERN, the exploration of BSM physics in neutrino experiments will enable complementary measurements at the energy regimes that balance that of the LHC. This is in concert with new ideas for high-intensity beams for fixed target and beam-dump experiments world-wide, e.g., those at CERN. The combination of the high intensity proton beam facilities and massive detectors for precision neutrino oscillation parameter measurements and for CP violation phase measurements will help make BSM physics reachable even in low energy regimes in accelerator based experiments. Large mass detectors with highly precise tracking and energy measurements, excellent timing resolution, and low energy thresholds will enable searches for BSM phenomena from cosmogenic origin, as well. Therefore, it is conceivable that BSM topics in the next generation neutrino experiments could be the dominant physics topics in the foreseeable future, as the precision of the neutrino oscillation parameter and CPV measurements continues to improve. In this spirit, this white paper provides a review of the current landscape of BSM theory in neutrino experiments in two selected areas of the BSM topics - dark matter and neutrino related BSM - and summarizes the current results from existing neutrino experiments to set benchmarks for both theory and experiment. This paper then provides a review of upcoming neutrino experiments throughout the next 10 - 15 year time scale and their capabilities to set the foundation for potential reach in BSM physics in the two aforementioned themes. △ Less

Submitted 18 October, 2019; v1 submitted 18 July, 2019; originally announced July 2019.

Report number: Reports on Progress in Physics, Volume 83, Number 12

arXiv:1801.01675 [pdf]

Highly Efficient Carrier Multiplication in van der Waals layered Materials

Authors: Ji-Hee Kim, Matthew R. Bergren, Jin Cheol Park, Subash Adhikari, Michael Lorke, Thomas Fraunheim, Duk-Hyun Choe, Beom Kim, Hyunyong Choi, Tom Gregorkiewicz, Young Hee Lee

Abstract: Carrier multiplication (CM), a photo-physical process to generate multiple electron-hole pairs by exploiting excess energy of free carriers, is explored for efficient photovoltaic conversion of photons from the blue solar band, predominantly wasted as heat in standard solar cells. Current state-of-the-art approaches with nanomaterials have demonstrated improved CM but are not satisfactory due to h… ▽ More Carrier multiplication (CM), a photo-physical process to generate multiple electron-hole pairs by exploiting excess energy of free carriers, is explored for efficient photovoltaic conversion of photons from the blue solar band, predominantly wasted as heat in standard solar cells. Current state-of-the-art approaches with nanomaterials have demonstrated improved CM but are not satisfactory due to high energy loss and inherent difficulties with carrier extraction. Here, we report ultra-efficient CM in van der Waals (vdW) layered materials that commences at the energy conservation limit and proceeds with nearly 100% conversion efficiency. A small threshold energy, as low as twice the bandgap, was achieved, marking an onset of quantum yield with enhanced carrier generation. Strong Coulomb interactions between electrons confined within vdW layers allow rapid electron-electron scattering to prevail over electron-phonon scattering. Additionally, the presence of electron pockets spread over momentum space could also contribute to the high CM efficiency. Combining with high conductivity and optimal bandgap, these superior CM characteristics identify vdW materials for third-generation solar cell. △ Less

Submitted 5 January, 2018; originally announced January 2018.

Comments: 17 pages, 4 figures

arXiv:1611.06118 [pdf, other]

doi 10.1093/ptep/pty044

Physics Potentials with the Second Hyper-Kamiokande Detector in Korea

Authors: Hyper-Kamiokande proto-collaboration, :, K. Abe, Ke. Abe, S. H. Ahn, H. Aihara, A. Aimi, R. Akutsu, C. Andreopoulos, I. Anghel, L. H. V. Anthony, M. Antonova, Y. Ashida, V. Aushev, M. Barbi, G. J. Barker, G. Barr, P. Beltrame, V. Berardi, M. Bergevin, S. Berkman, L. Berns, T. Berry, S. Bhadra, D. Bravo-Bergu no , et al. (331 additional authors not shown)

Abstract: Hyper-Kamiokande consists of two identical water-Cherenkov detectors of total 520~kt with the first one in Japan at 295~km from the J-PARC neutrino beam with 2.5$^{\textrm{o}}$ Off-Axis Angles (OAAs), and the second one possibly in Korea in a later stage. Having the second detector in Korea would benefit almost all areas of neutrino oscillation physics mainly due to longer baselines. There are sev… ▽ More Hyper-Kamiokande consists of two identical water-Cherenkov detectors of total 520~kt with the first one in Japan at 295~km from the J-PARC neutrino beam with 2.5$^{\textrm{o}}$ Off-Axis Angles (OAAs), and the second one possibly in Korea in a later stage. Having the second detector in Korea would benefit almost all areas of neutrino oscillation physics mainly due to longer baselines. There are several candidate sites in Korea with baselines of 1,000$\sim$1,300~km and OAAs of 1$^{\textrm{o}}$$\sim$3$^{\textrm{o}}$. We conducted sensitivity studies on neutrino oscillation physics for a second detector, either in Japan (JD $\times$ 2) or Korea (JD + KD) and compared the results with a single detector in Japan. Leptonic CP violation sensitivity is improved especially when the CP is non-maximally violated. The larger matter effect at Korean candidate sites significantly enhances sensitivities to non-standard interactions of neutrinos and mass ordering determination. Current studies indicate the best sensitivity is obtained at Mt. Bisul (1,088~km baseline, $1.3^\circ$ OAA). Thanks to a larger (1,000~m) overburden than the first detector site, clear improvements to sensitivities for solar and supernova relic neutrino searches are expected. △ Less

Submitted 26 March, 2018; v1 submitted 18 November, 2016; originally announced November 2016.

Comments: 102 pages, 49 figures. Accepted by PTEP

Journal ref: Prog Theor Exp Phys (2018)

arXiv:1509.02377 [pdf]

ERK/p38 MAPK inhibition reduces radio-resistance to pulsed proton beam in breast cancer stem cells cells

Authors: Myung-Hwan Jung, Jeong Chan Park

Abstract: Recent studies have identified highly tumorigenic cells with stem cell-like characteristics in human cancers, termed cancer stem cells (CSCs). CSCs are resistant to conventional radiotherapy and chemotherapy owing to their high DNA repair ability and oncogene overexpression. However, the mechanisms regulating CSC radio-resistance, particularly proton beam resistance, remain unclear. We isolated CS… ▽ More Recent studies have identified highly tumorigenic cells with stem cell-like characteristics in human cancers, termed cancer stem cells (CSCs). CSCs are resistant to conventional radiotherapy and chemotherapy owing to their high DNA repair ability and oncogene overexpression. However, the mechanisms regulating CSC radio-resistance, particularly proton beam resistance, remain unclear. We isolated CSCs from the breast cancer cell lines MCF-7 and MDA-MB-231, which expressed the characteristic breast CSC membrane protein markers CD44+/CD24-/low, and irradiated the CSCs with pulsed proton beams. We confirmed that CSCs are resistant to pulsed proton beams and showed that treatment with p38 and ERK inhibitors reduced CSC radioresistance. Based on these results, BCSC radio-resistance can be reduced during proton beam therapy by co-treatment with ERK1/2 or p38 inhibitors, representing a novel approach for breast cancer therapy. △ Less

Submitted 14 July, 2015; originally announced September 2015.

arXiv:1507.04863 [pdf]

Study of the Effects of High-Energy Proton Beams on Escherichia Coli

Authors: Jeong Chan Park, Myung-Hwan Jung

Abstract: Antibiotic-resistant bacterial infection becomes one of the most serious risks to public health care today. However, discouragingly, the development of new antibiotics has been little progressed over the last decade. There is an urgent need of the alternative approaches to treat the antibiotic-resistant bacteria. The novel methods, which include photothermal therapy based on gold nano-materials an… ▽ More Antibiotic-resistant bacterial infection becomes one of the most serious risks to public health care today. However, discouragingly, the development of new antibiotics has been little progressed over the last decade. There is an urgent need of the alternative approaches to treat the antibiotic-resistant bacteria. The novel methods, which include photothermal therapy based on gold nano-materials and ionizing radiation such as X-rays and gamma rays, have been reported. Studies of the effects of high-energy proton radiation on bacteria are mainly focused on Bacillus species and its spores. The effect of proton beams on Escherichia coli (E. coli) has been limitedly reported. The Escherichia coli is an important biological tool to obtain the metabolic and genetic information and also a common model microorganism for studying toxicity and antimicrobial activity. In addition, E. coli is a common bacterium in the intestinal tract of mammals. Herein, the morphological and physiological changes of E. coli after proton irradiation were investigated. The diluted solutions of the cells were used for proton beam radiation. LB agar plates were used to count the number of colonies formed. The growing profile of the cells was monitored by optical density at 600 nm. The morphology of the irradiated cells was analyzed with optical microscope. Microarray analysis was performed to examine the gene expression changes between irradiated samples and control samples without irradiation. △ Less

Submitted 17 July, 2015; originally announced July 2015.

arXiv:1007.3684 [pdf, ps, other]

doi 10.1016/j.physa.2013.10.012

Generalization of Gibbs Entropy and Thermodynamic Relation

Authors: Jun Chul Park

Abstract: In this paper, we extend Gibbs's approach of quasi-equilibrium thermodynamic processes, and calculate the microscopic expression of entropy for general non-equilibrium thermodynamic processes. Also, we analyze the formal structure of thermodynamic relation in non-equilibrium thermodynamic processes. In this paper, we extend Gibbs's approach of quasi-equilibrium thermodynamic processes, and calculate the microscopic expression of entropy for general non-equilibrium thermodynamic processes. Also, we analyze the formal structure of thermodynamic relation in non-equilibrium thermodynamic processes. △ Less

Submitted 1 December, 2013; v1 submitted 21 July, 2010; originally announced July 2010.

Comments: Final version published in Physica A; typos are corrected (the scientific contents are not changed)

Journal ref: Physica A 395 (2014) 135

arXiv:0905.0897 [pdf, ps, other]

Representation of intermediate time-scale motions in stochastic modeling: Analysis on stochastic description of classical Hamiltonian dynamics in relation with measurement imperfection

Authors: Jun Chul Park

Abstract: It is a well established result that, in classical dynamical systems with sufficient time-scale separation, the fast chaotic degrees of freedom are well modeled by (Gaussian) white noise. In this paper, we present the stochastic dynamical description for intermediate time-scale motions with insufficient time-scale separation from the slow dynamical system. First, we analyze how the fast determin… ▽ More It is a well established result that, in classical dynamical systems with sufficient time-scale separation, the fast chaotic degrees of freedom are well modeled by (Gaussian) white noise. In this paper, we present the stochastic dynamical description for intermediate time-scale motions with insufficient time-scale separation from the slow dynamical system. First, we analyze how the fast deterministic dynamics can be viewed as stochastic dynamics under experimental observation by intrinsic errors of measurement. Then, we present how the stochastic dynamical description should be modified if intermediate time-scale motions exist: the time correlation of the noise ξis modified to <ξ(t)ξ(t')> = C(x,p)δ(t-t'), where C(x,p) is a smooth function of the slow coordinate (x,p), and generally the cumulants of ξexcept its average vary as a smooth function of the slow coordinates (x,p). The analysis given in this work actually shows that, regardless of the sufficiency of time-scale separation, any complex (chaotic and ergodic) dynamical system can be well described using Markov process, if we perfectly construct the deterministic part of (extended) stochastic dynamics. △ Less

Submitted 6 December, 2009; v1 submitted 6 May, 2009; originally announced May 2009.

Comments: 14 pages; one footnote has been added (footnote 7 in page 9), which gives more precise argument for the new slow dynamics related with x_f-dependence

arXiv:cmp-lg/9505027 [pdf, ps]

Quantifier Scope and Constituency

Authors: Jong C. Park

Abstract: Traditional approaches to quantifier scope typically need stipulation to exclude readings that are unavailable to human understanders. This paper shows that quantifier scope phenomena can be precisely characterized by a semantic representation constrained by surface constituency, if the distinction between referential and quantificational NPs is properly observed. A CCG implementation is describ… ▽ More Traditional approaches to quantifier scope typically need stipulation to exclude readings that are unavailable to human understanders. This paper shows that quantifier scope phenomena can be precisely characterized by a semantic representation constrained by surface constituency, if the distinction between referential and quantificational NPs is properly observed. A CCG implementation is described and compared to other approaches. △ Less

Submitted 11 May, 1995; originally announced May 1995.

Comments: 8 pages, compressed and uuencoded postscript file, ACL-95

Showing 1–40 of 40 results for author: Park, J C