-
ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection
Authors:
Janek Herrlein,
Chia-Chien Hung,
Goran Glavaš
Abstract:
Research on token-level reference-free hallucination detection has predominantly focused on English, primarily due to the scarcity of robust datasets in other languages. This has hindered systematic investigations into the effectiveness of cross-lingual transfer for this important NLP application. To address this gap, we introduce ANHALTEN, a new evaluation dataset that extends the English halluci…
▽ More
Research on token-level reference-free hallucination detection has predominantly focused on English, primarily due to the scarcity of robust datasets in other languages. This has hindered systematic investigations into the effectiveness of cross-lingual transfer for this important NLP application. To address this gap, we introduce ANHALTEN, a new evaluation dataset that extends the English hallucination detection dataset to German. To the best of our knowledge, this is the first work that explores cross-lingual transfer for token-level reference-free hallucination detection. ANHALTEN contains gold annotations in German that are parallel (i.e., directly comparable to the original English instances). We benchmark several prominent cross-lingual transfer approaches, demonstrating that larger context length leads to better hallucination detection in German, even without succeeding context. Importantly, we show that the sample-efficient few-shot transfer is the most effective approach in most setups. This highlights the practical benefits of minimal annotation effort in the target language for reference-free hallucination detection. Aiming to catalyze future research on cross-lingual token-level reference-free hallucination detection, we make ANHALTEN publicly available: https://github.com/janekh24/anhalten
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Reward Steering with Evolutionary Heuristics for Decoding-time Alignment
Authors:
Chia-Yu Hung,
Navonil Majumder,
Ambuj Mehrish,
Soujanya Poria
Abstract:
The widespread applicability and increasing omnipresence of LLMs have instigated a need to align LLM responses to user and stakeholder preferences. Many preference optimization approaches have been proposed that fine-tune LLM parameters to achieve good alignment. However, such parameter tuning is known to interfere with model performance on many tasks. Moreover, keeping up with shifting user prefe…
▽ More
The widespread applicability and increasing omnipresence of LLMs have instigated a need to align LLM responses to user and stakeholder preferences. Many preference optimization approaches have been proposed that fine-tune LLM parameters to achieve good alignment. However, such parameter tuning is known to interfere with model performance on many tasks. Moreover, keeping up with shifting user preferences is tricky in such a situation. Decoding-time alignment with reward model guidance solves these issues at the cost of increased inference time. However, most of such methods fail to strike the right balance between exploration and exploitation of reward -- often due to the conflated formulation of these two aspects - to give well-aligned responses. To remedy this we decouple these two aspects and implement them in an evolutionary fashion: exploration is enforced by decoding from mutated instructions and exploitation is represented as the periodic replacement of poorly-rewarded generations with well-rewarded ones. Empirical evidences indicate that this strategy outperforms many preference optimization and decode-time alignment approaches on two widely accepted alignment benchmarks AlpacaEval 2 and MT-Bench. Our implementation will be available at: https://darwin-alignment.github.io.
△ Less
Submitted 8 July, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
Collapse of a quantum vortex in an attractive two-dimensional Bose gas
Authors:
Sambit Banerjee,
Kai Zhou,
Shiva Kant Tiwari,
Hikaru Tamura,
Rongjie Li,
Panayotis Kevrekidis,
Simeon I. Mistakidis,
Valentin Walther,
Chen-Lung Hung
Abstract:
We experimentally and numerically study the collapse dynamics of a quantum vortex in a two-dimensional atomic superfluid following a fast interaction ramp from repulsion to attraction. We find the conditions and time scales for a superfluid vortex to radially converge into a quasi-stationary density profile, demonstrating the first spontaneous formation of a vortex soliton in an atomic Bose gas. W…
▽ More
We experimentally and numerically study the collapse dynamics of a quantum vortex in a two-dimensional atomic superfluid following a fast interaction ramp from repulsion to attraction. We find the conditions and time scales for a superfluid vortex to radially converge into a quasi-stationary density profile, demonstrating the first spontaneous formation of a vortex soliton in an atomic Bose gas. We record an emergent universal dynamics of an azimuthal modulational instability, which amplifies initial density perturbations and leads to the eventual splitting of a vortex soliton or direct fragmentation of a superfluid into disordered, but roughly circular arrays of Townes soliton-like wavepackets. Our study sets the stage for exploring universal out-of-equilibrium dynamics of vortex quantum matter quenched to attractive interactions.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Authors:
Max Liu,
Chan-Hung Yu,
Wei-Hsu Lee,
Cheng-Wei Hung,
Yen-Chun Chen,
Shao-Hua Sun
Abstract:
Programmatic reinforcement learning (PRL) has been explored for representing policies through programs as a means to achieve interpretability and generalization. Despite promising outcomes, current state-of-the-art PRL methods are hindered by sample inefficiency, necessitating tens of millions of program-environment interactions. To tackle this challenge, we introduce a novel LLM-guided search fra…
▽ More
Programmatic reinforcement learning (PRL) has been explored for representing policies through programs as a means to achieve interpretability and generalization. Despite promising outcomes, current state-of-the-art PRL methods are hindered by sample inefficiency, necessitating tens of millions of program-environment interactions. To tackle this challenge, we introduce a novel LLM-guided search framework (LLM-GS). Our key insight is to leverage the programming expertise and common sense reasoning of LLMs to enhance the efficiency of assumption-free, random-guessing search methods. We address the challenge of LLMs' inability to generate precise and grammatically correct programs in domain-specific languages (DSLs) by proposing a Pythonic-DSL strategy - an LLM is instructed to initially generate Python codes and then convert them into DSL programs. To further optimize the LLM-generated programs, we develop a search algorithm named Scheduled Hill Climbing, designed to efficiently explore the programmatic search space to consistently improve the programs. Experimental results in the Karel domain demonstrate the superior effectiveness and efficiency of our LLM-GS framework. Extensive ablation studies further verify the critical role of our Pythonic-DSL strategy and Scheduled Hill Climbing algorithm.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Grain boundary metastability controls irradiation resistance in nanocrystalline metals
Authors:
Osman El-Atwani,
Annie K. Barnett,
Enrique Martinez,
Jian Han,
Asher C. Leff,
Chang-Yu Hung,
James E. Nathaniel,
Sicong He,
Emily H. Mang,
Larissa M. Woryk,
Khalid Hattar,
Blas P. Uberuaga,
David J. Srolovitz,
Michael L. Falk,
Jaime Marian,
Mitra L. Taheri
Abstract:
Grain boundaries (GBs) in polycrystalline materials are powerful sinks for irradiation defects. While standard theories assume that the sink efficiency of a grain boundary is defined solely by its character before irradiation, recent evidence conclusively shows that the irradiation sink efficiency is a highly dynamic property controlled by the intrinsic metastability of GBs under far-from-equilibr…
▽ More
Grain boundaries (GBs) in polycrystalline materials are powerful sinks for irradiation defects. While standard theories assume that the sink efficiency of a grain boundary is defined solely by its character before irradiation, recent evidence conclusively shows that the irradiation sink efficiency is a highly dynamic property controlled by the intrinsic metastability of GBs under far-from-equilibrium irradiation conditions. In this paper, we reveal that the denuded (i.e., defect-free) zone, typically the signature of a strong sink, can collapse as irradiation damage accumulates. We propose a radiation damage evolution model that captures this behavior based on the emergence of a series of irradiation defect-enabled metastable GB microstate changes that dynamically alter the ability of the GB to absorb further damage. We show that these microstate changes control further defect absorption and give rise to the formation of a defect network that manifests itself as a net Nye-tensor signal detectable via lattice curvature experiments.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
Authors:
Navonil Majumder,
Chia-Yu Hung,
Deepanway Ghosal,
Wei-Ning Hsu,
Rada Mihalcea,
Soujanya Poria
Abstract:
Generative multimodal content is increasingly prevalent in much of the content creation arena, as it has the potential to allow artists and media personnel to create pre-production mockups by quickly bringing their ideas to life. The generation of audio from text prompts is an important aspect of such processes in the music and film industry. Many of the recent diffusion-based text-to-audio models…
▽ More
Generative multimodal content is increasingly prevalent in much of the content creation arena, as it has the potential to allow artists and media personnel to create pre-production mockups by quickly bringing their ideas to life. The generation of audio from text prompts is an important aspect of such processes in the music and film industry. Many of the recent diffusion-based text-to-audio models focus on training increasingly sophisticated diffusion models on a large set of datasets of prompt-audio pairs. These models do not explicitly focus on the presence of concepts or events and their temporal ordering in the output audio with respect to the input prompt. Our hypothesis is focusing on how these aspects of audio generation could improve audio generation performance in the presence of limited data. As such, in this work, using an existing text-to-audio model Tango, we synthetically create a preference dataset where each prompt has a winner audio output and some loser audio outputs for the diffusion model to learn from. The loser outputs, in theory, have some concepts from the prompt missing or in an incorrect order. We fine-tune the publicly available Tango text-to-audio model using diffusion-DPO (direct preference optimization) loss on our preference dataset and show that it leads to improved audio output over Tango and AudioLDM2, in terms of both automatic- and manual-evaluation metrics.
△ Less
Submitted 17 July, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Single-image driven 3d viewpoint training data augmentation for effective wine label recognition
Authors:
Yueh-Cheng Huang,
Hsin-Yi Chen,
Cheng-Jui Hung,
Jen-Hui Chuang,
Jenq-Neng Hwang
Abstract:
Confronting the critical challenge of insufficient training data in the field of complex image recognition, this paper introduces a novel 3D viewpoint augmentation technique specifically tailored for wine label recognition. This method enhances deep learning model performance by generating visually realistic training samples from a single real-world wine label image, overcoming the challenges pose…
▽ More
Confronting the critical challenge of insufficient training data in the field of complex image recognition, this paper introduces a novel 3D viewpoint augmentation technique specifically tailored for wine label recognition. This method enhances deep learning model performance by generating visually realistic training samples from a single real-world wine label image, overcoming the challenges posed by the intricate combinations of text and logos. Classical Generative Adversarial Network (GAN) methods fall short in synthesizing such intricate content combination. Our proposed solution leverages time-tested computer vision and image processing strategies to expand our training dataset, thereby broadening the range of training samples for deep learning applications. This innovative approach to data augmentation circumvents the constraints of limited training resources. Using the augmented training images through batch-all triplet metric learning on a Vision Transformer (ViT) architecture, we can get the most discriminative embedding features for every wine label, enabling us to perform one-shot recognition of existing wine labels in the training classes or future newly collected wine labels unavailable in the training. Experimental results show a significant increase in recognition accuracy over conventional 2D data augmentation techniques.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Deep Learning Approach to Forecasting COVID-19 Cases in Residential Buildings of Hong Kong Public Housing Estates: The Role of Environment and Sociodemographics
Authors:
E. Leung,
J. Guan,
KO. Kwok,
CT. Hung,
CC. Ching,
KC. Chong,
CHK. Yam,
T. Sun,
WH. Tsang,
EK. Yeoh,
A. Lee
Abstract:
Introduction: The current study investigates the complex association between COVID-19 and the studied districts' socioecology (e.g. internal and external built environment, sociodemographic profiles, etc.) to quantify their contributions to the early outbreaks and epidemic resurgence of COVID-19. Methods: We aligned the analytic model's architecture with the hierarchical structure of the resident'…
▽ More
Introduction: The current study investigates the complex association between COVID-19 and the studied districts' socioecology (e.g. internal and external built environment, sociodemographic profiles, etc.) to quantify their contributions to the early outbreaks and epidemic resurgence of COVID-19. Methods: We aligned the analytic model's architecture with the hierarchical structure of the resident's socioecology using a multi-headed hierarchical convolutional neural network to structure the vast array of hierarchically related predictive features representing buildings' internal and external built environments and residents' sociodemographic profiles as model input. COVID-19 cases accumulated in buildings across three adjacent districts in HK, both before and during HK's epidemic resurgence, were modeled. A forward-chaining validation was performed to examine the model's performance in forecasting COVID-19 cases over the 3-, 7-, and 14-day horizons during the two months subsequent to when the model for COVID-19 resurgence was built to align with the forecasting needs in an evolving pandemic. Results: Different sets of factors were found to be linked to the earlier waves of COVID-19 outbreaks compared to the epidemic resurgence of the pandemic. Sociodemographic factors such as work hours, monthly household income, employment types, and the number of non-working adults or children in household populations were of high importance to the studied buildings' COVID-19 case counts during the early waves of COVID-19. Factors constituting one's internal built environment, such as the number of distinct households in the buildings, the number of distinct households per floor, and the number of floors, corridors, and lifts, had the greatest unique contributions to the building-level COVID-19 case counts during epidemic resurgence.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Analyzing the Variations in Emergency Department Boarding and Testing the Transferability of Forecasting Models across COVID-19 Pandemic Waves in Hong Kong: Hybrid CNN-LSTM approach to quantifying building-level socioecological risk
Authors:
Eman Leung,
Jingjing Guan,
Kin On Kwok,
CT Hung,
CC. Ching,
CK. Chung,
Hector Tsang,
EK Yeoh,
Albert Lee
Abstract:
Emergency department's (ED) boarding (defined as ED waiting time greater than four hours) has been linked to poor patient outcomes and health system performance. Yet, effective forecasting models is rare before COVID-19, lacking during the peri-COVID era. Here, a hybrid convolutional neural network (CNN)-Long short-term memory (LSTM) model was applied to public-domain data sourced from Hong Kong's…
▽ More
Emergency department's (ED) boarding (defined as ED waiting time greater than four hours) has been linked to poor patient outcomes and health system performance. Yet, effective forecasting models is rare before COVID-19, lacking during the peri-COVID era. Here, a hybrid convolutional neural network (CNN)-Long short-term memory (LSTM) model was applied to public-domain data sourced from Hong Kong's Hospital Authority, Department of Health, and Housing Authority. In addition, we sought to identify the phase of the COVID-19 pandemic that most significantly perturbed our complex adaptive healthcare system, thereby revealing a stable pattern of interconnectedness among its components, using deep transfer learning methodology.
Our result shows that 1) the greatest proportion of days with ED boarding was found between waves four and five; 2) the best-performing model for forecasting ED boarding was observed between waves four and five, which was based on features representing time-invariant residential buildings' built environment and sociodemographic profiles and the historical time series of ED boarding and case counts, compared to during the waves when best-performing forecasting is based on time-series features alone; and 3) when the model built from the period between waves four and five was applied to data from other waves via deep transfer learning, the transferred model enhanced the performance of indigenous models.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Efficient Processing of Subsequent Densest Subgraph Query
Authors:
Chia-Yang Hung,
Chih-Ya Shen
Abstract:
Dense subgraph extraction is a fundamental problem in graph analysis and data mining, aimed at identifying cohesive and densely connected substructures within a given graph. It plays a crucial role in various domains, including social network analysis, biological network analysis, recommendation systems, and community detection. However, extracting a subgraph with the highest node similarity is a…
▽ More
Dense subgraph extraction is a fundamental problem in graph analysis and data mining, aimed at identifying cohesive and densely connected substructures within a given graph. It plays a crucial role in various domains, including social network analysis, biological network analysis, recommendation systems, and community detection. However, extracting a subgraph with the highest node similarity is a lack of exploration. To address this problem, we studied the Member Selection Problem and extended it with a dynamic constraint variant. By incorporating dynamic constraints, our algorithm can adapt to changing conditions or requirements, allowing for more flexible and personalized subgraph extraction. This approach enables the algorithm to provide tailored solutions that meet specific needs, even in scenarios where constraints may vary over time. We also provide the theoretical analysis to show that our algorithm is 1/3-approximation. Eventually, the experiments show that our algorithm is effective and efficient in tackling the member selection problem with dynamic constraints.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Design, Construction, and Performance of the GEM based Radial Time Projection Chamber for the BONuS12 Experiment with CLAS12
Authors:
I. Albayrak,
S. Aune,
C. Ayerbe Gayoso,
P. Baron,
S. Bültmann,
G. Charles,
M. E. Christy,
G. Dodge,
N. Dzbenski,
R. Dupré,
K. Griffioen,
M. Hattawy,
Y. C. Hung,
N. Kalantarians,
S. Kuhn,
I. Mandjavidze,
A. Nadeeshani,
M. Ouillon,
P. Pandey,
D. Payette,
M. Pokhrel,
J. Poudel,
A. S. Tadepalli,
M. Vandenbroucke
Abstract:
A new radial time projection chamber based on Gas Electron Multiplier amplification layers was developed for the BONuS12 experiment in Hall B at Jefferson Lab. This device represents a significant evolutionary development over similar devices constructed for previous experiments, including cylindrical amplification layers constructed from single continuous GEM foils with less than 1\% dead area. P…
▽ More
A new radial time projection chamber based on Gas Electron Multiplier amplification layers was developed for the BONuS12 experiment in Hall B at Jefferson Lab. This device represents a significant evolutionary development over similar devices constructed for previous experiments, including cylindrical amplification layers constructed from single continuous GEM foils with less than 1\% dead area. Particular attention had been paid to producing excellent geometric uniformity of all electrodes, including the very thin metalized polyester film of the cylindrical cathode. This manuscript describes the design, construction, and performance of this new detector.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Coronary CTA and Quantitative Cardiac CT Perfusion (CCTP) in Coronary Artery Disease
Authors:
Hao Wu,
Yingnan Song,
Ammar Hoori,
Ananya Subramaniam,
Juhwan Lee,
Justin Kim,
Tao Hu,
Sadeer Al-Kindi,
Wei-Ming Huang,
Chun-Ho Yun,
Chung-Lieh Hung,
Sanjay Rajagopalan,
David L. Wilson
Abstract:
We assessed the benefit of combining stress cardiac CT perfusion (CCTP) myocardial blood flow (MBF) with coronary CT angiography (CCTA) using our innovative CCTP software. By combining CCTA and CCTP, one can uniquely identify a flow limiting stenosis (obstructive-lesion + low-MBF) versus MVD (no-obstructive-lesion + low-MBF. We retrospectively evaluated 104 patients with suspected CAD, including 1…
▽ More
We assessed the benefit of combining stress cardiac CT perfusion (CCTP) myocardial blood flow (MBF) with coronary CT angiography (CCTA) using our innovative CCTP software. By combining CCTA and CCTP, one can uniquely identify a flow limiting stenosis (obstructive-lesion + low-MBF) versus MVD (no-obstructive-lesion + low-MBF. We retrospectively evaluated 104 patients with suspected CAD, including 18 with diabetes, who underwent CCTA+CCTP. Whole heart and territorial MBF was assessed using our automated pipeline for CCTP analysis that included beam hardening correction; temporal scan registration; automated segmentation; fast, accurate, robust MBF estimation; and visualization. Stenosis severity was scored using the CCTA coronary-artery-disease-reporting-and-data-system (CAD-RADS), with obstructive stenosis deemed as CAD-RADS>=3. We established a threshold MBF (MBF=199-mL/min-100g) for normal perfusion. In patients with CAD-RADS>=3, 28/37(76%) patients showed ischemia in the corresponding territory. Two patients with obstructive disease had normal perfusion, suggesting collaterals and/or a hemodynamically insignificant stenosis. Among diabetics, 10 of 18 (56%) demonstrated diffuse ischemia consistent with MVD. Among non-diabetics, only 6% had MVD. Sex-specific prevalence of MVD was 21%/24% (M/F). On a per-vessel basis (n=256), MBF showed a significant difference between territories with and without obstructive stenosis (165 +/- 61 mL/min-100g vs. 274 +/- 62 mL/min-100g, p <0.05). A significant and negative rank correlation (rho=-0.53, p<0.05) between territory MBF and CAD-RADS was seen. CCTA in conjunction with a new automated quantitative CCTP approach can augment the interpretation of CAD, enabling the distinction of ischemia due to obstructive lesions and MVD.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Pericoronary adipose tissue feature analysis in CT calcium score images with comparison to coronary CTA
Authors:
Yingnan Song,
Hao Wu,
Juhwan Lee,
Justin Kim,
Ammar Hoori,
Tao Hu,
Vladislav Zimin,
Mohamed Makhlouf,
Sadeer Al-Kindi,
Sanjay Rajagopalan,
Chun-Ho Yun,
Chung-Lieh Hung,
David L. Wilson
Abstract:
We investigated the feasibility and advantages of using non-contrast CT calcium score (CTCS) images to assess pericoronary adipose tissue (PCAT) and its association with major adverse cardiovascular events (MACE). PCAT features from coronary CTA (CCTA) have been shown to be associated with cardiovascular risk but are potentially confounded by iodine. If PCAT in CTCS images can be similarly analyze…
▽ More
We investigated the feasibility and advantages of using non-contrast CT calcium score (CTCS) images to assess pericoronary adipose tissue (PCAT) and its association with major adverse cardiovascular events (MACE). PCAT features from coronary CTA (CCTA) have been shown to be associated with cardiovascular risk but are potentially confounded by iodine. If PCAT in CTCS images can be similarly analyzed, it would avoid this issue and enable its inclusion in formal risk assessment from readily available, low-cost CTCS images. To identify coronaries in CTCS images that have subtle visual evidence of vessels, we registered CTCS with paired CCTA images having coronary labels. We developed a novel axial-disk method giving regions for analyzing PCAT features in three main coronary arteries. We analyzed novel hand-crafted and radiomic features using univariate and multivariate logistic regression prediction of MACE and compared results against those from CCTA. Registration accuracy was sufficient to enable the identification of PCAT regions in CTCS images. Motion or beam hardening artifacts were often present in high-contrast CCTA but not CTCS. Mean HU and volume were increased in both CTCS and CCTA for MACE group. There were significant positive correlations between some CTCS and CCTA features, suggesting that similar characteristics were obtained. Using hand-crafted/radiomics from CTCS and CCTA, AUCs were 0.82/0.79 and 0.83/0.77 respectively, while Agatston gave AUC=0.73. Preliminarily, PCAT features can be assessed from three main coronary arteries in non-contrast CTCS images with performance characteristics that are at the very least comparable to CCTA.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
SoundShift: Exploring Sound Manipulations for Accessible Mixed-Reality Awareness
Authors:
Ruei-Che Chang,
Chia-Sheng Hung,
Bing-Yu Chen,
Dhruv Jain,
Anhong Guo
Abstract:
Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum…
▽ More
Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum posts within the blind community, identifying prevailing challenges, needs, and desired solutions. We synthesized the results and propose SoundShift for increasing MR sound awareness, which includes six sound manipulations: Transparency Shift, Envelope Shift, Position Shift, Style Shift, Time Shift, and Sound Append. To evaluate the effectiveness of SoundShift, we conducted a user study with 18 blind participants across three simulated MR scenarios, where participants identified specific sounds within intricate soundscapes. We found that SoundShift increased MR sound awareness and minimized cognitive load. Finally, we developed three real-world example applications to demonstrate the practicality of SoundShift.
△ Less
Submitted 26 May, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Trapped atoms and superradiance on an integrated nanophotonic microring circuit
Authors:
Xinchao Zhou,
Hikaru Tamura,
Tzu-Han Chang,
Chen-Lung Hung
Abstract:
Interfacing cold atoms with integrated nanophotonic devices could offer new paradigms for engineering atom-light interactions and provide a potentially scalable route for quantum sensing, metrology, and quantum information processing. However, it remains a challenging task to efficiently trap a large ensemble of cold atoms on an integrated nanophotonic circuit. Here, we demonstrate direct loading…
▽ More
Interfacing cold atoms with integrated nanophotonic devices could offer new paradigms for engineering atom-light interactions and provide a potentially scalable route for quantum sensing, metrology, and quantum information processing. However, it remains a challenging task to efficiently trap a large ensemble of cold atoms on an integrated nanophotonic circuit. Here, we demonstrate direct loading of an ensemble of up to 70 atoms into an optical microtrap on a nanophotonic microring circuit. Efficient trap loading is achieved by employing degenerate Raman-sideband cooling in the microtrap, where a built-in spin-motion coupling arises directly from the vector light shift of the evanescent field potential on a microring. Atoms are cooled into the trap via optical pumping with a single free space beam. We have achieved a trap lifetime approaching 700ms under continuous cooling. We show that the trapped atoms display large cooperative coupling and superradiant decay into a whispering-gallery mode of the microring resonator, holding promise for explorations of new collective effects. Our technique can be extended to trapping a large ensemble of cold atoms on nanophotonic circuits for various quantum applications.
△ Less
Submitted 21 June, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Authors:
Chia-Chien Hung,
Wiem Ben Rim,
Lindsay Frost,
Lars Bruckner,
Carolin Lawrence
Abstract:
High-risk domains pose unique challenges that require language models to provide accurate and safe responses. Despite the great success of large language models (LLMs), such as ChatGPT and its variants, their performance in high-risk domains remains unclear. Our study delves into an in-depth analysis of the performance of instruction-tuned LLMs, focusing on factual accuracy and safety adherence. T…
▽ More
High-risk domains pose unique challenges that require language models to provide accurate and safe responses. Despite the great success of large language models (LLMs), such as ChatGPT and its variants, their performance in high-risk domains remains unclear. Our study delves into an in-depth analysis of the performance of instruction-tuned LLMs, focusing on factual accuracy and safety adherence. To comprehensively assess the capabilities of LLMs, we conduct experiments on six NLP datasets including question answering and summarization tasks within two high-risk domains: legal and medical. Further qualitative analysis highlights the existing limitations inherent in current LLMs when evaluating in high-risk domains. This underscores the essential nature of not only improving LLM capabilities but also prioritizing the refinement of domain-specific metrics, and embracing a more human-centric approach to enhance safety and factual reliability. Our findings advance the field toward the concerns of properly evaluating LLMs in high-risk domains, aiming to steer the adaptability of LLMs in fulfilling societal obligations and aligning with forthcoming regulations, such as the EU AI Act.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
Explicit Change Relation Learning for Change Detection in VHR Remote Sensing Images
Authors:
Dalong Zheng,
Zebin Wu,
Jia Liu,
Chih-Cheng Hung,
Zhihui Wei
Abstract:
Change detection has always been a concerned task in the interpretation of remote sensing images. It is essentially a unique binary classification task with two inputs, and there is a change relationship between these two inputs. At present, the mining of change relationship features is usually implicit in the network architectures that contain single-branch or two-branch encoders. However, due to…
▽ More
Change detection has always been a concerned task in the interpretation of remote sensing images. It is essentially a unique binary classification task with two inputs, and there is a change relationship between these two inputs. At present, the mining of change relationship features is usually implicit in the network architectures that contain single-branch or two-branch encoders. However, due to the lack of artificial prior design for change relationship features, these networks cannot learn enough change semantic information and lose more accurate change detection performance. So we propose a network architecture NAME for the explicit mining of change relation features. In our opinion, the change features of change detection should be divided into pre-changed image features, post-changed image features and change relation features. In order to fully mine these three kinds of change features, we propose the triple branch network combining the transformer and convolutional neural network (CNN) to extract and fuse these change features from two perspectives of global information and local information, respectively. In addition, we design the continuous change relation (CCR) branch to further obtain the continuous and detail change relation features to improve the change discrimination capability of the model. The experimental results show that our network performs better, in terms of F1, IoU, and OA, than those of the existing advanced networks for change detection on four public very high-resolution (VHR) remote sensing datasets. Our source code is available at https://github.com/DalongZ/NAME.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Linking Surface Facts to Large-Scale Knowledge Graphs
Authors:
Gorjan Radevski,
Kiril Gashteovski,
Chia-Chien Hung,
Carolin Lawrence,
Goran Glavaš
Abstract:
Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples. These facts are, however, merely surface forms, the ambiguity of which impedes their downstream usage; e.g., the surface phrase "Michael Jordan" may refer to either the former basketball player or the university professor. Knowledge Graphs (KGs), on the other…
▽ More
Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples. These facts are, however, merely surface forms, the ambiguity of which impedes their downstream usage; e.g., the surface phrase "Michael Jordan" may refer to either the former basketball player or the university professor. Knowledge Graphs (KGs), on the other hand, contain facts in a canonical (i.e., unambiguous) form, but their coverage is limited by a static schema (i.e., a fixed set of entities and predicates). To bridge this gap, we need the best of both worlds: (i) high coverage of free-text OIEs, and (ii) semantic precision (i.e., monosemy) of KGs. In order to achieve this goal, we propose a new benchmark with novel evaluation protocols that can, for example, measure fact linking performance on a granular triple slot level, while also measuring if a system has the ability to recognize that a surface form has no match in the existing KG. Our extensive evaluation of several baselines show that detection of out-of-KG entities and predicates is more difficult than accurate linking to existing ones, thus calling for more research efforts on this difficult task. We publicly release all resources (data, benchmark and code) on https://github.com/nec-research/fact-linking.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Who Wrote it and Why? Prompting Large-Language Models for Authorship Verification
Authors:
Chia-Yu Hung,
Zhiqiang Hu,
Yujia Hu,
Roy Ka-Wei Lee
Abstract:
Authorship verification (AV) is a fundamental task in natural language processing (NLP) and computational linguistics, with applications in forensic analysis, plagiarism detection, and identification of deceptive content. Existing AV techniques, including traditional stylometric and deep learning approaches, face limitations in terms of data requirements and lack of explainability. To address thes…
▽ More
Authorship verification (AV) is a fundamental task in natural language processing (NLP) and computational linguistics, with applications in forensic analysis, plagiarism detection, and identification of deceptive content. Existing AV techniques, including traditional stylometric and deep learning approaches, face limitations in terms of data requirements and lack of explainability. To address these limitations, this paper proposes PromptAV, a novel technique that leverages Large-Language Models (LLMs) for AV by providing step-by-step stylometric explanation prompts. PromptAV outperforms state-of-the-art baselines, operates effectively with limited training data, and enhances interpretability through intuitive explanations, showcasing its potential as an effective and interpretable solution for the AV task.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
A Novel Method of Fuzzy Topic Modeling based on Transformer Processing
Authors:
Ching-Hsun Tseng,
Shin-Jye Lee,
Po-Wei Cheng,
Chien Lee,
Chih-Chieh Hung
Abstract:
Topic modeling is admittedly a convenient way to monitor markets trend. Conventionally, Latent Dirichlet Allocation, LDA, is considered a must-do model to gain this type of information. By given the merit of deducing keyword with token conditional probability in LDA, we can know the most possible or essential topic. However, the results are not intuitive because the given topics cannot wholly fit…
▽ More
Topic modeling is admittedly a convenient way to monitor markets trend. Conventionally, Latent Dirichlet Allocation, LDA, is considered a must-do model to gain this type of information. By given the merit of deducing keyword with token conditional probability in LDA, we can know the most possible or essential topic. However, the results are not intuitive because the given topics cannot wholly fit human knowledge. LDA offers the first possible relevant keywords, which also brings out another problem of whether the connection is reliable based on the statistic possibility. It is also hard to decide the topic number manually in advance. As the booming trend of using fuzzy membership to cluster and using transformers to embed words, this work presents the fuzzy topic modeling based on soft clustering and document embedding from state-of-the-art transformer-based model. In our practical application in a press release monitoring, the fuzzy topic modeling gives a more natural result than the traditional output from LDA.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Quantum Simulation of the Bosonic Kitaev Chain
Authors:
J. H. Busnaina,
Z. Shi,
A. McDonald,
D. Dubyna,
I. Nsanzineza,
Jimmy S. C. Hung,
C. W. Sandbo Chang,
A. A. Clerk,
C. M. Wilson
Abstract:
Superconducting quantum circuits are a natural platform for quantum simulations of a wide variety of important lattice models describing topological phenomena, spanning condensed matter and high-energy physics. One such model is the bosonic analogue of the well-known fermionic Kitaev chain, a 1D tight-binding model with both nearest-neighbor hopping and pairing terms. Despite being fully Hermitian…
▽ More
Superconducting quantum circuits are a natural platform for quantum simulations of a wide variety of important lattice models describing topological phenomena, spanning condensed matter and high-energy physics. One such model is the bosonic analogue of the well-known fermionic Kitaev chain, a 1D tight-binding model with both nearest-neighbor hopping and pairing terms. Despite being fully Hermitian, the bosonic Kitaev chain exhibits a number of striking features associated with non-Hermitian systems, including chiral transport and a dramatic sensitivity to boundary conditions known as the non-Hermitian skin effect. Here, using a multimode superconducting parametric cavity, we implement the bosonic Kitaev chain in synthetic dimensions. The lattice sites are mapped to frequency modes of the cavity, and the $\textit{in situ}$ tunable complex hopping and pairing terms are created by parametric pumping at the mode-difference and mode-sum frequencies, respectively. We experimentally demonstrate important precursors of nontrivial topology and the non-Hermitian skin effect in the bosonic Kitaev chain, including chiral transport, quadrature wavefunction localization, and sensitivity to boundary conditions. Our experiment is an important first step towards exploring genuine many-body non-Hermitian quantum dynamics.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Demonstrating a long-coherence dual-rail erasure qubit using tunable transmons
Authors:
Harry Levine,
Arbel Haim,
Jimmy S. C. Hung,
Nasser Alidoust,
Mahmoud Kalaee,
Laura DeLorenzo,
E. Alex Wollack,
Patricio Arrangoiz-Arriola,
Amirhossein Khalajhedayati,
Rohan Sanil,
Hesam Moradinejad,
Yotam Vaknin,
Aleksander Kubica,
David Hover,
Shahriar Aghaeimeibodi,
Joshua Ari Alcid,
Christopher Baek,
James Barnett,
Kaustubh Bawdekar,
Przemyslaw Bienias,
Hugh Carson,
Cliff Chen,
Li Chen,
Harut Chinkezian,
Eric M. Chisholm
, et al. (88 additional authors not shown)
Abstract:
Quantum error correction with erasure qubits promises significant advantages over standard error correction due to favorable thresholds for erasure errors. To realize this advantage in practice requires a qubit for which nearly all errors are such erasure errors, and the ability to check for erasure errors without dephasing the qubit. We demonstrate that a "dual-rail qubit" consisting of a pair of…
▽ More
Quantum error correction with erasure qubits promises significant advantages over standard error correction due to favorable thresholds for erasure errors. To realize this advantage in practice requires a qubit for which nearly all errors are such erasure errors, and the ability to check for erasure errors without dephasing the qubit. We demonstrate that a "dual-rail qubit" consisting of a pair of resonantly coupled transmons can form a highly coherent erasure qubit, where transmon $T_1$ errors are converted into erasure errors and residual dephasing is strongly suppressed, leading to millisecond-scale coherence within the qubit subspace. We show that single-qubit gates are limited primarily by erasure errors, with erasure probability $p_\text{erasure} = 2.19(2)\times 10^{-3}$ per gate while the residual errors are $\sim 40$ times lower. We further demonstrate mid-circuit detection of erasure errors while introducing $< 0.1\%$ dephasing error per check. Finally, we show that the suppression of transmon noise allows this dual-rail qubit to preserve high coherence over a broad tunable operating range, offering an improved capacity to avoid frequency collisions. This work establishes transmon-based dual-rail qubits as an attractive building block for hardware-efficient quantum error correction.
△ Less
Submitted 20 March, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
A Simple Embedding Method for Scalar Hyperbolic Conservation Laws on Implicit Surfaces
Authors:
Chun Kit Hung,
Shingyu Leung
Abstract:
We have developed a new embedding method for solving scalar hyperbolic conservation laws on surfaces. The approach represents the interface implicitly by a signed distance function following the typical level set method and some embedding methods. Instead of solving the equation explicitly on the surface, we introduce a modified partial differential equation in a small neighborhood of the interfac…
▽ More
We have developed a new embedding method for solving scalar hyperbolic conservation laws on surfaces. The approach represents the interface implicitly by a signed distance function following the typical level set method and some embedding methods. Instead of solving the equation explicitly on the surface, we introduce a modified partial differential equation in a small neighborhood of the interface. This embedding equation is developed based on a push-forward operator that can extend any tangential flux vectors from the surface to a neighboring level surface. This operator is easy to compute and involves only the level set function and the corresponding Hessian. The resulting solution is constant in the normal direction of the interface. To demonstrate the accuracy and effectiveness of our method, we provide some two- and three-dimensional examples.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Cardiac CT perfusion imaging of pericoronary adipose tissue (PCAT) highlights potential confounds in coronary CTA
Authors:
Hao Wu,
Yingnan Song,
Ammar Hoori,
Ananya Subramaniam,
Juhwan Lee,
Justin Kim,
Tao Hu,
Sadeer Al-Kindi,
Wei-Ming Huang,
Chun-Ho Yun,
Chung-Lieh Hung,
Sanjay Rajagopalan,
David L. Wilson
Abstract:
Features of pericoronary adipose tissue (PCAT) assessed from coronary computed tomography angiography (CCTA) are associated with inflammation and cardiovascular risk. As PCAT is vascularly connected with coronary vasculature, the presence of iodine is a potential confounding factor on PCAT HU and textures that has not been adequately investigated. Use dynamic cardiac CT perfusion (CCTP) to inform…
▽ More
Features of pericoronary adipose tissue (PCAT) assessed from coronary computed tomography angiography (CCTA) are associated with inflammation and cardiovascular risk. As PCAT is vascularly connected with coronary vasculature, the presence of iodine is a potential confounding factor on PCAT HU and textures that has not been adequately investigated. Use dynamic cardiac CT perfusion (CCTP) to inform contrast determinants of PCAT assessment. From CCTP, we analyzed HU dynamics of territory-specific PCAT, myocardium, and other adipose depots in patients with coronary artery disease. HU, blood flow, and radiomics were assessed over time. Changes from peak aorta time, Pa, chosen to model the time of CCTA, were obtained. HU in PCAT increased more than in other adipose depots. The estimated blood flow in PCAT was ~23% of that in the contiguous myocardium. Comparing PCAT distal and proximal to a significant stenosis, we found less enhancement and longer time-to-peak distally. Two-second offsets [before, after] Pa resulted in [ 4-HU, 3-HU] differences in PCAT. Due to changes in HU, the apparent PCAT volume reduced ~15% from the first scan (P1) to Pa using a conventional fat window. Comparing radiomic features over time, 78% of features changed >10% relative to P1. CCTP elucidates blood flow in PCAT and enables analysis of PCAT features over time. PCAT assessments (HU, apparent volume, and radiomics) are sensitive to acquisition timing and the presence of obstructive stenosis, which may confound the interpretation of PCAT in CCTA images. Data normalization may be in order.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
TADA: Efficient Task-Agnostic Domain Adaptation for Transformers
Authors:
Chia-Chien Hung,
Lukas Lange,
Jannik Strötgen
Abstract:
Intermediate training of pre-trained transformer-based language models on domain-specific data leads to substantial gains for downstream tasks. To increase efficiency and prevent catastrophic forgetting alleviated from full domain-adaptive pre-training, approaches such as adapters have been developed. However, these require additional parameters for each layer, and are criticized for their limited…
▽ More
Intermediate training of pre-trained transformer-based language models on domain-specific data leads to substantial gains for downstream tasks. To increase efficiency and prevent catastrophic forgetting alleviated from full domain-adaptive pre-training, approaches such as adapters have been developed. However, these require additional parameters for each layer, and are criticized for their limited expressiveness. In this work, we introduce TADA, a novel task-agnostic domain adaptation method which is modular, parameter-efficient, and thus, data-efficient. Within TADA, we retrain the embeddings to learn domain-aware input representations and tokenizers for the transformer encoder, while freezing all other parameters of the model. Then, task-specific fine-tuning is performed. We further conduct experiments with meta-embeddings and newly introduced meta-tokenizers, resulting in one model per task in multi-domain use cases. Our broad evaluation in 4 downstream tasks for 14 domains across single- and multi-domain setups and high- and low-resource scenarios reveals that TADA is an effective and efficient alternative to full domain-adaptive pre-training and adapters for domain adaptation, while not introducing additional parameters or complex training steps.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Temporal Convolution Network Based Onset Detection and Query by Humming System Design
Authors:
Yu Cheng Hung,
Jian-Jiun Ding
Abstract:
Onsets are a key factor to split audio into several notes. In this paper, we ensemble multiple temporal convolution network (TCN) based model and utilize a restricted frequency range spectrogram to achieve more robust onset detection. Different from the present onset detection of QBH system which is only available in a clean scenario, our proposal of onset detection and speech enhancement can prev…
▽ More
Onsets are a key factor to split audio into several notes. In this paper, we ensemble multiple temporal convolution network (TCN) based model and utilize a restricted frequency range spectrogram to achieve more robust onset detection. Different from the present onset detection of QBH system which is only available in a clean scenario, our proposal of onset detection and speech enhancement can prevent noise from affecting onset detection function (ODF). Compared to the CNN model which exploits spatial features of the spectrogram, the TCN model exploits both spatial and temporal features of the spectrogram. As the usage of QBH in noisy scenarios, we apply the TCN-based speech enhancement as a preprocessor of QBH. With the combinations of TCN-based speech enhancement and onset detection, simulations show that the proposal can enable the QBH system in both noisy and clean circumstances with short response time.
△ Less
Submitted 7 June, 2023; v1 submitted 8 May, 2023;
originally announced May 2023.
-
Pitch Estimation by Denoising Preprocessor and Hybrid Estimation Model
Authors:
Yu Cheng Hung,
Ping Hung Chen,
Jian Jiun Ding
Abstract:
Pitch estimation is to estimate the fundamental frequency and the midi number and plays a critical role in music signal analysis and vocal signal processing. In this work, we proposed a new architecture based on a learning-based enhancement preprocessor and a combination of several traditional and deep learning pitch estimation methods to achieve better pitch estimation performance in both noisy a…
▽ More
Pitch estimation is to estimate the fundamental frequency and the midi number and plays a critical role in music signal analysis and vocal signal processing. In this work, we proposed a new architecture based on a learning-based enhancement preprocessor and a combination of several traditional and deep learning pitch estimation methods to achieve better pitch estimation performance in both noisy and clean scenarios. We test 17 different types of noise and 4 SNRdb noise levels. The results show that the proposed pitch estimation can perform better in both noisy and clean scenarios with short response time.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
Observation of self-oscillating supersonic flow across an acoustic horizon in two dimensions
Authors:
Hikaru Tamura,
Sergei Khlebnikov,
Cheng-An Chen,
Chen-Lung Hung
Abstract:
Understanding the dynamics and stability of transonic flows in quantum fluids, especially for those beyond one spatial dimension, is an outstanding challenge, with applications ranging from nonlinear optics and condensed matter to analogue gravity. One intriguing possibility is that a system with a spatially bounded supersonic flow may evolve into a self-oscillating state that periodically emits s…
▽ More
Understanding the dynamics and stability of transonic flows in quantum fluids, especially for those beyond one spatial dimension, is an outstanding challenge, with applications ranging from nonlinear optics and condensed matter to analogue gravity. One intriguing possibility is that a system with a spatially bounded supersonic flow may evolve into a self-oscillating state that periodically emits solitons, in a process originating from the well-known Landau instability. Here, we report observation of self-oscillating supersonic flows in a two-dimensional atomic superfluid. By imposing a local particle sink with strong loss, we induce a convergent radial flow forming an acoustic analogue of a black-hole horizon and an inner horizon around the sink. The observed superflow appears to be modulated by quasi-periodic bursts of superluminal signals. We measure their frequencies and find agreement with numerical simulations of soliton oscillation frequencies within the black-hole horizon. The presented experiment demonstrates a new method for creating supersonic flows in atomic superfluids, which may find applications in quantum simulations of curved spacetime, supersonic turbulence, and self-oscillating dynamics in dissipative many-body systems.
△ Less
Submitted 15 January, 2024; v1 submitted 20 April, 2023;
originally announced April 2023.
-
Nanophotonic cavity cooling of a single atom
Authors:
Chenwei Lv,
Ming Zhu,
Sambit Banerjee,
Chen-Lung Hung
Abstract:
We investigate external and internal dynamics of a two-level atom strongly coupled to a weakly pumped nanophotonic cavity. We calculate the dipole force, friction force, and stochastic force due to the cavity pump field, and show that a three-dimensional cooling region exists near the surface of a cavity. Using a two-color evanescent field trap as an example, we perform three-dimensional Monte-Car…
▽ More
We investigate external and internal dynamics of a two-level atom strongly coupled to a weakly pumped nanophotonic cavity. We calculate the dipole force, friction force, and stochastic force due to the cavity pump field, and show that a three-dimensional cooling region exists near the surface of a cavity. Using a two-color evanescent field trap as an example, we perform three-dimensional Monte-Carlo simulations to demonstrate efficient loading of single atoms into a trap by momentum diffusion, and the stability of cavity cooling near the trap center. Our analyses show that cavity cooling can be a promising method for directly loading cold atoms from free-space into a surface micro-trap. We further discuss the impact of pump intensity on atom trapping and loading efficiency.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
BPCE: A Prototype for Co-Evolution between Business Process Variants through Configurable Process Model
Authors:
Linyue Liu,
Xi Guo,
Chun Ouyang,
Patrick C. K. Hung,
Hong-Yu Zhang,
Keqing He,
Chen Mo,
Zaiwen Feng
Abstract:
With the continuous development of business process management technology, the increasing business process models are usually owned by large enterprises. In large enterprises, different stakeholders may modify the same business process model. In order to better manage the changeability of processes, they adopt configurable business process models to manage process variants. However, the process va…
▽ More
With the continuous development of business process management technology, the increasing business process models are usually owned by large enterprises. In large enterprises, different stakeholders may modify the same business process model. In order to better manage the changeability of processes, they adopt configurable business process models to manage process variants. However, the process variants will vary with the change in enterprise business demands. Therefore, it is necessary to explore the co-evolution of the process variants so as to effectively manage the business process family. To this end, a novel framework for co-evolution between business process variants through a configurable process model is proposed in this work. First, the mapping relationship between process variants and configurable models is standardized in this study. A series of change operations and change propagation operations between process variants and configurable models are further defined for achieving propagation. Then, an overall algorithm is proposed for achieving co-evolution of process variants. Next, a prototype is developed for managing change synchronization between process variants and configurable process models. Finally, the effectiveness and efficiency of our proposed process change propagation method are verified based on experiments on two business process datasets. The experimental results show that our approach implements the co-evolution of process variants with high accuracy and efficiency.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Production of Porous Glass-foam Materials from Photovoltaic Panel Waste Glass
Authors:
Bui Khac Thach,
Le Nhat Tan,
Do Quang Minh,
Ly Cam Hung,
Phan Dinh Tuan
Abstract:
The Solar energy production is growing quickly for the global demand of renewa-ble one, decrease the dependence on fossil fuels. However, disposing of used pho-tovoltaic (PV) panels will be a serious environmental challenge in the future dec-ades since the solar panels would eventually become a source of hazardous waste. The potential of waste solar panel glass to generate porous glass material wi…
▽ More
The Solar energy production is growing quickly for the global demand of renewa-ble one, decrease the dependence on fossil fuels. However, disposing of used pho-tovoltaic (PV) panels will be a serious environmental challenge in the future dec-ades since the solar panels would eventually become a source of hazardous waste. The potential of waste solar panel glass to generate porous glass material with the addition of CaCO3 and water glass was assessed in this study. The porous glass firing temperature range, from 830°C - 910°C, was determined using a simu-lation of heating microscope technique. The created samples have the smallest volumetric density of 0.25 g/cm3 and the largest water absorption of 303.08 wt.%. This indicates that the image analysis of samples during the heating process could be used to identify the firing temperature for better foaming, which was favorably indicated by specific physicochemical parameters. The created glass-foam mate-rials with an apparent porosity up to 81.49% could be used as a water-retaining medium in hydroponic and aquaponic systems
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
Leveraging Scene Embeddings for Gradient-Based Motion Planning in Latent Space
Authors:
Jun Yamada,
Chia-Man Hung,
Jack Collins,
Ioannis Havoutis,
Ingmar Posner
Abstract:
Motion planning framed as optimisation in structured latent spaces has recently emerged as competitive with traditional methods in terms of planning success while significantly outperforming them in terms of computational speed. However, the real-world applicability of recent work in this domain remains limited by the need to express obstacle information directly in state-space, involving simple g…
▽ More
Motion planning framed as optimisation in structured latent spaces has recently emerged as competitive with traditional methods in terms of planning success while significantly outperforming them in terms of computational speed. However, the real-world applicability of recent work in this domain remains limited by the need to express obstacle information directly in state-space, involving simple geometric primitives. In this work we address this challenge by leveraging learned scene embeddings together with a generative model of the robot manipulator to drive the optimisation process. In addition, we introduce an approach for efficient collision checking which directly regularises the optimisation undertaken for planning. Using simulated as well as real-world experiments, we demonstrate that our approach, AMP-LS, is able to successfully plan in novel, complex scenes while outperforming traditional planning baselines in terms of computation speed by an order of magnitude. We show that the resulting system is fast enough to enable closed-loop planning in real-world dynamic scenes.
△ Less
Submitted 6 March, 2023;
originally announced March 2023.
-
Quantum defects from single surface exhibit strong mutual interactions
Authors:
Chih-Chiao Hung,
Tim Kohler,
Kevin D. Osborn
Abstract:
Two-level system (TLS) defects constitute a major decoherence source of quantum information science, but they are generally less understood at material interfaces than in deposited films. Here we study surface TLSs at the metal-air interface, by probing them using a quasi-uniform field within vacuum-gap (VG) capacitors of resonators. The VG capacitor has a nano-gap which creates an order-of-magnit…
▽ More
Two-level system (TLS) defects constitute a major decoherence source of quantum information science, but they are generally less understood at material interfaces than in deposited films. Here we study surface TLSs at the metal-air interface, by probing them using a quasi-uniform field within vacuum-gap (VG) capacitors of resonators. The VG capacitor has a nano-gap which creates an order-of-magnitude larger contribution from the metal-air interface than typical resonators used in circuit QED. We measure three phenomena and find qualitative agreement with an interacting TLS model, where near-resonant TLSs experience substantial frequency jitter from the state switching of far-detuned low-frequency TLSs. First, we find that the loss in all of our VG resonators is weakly or logarithmically power dependent, in contrast to data from deposited dielectric films. Second, we add a saturation tone with power $P_{in}$ to a transmission measurement and obtain the TLS Rabi frequency $Ω_{0}$. These data show a substantially weaker $P_{in}$ dependence of $Ω_{0}$ than the prediction from the standard non-interacting TLS model. Lastly, we increase the temperature and find an increased TLS jitter rate and dephasing rate from power-dependent loss and phase noise measurements, respectively. We also anneal samples, which lowers the low-frequency TLS density and jitter rate, but the single-photon loss is found to be unchanged. The results are qualitatively consistent with a fast-switching interacting-TLS model and they contrast the standard model of TLSs which describes TLSs independently.
△ Less
Submitted 11 December, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
An Unpaired Cross-modality Segmentation Framework Using Data Augmentation and Hybrid Convolutional Networks for Segmenting Vestibular Schwannoma and Cochlea
Authors:
Yuzhou Zhuang,
Hong Liu,
Enmin Song,
Coskun Cetinkaya,
Chih-Cheng Hung
Abstract:
The crossMoDA challenge aims to automatically segment the vestibular schwannoma (VS) tumor and cochlea regions of unlabeled high-resolution T2 scans by leveraging labeled contrast-enhanced T1 scans. The 2022 edition extends the segmentation task by including multi-institutional scans. In this work, we proposed an unpaired cross-modality segmentation framework using data augmentation and hybrid con…
▽ More
The crossMoDA challenge aims to automatically segment the vestibular schwannoma (VS) tumor and cochlea regions of unlabeled high-resolution T2 scans by leveraging labeled contrast-enhanced T1 scans. The 2022 edition extends the segmentation task by including multi-institutional scans. In this work, we proposed an unpaired cross-modality segmentation framework using data augmentation and hybrid convolutional networks. Considering heterogeneous distributions and various image sizes for multi-institutional scans, we apply the min-max normalization for scaling the intensities of all scans between -1 and 1, and use the voxel size resampling and center cropping to obtain fixed-size sub-volumes for training. We adopt two data augmentation methods for effectively learning the semantic information and generating realistic target domain scans: generative and online data augmentation. For generative data augmentation, we use CUT and CycleGAN to generate two groups of realistic T2 volumes with different details and appearances for supervised segmentation training. For online data augmentation, we design a random tumor signal reducing method for simulating the heterogeneity of VS tumor signals. Furthermore, we utilize an advanced hybrid convolutional network with multi-dimensional convolutions to adaptively learn sparse inter-slice information and dense intra-slice information for accurate volumetric segmentation of VS tumor and cochlea regions in anisotropic scans. On the crossMoDA2022 validation dataset, our method produces promising results and achieves the mean DSC values of 72.47% and 76.48% and ASSD values of 3.42 mm and 0.53 mm for VS tumor and cochlea regions, respectively.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Observation of self-patterned defect formation in atomic superfluids -- from ring dark solitons to vortex dipole necklaces
Authors:
Hikaru Tamura,
Cheng-An Chen,
Chen-Lung Hung
Abstract:
Unveiling nonequilibrium dynamics of solitonic and topological defect structures in a multidimensional nonlinear medium is a current frontier across diverse fields. One of the quintessential objects is a ring dark soliton (RDS), whose dynamics are expected to display remarkable interplay between symmetry and self-patterned topological defect formation from a transverse (snake) instability, but it…
▽ More
Unveiling nonequilibrium dynamics of solitonic and topological defect structures in a multidimensional nonlinear medium is a current frontier across diverse fields. One of the quintessential objects is a ring dark soliton (RDS), whose dynamics are expected to display remarkable interplay between symmetry and self-patterned topological defect formation from a transverse (snake) instability, but it has thus far evaded full experimental observations. Here, we report an experimental realization of RDS generation in a two-dimensional atomic superfluid trapped in a circular box. By quenching the confining box potential, we observe an RDS emitted from the edge and its peculiar signature in the radial motion. As an RDS evolves, we observe transverse modulations at discrete azimuthal angles, which clearly result in a patterned formation of a circular vortex dipole array. Through collisions of the vortex dipoles with the box trap, we observe vortex unbinding, vortex pinning to the edge, and emission of rarefaction pulses. Our box-quench protocol opens a new way to study multidimensional dark solitons, structured formation of topological defects, and potentially the dynamics of ordered quantum vortex matter.
△ Less
Submitted 9 September, 2023; v1 submitted 15 November, 2022;
originally announced November 2022.
-
Population and Technological Growth: Evidence from Roe v. Wade
Authors:
John T. H. Wong,
Matthias Hei Man,
Alex Li Cheuk Hung
Abstract:
We exploit the heterogeneous impact of the Roe v. Wade ruling by the US Supreme Court, which ruled most abortion restrictions unconstitutional. Our identifying assumption is that states which had not liberalized their abortion laws prior to Roe would experience a negative birth shock of greater proportion than states which had undergone pre-Roe reforms. We estimate the difference-in-difference in…
▽ More
We exploit the heterogeneous impact of the Roe v. Wade ruling by the US Supreme Court, which ruled most abortion restrictions unconstitutional. Our identifying assumption is that states which had not liberalized their abortion laws prior to Roe would experience a negative birth shock of greater proportion than states which had undergone pre-Roe reforms. We estimate the difference-in-difference in births and use estimated births as an exogenous treatment variable to predict patents per capita. Our results show that one standard deviation increase in cohort starting population increases per capita patents by 0.24 standard deviation. These results suggest that at the margins, increasing fertility can increase patent production. Insofar as patent production is a sufficient proxy for technological growth, increasing births has a positive impact on technological growth. This paper and its results do not pertain to the issue of abortion itself.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Reaching Through Latent Space: From Joint Statistics to Path Planning in Manipulation
Authors:
Chia-Man Hung,
Shaohong Zhong,
Walter Goodwin,
Oiwi Parker Jones,
Martin Engelcke,
Ioannis Havoutis,
Ingmar Posner
Abstract:
We present a novel approach to path planning for robotic manipulators, in which paths are produced via iterative optimisation in the latent space of a generative model of robot poses. Constraints are incorporated through the use of constraint satisfaction classifiers operating on the same space. Optimisation leverages gradients through our learned models that provide a simple way to combine goal r…
▽ More
We present a novel approach to path planning for robotic manipulators, in which paths are produced via iterative optimisation in the latent space of a generative model of robot poses. Constraints are incorporated through the use of constraint satisfaction classifiers operating on the same space. Optimisation leverages gradients through our learned models that provide a simple way to combine goal reaching objectives with constraint satisfaction, even in the presence of otherwise non-differentiable constraints. Our models are trained in a task-agnostic manner on randomly sampled robot poses. In baseline comparisons against a number of widely used planners, we achieve commensurate performance in terms of task success, planning time and path length, performing successful path planning with obstacle avoidance on a real 7-DoF robot arm.
△ Less
Submitted 21 October, 2022;
originally announced October 2022.
-
Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers
Authors:
Chia-Chien Hung,
Anne Lauscher,
Dirk Hovy,
Simone Paolo Ponzetto,
Goran Glavaš
Abstract:
Demographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating demographic factors can consistently improve performance for various NLP tasks with traditional NLP models. In this work, we investigate whether these previous findings still hold with state-of-the-art pretrained Transformer-based language models (PLMs). We use three common specialization methods…
▽ More
Demographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating demographic factors can consistently improve performance for various NLP tasks with traditional NLP models. In this work, we investigate whether these previous findings still hold with state-of-the-art pretrained Transformer-based language models (PLMs). We use three common specialization methods proven effective for incorporating external knowledge into pretrained Transformers (e.g., domain-specific or geographic knowledge). We adapt the language representations for the demographic dimensions of gender and age, using continuous language modeling and dynamic multi-task learning for adaptation, where we couple language modeling objectives with the prediction of demographic classes. Our results, when employing a multilingual PLM, show substantial gains in task performance across four languages (English, German, French, and Danish), which is consistent with the results of previous work. However, controlling for confounding factors - primarily domain and language proficiency of Transformer-based PLMs - shows that downstream performance gains from our demographic adaptation do not actually stem from demographic knowledge. Our results indicate that demographic specialization of PLMs, while holding promise for positive societal impact, still represents an unsolved problem for (modern) NLP.
△ Less
Submitted 9 May, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
On the Limitations of Sociodemographic Adaptation with Transformers
Authors:
Chia-Chien Hung,
Anne Lauscher,
Dirk Hovy,
Simone Paolo Ponzetto,
Goran Glavaš
Abstract:
Sociodemographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating specific sociodemographic factors can consistently improve performance for various NLP tasks in traditional NLP models. We investigate whether these previous findings still hold with state-of-the-art pretrained Transformers. We use three common specialization methods proven effective for inco…
▽ More
Sociodemographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating specific sociodemographic factors can consistently improve performance for various NLP tasks in traditional NLP models. We investigate whether these previous findings still hold with state-of-the-art pretrained Transformers. We use three common specialization methods proven effective for incorporating external knowledge into pretrained Transformers (e.g., domain-specific or geographic knowledge). We adapt the language representations for the sociodemographic dimensions of gender and age, using continuous language modeling and dynamic multi-task learning for adaptation, where we couple language modeling with the prediction of a sociodemographic class. Our results when employing a multilingual model show substantial performance gains across four languages (English, German, French, and Danish). These findings are in line with the results of previous work and hold promise for successful sociodemographic specialization. However, controlling for confounding factors like domain and language shows that, while sociodemographic adaptation does improve downstream performance, the gains do not always solely stem from sociodemographic knowledge. Our results indicate that sociodemographic specialization, while very important, is still an unresolved problem in NLP.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Intra-agent speech permits zero-shot task acquisition
Authors:
Chen Yan,
Federico Carnevale,
Petko Georgiev,
Adam Santoro,
Aurelia Guy,
Alistair Muldal,
Chia-Chun Hung,
Josh Abramson,
Timothy Lillicrap,
Gregory Wayne
Abstract:
Human language learners are exposed to a trickle of informative, context-sensitive language, but a flood of raw sensory data. Through both social language use and internal processes of rehearsal and practice, language learners are able to build high-level, semantic representations that explain their perceptions. Here, we take inspiration from such processes of "inner speech" in humans (Vygotsky, 1…
▽ More
Human language learners are exposed to a trickle of informative, context-sensitive language, but a flood of raw sensory data. Through both social language use and internal processes of rehearsal and practice, language learners are able to build high-level, semantic representations that explain their perceptions. Here, we take inspiration from such processes of "inner speech" in humans (Vygotsky, 1934) to better understand the role of intra-agent speech in embodied behavior. First, we formally pose intra-agent speech as a semi-supervised problem and develop two algorithms that enable visually grounded captioning with little labeled language data. We then experimentally compute scaling curves over different amounts of labeled data and compare the data efficiency against a supervised learning baseline. Finally, we incorporate intra-agent speech into an embodied, mobile manipulator agent operating in a 3D virtual world, and show that with as few as 150 additional image captions, intra-agent speech endows the agent with the ability to manipulate and answer questions about a new object without any related task-directed experience (zero-shot). Taken together, our experiments suggest that modelling intra-agent speech is effective in enabling embodied agents to learn new tasks efficiently and without direct interaction experience.
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System
Authors:
Chia-Chien Hung,
Tommaso Green,
Robert Litschko,
Tornike Tsereteli,
Sotaro Takeshita,
Marco Bombieri,
Goran Glavaš,
Simone Paolo Ponzetto
Abstract:
This paper introduces our proposed system for the MIA Shared Task on Cross-lingual Open-retrieval Question Answering (COQA). In this challenging scenario, given an input question the system has to gather evidence documents from a multilingual pool and generate from them an answer in the language of the question. We devised several approaches combining different model variants for three main compon…
▽ More
This paper introduces our proposed system for the MIA Shared Task on Cross-lingual Open-retrieval Question Answering (COQA). In this challenging scenario, given an input question the system has to gather evidence documents from a multilingual pool and generate from them an answer in the language of the question. We devised several approaches combining different model variants for three main components: Data Augmentation, Passage Retrieval, and Answer Generation. For passage retrieval, we evaluated the monolingual BM25 ranker against the ensemble of re-rankers based on multilingual pretrained language models (PLMs) and also variants of the shared task baseline, re-training it from scratch using a recently introduced contrastive loss that maintains a strong gradient signal throughout training by means of mixed negative samples. For answer generation, we focused on language- and domain-specialization by means of continued language model (LM) pretraining of existing multilingual encoders. Additionally, for both passage retrieval and answer generation, we augmented the training data provided by the task organizers with automatically generated question-answer pairs created from Wikipedia passages to mitigate the issue of data scarcity, particularly for the low-resource languages for which no training data were provided. Our results show that language- and domain-specialization as well as data augmentation help, especially for low-resource languages.
△ Less
Submitted 30 May, 2022;
originally announced May 2022.
-
Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog
Authors:
Chia-Chien Hung,
Anne Lauscher,
Ivan Vulić,
Simone Paolo Ponzetto,
Goran Glavaš
Abstract:
Research on (multi-domain) task-oriented dialog (TOD) has predominantly focused on the English language, primarily due to the shortage of robust TOD datasets in other languages, preventing the systematic investigation of cross-lingual transfer for this crucial NLP application area. In this work, we introduce Multi2WOZ, a new multilingual multi-domain TOD dataset, derived from the well-established…
▽ More
Research on (multi-domain) task-oriented dialog (TOD) has predominantly focused on the English language, primarily due to the shortage of robust TOD datasets in other languages, preventing the systematic investigation of cross-lingual transfer for this crucial NLP application area. In this work, we introduce Multi2WOZ, a new multilingual multi-domain TOD dataset, derived from the well-established English dataset MultiWOZ, that spans four typologically diverse languages: Chinese, German, Arabic, and Russian. In contrast to concurrent efforts, Multi2WOZ contains gold-standard dialogs in target languages that are directly comparable with development and test portions of the English dataset, enabling reliable and comparative estimates of cross-lingual transfer performance for TOD. We then introduce a new framework for multilingual conversational specialization of pretrained language models (PrLMs) that aims to facilitate cross-lingual transfer for arbitrary downstream TOD tasks. Using such conversational PrLMs specialized for concrete target languages, we systematically benchmark a number of zero-shot and few-shot cross-lingual transfer approaches on two standard TOD tasks: Dialog State Tracking and Response Retrieval. Our experiments show that, in most setups, the best performance entails the combination of (I) conversational specialization in the target language and (ii) few-shot transfer for the concrete TOD task. Most importantly, we show that our conversational specialization in the target language allows for an exceptionally sample-efficient few-shot transfer for downstream TOD tasks.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
Miutsu: NTU's TaskBot for the Alexa Prize
Authors:
Yen-Ting Lin,
Hui-Chi Kuo,
Ze-Song Xu,
Ssu Chiu,
Chieh-Chi Hung,
Yi-Cheng Chen,
Chao-Wei Huang,
Yun-Nung Chen
Abstract:
This paper introduces Miutsu, National Taiwan University's Alexa Prize TaskBot, which is designed to assist users in completing tasks requiring multiple steps and decisions in two different domains -- home improvement and cooking. We overview our system design and architectural goals, and detail the proposed core elements, including question answering, task retrieval, social chatting, and various…
▽ More
This paper introduces Miutsu, National Taiwan University's Alexa Prize TaskBot, which is designed to assist users in completing tasks requiring multiple steps and decisions in two different domains -- home improvement and cooking. We overview our system design and architectural goals, and detail the proposed core elements, including question answering, task retrieval, social chatting, and various conversational modules. A dialogue flow is proposed to provide a robust and engaging conversation when handling complex tasks. We discuss the faced challenges during the competition and potential future work.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Emergence of Intergranular Tunneling Dominated Negative Magnetoresistance in Helimagnetic Manganese Phosphide Nanorod Thin Films
Authors:
B. Muchharla,
R. P. Madhogaria,
D. DeTellem,
C. M. Hung,
A. Chanda,
A. T. Duong,
P. T. Huy,
M. T. Trinh,
S. Cho,
S. Witanachchi,
M. H. Phan
Abstract:
Helical magnets are emerging as a novel class of materials for spintronics and sensor applications; however, research on their charge and spin transport properties in a thin film form is less explored. Herein, we report the temperature and magnetic field dependent charge transport properties of a highly crystalline MnP nanorod thin film over a wide temperature range (2-350 K). The MnP nanorod film…
▽ More
Helical magnets are emerging as a novel class of materials for spintronics and sensor applications; however, research on their charge and spin transport properties in a thin film form is less explored. Herein, we report the temperature and magnetic field dependent charge transport properties of a highly crystalline MnP nanorod thin film over a wide temperature range (2-350 K). The MnP nanorod films of 100 nm thickness were grown on Si substrates at 500 oC using molecular beam epitaxy. The temperature dependent resistivity data exhibits a metallic behavior over the entire measured temperature range. However, large negative magnetoresistance of up to 12% is observed below 50 K at which the system enters a stable helical (screw) magnetic state. In this temperature regime, the MR(H,T) dependence seems to show a magnetic field manipulated phase coexistence. The observed magnetoresistance is dominantly governed by the intergranular spin dependent tunneling mechanism. These findings pinpoint a correlation between the transport and magnetism in this helimagnetic system.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
Exchange Bias and Interface-related Effects in Two-dimensional van der Waals Magnetic Heterostructures: Open Questions and Perspectives
Authors:
Manh-Huong Phan,
Vijaysankar Kalappattil,
Valery Ortiz Jimenez,
Yen Thi Hai Pham,
Nivarthana W. Y. A. Y. Mudiyanselage,
Derick Detellem,
Chang-Ming Hung,
Amit Chanda,
Tatiana Eggers
Abstract:
The exchange bias (EB) effect is known as a fundamentally and technologically important magnetic property of a magnetic bilayer film. It is manifested as a horizontal shift in a magnetic hysteresis loop of a film subject to cooling in the presence of a magnetic field. The EB effect in van der Waals (vdW) heterostructures offers a novel approach for tuning the magnetic properties of the newly disco…
▽ More
The exchange bias (EB) effect is known as a fundamentally and technologically important magnetic property of a magnetic bilayer film. It is manifested as a horizontal shift in a magnetic hysteresis loop of a film subject to cooling in the presence of a magnetic field. The EB effect in van der Waals (vdW) heterostructures offers a novel approach for tuning the magnetic properties of the newly discovered single-layer magnets, as well as adds a new impetus to magnetic vdW heterostructures. Indeed, intriguing EB effects have recently been reported in a variety of low-dimensional vdW magnetic systems ranging from a weakly interlayer-coupled vdW magnet (e.g., Fe3GeTe2) to a bilayer composed of two different magnetic vdW materials (e.g., Fe3GeTe2/CrCl3, Fe3GeTe2/FePS3, Fe3GeTe2/MnPS3), to bilayers of two different vdW defective magnets (e.g., VSe2/MoS2), or to metallic ferromagnet/vdW defective magnet interfaces (e.g., Fe/MoS2). Despite their huge potential in spintronic device applications, the physical origins of the observed EB effects have remained elusive to researchers. We present here a critical review of the EB effect and associated phenomena such as magnetic proximity (MP) in various vdW heterostructure systems and propose approaches to addressing some of the emerging fundamental questions.
△ Less
Submitted 3 January, 2022;
originally announced January 2022.
-
A Lightweight and Accurate Spatial-Temporal Transformer for Traffic Forecasting
Authors:
Guanyao Li,
Shuhan Zhong,
S. -H. Gary Chan,
Ruiyuan Li,
Chih-Chieh Hung,
Wen-Chih Peng
Abstract:
We study the forecasting problem for traffic with dynamic, possibly periodical, and joint spatial-temporal dependency between regions. Given the aggregated inflow and outflow traffic of regions in a city from time slots 0 to t-1, we predict the traffic at time t at any region. Prior arts in the area often consider the spatial and temporal dependencies in a decoupled manner or are rather computatio…
▽ More
We study the forecasting problem for traffic with dynamic, possibly periodical, and joint spatial-temporal dependency between regions. Given the aggregated inflow and outflow traffic of regions in a city from time slots 0 to t-1, we predict the traffic at time t at any region. Prior arts in the area often consider the spatial and temporal dependencies in a decoupled manner or are rather computationally intensive in training with a large number of hyper-parameters to tune. We propose ST-TIS, a novel, lightweight, and accurate Spatial-Temporal Transformer with information fusion and region sampling for traffic forecasting. ST-TIS extends the canonical Transformer with information fusion and region sampling. The information fusion module captures the complex spatial-temporal dependency between regions. The region sampling module is to improve the efficiency and prediction accuracy, cutting the computation complexity for dependency learning from $O(n^2)$ to $O(n\sqrt{n})$, where n is the number of regions. With far fewer parameters than state-of-the-art models, the offline training of our model is significantly faster in terms of tuning and computation (with a reduction of up to $90\%$ on training time and network parameters). Notwithstanding such training efficiency, extensive experiments show that ST-TIS is substantially more accurate in online prediction than state-of-the-art approaches (with an average improvement of up to $9.5\%$ on RMSE, and $12.4\%$ on MAPE).
△ Less
Submitted 3 May, 2022; v1 submitted 30 December, 2021;
originally announced January 2022.
-
MnP films with desired magnetic, magnetocaloric and thermoelectric properties for a perspective magneto-thermo-electric cooling device
Authors:
C. M. Hung,
R. P. Madhogaria,
B. Muchharla,
E. M. Clements,
A. T. Duong,
R. Das,
P. T. Huy,
S. L. Cho,
S. Witanachchi,
H. Srikanth,
Manh-Huong Phan
Abstract:
A perspective magneto-thermo-electric cooling device (MTECD) comprising a central magnetocaloric (MC) material (e.g., Gd) sandwiched by two thermoelectric (TE) materials (e.g., MnP) is proposed. The presence of the TE materials in the MTECD guides the heat flow direction and enhances heat pulsation. In this case, the usage of a ferromagnetic TE material that combines large TE with small MC propert…
▽ More
A perspective magneto-thermo-electric cooling device (MTECD) comprising a central magnetocaloric (MC) material (e.g., Gd) sandwiched by two thermoelectric (TE) materials (e.g., MnP) is proposed. The presence of the TE materials in the MTECD guides the heat flow direction and enhances heat pulsation. In this case, the usage of a ferromagnetic TE material that combines large TE with small MC properties within a similar temperature region can enhance the magnetic flux density and heat exchange efficiency. Here, we show that MnP nanorod-structured films with desired magnetic, MC and TE properties are very promising for use in MTECDs. The films were grown on Si substrates at 300, 400 and 500°C using molecular beam epitaxy. The 400 oC sample shows a desired TE and MC combination. A large power factor of 24.06 μW m-1 K-2 is achieved at room temperature. In this temperature region, the film exhibits a small MC effect (-deltaSM ~0.64 J/kg K and deltaTad ~0.3 K at m0H = 2 T) but ferromagnetism that gives rise to the enhanced MC effect of the central MC material. These properties could enable the MTECD to operate at high frequency.
△ Less
Submitted 8 December, 2021;
originally announced December 2021.
-
Coupling single atoms to a nanophotonic whispering-gallery-mode resonator via optical guiding
Authors:
Xinchao Zhou,
Hikaru Tamura,
Tzu-Han Chang,
Chen-Lung Hung
Abstract:
We demonstrate an efficient optical guiding technique for coupling cold atoms in the near field of a planar nanophotonic circuit, and realize large atom-photon coupling to a whispering-gallery mode in a microring resonator with a single-atom cooperativity $C\gtrsim 8$. The guiding potential is created by diffracted light on a nanophotonic waveguide that smoothly connects to a dipole trap in the fa…
▽ More
We demonstrate an efficient optical guiding technique for coupling cold atoms in the near field of a planar nanophotonic circuit, and realize large atom-photon coupling to a whispering-gallery mode in a microring resonator with a single-atom cooperativity $C\gtrsim 8$. The guiding potential is created by diffracted light on a nanophotonic waveguide that smoothly connects to a dipole trap in the far field for atom guiding with subwavelength precision. We observe atom-induced transparency for light coupled to a microring, characterize the atom-photon coupling rate, extract guided atom flux, and demonstrate on-chip photon routing by single atoms. Our demonstration promises new applications with cold atoms on a nanophotonic circuit for chiral quantum optics and quantum technologies.
△ Less
Submitted 9 March, 2023; v1 submitted 1 November, 2021;
originally announced November 2021.
-
Experimentally revealing anomalously large dipoles in a quantum-circuit dielectric
Authors:
Liuqi Yu,
Shlomi Matityahu,
Yaniv J. Rosen,
Chih-Chiao Hung,
Andrii Maksymov,
Alexander L. Burin,
Moshe Schechter,
Kevin D. Osborn
Abstract:
Quantum two-level systems (TLSs) intrinsic to glasses induce decoherence in many modern quantum devices, such as superconducting qubits. Although the low-temperature physics of these TLSs is usually well-explained by a phenomenological standard tunneling model of independent TLSs, the nature of these TLSs, as well as their behavior out of equilibrium and at high energies above 1 K, remain inconclu…
▽ More
Quantum two-level systems (TLSs) intrinsic to glasses induce decoherence in many modern quantum devices, such as superconducting qubits. Although the low-temperature physics of these TLSs is usually well-explained by a phenomenological standard tunneling model of independent TLSs, the nature of these TLSs, as well as their behavior out of equilibrium and at high energies above 1 K, remain inconclusive. Here we measure the non-equilibrium dielectric loss of TLSs in amorphous silicon using a superconducting resonator, where energies of TLSs are varied in time using a swept electric field. Our results show the existence of two distinct ensembles of TLSs, interacting weakly and strongly with phonons, where the latter also possesses anomalously large electric dipole moment. These results may shed new light on the low temperature characteristics of amorphous solids, and hold implications to experiments and applications in quantum devices using time-varying electric fields.
△ Less
Submitted 28 July, 2022; v1 submitted 20 October, 2021;
originally announced October 2021.
-
DS-TOD: Efficient Domain Specialization for Task Oriented Dialog
Authors:
Chia-Chien Hung,
Anne Lauscher,
Simone Paolo Ponzetto,
Goran Glavaš
Abstract:
Recent work has shown that self-supervised dialog-specific pretraining on large conversational datasets yields substantial gains over traditional language modeling (LM) pretraining in downstream task-oriented dialog (TOD). These approaches, however, exploit general dialogic corpora (e.g., Reddit) and thus presumably fail to reliably embed domain-specific knowledge useful for concrete downstream TO…
▽ More
Recent work has shown that self-supervised dialog-specific pretraining on large conversational datasets yields substantial gains over traditional language modeling (LM) pretraining in downstream task-oriented dialog (TOD). These approaches, however, exploit general dialogic corpora (e.g., Reddit) and thus presumably fail to reliably embed domain-specific knowledge useful for concrete downstream TOD domains. In this work, we investigate the effects of domain specialization of pretrained language models (PLMs) for TOD. Within our DS-TOD framework, we first automatically extract salient domain-specific terms, and then use them to construct DomainCC and DomainReddit -- resources that we leverage for domain-specific pretraining, based on (i) masked language modeling (MLM) and (ii) response selection (RS) objectives, respectively. We further propose a resource-efficient and modular domain specialization by means of domain adapters -- additional parameter-light layers in which we encode the domain knowledge. Our experiments with prominent TOD tasks -- dialog state tracking (DST) and response retrieval (RR) -- encompassing five domains from the MultiWOZ benchmark demonstrate the effectiveness of DS-TOD. Moreover, we show that the light-weight adapter-based specialization (1) performs comparably to full fine-tuning in single domain setups and (2) is particularly suitable for multi-domain specialization, where besides advantageous computational footprint, it can offer better TOD performance.
△ Less
Submitted 20 May, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.