Search | arXiv e-print repository

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Authors: Xiyang Wu, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, Dinesh Manocha

Abstract: Large vision-language models (LVLMs) hallucinate: certain context cues in an image may trigger the language module's overconfident and incorrect reasoning on abnormal or hypothetical objects. Though a few benchmarks have been developed to investigate LVLM hallucinations, they mainly rely on hand-crafted corner cases whose fail patterns may hardly generalize, and finetuning on them could undermine… ▽ More Large vision-language models (LVLMs) hallucinate: certain context cues in an image may trigger the language module's overconfident and incorrect reasoning on abnormal or hypothetical objects. Though a few benchmarks have been developed to investigate LVLM hallucinations, they mainly rely on hand-crafted corner cases whose fail patterns may hardly generalize, and finetuning on them could undermine their validity. These motivate us to develop the first automatic benchmark generation approach, AUTOHALLUSION, that harnesses a few principal strategies to create diverse hallucination examples. It probes the language modules in LVLMs for context cues and uses them to synthesize images by: (1) adding objects abnormal to the context cues; (2) for two co-occurring objects, keeping one and excluding the other; or (3) removing objects closely tied to the context cues. It then generates image-based questions whose ground-truth answers contradict the language module's prior. A model has to overcome contextual biases and distractions to reach correct answers, while incorrect or inconsistent answers indicate hallucinations. AUTOHALLUSION enables us to create new benchmarks at the minimum cost and thus overcomes the fragility of hand-crafted benchmarks. It also reveals common failure patterns and reasons, providing key insights to detect, avoid, or control hallucinations. Comprehensive evaluations of top-tier LVLMs, e.g., GPT-4V(ision), Gemini Pro Vision, Claude 3, and LLaVA-1.5, show a 97.7% and 98.7% success rate of hallucination induction on synthetic and real-world datasets of AUTOHALLUSION, paving the way for a long battle against hallucinations. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2405.04034 [pdf, other]

Differentially Private Post-Processing for Fair Regression

Authors: Ruicheng Xian, Qiaobo Li, Gautam Kamath, Han Zhao

Abstract: This paper describes a differentially private post-processing algorithm for learning fair regressors satisfying statistical parity, addressing privacy concerns of machine learning models trained on sensitive data, as well as fairness concerns of their potential to propagate historical biases. Our algorithm can be applied to post-process any given regressor to improve fairness by remapping its outp… ▽ More This paper describes a differentially private post-processing algorithm for learning fair regressors satisfying statistical parity, addressing privacy concerns of machine learning models trained on sensitive data, as well as fairness concerns of their potential to propagate historical biases. Our algorithm can be applied to post-process any given regressor to improve fairness by remapping its outputs. It consists of three steps: first, the output distributions are estimated privately via histogram density estimation and the Laplace mechanism, then their Wasserstein barycenter is computed, and the optimal transports to the barycenter are used for post-processing to satisfy fairness. We analyze the sample complexity of our algorithm and provide fairness guarantee, revealing a trade-off between the statistical bias and variance induced from the choice of the number of bins in the histogram, in which using less bins always favors fairness at the expense of error. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: ICML 2024. Code is at https://github.com/rxian/fair-regression

arXiv:2405.04025 [pdf, other]

Optimal Group Fair Classifiers from Linear Post-Processing

Authors: Ruicheng Xian, Han Zhao

Abstract: We propose a post-processing algorithm for fair classification that mitigates model bias under a unified family of group fairness criteria covering statistical parity, equal opportunity, and equalized odds, applicable to multi-class problems and both attribute-aware and attribute-blind settings. It achieves fairness by re-calibrating the output score of the given base model with a "fairness cost"… ▽ More We propose a post-processing algorithm for fair classification that mitigates model bias under a unified family of group fairness criteria covering statistical parity, equal opportunity, and equalized odds, applicable to multi-class problems and both attribute-aware and attribute-blind settings. It achieves fairness by re-calibrating the output score of the given base model with a "fairness cost" -- a linear combination of the (predicted) group memberships. Our algorithm is based on a representation result showing that the optimal fair classifier can be expressed as a linear post-processing of the loss function and the group predictor, derived via using these as sufficient statistics to reformulate the fair classification problem as a linear program. The parameters of the post-processor are estimated by solving the empirical LP. Experiments on benchmark datasets show the efficiency and effectiveness of our algorithm at reducing disparity compared to existing algorithms, including in-processing, especially on larger problems. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: Code is at https://github.com/rxian/fair-classification

arXiv:2404.03187 [pdf, other]

AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

Authors: Tianrui Guan, Ruiqi Xian, Xijun Wang, Xiyang Wu, Mohamed Elnoor, Daeun Song, Dinesh Manocha

Abstract: We present AGL-NET, a novel learning-based method for global localization using LiDAR point clouds and satellite maps. AGL-NET tackles two critical challenges: bridging the representation gap between image and points modalities for robust feature matching, and handling inherent scale discrepancies between global view and local view. To address these challenges, AGL-NET leverages a unified network… ▽ More We present AGL-NET, a novel learning-based method for global localization using LiDAR point clouds and satellite maps. AGL-NET tackles two critical challenges: bridging the representation gap between image and points modalities for robust feature matching, and handling inherent scale discrepancies between global view and local view. To address these challenges, AGL-NET leverages a unified network architecture with a novel two-stage matching design. The first stage extracts informative neural features directly from raw sensor data and performs initial feature matching. The second stage refines this matching process by extracting informative skeleton features and incorporating a novel scale alignment step to rectify scale variations between LiDAR and map data. Furthermore, a novel scale and skeleton loss function guides the network toward learning scale-invariant feature representations, eliminating the need for pre-processing satellite maps. This significantly improves real-world applicability in scenarios with unknown map scales. To facilitate rigorous performance evaluation, we introduce a meticulously designed dataset within the CARLA simulator specifically tailored for metric localization training and assessment. The code and dataset will be made publicly available. △ Less

Submitted 4 April, 2024; originally announced April 2024.

arXiv:2402.10932 [pdf]

Roadmap on Data-Centric Materials Science

Authors: Stefan Bauer, Peter Benner, Tristan Bereau, Volker Blum, Mario Boley, Christian Carbogno, C. Richard A. Catlow, Gerhard Dehm, Sebastian Eibl, Ralph Ernstorfer, Ádám Fekete, Lucas Foppa, Peter Fratzl, Christoph Freysoldt, Baptiste Gault, Luca M. Ghiringhelli, Sajal K. Giri, Anton Gladyshev, Pawan Goyal, Jason Hattrick-Simpers, Lara Kabalan, Petr Karpov, Mohammad S. Khorrami, Christoph Koch, Sebastian Kokott , et al. (36 additional authors not shown)

Abstract: Science is and always has been based on data, but the terms "data-centric" and the "4th paradigm of" materials research indicate a radical change in how information is retrieved, handled and research is performed. It signifies a transformative shift towards managing vast data collections, digital repositories, and innovative data analytics methods. The integration of Artificial Intelligence (AI) a… ▽ More Science is and always has been based on data, but the terms "data-centric" and the "4th paradigm of" materials research indicate a radical change in how information is retrieved, handled and research is performed. It signifies a transformative shift towards managing vast data collections, digital repositories, and innovative data analytics methods. The integration of Artificial Intelligence (AI) and its subset Machine Learning (ML), has become pivotal in addressing all these challenges. This Roadmap on Data-Centric Materials Science explores fundamental concepts and methodologies, illustrating diverse applications in electronic-structure theory, soft matter theory, microstructure research, and experimental techniques like photoemission, atom probe tomography, and electron microscopy. While the roadmap delves into specific areas within the broad interdisciplinary field of materials science, the provided examples elucidate key concepts applicable to a wider range of topics. The discussed instances offer insights into addressing the multifaceted challenges encountered in contemporary materials research. △ Less

Submitted 1 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Review, outlook, roadmap, perspective

arXiv:2402.10527 [pdf, other]

Zero-shot sampling of adversarial entities in biomedical question answering

Authors: R. Patrick Xian, Alex J. Lee, Vincent Wang, Qiming Cui, Russell Ro, Reza Abbasi-Asl

Abstract: The increasing depth of parametric domain knowledge in large language models (LLMs) is fueling their rapid deployment in real-world applications. In high-stakes and knowledge-intensive tasks, understanding model vulnerabilities is essential for quantifying the trustworthiness of model predictions and regulating their use. The recent discovery of named entities as adversarial examples in natural la… ▽ More The increasing depth of parametric domain knowledge in large language models (LLMs) is fueling their rapid deployment in real-world applications. In high-stakes and knowledge-intensive tasks, understanding model vulnerabilities is essential for quantifying the trustworthiness of model predictions and regulating their use. The recent discovery of named entities as adversarial examples in natural language processing tasks raises questions about their potential guises in other settings. Here, we propose a powerscaled distance-weighted sampling scheme in embedding space to discover diverse adversarial entities as distractors. We demonstrate its advantage over random sampling in adversarial question answering on biomedical topics. Our approach enables the exploration of different regions on the attack surface, which reveals two regimes of adversarial entities that markedly differ in their characteristics. Moreover, we show that the attacks successfully manipulate token-wise Shapley value explanations, which become deceptive in the adversarial setting. Our investigations illustrate the brittleness of domain knowledge in LLMs and reveal a shortcoming of standard evaluations for high-capacity models. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: 20 pages incl. appendix, under review

arXiv:2402.10340 [pdf, other]

Highlighting the Safety Concerns of Deploying LLMs/VLMs in Robotics

Authors: Xiyang Wu, Souradip Chakraborty, Ruiqi Xian, Jing Liang, Tianrui Guan, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi

Abstract: In this paper, we highlight the critical issues of robustness and safety associated with integrating large language models (LLMs) and vision-language models (VLMs) into robotics applications. Recent works focus on using LLMs and VLMs to improve the performance of robotics tasks, such as manipulation and navigation. Despite these improvements, analyzing the safety of such systems remains underexplo… ▽ More In this paper, we highlight the critical issues of robustness and safety associated with integrating large language models (LLMs) and vision-language models (VLMs) into robotics applications. Recent works focus on using LLMs and VLMs to improve the performance of robotics tasks, such as manipulation and navigation. Despite these improvements, analyzing the safety of such systems remains underexplored yet extremely critical. LLMs and VLMs are highly susceptible to adversarial inputs, prompting a significant inquiry into the safety of robotic systems. This concern is important because robotics operate in the physical world where erroneous actions can result in severe consequences. This paper explores this issue thoroughly, presenting a mathematical formulation of potential attacks on LLM/VLM-based robotic systems and offering experimental evidence of the safety challenges. Our empirical findings highlight a significant vulnerability: simple modifications to the input can drastically reduce system effectiveness. Specifically, our results demonstrate an average performance deterioration of 19.4% under minor input prompt modifications and a more alarming 29.1% under slight perceptual changes. These findings underscore the urgent need for robust countermeasures to ensure the safe and reliable deployment of advanced LLM/VLM-based robotic systems. △ Less

Submitted 16 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

arXiv:2310.14566 [pdf, other]

HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models

Authors: Tianrui Guan, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou

Abstract: We introduce HallusionBench, a comprehensive benchmark designed for the evaluation of image-context reasoning. This benchmark presents significant challenges to advanced large visual-language models (LVLMs), such as GPT-4V(Vision), Gemini Pro Vision, Claude 3, and LLaVA-1.5, by emphasizing nuanced understanding and interpretation of visual data. The benchmark comprises 346 images paired with 1129… ▽ More We introduce HallusionBench, a comprehensive benchmark designed for the evaluation of image-context reasoning. This benchmark presents significant challenges to advanced large visual-language models (LVLMs), such as GPT-4V(Vision), Gemini Pro Vision, Claude 3, and LLaVA-1.5, by emphasizing nuanced understanding and interpretation of visual data. The benchmark comprises 346 images paired with 1129 questions, all meticulously crafted by human experts. We introduce a novel structure for these visual questions designed to establish control groups. This structure enables us to conduct a quantitative analysis of the models' response tendencies, logical consistency, and various failure modes. In our evaluation on HallusionBench, we benchmarked 15 different models, highlighting a 31.42% question-pair accuracy achieved by the state-of-the-art GPT-4V. Notably, all other evaluated models achieve accuracy below 16%. Moreover, our analysis not only highlights the observed failure modes, including language hallucination and visual illusion, but also deepens an understanding of these pitfalls. Our comprehensive case studies within HallusionBench shed light on the challenges of hallucination and illusion in LVLMs. Based on these insights, we suggest potential pathways for their future improvement. The benchmark and codebase can be accessed at https://github.com/tianyi-lab/HallusionBench. △ Less

Submitted 25 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: Accepted to CVPR 2024

arXiv:2308.13985 [pdf, other]

Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective

Authors: Yuzheng Hu, Ruicheng Xian, Qilong Wu, Qiuling Fan, Lang Yin, Han Zhao

Abstract: Linear scalarization, i.e., combining all loss functions by a weighted sum, has been the default choice in the literature of multi-task learning (MTL) since its inception. In recent years, there is a surge of interest in developing Specialized Multi-Task Optimizers (SMTOs) that treat MTL as a multi-objective optimization problem. However, it remains open whether there is a fundamental advantage of… ▽ More Linear scalarization, i.e., combining all loss functions by a weighted sum, has been the default choice in the literature of multi-task learning (MTL) since its inception. In recent years, there is a surge of interest in developing Specialized Multi-Task Optimizers (SMTOs) that treat MTL as a multi-objective optimization problem. However, it remains open whether there is a fundamental advantage of SMTOs over scalarization. In fact, heated debates exist in the community comparing these two types of algorithms, mostly from an empirical perspective. To approach the above question, in this paper, we revisit scalarization from a theoretical perspective. We focus on linear MTL models and study whether scalarization is capable of fully exploring the Pareto front. Our findings reveal that, in contrast to recent works that claimed empirical advantages of scalarization, scalarization is inherently incapable of full exploration, especially for those Pareto optimal solutions that strike the balanced trade-offs between multiple tasks. More concretely, when the model is under-parametrized, we reveal a multi-surface structure of the feasible region and identify necessary and sufficient conditions for full exploration. This leads to the conclusion that scalarization is in general incapable of tracing out the Pareto front. Our theoretical results partially answer the open questions in Xin et al. (2021), and provide a more intuitive explanation on why scalarization fails beyond non-convexity. We additionally perform experiments on a real-world dataset using both scalarization and state-of-the-art SMTOs. The experimental results not only corroborate our theoretical findings, but also unveil the potential of SMTOs in finding balanced solutions, which cannot be achieved by scalarization. △ Less

Submitted 22 September, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

Comments: Accepted at NeurIPS 2023

arXiv:2306.12272 [pdf, other]

From structure mining to unsupervised exploration of atomic octahedral networks

Authors: R. Patrick Xian, Ryan J. Morelock, Ido Hadar, Charles B. Musgrave, Christopher Sutton

Abstract: Networks of atom-centered coordination octahedra commonly occur in inorganic and hybrid solid-state materials. Characterizing their spatial arrangements and characteristics is crucial for relating structures to properties for many materials families. The traditional method using case-by-case inspection becomes prohibitive for discovering trends and similarities in large datasets. Here, we operatio… ▽ More Networks of atom-centered coordination octahedra commonly occur in inorganic and hybrid solid-state materials. Characterizing their spatial arrangements and characteristics is crucial for relating structures to properties for many materials families. The traditional method using case-by-case inspection becomes prohibitive for discovering trends and similarities in large datasets. Here, we operationalize chemical intuition to automate the geometric parsing, quantification, and classification of coordination octahedral networks. We find axis-resolved tilting trends in ABO$_{3}$ perovskite polymorphs, which assist in detecting oxidation state changes. Moreover, we develop a scale-invariant encoding scheme to represent these networks, which, combined with human-assisted unsupervised machine learning, allows us to taxonomize the inorganic framework polytypes in hybrid iodoplumbates (A$_x$Pb$_y$I$_z$). Consequently, we uncover a violation of Pauling's third rule and the design principles underpinning their topological diversity. Our results offer a glimpse into the vast design space of atomic octahedral networks and inform high-throughput, targeted screening of specific structure types. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: 56 pages

arXiv:2305.12437 [pdf, other]

SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition

Authors: Xijun Wang, Ruiqi Xian, Tianrui Guan, Fuxiao Liu, Dinesh Manocha

Abstract: We present a new learning approach, Soft Conditional Prompt Learning (SCP), which leverages the strengths of prompt learning for aerial video action recognition. Our approach is designed to predict the action of each agent by helping the models focus on the descriptions or instructions associated with actions in the input videos for aerial/robot visual perception. Our formulation supports various… ▽ More We present a new learning approach, Soft Conditional Prompt Learning (SCP), which leverages the strengths of prompt learning for aerial video action recognition. Our approach is designed to predict the action of each agent by helping the models focus on the descriptions or instructions associated with actions in the input videos for aerial/robot visual perception. Our formulation supports various prompts, including learnable prompts, auxiliary visual information, and large vision models to improve the recognition performance. We present a soft conditional prompt method that learns to dynamically generate prompts from a pool of prompt experts under different video inputs. By sharing the same objective with the task, our proposed SCP can optimize prompts that guide the model's predictions while explicitly learning input-invariant (prompt experts pool) and input-specific (data-dependent) prompt knowledge. In practice, we observe a 3.17-10.2% accuracy improvement on the aerial video datasets (Okutama, NECDrone), which consist of scenes with single-agent and multi-agent actions. We further evaluate our approach on ground camera videos to verify the effectiveness and generalization and achieve a 1.0-3.6% improvement on dataset SSV2. We integrate our method into the ROS2 as well. △ Less

Submitted 28 August, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

Comments: IROS2024

arXiv:2304.06866 [pdf, other]

PMI Sampler: Patch Similarity Guided Frame Selection for Aerial Action Recognition

Authors: Ruiqi Xian, Xijun Wang, Divya Kothandaraman, Dinesh Manocha

Abstract: We present a new algorithm for selection of informative frames in video action recognition. Our approach is designed for aerial videos captured using a moving camera where human actors occupy a small spatial resolution of video frames. Our algorithm utilizes the motion bias within aerial videos, which enables the selection of motion-salient frames. We introduce the concept of patch mutual informat… ▽ More We present a new algorithm for selection of informative frames in video action recognition. Our approach is designed for aerial videos captured using a moving camera where human actors occupy a small spatial resolution of video frames. Our algorithm utilizes the motion bias within aerial videos, which enables the selection of motion-salient frames. We introduce the concept of patch mutual information (PMI) score to quantify the motion bias between adjacent frames, by measuring the similarity of patches. We use this score to assess the amount of discriminative motion information contained in one frame relative to another. We present an adaptive frame selection strategy using shifted leaky ReLu and cumulative distribution function, which ensures that the sampled frames comprehensively cover all the essential segments with high motion salience. Our approach can be integrated with any action recognition model to enhance its accuracy. In practice, our method achieves a relative improvement of 2.2 - 13.8% in top-1 accuracy on UAV-Human, 6.8% on NEC Drone, and 9.0% on Diving48 datasets. △ Less

Submitted 15 November, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

arXiv:2303.02575 [pdf, other]

MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition

Authors: Ruiqi Xian, Xijun Wang, Dinesh Manocha

Abstract: We present a novel approach for action recognition in UAV videos. Our formulation is designed to handle occlusion and viewpoint changes caused by the movement of a UAV. We use the concept of mutual information to compute and align the regions corresponding to human action or motion in the temporal domain. This enables our recognition model to learn from the key features associated with the motion.… ▽ More We present a novel approach for action recognition in UAV videos. Our formulation is designed to handle occlusion and viewpoint changes caused by the movement of a UAV. We use the concept of mutual information to compute and align the regions corresponding to human action or motion in the temporal domain. This enables our recognition model to learn from the key features associated with the motion. We also propose a novel frame sampling method that uses joint mutual information to acquire the most informative frame sequence in UAV videos. We have integrated our approach with X3D and evaluated the performance on multiple datasets. In practice, we achieve 18.9% improvement in Top-1 accuracy over current state-of-the-art methods on UAV-Human(Li et al., 2021), 7.3% improvement on Drone-Action(Perera et al., 2019), and 7.16% improvement on NEC Drones(Choi et al., 2020). △ Less

Submitted 15 November, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

arXiv:2303.01589 [pdf, other]

doi 10.1109/ICRA48891.2023.10160564

AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal Reasoning

Authors: Xijun Wang, Ruiqi Xian, Tianrui Guan, Celso M. de Melo, Stephen M. Nogar, Aniket Bera, Dinesh Manocha

Abstract: We propose a novel approach for aerial video action recognition. Our method is designed for videos captured using UAVs and can run on edge or mobile devices. We present a learning-based approach that uses customized auto zoom to automatically identify the human target and scale it appropriately. This makes it easier to extract the key features and reduces the computational overhead. We also presen… ▽ More We propose a novel approach for aerial video action recognition. Our method is designed for videos captured using UAVs and can run on edge or mobile devices. We present a learning-based approach that uses customized auto zoom to automatically identify the human target and scale it appropriately. This makes it easier to extract the key features and reduces the computational overhead. We also present an efficient temporal reasoning algorithm to capture the action information along the spatial and temporal domains within a controllable computational cost. Our approach has been implemented and evaluated both on the desktop with high-end GPUs and on the low power Robotics RB5 Platform for robots and drones. In practice, we achieve 6.1-7.4% improvement over SOTA in Top-1 accuracy on the RoCoG-v2 dataset, 8.3-10.4% improvement on the UAV-Human dataset and 3.2% improvement on the Drone Action dataset. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: Accepted for publication at ICRA 2023

arXiv:2212.10764 [pdf, other]

Learning List-Level Domain-Invariant Representations for Ranking

Authors: Ruicheng Xian, Honglei Zhuang, Zhen Qin, Hamed Zamani, Jing Lu, Ji Ma, Kai Hui, Han Zhao, Xuanhui Wang, Michael Bendersky

Abstract: Domain adaptation aims to transfer the knowledge learned on (data-rich) source domains to (low-resource) target domains, and a popular method is invariant representation learning, which matches and aligns the data distributions on the feature space. Although this method is studied extensively and applied on classification and regression problems, its adoption on ranking problems is sporadic, and t… ▽ More Domain adaptation aims to transfer the knowledge learned on (data-rich) source domains to (low-resource) target domains, and a popular method is invariant representation learning, which matches and aligns the data distributions on the feature space. Although this method is studied extensively and applied on classification and regression problems, its adoption on ranking problems is sporadic, and the few existing implementations lack theoretical justifications. This paper revisits invariant representation learning for ranking. Upon reviewing prior work, we found that they implement what we call item-level alignment, which aligns the distributions of the items being ranked from all lists in aggregate but ignores their list structure. However, the list structure should be leveraged, because it is intrinsic to ranking problems where the data and the metrics are defined and computed on lists, not the items by themselves. To close this discrepancy, we propose list-level alignment -- learning domain-invariant representations at the higher level of lists. The benefits are twofold: it leads to the first domain adaptation generalization bound for ranking, in turn providing theoretical support for the proposed method, and it achieves better empirical transfer performance for unsupervised domain adaptation on ranking tasks, including passage reranking. △ Less

Submitted 31 October, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

Comments: NeurIPS 2023. Comparison to v1: revised presentation and proof of Corollary 4.9

arXiv:2211.07574

An Introduction to PM2.5s, their Importance, and a Cluster Methodology to Analyze their Meteorological Dynamics

Authors: Rickie Xian, Dylan Jones

Abstract: The influence of human activity own the earth's atmospheric composition has never been more pronounced. Anthropogenic pollution is in fact the largest effector of the observed evolving atmospheric composition (Wallace, 2006). PM2.5 is a class of particulate matter pollutants of notable interest due to their significant driving of chemical, atmospheric change, their wide-scale, global circulations,… ▽ More The influence of human activity own the earth's atmospheric composition has never been more pronounced. Anthropogenic pollution is in fact the largest effector of the observed evolving atmospheric composition (Wallace, 2006). PM2.5 is a class of particulate matter pollutants of notable interest due to their significant driving of chemical, atmospheric change, their wide-scale, global circulations, and their malignant effects on human health; with a diameter of less than 2.5 microns; PM2.5s derive from combustion of organic materials, including fossil fuel combustion (Wallace, 2006) and forest fires (Newman, 2007). The gases released in these combustion reactions then condense in the atmosphere, undergoing gas to particle conversion, resulting in the atmospheric presence of PM2.5s. Particulate matter (PM) pollutants are harmful to human health in all diameter scales; increasing in recent years global morbidity and mortality (Araujo, 2011). The health risks of PM2.5 in particular are troubling due to their small size, which facilities their permeability in the respiratory system and ready diffusion into the bloodstream, inducing pathologies like ischaemic heart disease, respiratory infections, and lung cancers to name a few (Araujo, 2011). Once PM2.5 manifest in the atmosphere, they circulate on a larger scale due to atmospheric circulation patterns. Though government-enacted air quality measures have reduced the average PM2.5 levels in North America, pollution episodes still cause localized, acute PM2.5 exposure. The purpose of this project was to analyze PM2.5 mean concentration across America to identify and quantify any pollution episodes, as well as try to explain their dynamics using large scale, meteorological processes. △ Less

Submitted 16 December, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

Comments: Withdrawn because it was submitted without consent of the co-author

MSC Class: I.2.7 ACM Class: A.1.1

arXiv:2211.06020

Current Topics, Methods, and Challenges in the Modelling of Intrinsically Disordered Protein Dynamics

Authors: Rickie Xian, Sarah Rauscher

Abstract: The paradigm that the primary amino acid sequence prescribes structure and thus function has for a long time been central to the understanding of protein science. Though the theory is supported by the behaviour of most structured proteins, it loses much of its applicability when discussing intrinsically disordered proteins (IDPs). These peculiar proteins, whose tertiary structure constantly interc… ▽ More The paradigm that the primary amino acid sequence prescribes structure and thus function has for a long time been central to the understanding of protein science. Though the theory is supported by the behaviour of most structured proteins, it loses much of its applicability when discussing intrinsically disordered proteins (IDPs). These peculiar proteins, whose tertiary structure constantly interconverts between a series of energetically favourable conformations, are the root of many current, pressing scientific mechanisms. Many biological processes that are still yet to be elucidated--the mechanisms of of protein folding, ligand binding, and general protein dynamics--involve IDPs. Because most dynamic protein events are on such short time scales, using experimental methods to observe their action often times doesn't yield useful data. As well, the data resulting from scientific techniques developed for structured, "static proteins" must be presented in conjunction with data from methods tailored specifically to IDPs in order to have significance. A method that models IDPs with shocking accuracy is computer simulation, particularly Molecular Dynamics (MD) simulations. With computational power only recently increasing enough to encompass the timescale needed for protein dynamics, MD simulations are still fairly novel in their implementations. This paper will discuss and consolidate the current methods, problems, and solutions of using MD simulation to model IDPs. Which simulation parameters can be altered to more precisely describe observed biological behaviour? How can one accurately use MD simulation to answer questions that, when using experimental methods, have no answer? How can the data resulting from MD simulation be analyzed and quantified to support the conclusions being drawn? △ Less

Submitted 16 December, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

Comments: Withdrawn because it was submitted without consent of the co-author

MSC Class: I.2.7 ACM Class: A.1.1

arXiv:2211.04615 [pdf, other]

Observation of multi-directional energy transfer in a hybrid plasmonic-excitonic nanostructure

Authors: Tommaso Pincelli, Thomas Vasileiadis, Shuo Dong, Samuel Beaulieu, Maciej Dendzik, Daniela Zahn, Sang-Eun Lee, Hélène Seiler, Yinpeng Qi, R. Patrick Xian, Julian Maklar, Emerson Coy, Niclas S. Müller, Yu Okamura, Stephanie Reich, Martin Wolf, Laurenz Rettig, Ralph Ernstorfer

Abstract: Hybrid plasmonic devices involve a nanostructured metal supporting localized surface plasmons to amplify light-matter interaction, and a non-plasmonic material to functionalize charge excitations. Application-relevant epitaxial heterostructures, however, give rise to ballistic ultrafast dynamics that challenge the conventional semiclassical understanding of unidirectional nanometal-to-substrate en… ▽ More Hybrid plasmonic devices involve a nanostructured metal supporting localized surface plasmons to amplify light-matter interaction, and a non-plasmonic material to functionalize charge excitations. Application-relevant epitaxial heterostructures, however, give rise to ballistic ultrafast dynamics that challenge the conventional semiclassical understanding of unidirectional nanometal-to-substrate energy transfer. We study epitaxial Au nanoislands on WSe$_2$ with time- and angle-resolved photoemission spectroscopy and femtosecond electron diffraction: this combination of techniques resolves material, energy and momentum of charge-carriers and phonons excited in the heterostructure. We observe a strong non-linear plasmon-exciton interaction that transfers the energy of sub-bandgap photons very efficiently to the semiconductor, leaving the metal cold until non-radiative exciton recombination heats the nanoparticles on hundreds of femtoseconds timescales. Our results resolve a multi-directional energy exchange on timescales shorter than the electronic thermalization of the nanometal. Electron-phonon coupling and diffusive charge-transfer determine the subsequent energy flow. This complex dynamics opens perspectives for optoelectronic and photocatalytic applications, while providing a constraining experimental testbed for state-of-the-art modelling. △ Less

Submitted 29 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

arXiv:2211.04514 [pdf]

The Concurrent Use of Medical Imaging Modalities and Innovative Treatments to Combat Retinitis Pigmentosa

Authors: Rickie Xian

Abstract: Retinitis pigmentosa (RP), one of the leading causes of vision loss and blindness globally, is a progressive retinal disease involving the degradation of photoreceptors (7) and/or retinal pigment epithelial cells (14). Affecting approximately 1 in 4000 people, RP is caused by a series of genetic mutations; each specific mutation presents a specific pathological pattern in the patient, with the sam… ▽ More Retinitis pigmentosa (RP), one of the leading causes of vision loss and blindness globally, is a progressive retinal disease involving the degradation of photoreceptors (7) and/or retinal pigment epithelial cells (14). Affecting approximately 1 in 4000 people, RP is caused by a series of genetic mutations; each specific mutation presents a specific pathological pattern in the patient, with the same mutation even presenting in different phenotypes in different patients (14). RP generally starts with peripheral vision loss, attacking the rods first, causing nyctalopia or night blindness (22). In later stages of the disease, the cones start to atrophy, further narrowing the field of vision and obscuring central vision (22). Luckily, with recent advances in medical imaging techniques and novel therapeutic treatments, both early detection and the overall prognosis of RP in patients have improved dramatically in the past few decades. This review will trace RP's physiological causes, how it affects retinal and ocular physiology, the techniques through which we can diagnose and image it, and the various treatments developed to try to combat it. The medical imaging techniques to be discussed include but are not limited to adaptive optics (AO), OCT including SD-OCT and OCTA, fundus autofluorescence (FAF) and its associated fluorescence lifetime imaging ophthalmoscopy (FLIO), colour Doppler flow imaging (CDFI), microperimetry, and MRI. The treatments to be discussed include stem cell therapy, gene therapy, cell transplantation, pharmacological therapy, and artificial retinal implants. Throughout this review, it will be made evident of not just the severity and diversity through which RP can present, but also the advanced made in medical imaging and innovative treatments designed to combat this pathology. △ Less

Submitted 8 November, 2022; originally announced November 2022.

Comments: 39 pages, 23 figures

MSC Class: I.2.7 ACM Class: A.1.1

arXiv:2211.01528 [pdf, other]

Fair and Optimal Classification via Post-Processing

Authors: Ruicheng Xian, Lang Yin, Han Zhao

Abstract: To mitigate the bias exhibited by machine learning models, fairness criteria can be integrated into the training process to ensure fair treatment across all demographics, but it often comes at the expense of model performance. Understanding such tradeoffs, therefore, underlies the design of fair algorithms. To this end, this paper provides a complete characterization of the inherent tradeoff of de… ▽ More To mitigate the bias exhibited by machine learning models, fairness criteria can be integrated into the training process to ensure fair treatment across all demographics, but it often comes at the expense of model performance. Understanding such tradeoffs, therefore, underlies the design of fair algorithms. To this end, this paper provides a complete characterization of the inherent tradeoff of demographic parity on classification problems, under the most general multi-group, multi-class, and noisy setting. Specifically, we show that the minimum error rate achievable by randomized and attribute-aware fair classifiers is given by the optimal value of a Wasserstein-barycenter problem. On the practical side, our findings lead to a simple post-processing algorithm that derives fair classifiers from score functions, which yields the optimal fair classifier when the score is Bayes optimal. We provide suboptimality analysis and sample complexity for our algorithm, and demonstrate its effectiveness on benchmark datasets. △ Less

Submitted 5 June, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

Comments: ICML 2023. Code is at https://github.com/rxian/fair-classification. Comparison to v2: corrected proof of Theorem 4.4

arXiv:2210.04557 [pdf]

A Comparative Study of Disordered and Ordered Protein Folding Dynamics Using Computational Simulation

Authors: Rickie Xian

Abstract: Folding protein dynamics has been an area of high interest for quite some time, especially given the increased focus on the field of Biophysics. Because folding dynamics occur on such short time scales, empirical techniques developed for more "static" protein events, such as X-ray crystallography, nuclear magnetic resonance, and green fluorescent protein (GFP) labelling, aren't as applicable. Inst… ▽ More Folding protein dynamics has been an area of high interest for quite some time, especially given the increased focus on the field of Biophysics. Because folding dynamics occur on such short time scales, empirical techniques developed for more "static" protein events, such as X-ray crystallography, nuclear magnetic resonance, and green fluorescent protein (GFP) labelling, aren't as applicable. Instead, computational methods must often be used to simulate these short lived yet highly dynamic events. One such computational method that is proven to provide much valuable insight into protein folding dynamics is Molecular Dynamics Simulation (MD Simulation). This simulation method is both highly computationally demanding, yet highly accurate in its modelling of a proteins physical behaviour. Besides MD Simulation, simulations in general are quite applicable in the context of these protein events. For example, the simple Gillespie algorithm, a computational technique which can be executed on almost any personal computer, provides quite the robust view into protein dynamics given its computational simplicity. This paper will compare the results of two simulations, an MD simulation of a disordered, six-residue, carcinogenic protein fragment, and a Gillespie algorithm based simulation of an ordered folding protein: the mathematically identical nature of the Gillespie algorithm time series of the asymptotically stochastic hyperbolic tangent dynamics for the wild type predicting the exact behaviour of the carcinogenic protein system time series will show the computational power simulations provide for analyzing both disordered and ordered protein systems. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: 13 pages, draft 1, 8 figures

arXiv:2204.06824 [pdf, other]

doi 10.1038/s41586-023-05814-1

Orbital-resolved Observation of Singlet Fission

Authors: Alexander Neef, Samuel Beaulieu, Sebastian Hammer, Shuo Dong, Julian Maklar, Tommaso Pincelli, R. Patrick Xian, Martin Wolf, Laurenz Rettig, Jens Pflaum, Ralph Ernstorfer

Abstract: Singlet fission may boost photovoltaic efficiency [by transforming a singlet exciton into two triplet excitons and thereby doubling the number of excited charge carriers. The primary step of singlet fission is the ultrafast creation of the correlated triplet pair. While several mechanisms have been proposed to explain this step, none has emerged as a consensus. The challenge lies in tracking the t… ▽ More Singlet fission may boost photovoltaic efficiency [by transforming a singlet exciton into two triplet excitons and thereby doubling the number of excited charge carriers. The primary step of singlet fission is the ultrafast creation of the correlated triplet pair. While several mechanisms have been proposed to explain this step, none has emerged as a consensus. The challenge lies in tracking the transient excitonic states. Here we use time- and angle-resolved photoemission spectroscopy to observe the primary step of singlet fission in crystalline pentacene. Our results suggest a charge-transfer mediated mechanism with a hybridization of Frenkel and charge-transfer states in the lowest bright singlet exciton. We gained intimate knowledge about the localization and the orbital character of the exciton wave functions recorded in momentum maps. This allowed us to directly compare the localization of singlet and bitriplet excitons and decompose energetically overlapping states based on their orbital character. Orbital- and localization- resolved many-body dynamics promise deep insights into the mechanics governing molecular systems and topological materials. △ Less

Submitted 28 February, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

Comments: 24 pages, 4 main figures, 9 supplementary figures

arXiv:2108.07099 [pdf, other]

doi 10.1103/PhysRevB.105.075417

Excited-state band structure mapping

Authors: M. Puppin, C. W. Nicholson, C. Monney, Y. Deng, R. P. Xian, J. Feldl, S. Dong, A. Dominguez, H. Hübener, A. Rubio, M. Wolf, L. Rettig, R. Ernstorfer

Abstract: Angle-resolved photoelectron spectroscopy is an extremely powerful probe of materials to access the occupied electronic structure with energy and momentum resolution. However, it remains blind to those dynamic states above the Fermi level that determine technologically relevant transport properties. In this work, we extend band structure mapping into the unoccupied states and across the entire Bri… ▽ More Angle-resolved photoelectron spectroscopy is an extremely powerful probe of materials to access the occupied electronic structure with energy and momentum resolution. However, it remains blind to those dynamic states above the Fermi level that determine technologically relevant transport properties. In this work, we extend band structure mapping into the unoccupied states and across the entire Brillouin zone by using a state-of-the-art high repetition rate, extreme ultraviolet fem- tosecond light source to probe optically excited samples. The wide-ranging applicability and power of this approach are demonstrated by measurements on the 2D semiconductor WSe2, where the energy-momentum dispersion of valence and conduction bands are observed in a single experiment. This provides a direct momentum-resolved view not only on the complete out-of-equilibrium band gap but also on its renormalization induced by electron-hole interaction and screening. Our work establishes a new benchmark for measuring the band structure of materials, with direct access to the energy-momentum dispersion of the excited-state spectral function. △ Less

Submitted 16 August, 2021; originally announced August 2021.

arXiv:2108.06803 [pdf, other]

Observation of ultrafast interfacial Meitner-Auger energy transfer in a van der Waals heterostructure

Authors: Shuo Dong, Samuel Beaulieu, Malte Selig, Philipp Rosenzweig, Dominik Christiansen, Tommaso Pincelli, Maciej Dendzik, Jonas D. Ziegler, Julian Maklar, R. Patrick Xian, Alexander Neef, Avaise Mohammed, Armin Schulz, Mona Stadler, Michael Jetter, Peter Michler, Takashi Taniguchi, Kenji Watanabe, Hidenori Takagi, Ulrich Starke, Alexey Chernikov, Martin Wolf, Hiro Nakamura, Andreas Knorr, Laurenz Rettig , et al. (1 additional authors not shown)

Abstract: Atomically thin layered van der Waals heterostructures feature exotic and emergent optoelectronic properties. With growing interest in these novel quantum materials, the microscopic understanding of fundamental interfacial coupling mechanisms is of capital importance. Here, using multidimensional photoemission spectroscopy, we provide a layer- and momentum-resolved view on ultrafast interlayer ele… ▽ More Atomically thin layered van der Waals heterostructures feature exotic and emergent optoelectronic properties. With growing interest in these novel quantum materials, the microscopic understanding of fundamental interfacial coupling mechanisms is of capital importance. Here, using multidimensional photoemission spectroscopy, we provide a layer- and momentum-resolved view on ultrafast interlayer electron and energy transfer in a monolayer-WSe$_2$/graphene heterostructure. Depending on the nature of the optically prepared state, we find the different dominating transfer mechanisms: while electron injection from graphene to WSe$_2$ is observed after photoexcitation of quasi-free hot carriers in the graphene layer, we establish an interfacial Meitner-Auger energy transfer process following the excitation of excitons in WSe$_2$. By analysing the time-energy-momentum distributions of excited-state carriers with a rate-equation model, we distinguish these two types of interfacial dynamics and identify the ultrafast conversion of excitons in WSe$_2$ to valence band transitions in graphene. Microscopic calculations find interfacial dipole-monopole coupling underlying the Meitner-Auger energy transfer to dominate over conventional Förster- and Dexter-type interactions, in agreement with the experimental observations. The energy transfer mechanism revealed here might enable new hot-carrier-based device concepts with van der Waals heterostructures. △ Less

Submitted 29 May, 2022; v1 submitted 15 August, 2021; originally announced August 2021.

Comments: 28 pages, 4 figures

arXiv:2102.05604 [pdf, other]

Scalable multicomponent spectral analysis for high-throughput data annotation

Authors: Rui Patrick Xian, Ralph Ernstorfer, Philipp Michael Pelz

Abstract: Orchestrating parametric fitting of multicomponent spectra at scale is an essential yet underappreciated task in high-throughput quantification of materials and chemical composition. To automate the annotation process for spectroscopic and diffraction data collected in counts of hundreds to thousands, we present a systematic approach compatible with high-performance computing infrastructures using… ▽ More Orchestrating parametric fitting of multicomponent spectra at scale is an essential yet underappreciated task in high-throughput quantification of materials and chemical composition. To automate the annotation process for spectroscopic and diffraction data collected in counts of hundreds to thousands, we present a systematic approach compatible with high-performance computing infrastructures using the MapReduce model and task-based parallelization. We implement the approach in software and demonstrate linear computational scaling with respect to spectral components using multidimensional experimental materials characterization datasets from photoemission spectroscopy and powder electron diffraction as benchmarks. Our approach enables efficient generation of high-quality data annotation and online spectral analysis and is applicable to a variety of analytical techniques in materials science and chemistry as a building block for closed-loop experimental systems. △ Less

Submitted 29 March, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

arXiv:2102.03024 [pdf]

doi 10.1063/5.0049111

Machine Learning on Neutron and X-Ray Scattering

Authors: Zhantao Chen, Nina Andrejevic, Nathan Drucker, Thanh Nguyen, R Patrick Xian, Tess Smidt, Yao Wang, Ralph Ernstorfer, Alan Tennant, Maria Chan, Mingda Li

Abstract: Neutron and X-ray scattering represent two state-of-the-art materials characterization techniques that measure materials' structural and dynamical properties with high precision. These techniques play critical roles in understanding a wide variety of materials systems, from catalysis to polymers, nanomaterials to macromolecules, and energy materials to quantum materials. In recent years, neutron a… ▽ More Neutron and X-ray scattering represent two state-of-the-art materials characterization techniques that measure materials' structural and dynamical properties with high precision. These techniques play critical roles in understanding a wide variety of materials systems, from catalysis to polymers, nanomaterials to macromolecules, and energy materials to quantum materials. In recent years, neutron and X-ray scattering have received a significant boost due to the development and increased application of machine learning to materials problems. This article reviews the recent progress in applying machine learning techniques to augment various neutron and X-ray scattering techniques. We highlight the integration of machine learning methods into the typical workflow of scattering experiments. We focus on scattering problems that faced challenge with traditional methods but addressable using machine learning, such as leveraging the knowledge of simple materials to model more complicated systems, learning with limited data or incomplete labels, identifying meaningful spectra and materials' representations for learning tasks, mitigating spectral noise, and many others. We present an outlook on a few emerging roles machine learning may play in broad types of scattering and spectroscopic problems in the foreseeable future. △ Less

Submitted 5 February, 2021; originally announced February 2021.

Comments: 56 pages, 12 figures. Feedback most welcome

Journal ref: Chem. Phys. Rev. 2, 031301 (2021)

arXiv:2012.15328 [pdf, other]

doi 10.1002/ntls.10010

Direct measurement of key exciton properties: energy, dynamics and spatial distribution of the wave function

Authors: Shuo Dong, Michele Puppin, Tommaso Pincelli, Samuel Beaulieu, Dominik Christiansen, Hannes Hubener, Christopher W. Nicholson, R. Patrick Xian, Maciej Dendzik, Yunpei Deng, Yoav William Windsor, Malte Selig, Ermin Malic, Angel Rubio, Andreas Knorr, Martin Wolf, Laurenz Rettig, Ralph Ernstorfer

Abstract: Excitons, Coulomb-bound electron-hole pairs, are the fundamental excitations governing the optoelectronic properties of semiconductors. While optical signatures of excitons have been studied extensively, experimental access to the excitonic wave function itself has been elusive. Using multidimensional photoemission spectroscopy, we present a momentum-, energy- and time-resolved perspective on exci… ▽ More Excitons, Coulomb-bound electron-hole pairs, are the fundamental excitations governing the optoelectronic properties of semiconductors. While optical signatures of excitons have been studied extensively, experimental access to the excitonic wave function itself has been elusive. Using multidimensional photoemission spectroscopy, we present a momentum-, energy- and time-resolved perspective on excitons in the layered semiconductor WSe$_2$. By tuning the excitation wavelength, we determine the energy-momentum signature of bright exciton formation and its difference from conventional single-particle excited states. The multidimensional data allows to retrieve fundamental exciton properties like the binding energy and the exciton-lattice coupling and to reconstruct the real-space excitonic distribution function via Fourier transform. All quantities are in excellent agreement with microscopic calculations. Our approach provides a full characterization of the exciton properties and is applicable to bright and dark excitons in semiconducting materials, heterostructures and devices. △ Less

Submitted 4 May, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

Report number: e10010

Journal ref: Natural Sciences (2021)

arXiv:2008.05829 [pdf, other]

doi 10.1063/5.0024493

A quantitative comparison of time-of-flight momentum microscopes and hemispherical analyzers for time- and angle-resolved photoemission spectroscopy experiments

Authors: J. Maklar, S. Dong, S. Beaulieu, T. Pincelli, M. Dendzik, Y. W. Windsor, R. P. Xian, M. Wolf, R. Ernstorfer, L. Rettig

Abstract: Time-of-flight-based momentum microscopy has a growing presence in photoemission studies, as it enables parallel energy- and momentum-resolved acquisition of the full photoelectron distribution. Here, we report table-top extreme ultraviolet (XUV) time- and angle-resolved photoemission spectroscopy (trARPES) featuring both a hemispherical analyzer and a momentum microscope within the same setup. We… ▽ More Time-of-flight-based momentum microscopy has a growing presence in photoemission studies, as it enables parallel energy- and momentum-resolved acquisition of the full photoelectron distribution. Here, we report table-top extreme ultraviolet (XUV) time- and angle-resolved photoemission spectroscopy (trARPES) featuring both a hemispherical analyzer and a momentum microscope within the same setup. We present a systematic comparison of the two detection schemes and quantify experimentally relevant parameters, including pump- and probe-induced space-charge effects, detection efficiency, photoelectron count rates, and depth of focus. We highlight the advantages and limitations of both instruments based on exemplary trARPES measurements of bulk WSe2. Our analysis demonstrates the complementary nature of the two spectrometers for time-resolved ARPES experiments. Their combination in a single experimental apparatus allows us to address a broad range of scientific questions with trARPES. △ Less

Submitted 14 December, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

Comments: 19 pages, 9 figures. The following article has been submitted to Review of Scientific Instruments / AIP Publishing. After it is published, it will be found at https://aip.scitation.org/journal/rsi

arXiv:2005.10210 [pdf, other]

doi 10.1038/s43588-022-00382-2

A machine learning route between band mapping and band structure

Authors: Rui Patrick Xian, Vincent Stimper, Marios Zacharias, Maciej Dendzik, Shuo Dong, Samuel Beaulieu, Bernhard Schölkopf, Martin Wolf, Laurenz Rettig, Christian Carbogno, Stefan Bauer, Ralph Ernstorfer

Abstract: Electronic band structure (BS) and crystal structure are the two complementary identifiers of solid state materials. While convenient instruments and reconstruction algorithms have made large, empirical, crystal structure databases possible, extracting quasiparticle dispersion (closely related to BS) from photoemission band mapping data is currently limited by the available computational methods.… ▽ More Electronic band structure (BS) and crystal structure are the two complementary identifiers of solid state materials. While convenient instruments and reconstruction algorithms have made large, empirical, crystal structure databases possible, extracting quasiparticle dispersion (closely related to BS) from photoemission band mapping data is currently limited by the available computational methods. To cope with the growing size and scale of photoemission data, we develop a pipeline including probabilistic machine learning and the associated data processing, optimization and evaluation methods for band structure reconstruction, leveraging theoretical calculations. The pipeline reconstructs all 14 valence bands of a semiconductor and shows excellent performance on benchmarks and other materials datasets. The reconstruction uncovers previously inaccessible momentum-space structural information on both global and local scales, while realizing a path towards integration with materials science databases. Our approach illustrates the potential of combining machine learning and domain knowledge for scalable feature extraction in multidimensional data. △ Less

Submitted 15 November, 2022; v1 submitted 20 May, 2020; originally announced May 2020.

arXiv:2003.12925 [pdf, other]

doi 10.1103/PhysRevLett.125.096401

Observation of an excitonic Mott transition through ultrafast core-$\textit{cum}$-conduction photoemission spectroscopy

Authors: Maciej Dendzik, R. Patrick Xian, Enrico Perfetto, Davide Sangalli, Dmytro Kutnyakhov, Shuo Dong, Samuel Beaulieu, Tommaso Pincelli, Federico Pressacco, Davide Curcio, Steinn Ymir Agustsson, Michael Heber, Jasper Hauer, Wilfried Wurth, Günter Brenner, Yves Acremann, Philip Hofmann, Martin Wolf, Andrea Marini, Gianluca Stefanucci, Laurenz Rettig, Ralph Ernstorfer

Abstract: Time-resolved soft-X-ray photoemission spectroscopy is used to simultaneously measure the ultrafast dynamics of core-level spectral functions and excited states upon excitation of excitons in WSe$_2$. We present a many-body approximation for the Green's function, which excellently describes the transient core-hole spectral function. The relative dynamics of excited-state signal and core levels rev… ▽ More Time-resolved soft-X-ray photoemission spectroscopy is used to simultaneously measure the ultrafast dynamics of core-level spectral functions and excited states upon excitation of excitons in WSe$_2$. We present a many-body approximation for the Green's function, which excellently describes the transient core-hole spectral function. The relative dynamics of excited-state signal and core levels reveals a delayed core-hole renormalization due to screening by excited quasi-free carriers, revealing an excitonic Mott transition. These findings establish time-resolved core-level photoelectron spectroscopy as a sensitive probe of subtle electronic many-body interactions and an ultrafast electronic phase transition. △ Less

Submitted 28 March, 2020; originally announced March 2020.

Comments: 6 pages, 4 figures

Journal ref: Phys. Rev. Lett. 125, 096401 (2020)

arXiv:2003.04059 [pdf, other]

Ultrafast Dynamical Lifshitz Transition

Authors: Samuel Beaulieu, Shuo Dong, Nicolas Tancogne-Dejean, Maciej Dendzik, Tommaso Pincelli, Julian Maklar, R. Patrick Xian, Michael A. Sentef, Martin Wolf, Angel Rubio, Laurenz Rettig, Ralph Ernstorfer

Abstract: Fermi surface is at the heart of our understanding of metals and strongly correlated many-body systems. An abrupt change in the Fermi surface topology, also called Lifshitz transition, can lead to the emergence of fascinating phenomena like colossal magnetoresistance and superconductivity. While Lifshitz transitions have been demonstrated for a broad range of materials by equilibrium tuning of mac… ▽ More Fermi surface is at the heart of our understanding of metals and strongly correlated many-body systems. An abrupt change in the Fermi surface topology, also called Lifshitz transition, can lead to the emergence of fascinating phenomena like colossal magnetoresistance and superconductivity. While Lifshitz transitions have been demonstrated for a broad range of materials by equilibrium tuning of macroscopic parameters such as strain, doping, pressure and temperature, a non-equilibrium dynamical route toward ultrafast modification of the Fermi surface topology has not been experimentally demonstrated. Combining time-resolved multidimensional photoemission spectroscopy with state-of-the-art TDDFT+$U$ simulations, we introduce a novel scheme for driving an ultrafast Lifshitz transition in the correlated type-II Weyl semimetal T$\mathrm{_{d}}$-MoTe$_{2}$. We demonstrate that this non-equilibrium topological electronic transition finds its microscopic origin in the dynamical modification of the effective electronic correlations. These results shed light on a novel ultrafast scheme for controlling the Fermi surface topology in correlated quantum materials. △ Less

Submitted 15 February, 2021; v1 submitted 9 March, 2020; originally announced March 2020.

arXiv:1910.06956 [pdf, ps, other]

Neural tangent kernels, transportation mappings, and universal approximation

Authors: Ziwei Ji, Matus Telgarsky, Ruicheng Xian

Abstract: This paper establishes rates of universal approximation for the shallow neural tangent kernel (NTK): network weights are only allowed microscopic changes from random initialization, which entails that activations are mostly unchanged, and the network is nearly equivalent to its linearization. Concretely, the paper has two main contributions: a generic scheme to approximate functions with the NTK b… ▽ More This paper establishes rates of universal approximation for the shallow neural tangent kernel (NTK): network weights are only allowed microscopic changes from random initialization, which entails that activations are mostly unchanged, and the network is nearly equivalent to its linearization. Concretely, the paper has two main contributions: a generic scheme to approximate functions with the NTK by sampling from transport mappings between the initial weights and their desired values, and the construction of transport mappings via Fourier transforms. Regarding the first contribution, the proof scheme provides another perspective on how the NTK regime arises from rescaling: redundancy in the weights due to resampling allows individual weights to be scaled down. Regarding the second contribution, the most notable transport mapping asserts that roughly $1 / δ^{10d}$ nodes are sufficient to approximate continuous functions, where $δ$ depends on the continuity properties of the target function. By contrast, nearly the same proof yields a bound of $1 / δ^{2d}$ for shallow ReLU networks; this gap suggests a tantalizing direction for future work, separating shallow ReLU networks and their linearization. △ Less

Submitted 14 February, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

arXiv:1909.07714 [pdf, other]

doi 10.1038/s41597-020-00769-8

An open-source, end-to-end workflow for multidimensional photoemission spectroscopy

Authors: Rui Patrick Xian, Yves Acremann, Steinn Ymir Agustsson, Maciej Dendzik, Kevin Bühlmann, Davide Curcio, Dmytro Kutnyakhov, Frederico Pressacco, Michael Heber, Shuo Dong, Tommaso Pincelli, Jure Demsar, Wilfried Wurth, Philip Hofmann, Martin Wolf, Markus Scheidgen, Laurenz Rettig, Ralph Ernstorfer

Abstract: Characterization of the electronic band structure of solid state materials is routinely performed using photoemission spectroscopy. Recent advancements in short-wavelength light sources and electron detectors give rise to multidimensional photoemission spectroscopy, allowing parallel measurements of the electron spectral function simultaneously in energy, two momentum components and additional phy… ▽ More Characterization of the electronic band structure of solid state materials is routinely performed using photoemission spectroscopy. Recent advancements in short-wavelength light sources and electron detectors give rise to multidimensional photoemission spectroscopy, allowing parallel measurements of the electron spectral function simultaneously in energy, two momentum components and additional physical parameters with single-event detection capability. Efficient processing of the photoelectron event streams at a rate of up to tens of megabytes per second will enable rapid band mapping for materials characterization. We describe an open-source workflow that allows user interaction with billion-count single-electron events in photoemission band mapping experiments, compatible with beamlines at $3^{\text{rd}}$ and $4^{\text{th}}$ generation light sources and table-top laser-based setups. The workflow offers an end-to-end recipe from distributed operations on single-event data to structured formats for downstream scientific tasks and storage to materials science database integration. Both the workflow and processed data can be archived for reuse, providing the infrastructure for documenting the provenance and lineage of photoemission data for future high-throughput experiments. △ Less

Submitted 14 November, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

arXiv:1909.00248 [pdf, other]

doi 10.1103/PhysRevLett.124.206402

Evidence of large polarons in photoemission band mapping of the perovskite semiconductor CsPbBr$_3$

Authors: M. Puppin, S. Polishchuk, N. Colonna, A. Crepaldi, D. N. Dirin, O. Nazarenko, R. De Gennaro, G. Gatti, S. Roth, T. Barillot, L. Poletto, R. P. Xian, L. Rettig, M. Wolf, R. Ernstorfer, M. V. Kovalenko, N. Marzari, M. Grioni, M. Chergui

Abstract: Lead-halide perovskite (LHP) semiconductors are emergent optoelectronic materials with outstanding transport properties which are not yet fully understood. We find signatures of large polaron formation in the electronic structure of the inorganic LHP CsPbBr$_3$ by means of angle-resolved photoelectron spectroscopy. The experimental valence band dispersion shows a hole effective mass… ▽ More Lead-halide perovskite (LHP) semiconductors are emergent optoelectronic materials with outstanding transport properties which are not yet fully understood. We find signatures of large polaron formation in the electronic structure of the inorganic LHP CsPbBr$_3$ by means of angle-resolved photoelectron spectroscopy. The experimental valence band dispersion shows a hole effective mass $0.26\pm0.02\,\,m_e$, 50% heavier than the bare mass $m_0 =0.17 m_e$ predicted by density functional theory. Calculations of electron-phonon coupling indicate that phonon dressing of the carriers mainly occurs via distortions of the Pb-Br bond with a Fröhlich coupling parameter $α=1.82$. A good agreement with our experimental data is obtained within the Feynmann polaron model, validating a viable theorical method to predict the carrier effective mass of LHPs ab-initio. △ Less

Submitted 31 August, 2019; originally announced September 2019.

Journal ref: Phys. Rev. Lett. 124, 206402 (2020)

arXiv:1906.12155 [pdf, other]

doi 10.1063/1.5118777

Time- and momentum-resolved photoemission studies using time-of-flight momentum microscopy at a free-electron laser

Authors: Dmytro Kutnyakhov, Rui Patrick Xian, Maciej Dendzik, Michael Heber, Federico Pressacco, Steinn Ymir Agustsson, Lukas Wenthaus, Holger Meyer, Sven Gieschen, Giuseppe Mercurio, Adrian Benz, Kevin Bühlman, Simon Däster, Rafael Gort, Davide Curcio, Klara Volckaert, Marco Bianchi, Charlotte Sanders, Jill Atsuko Miwa, Søren Ulstrup, Andreas Oelsner, Christian Tusche, Ying-Jiun Chen, Dmitrii Vasilyev, Katerina Medjanik , et al. (16 additional authors not shown)

Abstract: Time-resolved photoemission with ultrafast pump and probe pulses is an emerging technique with wide application potential. Real-time recording of non-equilibrium electronic processes, transient states in chemical reactions or the interplay of electronic and structural dynamics offers fascinating opportunities for future research. Combining valence-band and core-level spectroscopy with photoelectro… ▽ More Time-resolved photoemission with ultrafast pump and probe pulses is an emerging technique with wide application potential. Real-time recording of non-equilibrium electronic processes, transient states in chemical reactions or the interplay of electronic and structural dynamics offers fascinating opportunities for future research. Combining valence-band and core-level spectroscopy with photoelectron diffraction for electronic, chemical and structural analysis requires few 10 fs soft X-ray pulses with some 10 meV spectral resolution, which are currently available at high repetition rate free-electron lasers. The PG2 beamline at FLASH (DESY, Hamburg) provides a high pulse rate of 5000 pulses/s, 60 fs pulse duration and 40 meV bandwidth in an energy range of 25-830 eV with a photon beam size down to 50 microns in diameter. We have constructed and optimized a versatile setup commissioned at FLASH/PG2 that combines FEL capabilities together with a multidimensional recording scheme for photoemission studies. We use a full-field imaging momentum microscope with time-of-flight energy recording as the detector for mapping of 3D band structures in ($k_x$, $k_y$, $E$) parameter space with unprecedented efficiency. Our instrument can image full surface Brillouin zones with up to 7 Å $^{-1}$ diameter in a binding-energy range of several eV, resolving about $2.5\times10^5$ data voxels. As an example, we present results for the ultrafast excited state dynamics in the model van der Waals semiconductor WSe$_2$. △ Less

Submitted 18 September, 2019; v1 submitted 28 June, 2019; originally announced June 2019.

Journal ref: Review of Scientific Instruments 91, 013109 (2020)

arXiv:1906.11355 [pdf, other]

doi 10.1109/ACCESS.2019.2952899

Multidimensional Contrast Limited Adaptive Histogram Equalization

Authors: Vincent Stimper, Stefan Bauer, Ralph Ernstorfer, Bernhard Schölkopf, R. Patrick Xian

Abstract: Contrast enhancement is an important preprocessing technique for improving the performance of downstream tasks in image processing and computer vision. Among the existing approaches based on nonlinear histogram transformations, contrast limited adaptive histogram equalization (CLAHE) is a popular choice for dealing with 2D images obtained in natural and scientific settings. The recent hardware upg… ▽ More Contrast enhancement is an important preprocessing technique for improving the performance of downstream tasks in image processing and computer vision. Among the existing approaches based on nonlinear histogram transformations, contrast limited adaptive histogram equalization (CLAHE) is a popular choice for dealing with 2D images obtained in natural and scientific settings. The recent hardware upgrade in data acquisition systems results in significant increase in data complexity, including their sizes and dimensions. Measurements of densely sampled data higher than three dimensions, usually composed of 3D data as a function of external parameters, are becoming commonplace in various applications in the natural sciences and engineering. The initial understanding of these complex multidimensional datasets often requires human intervention through visual examination, which may be hampered by the varying levels of contrast permeating through the dimensions. We show both qualitatively and quantitatively that using our multidimensional extension of CLAHE (MCLAHE) simultaneously on all dimensions of the datasets allows better visualization and discernment of multidimensional image features, as demonstrated using cases from 4D photoemission spectroscopy and fluorescence microscopy. Our implementation of multidimensional CLAHE in Tensorflow is publicly accessible and supports parallelization with multiple CPUs and various other hardware accelerators, including GPUs. △ Less

Submitted 9 November, 2019; v1 submitted 26 June, 2019; originally announced June 2019.

Journal ref: IEEE Access 7, 165437 (2019)

arXiv:1906.07709

Approximation power of random neural networks

Authors: Bolton Bailey, Ziwei Ji, Matus Telgarsky, Ruicheng Xian

Abstract: This paper investigates the approximation power of three types of random neural networks: (a) infinite width networks, with weights following an arbitrary distribution; (b) finite width networks obtained by subsampling the preceding infinite width networks; (c) finite width networks obtained by starting with standard Gaussian initialization, and then adding a vanishingly small correction to the we… ▽ More This paper investigates the approximation power of three types of random neural networks: (a) infinite width networks, with weights following an arbitrary distribution; (b) finite width networks obtained by subsampling the preceding infinite width networks; (c) finite width networks obtained by starting with standard Gaussian initialization, and then adding a vanishingly small correction to the weights. The primary result is a fully quantified bound on the rate of approximation of general general continuous functions: in all three cases, a function $f$ can be approximated with complexity $\|f\|_1 (d/δ)^{\mathcal{O}(d)}$, where $δ$ depends on continuity properties of $f$ and the complexity measure depends on the weight magnitudes and/or cardinalities. Along the way, a variety of ancillary results are developed: an exact construction of Gaussian densities with infinite width networks, an elementary stand-alone proof scheme for approximation via convolutions of radial basis functions, subsampling rates for infinite width networks, and depth separation for corrected networks. △ Less

Submitted 17 October, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

Comments: This submission constitutes a poor approach to the problem, and has no scientific purpose. A superior (different) approach (and stronger final result, also treating the NTK) has appeared in arXiv:1910.06956 ; please see that work instead

arXiv:1901.07064 [pdf, other]

Predictive Indexing

Authors: Joy Arulraj, Ran Xian, Lin Ma, Andrew Pavlo

Abstract: There has been considerable research on automated index tuning in database management systems (DBMSs). But the majority of these solutions tune the index configuration by retrospectively making computationally expensive physical design changes all at once. Such changes degrade the DBMS's performance during the process, and have reduced utility during subsequent query processing due to the delay be… ▽ More There has been considerable research on automated index tuning in database management systems (DBMSs). But the majority of these solutions tune the index configuration by retrospectively making computationally expensive physical design changes all at once. Such changes degrade the DBMS's performance during the process, and have reduced utility during subsequent query processing due to the delay between a workload shift and the associated change. A better approach is to generate small changes that tune the physical design over time, forecast the utility of these changes, and apply them ahead of time to maximize their impact. This paper presents predictive indexing that continuously improves a database's physical design using lightweight physical design changes. It uses a machine learning model to forecast the utility of these changes, and continuously refines the index configuration of the database to handle evolving workloads. We introduce a lightweight hybrid scan operator with which a DBMS can make use of partially-built indexes for query processing. Our evaluation shows that predictive indexing improves the throughput of a DBMS by 3.5--5.2x compared to other state-of-the-art indexing approaches. We demonstrate that predictive indexing works seamlessly with other lightweight automated physical design tuning methods. △ Less

Submitted 21 January, 2019; originally announced January 2019.

Comments: 12 pages

ACM Class: H.2.2; H.2.4

arXiv:1901.00312 [pdf, other]

doi 10.1016/j.ultramic.2019.04.004

Symmetry-guided nonrigid registration: the case for distortion correction in multidimensional photoemission spectroscopy

Authors: Rui Patrick Xian, Laurenz Rettig, Ralph Ernstorfer

Abstract: Image symmetrization is an effective strategy to correct symmetry distortion in experimental data for which symmetry is essential in the subsequent analysis. In the process, a coordinate transform, the symmetrization transform, is required to undo the distortion. The transform may be determined by image registration (i.e. alignment) with symmetry constraints imposed in the registration target and… ▽ More Image symmetrization is an effective strategy to correct symmetry distortion in experimental data for which symmetry is essential in the subsequent analysis. In the process, a coordinate transform, the symmetrization transform, is required to undo the distortion. The transform may be determined by image registration (i.e. alignment) with symmetry constraints imposed in the registration target and in the iterative parameter tuning, which we call symmetry-guided registration. An example use case of image symmetrization is found in electronic band structure mapping by multidimensional photoemission spectroscopy, which employs a 3D time-of-flight detector to measure electrons sorted into the momentum ($k_x$, $k_y$) and energy ($E$) coordinates. In reality, imperfect instrument design, sample geometry and experimental settings cause distortion of the photoelectron trajectories and, therefore, the symmetry in the measured band structure, which hinders the full understanding and use of the volumetric datasets. We demonstrate that symmetry-guided registration can correct the symmetry distortion in the momentum-resolved photoemission patterns. Using proposed symmetry metrics, we show quantitatively that the iterative approach to symmetrization outperforms its non-iterative counterpart in the restored symmetry of the outcome while preserving the average shape of the photoemission pattern. Our approach is generalizable to distortion corrections in different types of symmetries and should also find applications in other experimental methods that produce images with similar features. △ Less

Submitted 7 April, 2019; v1 submitted 2 January, 2019; originally announced January 2019.

Journal ref: Ultramicroscopy 202, 133 (2019)

arXiv:0902.3713 [pdf, ps, other]

doi 10.1364/OL.35.001166

Arbitrary-order lensless ghost imaging with thermal light

Authors: Xi-Hao Chen, Ivan N. Agafonov, Kai-Hong Luo, Qian Liu, Rui Xian, Maria V. Chekhova, Ling-An Wu

Abstract: Arbitrary Nth-order ($N\geq2$) lensless ghost imaging with thermal light has been performed for the first time by only recording the intensities in two optical paths. It is shown that the image visibility can be dramatically enhanced as the order N increases. It is also found that longer integration times are required for higher-order correlation measurements as N increases, due to the increased… ▽ More Arbitrary Nth-order ($N\geq2$) lensless ghost imaging with thermal light has been performed for the first time by only recording the intensities in two optical paths. It is shown that the image visibility can be dramatically enhanced as the order N increases. It is also found that longer integration times are required for higher-order correlation measurements as N increases, due to the increased fluctuations of higher-order intensity correlation functions. △ Less

Submitted 4 December, 2009; v1 submitted 21 February, 2009; originally announced February 2009.

Comments: Updated version; some more detailed explanations provided

Showing 1–40 of 40 results for author: Xian, R