Search | arXiv e-print repository

Knowledge Navigator: LLM-guided Browsing Framework for Exploratory Search in Scientific Literature

Authors: Uri Katz, Mosh Levy, Yoav Goldberg

Abstract: The exponential growth of scientific literature necessitates advanced tools for effective knowledge exploration. We present Knowledge Navigator, a system designed to enhance exploratory search abilities by organizing and structuring the retrieved documents from broad topical queries into a navigable, two-level hierarchy of named and descriptive scientific topics and subtopics. This structured orga… ▽ More The exponential growth of scientific literature necessitates advanced tools for effective knowledge exploration. We present Knowledge Navigator, a system designed to enhance exploratory search abilities by organizing and structuring the retrieved documents from broad topical queries into a navigable, two-level hierarchy of named and descriptive scientific topics and subtopics. This structured organization provides an overall view of the research themes in a domain, while also enabling iterative search and deeper knowledge discovery within specific subtopics by allowing users to refine their focus and retrieve additional relevant documents. Knowledge Navigator combines LLM capabilities with cluster-based methods to enable an effective browsing method. We demonstrate our approach's effectiveness through automatic and manual evaluations on two novel benchmarks, CLUSTREC-COVID and SCITOC. Our code, prompts, and benchmarks are made publicly available. △ Less

Submitted 28 August, 2024; originally announced August 2024.

arXiv:2407.15849 [pdf, other]

WayEx: Waypoint Exploration using a Single Demonstration

Authors: Mara Levy, Nirat Saini, Abhinav Shrivastava

Abstract: We propose WayEx, a new method for learning complex goal-conditioned robotics tasks from a single demonstration. Our approach distinguishes itself from existing imitation learning methods by demanding fewer expert examples and eliminating the need for information about the actions taken during the demonstration. This is accomplished by introducing a new reward function and employing a knowledge ex… ▽ More We propose WayEx, a new method for learning complex goal-conditioned robotics tasks from a single demonstration. Our approach distinguishes itself from existing imitation learning methods by demanding fewer expert examples and eliminating the need for information about the actions taken during the demonstration. This is accomplished by introducing a new reward function and employing a knowledge expansion technique. We demonstrate the effectiveness of WayEx, our waypoint exploration strategy, across six diverse tasks, showcasing its applicability in various environments. Notably, our method significantly reduces training time by 50% as compared to traditional reinforcement learning methods. WayEx obtains a higher reward than existing imitation learning methods given only a single demonstration. Furthermore, we demonstrate its success in tackling complex environments where standard approaches fall short. More information is available at: https://waypoint-ex.github.io. △ Less

Submitted 22 July, 2024; originally announced July 2024.

Comments: ICRA 2024

arXiv:2407.07092 [pdf, other]

V-VIPE: Variational View Invariant Pose Embedding

Authors: Mara Levy, Abhinav Shrivastava

Abstract: Learning to represent three dimensional (3D) human pose given a two dimensional (2D) image of a person, is a challenging problem. In order to make the problem less ambiguous it has become common practice to estimate 3D pose in the camera coordinate space. However, this makes the task of comparing two 3D poses difficult. In this paper, we address this challenge by separating the problem of estimati… ▽ More Learning to represent three dimensional (3D) human pose given a two dimensional (2D) image of a person, is a challenging problem. In order to make the problem less ambiguous it has become common practice to estimate 3D pose in the camera coordinate space. However, this makes the task of comparing two 3D poses difficult. In this paper, we address this challenge by separating the problem of estimating 3D pose from 2D images into two steps. We use a variational autoencoder (VAE) to find an embedding that represents 3D poses in canonical coordinate space. We refer to this embedding as variational view-invariant pose embedding V-VIPE. Using V-VIPE we can encode 2D and 3D poses and use the embedding for downstream tasks, like retrieval and classification. We can estimate 3D poses from these embeddings using the decoder as well as generate unseen 3D poses. The variability of our encoding allows it to generalize well to unseen camera views when mapping from 2D space. To the best of our knowledge, V-VIPE is the only representation to offer this diversity of applications. Code and more information can be found at https://v-vipe.github.io/. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: CVPR 2024 - RHOBIN Workshop

arXiv:2406.13301 [pdf, other]

ARDuP: Active Region Video Diffusion for Universal Policies

Authors: Shuaiyi Huang, Mara Levy, Zhenyu Jiang, Anima Anandkumar, Yuke Zhu, Linxi Fan, De-An Huang, Abhinav Shrivastava

Abstract: Sequential decision-making can be formulated as a text-conditioned video generation problem, where a video planner, guided by a text-defined goal, generates future frames visualizing planned actions, from which control actions are subsequently derived. In this work, we introduce Active Region Video Diffusion for Universal Policies (ARDuP), a novel framework for video-based policy learning that emp… ▽ More Sequential decision-making can be formulated as a text-conditioned video generation problem, where a video planner, guided by a text-defined goal, generates future frames visualizing planned actions, from which control actions are subsequently derived. In this work, we introduce Active Region Video Diffusion for Universal Policies (ARDuP), a novel framework for video-based policy learning that emphasizes the generation of active regions, i.e. potential interaction areas, enhancing the conditional policy's focus on interactive areas critical for task execution. This innovative framework integrates active region conditioning with latent diffusion models for video planning and employs latent representations for direct action decoding during inverse dynamic modeling. By utilizing motion cues in videos for automatic active region discovery, our method eliminates the need for manual annotations of active regions. We validate ARDuP's efficacy via extensive experiments on simulator CLIPort and the real-world dataset BridgeData v2, achieving notable improvements in success rates and generating convincingly realistic video plans. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.03855 [pdf, other]

Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As

Authors: Eden Avnat, Michal Levy, Daniel Herstain, Elia Yanko, Daniel Ben Joya, Michal Tzuchman Katz, Dafna Eshel, Sahar Laros, Yael Dagan, Shahar Barami, Joseph Mermelstein, Shahar Ovadia, Noam Shomron, Varda Shalev, Raja-Elie E. Abdulnour

Abstract: Clinical problem-solving requires processing of semantic medical knowledge such as illness scripts and numerical medical knowledge of diagnostic tests for evidence-based decision-making. As large language models (LLMs) show promising results in many aspects of language-based clinical practice, their ability to generate non-language evidence-based answers to clinical questions is inherently limited… ▽ More Clinical problem-solving requires processing of semantic medical knowledge such as illness scripts and numerical medical knowledge of diagnostic tests for evidence-based decision-making. As large language models (LLMs) show promising results in many aspects of language-based clinical practice, their ability to generate non-language evidence-based answers to clinical questions is inherently limited by tokenization. Therefore, we evaluated LLMs' performance on two question types: numeric (correlating findings) and semantic (differentiating entities) while examining differences within and between LLMs in medical aspects and comparing their performance to humans. To generate straightforward multi-choice questions and answers (QAs) based on evidence-based medicine (EBM), we used a comprehensive medical knowledge graph (encompassed data from more than 50,00 peer-reviewed articles) and created the "EBMQA". EBMQA contains 105,000 QAs labeled with medical and non-medical topics and classified into numerical or semantic questions. We benchmarked this dataset using more than 24,500 QAs on two state-of-the-art LLMs: Chat-GPT4 and Claude3-Opus. We evaluated the LLMs accuracy on semantic and numerical question types and according to sub-labeled topics. For validation, six medical experts were tested on 100 numerical EBMQA questions. We found that both LLMs excelled more in semantic than numerical QAs, with Claude3 surpassing GPT4 in numerical QAs. However, both LLMs showed inter and intra gaps in different medical aspects and remained inferior to humans. Thus, their medical advice should be addressed carefully. △ Less

Submitted 24 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

arXiv:2405.18065 [pdf, other]

EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition

Authors: Issar Tzachor, Boaz Lerner, Matan Levy, Michael Green, Tal Berkovitz Shalev, Gavriel Habib, Dvir Samuel, Noam Korngut Zailer, Or Shimshi, Nir Darshan, Rami Ben-Ari

Abstract: The task of Visual Place Recognition (VPR) is to predict the location of a query image from a database of geo-tagged images. Recent studies in VPR have highlighted the significant advantage of employing pre-trained foundation models like DINOv2 for the VPR task. However, these models are often deemed inadequate for VPR without further fine-tuning on task-specific data. In this paper, we propose a… ▽ More The task of Visual Place Recognition (VPR) is to predict the location of a query image from a database of geo-tagged images. Recent studies in VPR have highlighted the significant advantage of employing pre-trained foundation models like DINOv2 for the VPR task. However, these models are often deemed inadequate for VPR without further fine-tuning on task-specific data. In this paper, we propose a simple yet powerful approach to better exploit the potential of a foundation model for VPR. We first demonstrate that features extracted from self-attention layers can serve as a powerful re-ranker for VPR. Utilizing these features in a zero-shot manner, our method surpasses previous zero-shot methods and achieves competitive results compared to supervised methods across multiple datasets. Subsequently, we demonstrate that a single-stage method leveraging internal ViT layers for pooling can generate global features that achieve state-of-the-art results, even when reduced to a dimensionality as low as 128D. Nevertheless, incorporating our local foundation features for re-ranking, expands this gap. Our approach further demonstrates remarkable robustness and generalization, achieving state-of-the-art results, with a significant gap, in challenging scenarios, involving occlusion, day-night variations, and seasonal changes. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.18025 [pdf, other]

Unveiling the Power of Diffusion Features For Personalized Segmentation and Retrieval

Authors: Dvir Samuel, Rami Ben-Ari, Matan Levy, Nir Darshan, Gal Chechik

Abstract: Personalized retrieval and segmentation aim to locate specific instances within a dataset based on an input image and a short description of the reference instance. While supervised methods are effective, they require extensive labeled data for training. Recently, self-supervised foundation models have been introduced to these tasks showing comparable results to supervised methods. However, a sign… ▽ More Personalized retrieval and segmentation aim to locate specific instances within a dataset based on an input image and a short description of the reference instance. While supervised methods are effective, they require extensive labeled data for training. Recently, self-supervised foundation models have been introduced to these tasks showing comparable results to supervised methods. However, a significant flaw in these models is evident: they struggle to locate a desired instance when other instances within the same class are presented. In this paper, we explore text-to-image diffusion models for these tasks. Specifically, we propose a novel approach called PDM for Personalized Features Diffusion Matching, that leverages intermediate features of pre-trained text-to-image models for personalization tasks without any additional training. PDM demonstrates superior performance on popular retrieval and segmentation benchmarks, outperforming even supervised methods. We also highlight notable shortcomings in current instance and segmentation datasets and propose new benchmarks for these tasks. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.06860 [pdf, ps, other]

A Generalization of the Erdős-Kac Theorem

Authors: Matthew Levy, Joseph Squillace

Abstract: Given a natural number $n$, let $ω\left(n\right)$ denote the number of distinct prime factors of $n$, let $Z$ denote a standard normal variable, and let $P_{n}$ denote the uniform distribution on $\left\{ 1,\ldots,n\right\} $. The Erdős-Kac Theorem states that if $N\left(n\right)$ is a uniformly distributed variable on $\lbrace 1,\ldots,n \rbrace$, then $ω\left(N\left(n\right)\right)$ is asymptoti… ▽ More Given a natural number $n$, let $ω\left(n\right)$ denote the number of distinct prime factors of $n$, let $Z$ denote a standard normal variable, and let $P_{n}$ denote the uniform distribution on $\left\{ 1,\ldots,n\right\} $. The Erdős-Kac Theorem states that if $N\left(n\right)$ is a uniformly distributed variable on $\lbrace 1,\ldots,n \rbrace$, then $ω\left(N\left(n\right)\right)$ is asymptotically normally distributed as $n\to \infty$ with both mean and variance equal to $\log \log n$. The contribution of this paper is a generalization of the Erdős-Kac Theorem to a larger class of random variables by considering perturbations of the uniform probability mass $1/n$ in the following sense. Denote by $\mathbb{P}_{n}$ a probability distribution on $\left\{ 1,\ldots,n\right\} $ given by $\mathbb{P}_{n}\left(i\right)=1/n+\varepsilon_{i,n}$. We provide sufficient conditions on $\varepsilon_{i,n}$ so that the number of distinct prime factors of a $\mathbb{P}_{n}$-distributed random variable is asymptotically normally distributed, as $n\to \infty$, with both mean and variance equal to $\log \log n$. Our main result is applied to prove that the number of distinct prime factors of a positive integer with the Harmonic$\left(n\right)$ distribution also tends to the normal distribution, as $n\to \infty$. In addition, we explore sequences of distributions on the natural numbers such that $ω(n)$ is normally distributed in the limit. In addition, one of our theorems and its corollaries generalize a result from the literature involving the limit of $Zeta\left(s\right)$ distributions as the parameter $s \to 1$. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: Supersedes arXiv:2011.00152v1

MSC Class: 60F05; 11N37

arXiv:2405.03745 [pdf, ps, other]

White dwarf eccentricity fluctuation and dissipation by AGB convection

Authors: Yair Cohen, Sivan Ginzburg, Maya Levy, Tal Bar Shalom, Yoav Siman Tov

Abstract: Millisecond pulsars with white dwarf companions have typical eccentricities $e\sim 10^{-6}-10^{-3}$. The eccentricities of helium white dwarfs are explained well by applying the fluctuation-dissipation theorem to convective eddies in their red giant progenitors. We extend this theory to more massive carbon-oxygen (CO) white dwarfs with asymptotic giant branch (AGB) progenitors. Due to the radiatio… ▽ More Millisecond pulsars with white dwarf companions have typical eccentricities $e\sim 10^{-6}-10^{-3}$. The eccentricities of helium white dwarfs are explained well by applying the fluctuation-dissipation theorem to convective eddies in their red giant progenitors. We extend this theory to more massive carbon-oxygen (CO) white dwarfs with asymptotic giant branch (AGB) progenitors. Due to the radiation pressure in AGB stars, the dominant factor in determining the remnant white dwarf's eccentricity is the critical residual hydrogen envelope mass $m_{\rm env}$ required to inflate the star to giant proportions. Using a suite of MESA stellar evolution simulations with $Δm_{\rm c}=10^{-3}\,{\rm M}_\odot$ core-mass intervals, we resolved the AGB thermal pulses and found that the critical $m_{\rm env}\propto m_{\rm c}^{-6}$. This steep dependence causes the $e(m_{\rm c})$ relation to turn over, such that $e\sim 3\times 10^{-3}$ almost independently of the remnant CO white dwarf's mass $m_{\rm c}$. Nearly all of the measured eccentricities lie below this robust theoretical limit, indicating that the eccentricity is damped during the common-envelope inspiral that follows the unstable Roche-lobe overflow of the AGB star. Specifically, we focused on white dwarfs with median masses $m_{\rm c}>0.6\,{\rm M}_\odot$. These massive white dwarfs begin their inspiral with practically identical orbital periods and eccentricities, eliminating any dependence on the initial conditions. For this sub-sample, we find an empirical relation $e\propto P^{3/2}$ between the final period and eccentricity that is much tighter than previous studies - motivating theoretical work on the eccentricity evolution during the common envelope phase. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: submitted to MNRAS, comments welcome

arXiv:2404.11901 [pdf, other]

Deep and Dynamic Metabolic and Structural Imaging in Living Tissues

Authors: Kunzan Liu, Honghao Cao, Kasey Shashaty, Li-Yu Yu, Sarah Spitz, Francesca Michela Pramotton, Zhengpeng Wan, Ellen L. Kan, Erin N. Tevonian, Manuel Levy, Eva Lendaro, Roger D. Kamm, Linda G. Griffith, Fan Wang, Tong Qiu, Sixian You

Abstract: Label-free imaging through two-photon autofluorescence (2PAF) of NAD(P)H allows for non-destructive and high-resolution visualization of cellular activities in living systems. However, its application to thick tissues and organoids has been restricted by its limited penetration depth within 300 $μ$m, largely due to tissue scattering at the typical excitation wavelength (~750 nm) required for NAD(P… ▽ More Label-free imaging through two-photon autofluorescence (2PAF) of NAD(P)H allows for non-destructive and high-resolution visualization of cellular activities in living systems. However, its application to thick tissues and organoids has been restricted by its limited penetration depth within 300 $μ$m, largely due to tissue scattering at the typical excitation wavelength (~750 nm) required for NAD(P)H. Here, we demonstrate that the imaging depth for NAD(P)H can be extended to over 700 $μ$m in living engineered human multicellular microtissues by adopting multimode fiber (MMF)-based low-repetition-rate high-peak-power three-photon (3P) excitation of NAD(P)H at 1100 nm. This is achieved by having over 0.5 MW peak power at the band of 1100$\pm$25 nm through adaptively modulating multimodal nonlinear pulse propagation with a compact fiber shaper. Moreover, the 8-fold increase in pulse energy at 1100 nm enables faster imaging of monocyte behaviors in the living multicellular models. These results represent a significant advance for deep and dynamic metabolic and structural imaging of intact living biosystems. The modular design (MMF with a slip-on fiber shaper) is anticipated to allow wide adoption of this methodology for demanding in vivo and in vitro imaging applications, including cancer research, autoimmune diseases, and tissue engineering. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 20 pages, 5 figures, under review in Science Advances

arXiv:2402.14848 [pdf, other]

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Authors: Mosh Levy, Alon Jacoby, Yoav Goldberg

Abstract: This paper explores the impact of extending input lengths on the capabilities of Large Language Models (LLMs). Despite LLMs advancements in recent times, their performance consistency across different input lengths is not well understood. We investigate this aspect by introducing a novel QA reasoning framework, specifically designed to assess the impact of input length. We isolate the effect of in… ▽ More This paper explores the impact of extending input lengths on the capabilities of Large Language Models (LLMs). Despite LLMs advancements in recent times, their performance consistency across different input lengths is not well understood. We investigate this aspect by introducing a novel QA reasoning framework, specifically designed to assess the impact of input length. We isolate the effect of input length using multiple versions of the same sample, each being extended with padding of different lengths, types and locations. Our findings show a notable degradation in LLMs' reasoning performance at much shorter input lengths than their technical maximum. We show that the degradation trend appears in every version of our dataset, although at different intensities. Additionally, our study reveals that the traditional metric of next word prediction correlates negatively with performance of LLMs' on our reasoning dataset. We analyse our results and identify failure modes that can serve as useful guides for future research, potentially informing strategies to address the limitations observed in LLMs. △ Less

Submitted 10 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: Accepted to ACL 2024

arXiv:2402.09352 [pdf, other]

Sign of the $hZZ$ coupling and implication for new physics

Authors: Dipankar Das, Anirban Kundu, Miguel Levy, Anugrah M. Prasad, Ipsita Saha, Agnivo Sarkar

Abstract: The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boso… ▽ More The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boson with the same, is consistent with both $+1$ and $-1$, the latter being the `wrong-sign'. We argue that the wrong-sign $hZZ$ coupling will necessitate the intervention of new physics below $\mathcal{O}\left(620\right)$ GeV to safeguard the underlying theory from unitarity violation. The strength of the new nonstandard couplings can be derived from the unitarity sum rules, which are comparable to the SM-Higgs couplings in magnitude. Thus the strong limits from the direct searches at the LHC can help us rule out the existence of such nonstandard particles with unusually large couplings thereby disfavoring the possibility of a wrong-sign $hZZ$ coupling. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 8 pages, 3 figures

Report number: HRI-RECAPP-2024-01

arXiv:2312.10660 [pdf, other]

Cryogenic hybrid magnonic circuits based on spalled YIG thin films

Authors: Jing Xu, Connor Horn, Yu Jiang, Xinhao Li, Daniel Rosenmann, Xu Han, Miguel Levy, Supratik Guha, Xufeng Zhang

Abstract: Yttrium iron garnet (YIG) magnonics has sparked extensive research interests toward harnessing magnons (quasiparticles of collective spin excitation) for signal processing. In particular, YIG magnonics-based hybrid systems exhibit great potentials for quantum information science because of their wide frequency tunability and excellent compatibility with other platforms. However, the broad applicat… ▽ More Yttrium iron garnet (YIG) magnonics has sparked extensive research interests toward harnessing magnons (quasiparticles of collective spin excitation) for signal processing. In particular, YIG magnonics-based hybrid systems exhibit great potentials for quantum information science because of their wide frequency tunability and excellent compatibility with other platforms. However, the broad application and scalability of thin-film YIG devices in the quantum regime has been severely limited due to the substantial microwave loss in the host substrate for YIG, gadolinium gallium garnet (GGG), at cryogenic temperatures. In this study, we demonstrate that substrate-free YIG thin films can be obtained by introducing the controlled spalling and layer transfer technology to YIG/GGG samples. Our approach is validated by measuring a hybrid device consisting of a superconducting resonator and a spalled YIG film, which gives a strong coupling feature indicating the good coherence of our system. This advancement paves the way for enhanced on-chip integration and the scalability of YIG-based quantum devices. △ Less

Submitted 19 December, 2023; v1 submitted 17 December, 2023; originally announced December 2023.

Comments: 10 pages, 8 figures

arXiv:2311.07389 [pdf, other]

Transpose Attack: Stealing Datasets with Bidirectional Training

Authors: Guy Amit, Mosh Levy, Yisroel Mirsky

Abstract: Deep neural networks are normally executed in the forward direction. However, in this work, we identify a vulnerability that enables models to be trained in both directions and on different tasks. Adversaries can exploit this capability to hide rogue models within seemingly legitimate models. In addition, in this work we show that neural networks can be taught to systematically memorize and retrie… ▽ More Deep neural networks are normally executed in the forward direction. However, in this work, we identify a vulnerability that enables models to be trained in both directions and on different tasks. Adversaries can exploit this capability to hide rogue models within seemingly legitimate models. In addition, in this work we show that neural networks can be taught to systematically memorize and retrieve specific samples from datasets. Together, these findings expose a novel method in which adversaries can exfiltrate datasets from protected learning environments under the guise of legitimate models. We focus on the data exfiltration attack and show that modern architectures can be used to secretly exfiltrate tens of thousands of samples with high fidelity, high enough to compromise data privacy and even train new models. Moreover, to mitigate this threat we propose a novel approach for detecting infected models. △ Less

Submitted 17 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

Comments: NDSS24 paper, Transpose Attack, Transposed Model. NDSS version: https://www.ndss-symposium.org/ndss-paper/transpose-attack-stealing-datasets-with-bidirectional-training/

arXiv:2311.00613 [pdf, other]

Controllable Music Production with Diffusion Models and Guidance Gradients

Authors: Mark Levy, Bruno Di Giorgi, Floris Weers, Angelos Katharopoulos, Tom Nickson

Abstract: We demonstrate how conditional generation from diffusion models can be used to tackle a variety of realistic tasks in the production of music in 44.1kHz stereo audio with sampling-time guidance. The scenarios we consider include continuation, inpainting and regeneration of musical audio, the creation of smooth transitions between two different music tracks, and the transfer of desired stylistic ch… ▽ More We demonstrate how conditional generation from diffusion models can be used to tackle a variety of realistic tasks in the production of music in 44.1kHz stereo audio with sampling-time guidance. The scenarios we consider include continuation, inpainting and regeneration of musical audio, the creation of smooth transitions between two different music tracks, and the transfer of desired stylistic characteristics to existing audio clips. We achieve this by applying guidance at sampling time in a simple framework that supports both reconstruction and classification losses, or any combination of the two. This approach ensures that generated audio can match its surrounding context, or conform to a class distribution or latent representation specified relative to any suitable pre-trained classifier or embedding model. Audio samples are available at https://machinelearning.apple.com/research/controllable-music △ Less

Submitted 5 December, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.18360 [pdf, other]

Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers

Authors: Mosh Levy, Shauli Ravfogel, Yoav Goldberg

Abstract: Recent applications of LLMs in Machine Reading Comprehension (MRC) systems have shown impressive results, but the use of shortcuts, mechanisms triggered by features spuriously correlated to the true label, has emerged as a potential threat to their reliability. We analyze the problem from two angles: LLMs as editors, guided to edit text to mislead LLMs; and LLMs as readers, who answer questions ba… ▽ More Recent applications of LLMs in Machine Reading Comprehension (MRC) systems have shown impressive results, but the use of shortcuts, mechanisms triggered by features spuriously correlated to the true label, has emerged as a potential threat to their reliability. We analyze the problem from two angles: LLMs as editors, guided to edit text to mislead LLMs; and LLMs as readers, who answer questions based on the edited text. We introduce a framework that guides an editor to add potential shortcuts-triggers to samples. Using GPT4 as the editor, we find it can successfully edit trigger shortcut in samples that fool LLMs. Analysing LLMs as readers, we observe that even capable LLMs can be deceived using shortcut knowledge. Strikingly, we discover that GPT4 can be deceived by its own edits (15% drop in F1). Our findings highlight inherent vulnerabilities of LLMs to shortcut manipulations. We publish ShortcutQA, a curated dataset generated by our framework for future research. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: Accepted to EMNLP 2023 Findings

arXiv:2310.01974 [pdf, other]

Optical excitation of multiple standing spin modes in 3D optomagnonic nanocavities

Authors: Daria O. Ignatyeva, Denis M. Krichevsky, Dolendra Karki, Anton Kolosvetov, Polina E. Zimnyakova, Alexander N. Shaposhnikov, Vladimir N. Berzhansky, Miguel Levy, Alexander I. Chernov, Vladimir I. Belotelov

Abstract: We report the first experimental observation of multiple standing spin modes in 3D optomagnonic nanocavity formed by nanometer-sized iron-garnet nanocylinder. We show that launching of standing spin modes is achieved due to a high confinement of the optically generated effective magnetic field caused by the localized optical resonance. Quantization and spin-wave mode inhomogeneity is achieved in e… ▽ More We report the first experimental observation of multiple standing spin modes in 3D optomagnonic nanocavity formed by nanometer-sized iron-garnet nanocylinder. We show that launching of standing spin modes is achieved due to a high confinement of the optically generated effective magnetic field caused by the localized optical resonance. Quantization and spin-wave mode inhomogeneity is achieved in each of the three spatial dimensions. The presented approach opens new horizons of 3D optomagnonics by combining nanophotonic and magnonic functionalities within a single nanocavity. △ Less

Submitted 3 October, 2023; originally announced October 2023.

arXiv:2309.15901 [pdf, ps, other]

A Modular $SU(5)$ Littlest Seesaw

Authors: Ivo de Medeiros Varzielas, Steve F. King, Miguel Levy

Abstract: We extend the littlest modular seesaw to a Grand Unified scenario based on $SU(5)$ endowed with three modular $S_4$ symmetries. We leverage symmetry protected zeroes in the leptonic and down quark sectors to suppress deviations to the littlest modular seesaw predictions, but not contributions to the quark mixing. The model is supplemented by two weighton fields, such that the hierarchical nature o… ▽ More We extend the littlest modular seesaw to a Grand Unified scenario based on $SU(5)$ endowed with three modular $S_4$ symmetries. We leverage symmetry protected zeroes in the leptonic and down quark sectors to suppress deviations to the littlest modular seesaw predictions, but not contributions to the quark mixing. The model is supplemented by two weighton fields, such that the hierarchical nature of the charged-lepton masses, as well as the quark masses and mixing, stem from the content and symmetries of the model, rather than a hierarchical nature of the Yukawa coefficients. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: 19 pages

arXiv:2307.14410 [pdf, other]

Quarks at the modular $S_4$ cusp

Authors: I. de Medeiros Varzielas, M. Levy, J. T. Penedo, S. T. Petcov

Abstract: We analyse the possibility of describing quark masses, mixing and CP violation in $S'_4$ modular flavour models without flavons. We focus on the case where the closeness of the modulus to the point of residual $\mathbb{Z}^{ST}_3$ symmetry (the cusp) plays a role in generating quark mass hierarchies and discuss the role modular form normalisations play in such constructions. We find that fitting qu… ▽ More We analyse the possibility of describing quark masses, mixing and CP violation in $S'_4$ modular flavour models without flavons. We focus on the case where the closeness of the modulus to the point of residual $\mathbb{Z}^{ST}_3$ symmetry (the cusp) plays a role in generating quark mass hierarchies and discuss the role modular form normalisations play in such constructions. We find that fitting quark data requires explicit CP breaking, unless a second modulus is introduced. △ Less

Submitted 26 July, 2023; originally announced July 2023.

Comments: 45 pages, 1 figure, 10 tables

Report number: SISSA 11/2023/FISI, CFTP/23-002

arXiv:2307.04108 [pdf, other]

Asynchronous Proportional Response Dynamics in Markets with Adversarial Scheduling

Authors: Yoav Kolumbus, Menahem Levy, Noam Nisan

Abstract: We study Proportional Response Dynamics (PRD) in linear Fisher markets where participants act asynchronously. We model this scenario as a sequential process in which in every step, an adversary selects a subset of the players that will update their bids, subject to liveness constraints. We show that if every bidder individually uses the PRD update rule whenever they are included in the group of bi… ▽ More We study Proportional Response Dynamics (PRD) in linear Fisher markets where participants act asynchronously. We model this scenario as a sequential process in which in every step, an adversary selects a subset of the players that will update their bids, subject to liveness constraints. We show that if every bidder individually uses the PRD update rule whenever they are included in the group of bidders selected by the adversary, then (in the generic case) the entire dynamic converges to a competitive equilibrium of the market. Our proof technique uncovers further properties of linear Fisher markets, such as the uniqueness of the equilibrium for generic parameters and the convergence of associated best-response dynamics and no-swap regret dynamics under certain conditions. △ Less

Submitted 15 January, 2024; v1 submitted 9 July, 2023; originally announced July 2023.

arXiv:2306.05244 [pdf, other]

Spectral-temporal-spatial customization via modulating multimodal nonlinear pulse propagation

Authors: Tong Qiu, Honghao Cao, Kunzan Liu, Li-Yu Yu, Manuel Levy, Eva Lendaro, Fan Wang, Sixian You

Abstract: Multimode fibers (MMFs) have recently reemerged as attractive avenues for nonlinear effects due to their high-dimensional spatiotemporal nonlinear dynamics and scalability for high power. High-brightness MMF sources with effective control of the nonlinear processes would offer new possibilities for a wide range of applications from high-power fiber lasers, to bioimaging and chemical sensing, and t… ▽ More Multimode fibers (MMFs) have recently reemerged as attractive avenues for nonlinear effects due to their high-dimensional spatiotemporal nonlinear dynamics and scalability for high power. High-brightness MMF sources with effective control of the nonlinear processes would offer new possibilities for a wide range of applications from high-power fiber lasers, to bioimaging and chemical sensing, and to novel physics phenomena. Here we present a simple yet effective way of controlling nonlinear effects at high peak power levels: by leveraging not only the spatial but also the temporal degrees of freedom of the multimodal nonlinear pulse propagation in step-index MMFs using a programmable fiber shaper. This method represents the first method that enables modulation and optimization of multimodal nonlinear pulse propagation, achieving high tunability and broadband high peak power. Its potential as a nonlinear imaging source is further demonstrated by applying the MMF source to multiphoton microscopy, where widely tunable two-photon and three-photon imaging is achieved with adaptive optimization. These demonstrations highlight the effectiveness of directly modulating multimodal nonlinear pulse propagation to enhance the high-dimensional customization and optimize the high spectral brightness of MMF output. These advancements provide new possibilities for technology advances in nonlinear optics, bioimaging, spectroscopy, optical computing, and material processing. △ Less

Submitted 5 December, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

Comments: 36 pages, 19 figures

arXiv:2305.20062 [pdf, other]

Chatting Makes Perfect: Chat-based Image Retrieval

Authors: Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski

Abstract: Chats emerge as an effective user-friendly approach for information retrieval, and are successfully employed in many domains, such as customer service, healthcare, and finance. However, existing image retrieval approaches typically address the case of a single query-to-image round, and the use of chats for image retrieval has been mostly overlooked. In this work, we introduce ChatIR: a chat-based… ▽ More Chats emerge as an effective user-friendly approach for information retrieval, and are successfully employed in many domains, such as customer service, healthcare, and finance. However, existing image retrieval approaches typically address the case of a single query-to-image round, and the use of chats for image retrieval has been mostly overlooked. In this work, we introduce ChatIR: a chat-based image retrieval system that engages in a conversation with the user to elicit information, in addition to an initial query, in order to clarify the user's search intent. Motivated by the capabilities of today's foundation models, we leverage Large Language Models to generate follow-up questions to an initial image description. These questions form a dialog with the user in order to retrieve the desired image from a large corpus. In this study, we explore the capabilities of such a system tested on a large dataset and reveal that engaging in a dialog yields significant gains in image retrieval. We start by building an evaluation pipeline from an existing manually generated dataset and explore different modules and training strategies for ChatIR. Our comparison includes strong baselines derived from related applications trained with Reinforcement Learning. Our system is capable of retrieving the target image from a pool of 50K images with over 78% success rate after 5 dialogue rounds, compared to 75% when questions are asked by humans, and 64% for a single shot text-to-image retrieval. Extensive evaluations reveal the strong capabilities and examine the limitations of CharIR under different settings. Project repository is available at https://github.com/levymsn/ChatIR. △ Less

Submitted 5 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

Comments: Camera Ready version for NeurIPS 2023

arXiv:2305.14763 [pdf, other]

Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models

Authors: Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz

Abstract: The escalating debate on AI's capabilities warrants developing reliable metrics to assess machine "intelligence". Recently, many anecdotal examples were used to suggest that newer large language models (LLMs) like ChatGPT and GPT-4 exhibit Neural Theory-of-Mind (N-ToM); however, prior work reached conflicting conclusions regarding those abilities. We investigate the extent of LLMs' N-ToM through a… ▽ More The escalating debate on AI's capabilities warrants developing reliable metrics to assess machine "intelligence". Recently, many anecdotal examples were used to suggest that newer large language models (LLMs) like ChatGPT and GPT-4 exhibit Neural Theory-of-Mind (N-ToM); however, prior work reached conflicting conclusions regarding those abilities. We investigate the extent of LLMs' N-ToM through an extensive evaluation on 6 tasks and find that while LLMs exhibit certain N-ToM abilities, this behavior is far from being robust. We further examine the factors impacting performance on N-ToM tasks and discover that LLMs struggle with adversarial examples, indicating reliance on shallow heuristics rather than robust ToM abilities. We caution against drawing conclusions from anecdotal examples, limited benchmark testing, and using human-designed psychological tests to evaluate models. △ Less

Submitted 24 May, 2023; originally announced May 2023.

arXiv:2303.09429 [pdf, other]

Data Roaming and Quality Assessment for Composed Image Retrieval

Authors: Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski

Abstract: The task of Composed Image Retrieval (CoIR) involves queries that combine image and text modalities, allowing users to express their intent more effectively. However, current CoIR datasets are orders of magnitude smaller compared to other vision and language (V&L) datasets. Additionally, some of these datasets have noticeable issues, such as queries containing redundant modalities. To address thes… ▽ More The task of Composed Image Retrieval (CoIR) involves queries that combine image and text modalities, allowing users to express their intent more effectively. However, current CoIR datasets are orders of magnitude smaller compared to other vision and language (V&L) datasets. Additionally, some of these datasets have noticeable issues, such as queries containing redundant modalities. To address these shortcomings, we introduce the Large Scale Composed Image Retrieval (LaSCo) dataset, a new CoIR dataset which is ten times larger than existing ones. Pre-training on our LaSCo, shows a noteworthy improvement in performance, even in zero-shot. Furthermore, we propose a new approach for analyzing CoIR datasets and methods, which detects modality redundancy or necessity, in queries. We also introduce a new CoIR baseline, the Cross-Attention driven Shift Encoder (CASE). This baseline allows for early fusion of modalities using a cross-attention module and employs an additional auxiliary task during training. Our experiments demonstrate that this new baseline outperforms the current state-of-the-art methods on established benchmarks like FashionIQ and CIRR. △ Less

Submitted 20 December, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

Comments: Camera Ready version for AAAI 2024

arXiv:2301.00231 [pdf, other]

doi 10.1103/PhysRevD.107.055035

Democratic three Higgs-doublet models: the custodial limit and wrong-sign Yukawa

Authors: Dipankar Das, Miguel Levy, Palash B. Pal, Anugrah M. Prasad, Ipsita Saha, Ayushi Srivastava

Abstract: We study two novel aspects of democratic 3HDMs -- the custodial limit and the possibility of wrong-sign Yukawa couplings. In the custodial limit, the democratic 3HDMs can easily negotiate the constraints from the electroweak $T$-parameter. We also uncover the possibility of having wrong-sign Yukawa couplings in democratic 3HDMs, as in the case of 2HDMs. We show that a democratic 3HDM encompasses a… ▽ More We study two novel aspects of democratic 3HDMs -- the custodial limit and the possibility of wrong-sign Yukawa couplings. In the custodial limit, the democratic 3HDMs can easily negotiate the constraints from the electroweak $T$-parameter. We also uncover the possibility of having wrong-sign Yukawa couplings in democratic 3HDMs, as in the case of 2HDMs. We show that a democratic 3HDM encompasses all the wrong-sign possibilities entertained by 2HDMs, and has considerably more leeway in the wrong-sign limit as compared to the 2HDM case. Our study underscores the importance of reporting analysis in the kappa-formalism without any implicit assumptions on the signs of the kappas. △ Less

Submitted 2 March, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

Comments: 22 pages, 3 captioned figures. Version accepted for publication in Phys. Rev. D

arXiv:2211.15700 [pdf, other]

doi 10.1140/epjc/s10052-023-11654-0

Revisiting the Universal Texture Zero of Flavour: a Markov Chain Monte Carlo Analysis

Authors: Jordan Bernigaud, Ivo de Medeiros Varzielas, Miguel Levy, Jim Talbert

Abstract: We revisit the phenomenological predictions of the Universal Texture Zero (UTZ) model of flavour originally presented in arXiv:1710.01741, and update them in light of both improved experimental constraints and numerical analysis techniques. In particular, we have developed an in-house Markov Chain Monte Carlo (MCMC) algorithm to exhaustively explore the UTZ's viable parameter space, considering bo… ▽ More We revisit the phenomenological predictions of the Universal Texture Zero (UTZ) model of flavour originally presented in arXiv:1710.01741, and update them in light of both improved experimental constraints and numerical analysis techniques. In particular, we have developed an in-house Markov Chain Monte Carlo (MCMC) algorithm to exhaustively explore the UTZ's viable parameter space, considering both leading- and next-to-leading contributions in the model's effective operator product expansion. We also extract -- for the first time -- reliable UTZ predictions for the (poorly constrained) leptonic CP-violating phases, and ratio observables that characterize neutrino masses probed by (e.g.) oscillation, $β$-decay, and cosmological processes. We therefore dramatically improve on the proof-in-principle phenomenological analysis originally presented in arXiv:1710.01741, and ultimately show that the UTZ remains a minimal, viable, and appealing theory of flavour. Our results also further demonstrate the potential of robustly examining multi-parameter flavour models with MCMC routines. △ Less

Submitted 21 August, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

Comments: 33 pages, 6 tables, and 6 figures. Paper dedicated to Prof. Graham G. Ross. Matches published version

Journal ref: Eur.Phys.J.C 83 (2023) 6, 479

arXiv:2211.00654 [pdf, other]

doi 10.1007/JHEP02(2023)143

Littlest Modular Seesaw

Authors: Ivo de Medeiros Varzielas, Steve F. King, Miguel Levy

Abstract: We present the first complete model of the Littlest Modular Seesaw, based on two right-handed neutrinos, within the framework of multiple modular symmetries, justifying the use of multiple moduli fields which take their values at 3 specific stabilizers of $Γ_4 \simeq S_4$, including a new phenomenological possibility. Using a semi-analytical approach, we perform a $χ^2$ analysis of each case and s… ▽ More We present the first complete model of the Littlest Modular Seesaw, based on two right-handed neutrinos, within the framework of multiple modular symmetries, justifying the use of multiple moduli fields which take their values at 3 specific stabilizers of $Γ_4 \simeq S_4$, including a new phenomenological possibility. Using a semi-analytical approach, we perform a $χ^2$ analysis of each case and show that good agreement with neutrino oscillation data is obtained, including predictive relations between the leptonic mixing angles and the ratio of light neutrino masses, which non-trivially agree with the experimental values. It is noteworthy that in this very predictive setup, the models fit the global fits of the experimental data remarkably well, both with and without the Super-Kamiokande atmospheric data, for both choices of stabilizers. By extending the model to include a weighton and the double cover group $Γ'_4 \simeq S'_4$, we are able to also account for the hierarchy of the charged leptons using modular symmetries, without altering the neutrino predictions. △ Less

Submitted 17 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

Comments: 19 pages, 2 Figures. v2: minor improvements to the presentation

arXiv:2208.12782 [pdf, other]

Mel Spectrogram Inversion with Stable Pitch

Authors: Bruno Di Giorgi, Mark Levy, Richard Sharp

Abstract: Vocoders are models capable of transforming a low-dimensional spectral representation of an audio signal, typically the mel spectrogram, to a waveform. Modern speech generation pipelines use a vocoder as their final component. Recent vocoder models developed for speech achieve a high degree of realism, such that it is natural to wonder how they would perform on music signals. Compared to speech, t… ▽ More Vocoders are models capable of transforming a low-dimensional spectral representation of an audio signal, typically the mel spectrogram, to a waveform. Modern speech generation pipelines use a vocoder as their final component. Recent vocoder models developed for speech achieve a high degree of realism, such that it is natural to wonder how they would perform on music signals. Compared to speech, the heterogeneity and structure of the musical sound texture offers new challenges. In this work we focus on one specific artifact that some vocoder models designed for speech tend to exhibit when applied to music: the perceived instability of pitch when synthesizing sustained notes. We argue that the characteristic sound of this artifact is due to the lack of horizontal phase coherence, which is often the result of using a time-domain target space with a model that is invariant to time-shifts, such as a convolutional neural network. We propose a new vocoder model that is specifically designed for music. Key to improving the pitch stability is the choice of a shift-invariant target space that consists of the magnitude spectrum and the phase gradient. We discuss the reasons that inspired us to re-formulate the vocoder task, outline a working example, and evaluate it on musical signals. Our method results in 60% and 10% improved reconstruction of sustained notes and chords with respect to existing models, using a novel harmonic error metric. △ Less

Submitted 26 August, 2022; originally announced August 2022.

Comments: 7 pages, 5 figures, Proceedings of the 23st International Society for Music Information Retrieval Conference, ISMIR 2022

arXiv:2208.10878 [pdf, other]

Transferability Ranking of Adversarial Examples

Authors: Mosh Levy, Guy Amit, Yuval Elovici, Yisroel Mirsky

Abstract: Adversarial transferability in black-box scenarios presents a unique challenge: while attackers can employ surrogate models to craft adversarial examples, they lack assurance on whether these examples will successfully compromise the target model. Until now, the prevalent method to ascertain success has been trial and error-testing crafted samples directly on the victim model. This approach, howev… ▽ More Adversarial transferability in black-box scenarios presents a unique challenge: while attackers can employ surrogate models to craft adversarial examples, they lack assurance on whether these examples will successfully compromise the target model. Until now, the prevalent method to ascertain success has been trial and error-testing crafted samples directly on the victim model. This approach, however, risks detection with every attempt, forcing attackers to either perfect their first try or face exposure. Our paper introduces a ranking strategy that refines the transfer attack process, enabling the attacker to estimate the likelihood of success without repeated trials on the victim's system. By leveraging a set of diverse surrogate models, our method can predict transferability of adversarial examples. This strategy can be used to either select the best sample to use in an attack or the best perturbation to apply to a specific sample. Using our strategy, we were able to raise the transferability of adversarial examples from a mere 20% - akin to random selection-up to near upper-bound levels, with some scenarios even witnessing a 100% success rate. This substantial improvement not only sheds light on the shared susceptibilities across diverse architectures but also demonstrates that attackers can forego the detectable trial-and-error tactics raising increasing the threat of surrogate-based attacks. △ Less

Submitted 18 April, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

arXiv:2207.08169 [pdf, other]

Ethnic Representation Analysis of Commercial Movie Posters

Authors: Dima Kagan, Mor Levy, Michael Fire, Galit Fuhrmann Alpert

Abstract: In the last decades, global awareness towards the importance of diverse representation has been increasing. Lack of diversity and discrimination toward minorities did not skip the film industry. Here, we examine ethnic bias in the film industry through commercial posters, the industry's primary advertisement medium for decades. Movie posters are designed to establish the viewer's initial impressio… ▽ More In the last decades, global awareness towards the importance of diverse representation has been increasing. Lack of diversity and discrimination toward minorities did not skip the film industry. Here, we examine ethnic bias in the film industry through commercial posters, the industry's primary advertisement medium for decades. Movie posters are designed to establish the viewer's initial impression. We developed a novel approach for evaluating ethnic bias in the film industry by analyzing nearly 125,000 posters using state-of-the-art deep learning models. Our analysis shows that while ethnic biases still exist, there is a trend of reduction of bias, as seen by several parameters. Particularly in English-speaking movies, the ethnic distribution of characters on posters from the last couple of years is reaching numbers that are approaching the actual ethnic composition of US population. An automatic approach to monitor ethnic diversity in the film industry, potentially integrated with financial value, may be of significant use for producers and policymakers. △ Less

Submitted 17 July, 2022; originally announced July 2022.

arXiv:2207.03855 [pdf, other]

doi 10.1146/annurev-physchem-062422-013259

Predictive Power of the Exact Constraints and Appropriate Norms in Density Functional Theory

Authors: Aaron D. Kaplan, Mel Levy, John P. Perdew

Abstract: Ground-state Kohn-Sham density functional theory provides, in principle, the exact ground-state energy and electronic spin-densities of real interacting electrons in a static external potential. In practice, the exact density functional for the exchange-correlation (xc) energy must be approximated in a computationally efficient way. About twenty mathematical properties of the exact xc functional a… ▽ More Ground-state Kohn-Sham density functional theory provides, in principle, the exact ground-state energy and electronic spin-densities of real interacting electrons in a static external potential. In practice, the exact density functional for the exchange-correlation (xc) energy must be approximated in a computationally efficient way. About twenty mathematical properties of the exact xc functional are known. In this work, we review and discuss these known constraints on the xc energy and hole. By analyzing a sequence of increasingly sophisticated density functional approximations (DFAs), we argue that: (1) the satisfaction of more exact constraints and appropriate norms makes a functional more predictive over the immense space of many-electron systems; (2) fitting to bonded systems yields an interpolative DFA that may not extrapolate well to systems unlike those in the fitting set. We discuss how the class of well-described systems has grown along with constraint satisfaction, and the possibilities for future functional development. △ Less

Submitted 8 July, 2022; originally announced July 2022.

Comments: When citing this paper, please use the following: Kaplan AD, Levy M, Perdew JP. 2023. Predictive power of the exact constraints and approximate norms in density functional theory. Annu. Rev. Phys. Chem. 74. Submitted. DOI: 10.1146/annurev-physchem-062422-013259

arXiv:2206.11372 [pdf, other]

doi 10.1063/5.0105684

Can the Hartree-Fock kinetic energy exceed the exact kinetic energy?

Authors: Steven Crisostomo, Mel Levy, Kieron Burke

Abstract: The Hartree-Fock (HF) approximation has been an important tool for quantum-chemical calculations since its earliest appearance in the late 1920s, and remains the starting point of most single-reference methods in use today. Intuition suggests that the HF kinetic energy should not exceed the exact kinetic energy, but no proof of this conjecture exists, despite a near century of development. Beginni… ▽ More The Hartree-Fock (HF) approximation has been an important tool for quantum-chemical calculations since its earliest appearance in the late 1920s, and remains the starting point of most single-reference methods in use today. Intuition suggests that the HF kinetic energy should not exceed the exact kinetic energy, but no proof of this conjecture exists, despite a near century of development. Beginning from a generalized virial theorem derived from scaling considerations, we derive a general expression for the kinetic energy difference that applies to all systems. For any atom or ion this trivially reduces to the well-known result that the total energy is the negative of the kinetic energy and since correlation energies are never positive, proves the conjecture in this case. Similar considerations apply to molecules at their equilibrium bond lengths. We use highly precise calculations on Hooke's atom (two electrons in a parabolic well) to test the conjecture in a non-trivial case, and to parameterize the difference between density-functional and HF quantities, but find no violations of the conjecture. △ Less

Submitted 22 February, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

Comments: 7 pages, 6 figures; final edits

arXiv:2201.08661 [pdf, other]

The Security of Deep Learning Defences for Medical Imaging

Authors: Moshe Levy, Guy Amit, Yuval Elovici, Yisroel Mirsky

Abstract: Deep learning has shown great promise in the domain of medical image analysis. Medical professionals and healthcare providers have been adopting the technology to speed up and enhance their work. These systems use deep neural networks (DNN) which are vulnerable to adversarial samples; images with imperceivable changes that can alter the model's prediction. Researchers have proposed defences which… ▽ More Deep learning has shown great promise in the domain of medical image analysis. Medical professionals and healthcare providers have been adopting the technology to speed up and enhance their work. These systems use deep neural networks (DNN) which are vulnerable to adversarial samples; images with imperceivable changes that can alter the model's prediction. Researchers have proposed defences which either make a DNN more robust or detect the adversarial samples before they do harm. However, none of these works consider an informed attacker which can adapt to the defence mechanism. We show that an informed attacker can evade five of the current state of the art defences while successfully fooling the victim's deep learning model, rendering these defences useless. We then suggest better alternatives for securing healthcare DNNs from such attacks: (1) harden the system's security and (2) use digital signatures. △ Less

Submitted 21 January, 2022; originally announced January 2022.

arXiv:2111.14792 [pdf, other]

Classification-Regression for Chart Comprehension

Authors: Matan Levy, Rami Ben-Ari, Dani Lischinski

Abstract: Chart question answering (CQA) is a task used for assessing chart comprehension, which is fundamentally different from understanding natural images. CQA requires analyzing the relationships between the textual and the visual components of a chart, in order to answer general questions or infer numerical values. Most existing CQA datasets and models are based on simplifying assumptions that often en… ▽ More Chart question answering (CQA) is a task used for assessing chart comprehension, which is fundamentally different from understanding natural images. CQA requires analyzing the relationships between the textual and the visual components of a chart, in order to answer general questions or infer numerical values. Most existing CQA datasets and models are based on simplifying assumptions that often enable surpassing human performance. In this work, we address this outcome and propose a new model that jointly learns classification and regression. Our language-vision setup uses co-attention transformers to capture the complex real-world interactions between the question and the textual elements. We validate our design with extensive experiments on the realistic PlotQA dataset, outperforming previous approaches by a large margin, while showing competitive performance on FigureQA. Our model is particularly well suited for realistic questions with out-of-vocabulary answers that require regression. △ Less

Submitted 11 July, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

Comments: ECCV 2022

arXiv:2110.01372 [pdf, other]

doi 10.1016/j.apnum.2024.03.014

Legendre Expansions of Products of Functions with Applications to Nonlinear Partial Differential Equations

Authors: Rabia Djellouli, David Klein, Matthew Levy

Abstract: Given the Fourier-Legendre expansions of $f$ and $g$, and mild conditions on $f$ and $g$, we derive the Fourier-Legendre expansion of their product in terms of their corresponding Fourier-Legendre coefficients. In this way, expansions of whole number powers of $f$ may be obtained. We establish upper bounds on rates of convergence. We then employ these expansions to solve semi-analytically a class… ▽ More Given the Fourier-Legendre expansions of $f$ and $g$, and mild conditions on $f$ and $g$, we derive the Fourier-Legendre expansion of their product in terms of their corresponding Fourier-Legendre coefficients. In this way, expansions of whole number powers of $f$ may be obtained. We establish upper bounds on rates of convergence. We then employ these expansions to solve semi-analytically a class of nonlinear PDEs with a polynomial nonlinearity of degree 2. The obtained numerical results illustrate the efficiency and performance accuracy of this Fourier-Legendre based solution methodology for solving an important class of nonlinear PDEs. △ Less

Submitted 24 March, 2024; v1 submitted 18 September, 2021; originally announced October 2021.

Comments: 38 pages, 25 figures

MSC Class: 42C10; 41A25; 65L06; 65N35; 40-08

Journal ref: Applied Numerical Mathematics, Vol. 201, p. 301-321 (2024)

arXiv:2108.07056 [pdf, other]

Magneto-optics of the 2D iron-garnet nanocylinder array with localized and lattice modes

Authors: Polina E. Zimnyakova, Daria O. Ignatyeva, Dolendra Karki, Andrey A. Voronov, Alexander N. Shaposhnikov, Vladimir N. Berzhansky, Miguel Levy, Vladimir I. Belotelov

Abstract: We experimentally show the enhancement of the Faraday and transverse magneto-optical Kerr effects in the two-dimensional arrays of nanocylinders made of bismuth-substituted iron-garnet and supporting both localized and lattice modes. Simultaneous excitation of these modes makes it possible to increase the Faraday rotation by 3 times and TMOKE by an order of magnitude compared to the smooth magneti… ▽ More We experimentally show the enhancement of the Faraday and transverse magneto-optical Kerr effects in the two-dimensional arrays of nanocylinders made of bismuth-substituted iron-garnet and supporting both localized and lattice modes. Simultaneous excitation of these modes makes it possible to increase the Faraday rotation by 3 times and TMOKE by an order of magnitude compared to the smooth magnetic film of the equal effective thickness. Both magneto-optical effects are enhanced in wide spectral and angular ranges making the nanocylinder array magnetic dielectric structures promising for applications with short and tightly-focused laser pulses. △ Less

Submitted 17 August, 2021; v1 submitted 16 August, 2021; originally announced August 2021.

arXiv:2108.04479 [pdf, other]

Scalable Reverse Image Search Engine for NASAWorldview

Authors: Abhigya Sodani, Michael Levy, Anirudh Koul, Meher Anand Kasam, Siddha Ganju

Abstract: Researchers often spend weeks sifting through decades of unlabeled satellite imagery(on NASA Worldview) in order to develop datasets on which they can start conducting research. We developed an interactive, scalable and fast image similarity search engine (which can take one or more images as the query image) that automatically sifts through the unlabeled dataset reducing dataset generation time f… ▽ More Researchers often spend weeks sifting through decades of unlabeled satellite imagery(on NASA Worldview) in order to develop datasets on which they can start conducting research. We developed an interactive, scalable and fast image similarity search engine (which can take one or more images as the query image) that automatically sifts through the unlabeled dataset reducing dataset generation time from weeks to minutes. In this work, we describe key components of the end to end pipeline. Our similarity search system was created to be able to identify similar images from a potentially petabyte scale database that are similar to an input image, and for this we had to break down each query image into its features, which were generated by a classification layer stripped CNN trained in a supervised manner. To store and search these features efficiently, we had to make several scalability improvements. To improve the speed, reduce the storage, and shrink memory requirements for embedding search, we add a fully connected layer to our CNN make all images into a 128 length vector before entering the classification layers. This helped us compress the size of our image features from 2048 (for ResNet, which was initially tried as our featurizer) to 128 for our new custom model. Additionally, we utilize existing approximate nearest neighbor search libraries to significantly speed up embedding search. Our system currently searches over our entire database of images at 5 seconds per query on a single virtual machine in the cloud. In the future, we would like to incorporate a SimCLR based featurizing model which could be trained without any labelling by a human (since the classification aspect of the model is irrelevant to this use case). △ Less

Submitted 10 August, 2021; originally announced August 2021.

Comments: 7 pages, Published at COSPAR 2021, 6 figures

arXiv:2107.08227 [pdf, other]

doi 10.1140/epjc/s10052-021-09681-w

Exploring multi-Higgs models with softly broken large discrete symmetry groups

Authors: Ivo de Medeiros Varzielas, Igor P. Ivanov, Miguel Levy

Abstract: We develop methods to study the scalar sector of multi-Higgs models with large discrete symmetry groups that are softly broken. While in the exact symmetry limit, the model has very few parameters and can be studied analytically, proliferation of quadratic couplings in the most general softly broken case makes the analysis cumbersome. We identify two sets of soft breaking terms which play differen… ▽ More We develop methods to study the scalar sector of multi-Higgs models with large discrete symmetry groups that are softly broken. While in the exact symmetry limit, the model has very few parameters and can be studied analytically, proliferation of quadratic couplings in the most general softly broken case makes the analysis cumbersome. We identify two sets of soft breaking terms which play different roles: those which preserve the symmetric vacuum expectation value alignment, and the remaining terms which shift it. Focusing on alignment preserving terms, we check which structural features of the symmetric parent model are conserved and which are modified. We find remarkable examples of structural features which are inherited from the parent symmetric model and which persist even when no exact symmetry is left. The general procedure is illustrated with the example of the three-Higgs-doublet model with the softly broken symmetry group $Σ(36)$. △ Less

Submitted 21 October, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

Comments: 21 pages, 2 figures; v2: minor updates, extra appendix and references, matches the published version

Journal ref: Eur. Phys. J. C81, 918 (2021)

arXiv:2107.03756 [pdf, other]

doi 10.1140/epjc/s10052-022-10125-2

Diluting quark flavor hierarchies using dihedral symmetry

Authors: Ayushi Srivastava, Miguel Levy, Dipankar Das

Abstract: We present a $D_4$ flavored extension of the SM which provides an intuitive reasoning for the masses and mixing patterns in the quark sector. In our model, the Cabibbo mixing angle stems purely from the scalar sector dynamics. In fact, the orders of magnitude of the CKM matrix elements are readily obtained from the hierarchical nature of the vacuum expectation values. Moreover, we also show that t… ▽ More We present a $D_4$ flavored extension of the SM which provides an intuitive reasoning for the masses and mixing patterns in the quark sector. In our model, the Cabibbo mixing angle stems purely from the scalar sector dynamics. In fact, the orders of magnitude of the CKM matrix elements are readily obtained from the hierarchical nature of the vacuum expectation values. Moreover, we also show that the smallness of the off-Cabibbo elements in the CKM matrix is strongly connected to the heaviness of the third generation of quarks. △ Less

Submitted 14 February, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

Comments: 13 pages, 3 figures

arXiv:2106.09714 [pdf, other]

No-frills Dynamic Planning using Static Planners

Authors: Mara Levy, Vasista Ayyagari, Abhinav Shrivastava

Abstract: In this paper, we address the task of interacting with dynamic environments where the changes in the environment are independent of the agent. We study this through the context of trapping a moving ball with a UR5 robotic arm. Our key contribution is an approach to utilize a static planner for dynamic tasks using a Dynamic Planning add-on; that is, if we can successfully solve a task with a static… ▽ More In this paper, we address the task of interacting with dynamic environments where the changes in the environment are independent of the agent. We study this through the context of trapping a moving ball with a UR5 robotic arm. Our key contribution is an approach to utilize a static planner for dynamic tasks using a Dynamic Planning add-on; that is, if we can successfully solve a task with a static target, then our approach can solve the same task when the target is moving. Our approach has three key components: an off-the-shelf static planner, a trajectory forecasting network, and a network to predict robot's estimated time of arrival at any location. We demonstrate the generalization of our approach across environments. More information and videos at https://mlevy2525.github.io/DynamicAddOn. △ Less

Submitted 17 June, 2021; originally announced June 2021.

Comments: ICRA 2021

arXiv:2105.12163 [pdf]

Optimizing the location of vaccination sites to stop a zoonotic epidemic

Authors: Ricardo Castillo-Neyra, Bhaswar Bhattacharya, Aris Saxena, Brinkley Raynor, Elvis Diaz, Gian Franco Condori, Maria Rieders, Michael Z. Levy

Abstract: The mainstay of canine rabies control is fixed point mass dog vaccination campaigns (MDVC). However, in some regions, ideal vaccination coverage in dogs is not obtained due to low participation in the MDVC. Travel distance to the vaccination sites has been identified as an important barrier to participation. We aim to increase MDVC participation by optimally placing fixed point vaccination locatio… ▽ More The mainstay of canine rabies control is fixed point mass dog vaccination campaigns (MDVC). However, in some regions, ideal vaccination coverage in dogs is not obtained due to low participation in the MDVC. Travel distance to the vaccination sites has been identified as an important barrier to participation. We aim to increase MDVC participation by optimally placing fixed point vaccination locations to minimize walking distance to the nearest vaccination location. We quantified participation probability based on walking distance to the nearest vaccination point using a Poisson regression model. The regression was fit with survey data collected from 2016-2019. We then used a computational recursive interchange technique to solve the facility location problem to find a set of optimal placements of fixed point vaccination locations. Finally, we compared predicted participation of optimally placed vaccination sites to historical participation data from surveys collected from 2016-2019. We identified the p-median algorithm to solve the facility location problem as ideal for fixed point vaccination placement. We found a predicted increase in MDVC participation if vaccination locations are placed optimally. We also found a more even vaccination coverage with optimized vaccination sites; however, the workload in some optimized locations increased significantly. We developed a data-driven computational algorithm to combat an ongoing rabies epidemic by optimally using limited resources to maximize vaccination coverage. The main positive effects we expect if this algorithm is to be implemented would be increased overall vaccination coverage and increased spatial evenness of coverage. A potential negative effect could be the presence of long waiting lines as participation increases. △ Less

Submitted 25 May, 2021; originally announced May 2021.

Comments: 21 pages, 2 tables, 5 figures

arXiv:2104.08146 [pdf, other]

doi 10.1103/PhysRevD.104.075033

Prospects of light charged scalars in a three Higgs doublet model with $Z_3$ symmetry

Authors: Manimala Chakraborti, Dipankar Das, Miguel Levy, Samadrita Mukherjee, Ipsita Saha

Abstract: The stringent constraints from the direct searches for exotic scalars at the LHC as well as indirect bounds from flavor physics measurements have imposed severe restrictions on the parameter space of new physics models featuring extended Higgs sectors. In the Type-II 2HDM, this implies a lower bound on the charged Higgs masses of $\cal O$(600 GeV). In this work we analyze the phenomenology of a Z3… ▽ More The stringent constraints from the direct searches for exotic scalars at the LHC as well as indirect bounds from flavor physics measurements have imposed severe restrictions on the parameter space of new physics models featuring extended Higgs sectors. In the Type-II 2HDM, this implies a lower bound on the charged Higgs masses of $\cal O$(600 GeV). In this work we analyze the phenomenology of a Z3HDM in the alignment limit focusing on the impact of flavor physics constraints on its parameter space. We show that the couplings of the two charged Higgs bosons in this model feature an additional suppression factor compared to Type-II 2HDM. This gives rise to a significant relaxation of the flavor physics constraints in this model, allowing the charged Higgs masses to be as low as $\cal O$(200 GeV). We also consider the constraints coming from precision electroweak observables and the observed diphoton decay rate of the 125 GeV Higgs boson at the LHC. The bounds coming from the direct searches of nonstandard Higgs bosons at the LHC, particularly those from resonance searches in the ditau channel, prove to be very effective in constraining this scenario further. △ Less

Submitted 4 October, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

Comments: 22 pages, 7 figures, Discussions on unitarity, stability and perturbativity of the scalar potential included, matches accepted PRD version

Report number: IPMU21-0026

arXiv:2102.02282 [pdf, other]

Downbeat Tracking with Tempo-Invariant Convolutional Neural Networks

Authors: Bruno Di Giorgi, Matthias Mauch, Mark Levy

Abstract: The human ability to track musical downbeats is robust to changes in tempo, and it extends to tempi never previously encountered. We propose a deterministic time-warping operation that enables this skill in a convolutional neural network (CNN) by allowing the network to learn rhythmic patterns independently of tempo. Unlike conventional deep learning approaches, which learn rhythmic patterns at th… ▽ More The human ability to track musical downbeats is robust to changes in tempo, and it extends to tempi never previously encountered. We propose a deterministic time-warping operation that enables this skill in a convolutional neural network (CNN) by allowing the network to learn rhythmic patterns independently of tempo. Unlike conventional deep learning approaches, which learn rhythmic patterns at the tempi present in the training dataset, the patterns learned in our model are tempo-invariant, leading to better tempo generalisation and more efficient usage of the network capacity. We test the generalisation property on a synthetic dataset created by rendering the Groove MIDI Dataset using FluidSynth, split into a training set containing the original performances and a test set containing tempo-scaled versions rendered with different SoundFonts (test-time augmentation). The proposed model generalises nearly perfectly to unseen tempi (F-measure of 0.89 on both training and test sets), whereas a comparable conventional CNN achieves similar accuracy only for the training set (0.89) and drops to 0.54 on the test set. The generalisation advantage of the proposed model extends to real music, as shown by results on the GTZAN and Ballroom datasets. △ Less

Submitted 3 February, 2021; originally announced February 2021.

Comments: 7 pages, 5 figures, Proceedings of the 21st International Society for Music Information Retrieval Conference, ISMIR 2020

Journal ref: Proceedings of the 21st International Society for Music Information Retrieval Conference (2020) 216-222

arXiv:2102.00179 [pdf, other]

Matching Representations of Explainable Artificial Intelligence and Eye Gaze for Human-Machine Interaction

Authors: Tiffany Hwu, Mia Levy, Steven Skorheim, David Huber

Abstract: Rapid non-verbal communication of task-based stimuli is a challenge in human-machine teaming, particularly in closed-loop interactions such as driving. To achieve this, we must understand the representations of information for both the human and machine, and determine a basis for bridging these representations. Techniques of explainable artificial intelligence (XAI) such as layer-wise relevance pr… ▽ More Rapid non-verbal communication of task-based stimuli is a challenge in human-machine teaming, particularly in closed-loop interactions such as driving. To achieve this, we must understand the representations of information for both the human and machine, and determine a basis for bridging these representations. Techniques of explainable artificial intelligence (XAI) such as layer-wise relevance propagation (LRP) provide visual heatmap explanations for high-dimensional machine learning techniques such as deep neural networks. On the side of human cognition, visual attention is driven by the bottom-up and top-down processing of sensory input related to the current task. Since both XAI and human cognition should focus on task-related stimuli, there may be overlaps between their representations of visual attention, potentially providing a means of nonverbal communication between the human and machine. In this work, we examine the correlations between LRP heatmap explanations of a neural network trained to predict driving behavior and eye gaze heatmaps of human drivers. The analysis is used to determine the feasibility of using such a technique for enhancing driving performance. We find that LRP heatmaps show increasing levels of similarity with eye gaze according to the task specificity of the neural network. We then propose how these findings may assist humans by visually directing attention towards relevant areas. To our knowledge, our work provides the first known analysis of LRP and eye gaze for driving tasks. △ Less

Submitted 30 January, 2021; originally announced February 2021.

arXiv:2012.03988 [pdf, other]

doi 10.1007/JHEP12(2021)176

Warm Inflation, Neutrinos and Dark matter: a minimal extension of the Standard Model

Authors: Miguel Levy, João G. Rosa, Luis B. Ventura

Abstract: We show that warm inflation can be realized within a minimal extension of the Standard Model with three right-handed neutrinos, three complex scalars and a gauged lepton/B-L U(1) symmetry. This simple model can address all the shortcomings of the Standard Model that are not related to fine-tuning, within general relativity, with distinctive experimental signatures that can be probed in the near fu… ▽ More We show that warm inflation can be realized within a minimal extension of the Standard Model with three right-handed neutrinos, three complex scalars and a gauged lepton/B-L U(1) symmetry. This simple model can address all the shortcomings of the Standard Model that are not related to fine-tuning, within general relativity, with distinctive experimental signatures that can be probed in the near future. The inflaton field emerges from the collective breaking of the U(1) symmetry, and interacts with two of the right-handed neutrinos, sustaining a high-temperature radiation bath during inflation. The discrete interchange symmetry of the model protects the scalar potential against large thermal corrections and leads to a stable inflaton remnant at late times which can account for dark matter. Consistency of the model and agreement with Cosmic Microwave Background observations naturally yield light neutrino masses below 0.1 eV, while thermal leptogenesis occurs naturally after a smooth exit from inflation into the radiation era. △ Less

Submitted 7 December, 2020; originally announced December 2020.

Comments: 43 pages (30 main + 13 appendices), 8 figures. Comments are welcome

arXiv:2012.02296 [pdf, other]

doi 10.1038/s41467-021-26529-9

Generative Capacity of Probabilistic Protein Sequence Models

Authors: Francisco McGee, Quentin Novinger, Ronald M. Levy, Vincenzo Carnevale, Allan Haldane

Abstract: Potts models and variational autoencoders (VAEs) have recently gained popularity as generative protein sequence models (GPSMs) to explore fitness landscapes and predict the effect of mutations. Despite encouraging results, quantitative characterization and comparison of GPSM-generated probability distributions is still lacking. It is currently unclear whether GPSMs can faithfully reproduce the com… ▽ More Potts models and variational autoencoders (VAEs) have recently gained popularity as generative protein sequence models (GPSMs) to explore fitness landscapes and predict the effect of mutations. Despite encouraging results, quantitative characterization and comparison of GPSM-generated probability distributions is still lacking. It is currently unclear whether GPSMs can faithfully reproduce the complex multi-residue mutation patterns observed in natural sequences arising due to epistasis. We develop a set of sequence statistics to assess the "generative capacity" of three GPSMs of recent interest: the pairwise Potts Hamiltonian, the VAE, and the site-independent model, using natural and synthetic datasets. We show that the generative capacity of the Potts Hamiltonian model is the largest, in that the higher order mutational statistics generated by the model agree with those observed for natural sequences. In contrast, we show that the VAE's generative capacity lies between the pairwise Potts and site-independent models. Importantly, our work measures GPSM generative capacity in terms of higher-order sequence covariation statistics which we have developed, and provides a new framework for evaluating and interpreting GPSM accuracy that emphasizes the role of epistasis. △ Less

Submitted 15 March, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

arXiv:2009.05283 [pdf, other]

Fair and accurate age prediction using distribution aware data curation and augmentation

Authors: Yushi Cao, David Berend, Palina Tolmach, Guy Amit, Moshe Levy, Yang Liu, Asaf Shabtai, Yuval Elovici

Abstract: Deep learning-based facial recognition systems have experienced increased media attention due to exhibiting unfair behavior. Large enterprises, such as IBM, shut down their facial recognition and age prediction systems as a consequence. Age prediction is an especially difficult application with the issue of fairness remaining an open research problem (e.g., predicting age for different ethnicity e… ▽ More Deep learning-based facial recognition systems have experienced increased media attention due to exhibiting unfair behavior. Large enterprises, such as IBM, shut down their facial recognition and age prediction systems as a consequence. Age prediction is an especially difficult application with the issue of fairness remaining an open research problem (e.g., predicting age for different ethnicity equally accurate). One of the main causes of unfair behavior in age prediction methods lies in the distribution and diversity of the training data. In this work, we present two novel approaches for dataset curation and data augmentation in order to increase fairness through balanced feature curation and increase diversity through distribution aware augmentation. To achieve this, we introduce out-of-distribution detection to the facial recognition domain which is used to select the data most relevant to the deep neural network's (DNN) task when balancing the data among age, ethnicity, and gender. Our approach shows promising results. Our best-trained DNN model outperformed all academic and industrial baselines in terms of fairness by up to 4.92 times and also enhanced the DNN's ability to generalize outperforming Amazon AWS and Microsoft Azure public cloud systems by 31.88% and 10.95%, respectively. △ Less

Submitted 16 November, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

Comments: Preprint, accepted at WACV'22

arXiv:2008.06856 [pdf, other]

FOOD: Fast Out-Of-Distribution Detector

Authors: Guy Amit, Moshe Levy, Ishai Rosenberg, Asaf Shabtai, Yuval Elovici

Abstract: Deep neural networks (DNNs) perform well at classifying inputs associated with the classes they have been trained on, which are known as in distribution inputs. However, out-of-distribution (OOD) inputs pose a great challenge to DNNs and consequently represent a major risk when DNNs are implemented in safety-critical systems. Extensive research has been performed in the domain of OOD detection. Ho… ▽ More Deep neural networks (DNNs) perform well at classifying inputs associated with the classes they have been trained on, which are known as in distribution inputs. However, out-of-distribution (OOD) inputs pose a great challenge to DNNs and consequently represent a major risk when DNNs are implemented in safety-critical systems. Extensive research has been performed in the domain of OOD detection. However, current state-of-the-art methods for OOD detection suffer from at least one of the following limitations: (1) increased inference time - this limits existing methods' applicability to many real-world applications, and (2) the need for OOD training data - such data can be difficult to acquire and may not be representative enough, thus limiting the ability of the OOD detector to generalize. In this paper, we propose FOOD -- Fast Out-Of-Distribution detector -- an extended DNN classifier capable of efficiently detecting OOD samples with minimal inference time overhead. Our architecture features a DNN with a final Gaussian layer combined with the log likelihood ratio statistical test and an additional output neuron for OOD detection. Instead of using real OOD data, we use a novel method to craft artificial OOD samples from in-distribution data, which are used to train our OOD detector neuron. We evaluate FOOD's detection performance on the SVHN, CIFAR-10, and CIFAR-100 datasets. Our results demonstrate that in addition to achieving state-of-the-art performance, FOOD is fast and applicable to real-world applications. △ Less

Submitted 23 February, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

Comments: Guy Amit and Moshe Levy contributed equally to this paper Updated version

arXiv:2008.06536 [pdf, other]

Making Distributed Mobile Applications SAFE: Enforcing User Privacy Policies on Untrusted Applications with Secure Application Flow Enforcement

Authors: Adriana Szekeres, Irene Zhang, Katelin Bailey, Isaac Ackerman, Haichen Shen, Franziska Roesner, Dan R. K. Ports, Arvind Krishnamurthy, Henry M. Levy

Abstract: Today's mobile devices sense, collect, and store huge amounts of personal information, which users share with family and friends through a wide range of applications. Once users give applications access to their data, they must implicitly trust that the apps correctly maintain data privacy. As we know from both experience and all-too-frequent press articles, that trust is often misplaced. While us… ▽ More Today's mobile devices sense, collect, and store huge amounts of personal information, which users share with family and friends through a wide range of applications. Once users give applications access to their data, they must implicitly trust that the apps correctly maintain data privacy. As we know from both experience and all-too-frequent press articles, that trust is often misplaced. While users do not trust applications, they do trust their mobile devices and operating systems. Unfortunately, sharing applications are not limited to mobile clients but must also run on cloud services to share data between users. In this paper, we leverage the trust that users have in their mobile OSes to vet cloud services. To do so, we define a new Secure Application Flow Enforcement (SAFE) framework, which requires cloud services to attest to a system stack that will enforce policies provided by the mobile OS for user data. We implement a mobile OS that enforces SAFE policies on unmodified mobile apps and two systems for enforcing policies on untrusted cloud services. Using these prototypes, we demonstrate that it is possible to enforce existing user privacy policies on unmodified applications. △ Less

Submitted 14 August, 2020; originally announced August 2020.

arXiv:2008.05329 [pdf, other]

doi 10.1007/JHEP11(2020)085

Symmetries and stabilisers in modular invariant flavour models

Authors: Ivo de Medeiros Varzielas, Miguel Levy, Ye-Ling Zhou

Abstract: The idea of modular invariance provides a novel explanation of flavour mixing. Within the context of finite modular symmetries $Γ_N$ and for a given element $γ\in Γ_N$, we present an algorithm for finding stabilisers (specific values for moduli fields $τ_γ$ which remain unchanged under the action associated to $γ$). We then employ this algorithm to find all stabilisers for each element of finite m… ▽ More The idea of modular invariance provides a novel explanation of flavour mixing. Within the context of finite modular symmetries $Γ_N$ and for a given element $γ\in Γ_N$, we present an algorithm for finding stabilisers (specific values for moduli fields $τ_γ$ which remain unchanged under the action associated to $γ$). We then employ this algorithm to find all stabilisers for each element of finite modular groups for $N=2$ to $5$, namely, $Γ_2\simeq S_3$, $Γ_3\simeq A_4$, $Γ_4\simeq S_4$ and $Γ_5\simeq A_5$. These stabilisers then leave preserved a specific cyclic subgroup of $Γ_N$. This is of interest to build models of fermionic mixing where each fermionic sector preserves a separate residual symmetry. △ Less

Submitted 16 November, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

Comments: 18 pages, 5 figures, 4 tables, accepted for publication in JHEP

Showing 1–50 of 125 results for author: Levy, M