Skip to main content

Showing 1–32 of 32 results for author: Levy, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.07092  [pdf, other

    cs.CV cs.AI

    V-VIPE: Variational View Invariant Pose Embedding

    Authors: Mara Levy, Abhinav Shrivastava

    Abstract: Learning to represent three dimensional (3D) human pose given a two dimensional (2D) image of a person, is a challenging problem. In order to make the problem less ambiguous it has become common practice to estimate 3D pose in the camera coordinate space. However, this makes the task of comparing two 3D poses difficult. In this paper, we address this challenge by separating the problem of estimati… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: CVPR 2024 - RHOBIN Workshop

  2. arXiv:2406.13301  [pdf, other

    cs.CV cs.RO

    ARDuP: Active Region Video Diffusion for Universal Policies

    Authors: Shuaiyi Huang, Mara Levy, Zhenyu Jiang, Anima Anandkumar, Yuke Zhu, Linxi Fan, De-An Huang, Abhinav Shrivastava

    Abstract: Sequential decision-making can be formulated as a text-conditioned video generation problem, where a video planner, guided by a text-defined goal, generates future frames visualizing planned actions, from which control actions are subsequently derived. In this work, we introduce Active Region Video Diffusion for Universal Policies (ARDuP), a novel framework for video-based policy learning that emp… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.03855  [pdf, other

    cs.CL

    Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As

    Authors: Eden Avnat, Michal Levy, Daniel Herstain, Elia Yanko, Daniel Ben Joya, Michal Tzuchman Katz, Dafna Eshel, Sahar Laros, Yael Dagan, Shahar Barami, Joseph Mermelstein, Shahar Ovadia, Noam Shomron, Varda Shalev, Raja-Elie E. Abdulnour

    Abstract: Clinical problem-solving requires processing of semantic medical knowledge such as illness scripts and numerical medical knowledge of diagnostic tests for evidence-based decision-making. As large language models (LLMs) show promising results in many aspects of language-based clinical practice, their ability to generate non-language evidence-based answers to clinical questions is inherently limited… ▽ More

    Submitted 1 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2405.18065  [pdf, other

    cs.CV cs.AI

    EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition

    Authors: Issar Tzachor, Boaz Lerner, Matan Levy, Michael Green, Tal Berkovitz Shalev, Gavriel Habib, Dvir Samuel, Noam Korngut Zailer, Or Shimshi, Nir Darshan, Rami Ben-Ari

    Abstract: The task of Visual Place Recognition (VPR) is to predict the location of a query image from a database of geo-tagged images. Recent studies in VPR have highlighted the significant advantage of employing pre-trained foundation models like DINOv2 for the VPR task. However, these models are often deemed inadequate for VPR without further fine-tuning on task-specific data. In this paper, we propose a… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2405.18025  [pdf, other

    cs.CV cs.AI

    Unveiling the Power of Diffusion Features For Personalized Segmentation and Retrieval

    Authors: Dvir Samuel, Rami Ben-Ari, Matan Levy, Nir Darshan, Gal Chechik

    Abstract: Personalized retrieval and segmentation aim to locate specific instances within a dataset based on an input image and a short description of the reference instance. While supervised methods are effective, they require extensive labeled data for training. Recently, self-supervised foundation models have been introduced to these tasks showing comparable results to supervised methods. However, a sign… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2402.14848  [pdf, other

    cs.CL cs.AI

    Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

    Authors: Mosh Levy, Alon Jacoby, Yoav Goldberg

    Abstract: This paper explores the impact of extending input lengths on the capabilities of Large Language Models (LLMs). Despite LLMs advancements in recent times, their performance consistency across different input lengths is not well understood. We investigate this aspect by introducing a novel QA reasoning framework, specifically designed to assess the impact of input length. We isolate the effect of in… ▽ More

    Submitted 10 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  7. arXiv:2311.07389  [pdf, other

    cs.LG cs.CR

    Transpose Attack: Stealing Datasets with Bidirectional Training

    Authors: Guy Amit, Mosh Levy, Yisroel Mirsky

    Abstract: Deep neural networks are normally executed in the forward direction. However, in this work, we identify a vulnerability that enables models to be trained in both directions and on different tasks. Adversaries can exploit this capability to hide rogue models within seemingly legitimate models. In addition, in this work we show that neural networks can be taught to systematically memorize and retrie… ▽ More

    Submitted 17 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: NDSS24 paper, Transpose Attack, Transposed Model. NDSS version: https://www.ndss-symposium.org/ndss-paper/transpose-attack-stealing-datasets-with-bidirectional-training/

  8. arXiv:2311.00613  [pdf, other

    cs.SD cs.LG eess.AS

    Controllable Music Production with Diffusion Models and Guidance Gradients

    Authors: Mark Levy, Bruno Di Giorgi, Floris Weers, Angelos Katharopoulos, Tom Nickson

    Abstract: We demonstrate how conditional generation from diffusion models can be used to tackle a variety of realistic tasks in the production of music in 44.1kHz stereo audio with sampling-time guidance. The scenarios we consider include continuation, inpainting and regeneration of musical audio, the creation of smooth transitions between two different music tracks, and the transfer of desired stylistic ch… ▽ More

    Submitted 5 December, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

  9. arXiv:2310.18360  [pdf, other

    cs.CL cs.AI

    Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers

    Authors: Mosh Levy, Shauli Ravfogel, Yoav Goldberg

    Abstract: Recent applications of LLMs in Machine Reading Comprehension (MRC) systems have shown impressive results, but the use of shortcuts, mechanisms triggered by features spuriously correlated to the true label, has emerged as a potential threat to their reliability. We analyze the problem from two angles: LLMs as editors, guided to edit text to mislead LLMs; and LLMs as readers, who answer questions ba… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Findings

  10. arXiv:2307.04108  [pdf, other

    cs.GT cs.MA econ.TH math.DS

    Asynchronous Proportional Response Dynamics in Markets with Adversarial Scheduling

    Authors: Yoav Kolumbus, Menahem Levy, Noam Nisan

    Abstract: We study Proportional Response Dynamics (PRD) in linear Fisher markets where participants act asynchronously. We model this scenario as a sequential process in which in every step, an adversary selects a subset of the players that will update their bids, subject to liveness constraints. We show that if every bidder individually uses the PRD update rule whenever they are included in the group of bi… ▽ More

    Submitted 15 January, 2024; v1 submitted 9 July, 2023; originally announced July 2023.

  11. arXiv:2305.20062  [pdf, other

    cs.CV

    Chatting Makes Perfect: Chat-based Image Retrieval

    Authors: Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski

    Abstract: Chats emerge as an effective user-friendly approach for information retrieval, and are successfully employed in many domains, such as customer service, healthcare, and finance. However, existing image retrieval approaches typically address the case of a single query-to-image round, and the use of chats for image retrieval has been mostly overlooked. In this work, we introduce ChatIR: a chat-based… ▽ More

    Submitted 5 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Camera Ready version for NeurIPS 2023

  12. arXiv:2305.14763  [pdf, other

    cs.CL

    Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models

    Authors: Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz

    Abstract: The escalating debate on AI's capabilities warrants developing reliable metrics to assess machine "intelligence". Recently, many anecdotal examples were used to suggest that newer large language models (LLMs) like ChatGPT and GPT-4 exhibit Neural Theory-of-Mind (N-ToM); however, prior work reached conflicting conclusions regarding those abilities. We investigate the extent of LLMs' N-ToM through a… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  13. arXiv:2303.09429  [pdf, other

    cs.CV

    Data Roaming and Quality Assessment for Composed Image Retrieval

    Authors: Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski

    Abstract: The task of Composed Image Retrieval (CoIR) involves queries that combine image and text modalities, allowing users to express their intent more effectively. However, current CoIR datasets are orders of magnitude smaller compared to other vision and language (V&L) datasets. Additionally, some of these datasets have noticeable issues, such as queries containing redundant modalities. To address thes… ▽ More

    Submitted 20 December, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Camera Ready version for AAAI 2024

  14. arXiv:2208.12782  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Mel Spectrogram Inversion with Stable Pitch

    Authors: Bruno Di Giorgi, Mark Levy, Richard Sharp

    Abstract: Vocoders are models capable of transforming a low-dimensional spectral representation of an audio signal, typically the mel spectrogram, to a waveform. Modern speech generation pipelines use a vocoder as their final component. Recent vocoder models developed for speech achieve a high degree of realism, such that it is natural to wonder how they would perform on music signals. Compared to speech, t… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 7 pages, 5 figures, Proceedings of the 23st International Society for Music Information Retrieval Conference, ISMIR 2022

  15. arXiv:2208.10878  [pdf, other

    cs.LG cs.CR

    Transferability Ranking of Adversarial Examples

    Authors: Mosh Levy, Guy Amit, Yuval Elovici, Yisroel Mirsky

    Abstract: Adversarial transferability in black-box scenarios presents a unique challenge: while attackers can employ surrogate models to craft adversarial examples, they lack assurance on whether these examples will successfully compromise the target model. Until now, the prevalent method to ascertain success has been trial and error-testing crafted samples directly on the victim model. This approach, howev… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

  16. arXiv:2207.08169  [pdf, other

    cs.CY

    Ethnic Representation Analysis of Commercial Movie Posters

    Authors: Dima Kagan, Mor Levy, Michael Fire, Galit Fuhrmann Alpert

    Abstract: In the last decades, global awareness towards the importance of diverse representation has been increasing. Lack of diversity and discrimination toward minorities did not skip the film industry. Here, we examine ethnic bias in the film industry through commercial posters, the industry's primary advertisement medium for decades. Movie posters are designed to establish the viewer's initial impressio… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  17. arXiv:2201.08661  [pdf, other

    cs.CR cs.LG eess.IV

    The Security of Deep Learning Defences for Medical Imaging

    Authors: Moshe Levy, Guy Amit, Yuval Elovici, Yisroel Mirsky

    Abstract: Deep learning has shown great promise in the domain of medical image analysis. Medical professionals and healthcare providers have been adopting the technology to speed up and enhance their work. These systems use deep neural networks (DNN) which are vulnerable to adversarial samples; images with imperceivable changes that can alter the model's prediction. Researchers have proposed defences which… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

  18. arXiv:2111.14792  [pdf, other

    cs.CV

    Classification-Regression for Chart Comprehension

    Authors: Matan Levy, Rami Ben-Ari, Dani Lischinski

    Abstract: Chart question answering (CQA) is a task used for assessing chart comprehension, which is fundamentally different from understanding natural images. CQA requires analyzing the relationships between the textual and the visual components of a chart, in order to answer general questions or infer numerical values. Most existing CQA datasets and models are based on simplifying assumptions that often en… ▽ More

    Submitted 11 July, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  19. arXiv:2108.04479  [pdf, other

    cs.CV cs.AI

    Scalable Reverse Image Search Engine for NASAWorldview

    Authors: Abhigya Sodani, Michael Levy, Anirudh Koul, Meher Anand Kasam, Siddha Ganju

    Abstract: Researchers often spend weeks sifting through decades of unlabeled satellite imagery(on NASA Worldview) in order to develop datasets on which they can start conducting research. We developed an interactive, scalable and fast image similarity search engine (which can take one or more images as the query image) that automatically sifts through the unlabeled dataset reducing dataset generation time f… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: 7 pages, Published at COSPAR 2021, 6 figures

  20. arXiv:2106.09714  [pdf, other

    cs.RO

    No-frills Dynamic Planning using Static Planners

    Authors: Mara Levy, Vasista Ayyagari, Abhinav Shrivastava

    Abstract: In this paper, we address the task of interacting with dynamic environments where the changes in the environment are independent of the agent. We study this through the context of trapping a moving ball with a UR5 robotic arm. Our key contribution is an approach to utilize a static planner for dynamic tasks using a Dynamic Planning add-on; that is, if we can successfully solve a task with a static… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: ICRA 2021

  21. arXiv:2102.02282  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Downbeat Tracking with Tempo-Invariant Convolutional Neural Networks

    Authors: Bruno Di Giorgi, Matthias Mauch, Mark Levy

    Abstract: The human ability to track musical downbeats is robust to changes in tempo, and it extends to tempi never previously encountered. We propose a deterministic time-warping operation that enables this skill in a convolutional neural network (CNN) by allowing the network to learn rhythmic patterns independently of tempo. Unlike conventional deep learning approaches, which learn rhythmic patterns at th… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 7 pages, 5 figures, Proceedings of the 21st International Society for Music Information Retrieval Conference, ISMIR 2020

    Journal ref: Proceedings of the 21st International Society for Music Information Retrieval Conference (2020) 216-222

  22. arXiv:2102.00179  [pdf, other

    cs.HC cs.AI

    Matching Representations of Explainable Artificial Intelligence and Eye Gaze for Human-Machine Interaction

    Authors: Tiffany Hwu, Mia Levy, Steven Skorheim, David Huber

    Abstract: Rapid non-verbal communication of task-based stimuli is a challenge in human-machine teaming, particularly in closed-loop interactions such as driving. To achieve this, we must understand the representations of information for both the human and machine, and determine a basis for bridging these representations. Techniques of explainable artificial intelligence (XAI) such as layer-wise relevance pr… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

  23. arXiv:2012.02296  [pdf, other

    cs.LG physics.data-an q-bio.QM

    Generative Capacity of Probabilistic Protein Sequence Models

    Authors: Francisco McGee, Quentin Novinger, Ronald M. Levy, Vincenzo Carnevale, Allan Haldane

    Abstract: Potts models and variational autoencoders (VAEs) have recently gained popularity as generative protein sequence models (GPSMs) to explore fitness landscapes and predict the effect of mutations. Despite encouraging results, quantitative characterization and comparison of GPSM-generated probability distributions is still lacking. It is currently unclear whether GPSMs can faithfully reproduce the com… ▽ More

    Submitted 15 March, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

  24. arXiv:2009.05283  [pdf, other

    cs.CV cs.AI cs.LG

    Fair and accurate age prediction using distribution aware data curation and augmentation

    Authors: Yushi Cao, David Berend, Palina Tolmach, Guy Amit, Moshe Levy, Yang Liu, Asaf Shabtai, Yuval Elovici

    Abstract: Deep learning-based facial recognition systems have experienced increased media attention due to exhibiting unfair behavior. Large enterprises, such as IBM, shut down their facial recognition and age prediction systems as a consequence. Age prediction is an especially difficult application with the issue of fairness remaining an open research problem (e.g., predicting age for different ethnicity e… ▽ More

    Submitted 16 November, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: Preprint, accepted at WACV'22

  25. arXiv:2008.06856  [pdf, other

    cs.LG stat.ML

    FOOD: Fast Out-Of-Distribution Detector

    Authors: Guy Amit, Moshe Levy, Ishai Rosenberg, Asaf Shabtai, Yuval Elovici

    Abstract: Deep neural networks (DNNs) perform well at classifying inputs associated with the classes they have been trained on, which are known as in distribution inputs. However, out-of-distribution (OOD) inputs pose a great challenge to DNNs and consequently represent a major risk when DNNs are implemented in safety-critical systems. Extensive research has been performed in the domain of OOD detection. Ho… ▽ More

    Submitted 23 February, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: Guy Amit and Moshe Levy contributed equally to this paper Updated version

  26. arXiv:2008.06536  [pdf, other

    cs.CR cs.OS

    Making Distributed Mobile Applications SAFE: Enforcing User Privacy Policies on Untrusted Applications with Secure Application Flow Enforcement

    Authors: Adriana Szekeres, Irene Zhang, Katelin Bailey, Isaac Ackerman, Haichen Shen, Franziska Roesner, Dan R. K. Ports, Arvind Krishnamurthy, Henry M. Levy

    Abstract: Today's mobile devices sense, collect, and store huge amounts of personal information, which users share with family and friends through a wide range of applications. Once users give applications access to their data, they must implicitly trust that the apps correctly maintain data privacy. As we know from both experience and all-too-frequent press articles, that trust is often misplaced. While us… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

  27. arXiv:2003.09260  [pdf

    q-bio.QM cs.CV eess.IV eess.SP

    Accuracy of MRI Classification Algorithms in a Tertiary Memory Center Clinical Routine Cohort

    Authors: Alexandre Morin, Jorge Samper-González, Anne Bertrand, Sebastian Stroer, Didier Dormont, Aline Mendes, Pierrick Coupé, Jamila Ahdidan, Marcel Lévy, Dalila Samri, Harald Hampel, Bruno Dubois, Marc Teichmann, Stéphane Epelbaum, Olivier Colliot

    Abstract: BACKGROUND:Automated volumetry software (AVS) has recently become widely available to neuroradiologists. MRI volumetry with AVS may support the diagnosis of dementias by identifying regional atrophy. Moreover, automatic classifiers using machine learning techniques have recently emerged as promising approaches to assist diagnosis. However, the performance of both AVS and automatic classifiers has… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Journal ref: Journal of Alzheimer's Disease, IOS Press, 2020, pp.1-10

  28. arXiv:2002.02176  [pdf, other

    cs.LG stat.ML

    GIM: Gaussian Isolation Machines

    Authors: Guy Amit, Ishai Rosenberg, Moshe Levy, Ron Bitton, Asaf Shabtai, Yuval Elovici

    Abstract: In many cases, neural network classifiers are likely to be exposed to input data that is outside of their training distribution data. Samples from outside the distribution may be classified as an existing class with high probability by softmax-based classifiers; such incorrect classifications affect the performance of the classifiers and the applications/systems that depend on them. Previous resea… ▽ More

    Submitted 21 March, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Submitted to IJCNN2020 conference

  29. arXiv:1804.05371  [pdf, other

    cs.IT

    Mutually Uncorrelated Codes for DNA Storage

    Authors: Maya Levy, Eitan Yaakobi

    Abstract: Mutually Uncorrelated (MU) codes are a class of codes in which no proper prefix of one codeword is a suffix of another codeword. These codes were originally studied for synchronization purposes and recently, Yazdi et al. showed their applicability to enable random access in DNA storage. In this work we follow the research of Yazdi et al. and study MU codes along with their extensions to correct er… ▽ More

    Submitted 15 April, 2018; originally announced April 2018.

  30. arXiv:1603.07458  [pdf, other

    cs.LO

    The method "Model Elimination" of D.W.Loveland explained

    Authors: Michel Lévy

    Abstract: We present concisely the method "Model Elimination" of D.W.Loveland. Especially, we explain and prove the correctness of the lemmas generated by this method.

    Submitted 29 June, 2016; v1 submitted 24 March, 2016; originally announced March 2016.

  31. arXiv:1502.05417  [pdf, other

    physics.soc-ph cs.SD

    The Evolution of Popular Music: USA 1960-2010

    Authors: Matthias Mauch, Robert M. MacCallum, Mark Levy, Armand M. Leroi

    Abstract: In modern societies, cultural change seems ceaseless. The flux of fashion is especially obvious for popular music. While much has been written about the origin and evolution of pop, most claims about its history are anecdotal rather than scientific in nature. To rectify this we investigate the US Billboard Hot 100 between 1960 and 2010. Using Music Information Retrieval (MIR) and text-mining tools… ▽ More

    Submitted 17 February, 2015; originally announced February 2015.

    Comments: MS: 13 pages, 6 figures; SI: 15 pages, 7 figures

    Journal ref: R. Soc. open sci. 2015 2 150081

  32. arXiv:0906.3202  [pdf

    cs.CY physics.soc-ph

    Distance Is Not Dead: Social Interaction and Geographical Distance in the Internet Era

    Authors: Jacob Goldenberg, Moshe Levy

    Abstract: The Internet revolution has made long-distance communication dramatically faster, easier, and cheaper than ever before. This, it has been argued, has decreased the importance of geographic proximity in social interactions, transforming our world into a global village with a borderless society. We argue for the opposite: while technology has undoubtedly increased the overall level of communicatio… ▽ More

    Submitted 19 October, 2009; v1 submitted 17 June, 2009; originally announced June 2009.