Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Frank, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04981  [pdf, other

    cs.LG stat.ML

    The Price of Implicit Bias in Adversarially Robust Generalization

    Authors: Nikolaos Tsilivis, Natalie Frank, Nathan Srebro, Julia Kempe

    Abstract: We study the implicit bias of optimization in robust empirical risk minimization (robust ERM) and its connection with robust generalization. In classification settings under adversarial perturbations with linear models, we study what type of regularization should ideally be applied for a given perturbation set to improve (robust) generalization. We then show that the implicit bias of optimization… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2404.17358  [pdf, ps, other

    cs.LG math.ST stat.ML

    Adversarial Consistency and the Uniqueness of the Adversarial Bayes Classifier

    Authors: Natalie S. Frank

    Abstract: Adversarial training is a common technique for learning robust classifiers. Prior work showed that convex surrogate losses are not statistically consistent in the adversarial context -- or in other words, a minimizing sequence of the adversarial surrogate risk will not necessarily minimize the adversarial classification error. We connect the consistency of adversarial surrogate losses to propertie… ▽ More

    Submitted 15 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 18 pages, v2: fixed typos

  3. arXiv:2404.16956  [pdf, other

    cs.LG math.ST stat.ML

    A Notion of Uniqueness for the Adversarial Bayes Classifier

    Authors: Natalie S. Frank

    Abstract: We propose a new notion of uniqueness for the adversarial Bayes classifier in the setting of binary classification. Analyzing this concept produces a simple procedure for computing all adversarial Bayes classifiers for a well-motivated family of one dimensional data distributions. This characterization is then leveraged to show that as the perturbation radius increases, certain the regularity of a… ▽ More

    Submitted 17 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 49 pages, 7 figures v2: fixed typos, notation errors, and a mistake in example 7

  4. SimCol3D -- 3D Reconstruction during Colonoscopy Challenge

    Authors: Anita Rau, Sophia Bano, Yueming Jin, Pablo Azagra, Javier Morlana, Rawen Kader, Edward Sanderson, Bogdan J. Matuszewski, Jae Young Lee, Dong-Jae Lee, Erez Posner, Netanel Frank, Varshini Elangovan, Sista Raviteja, Zhengwen Li, Jiquan Liu, Seenivasan Lalithkumar, Mobarakol Islam, Hongliang Ren, Laurence B. Lovat, José M. M. Montiel, Danail Stoyanov

    Abstract: Colorectal cancer is one of the most common cancers in the world. While colonoscopy is an effective screening technique, navigating an endoscope through the colon to detect polyps is challenging. A 3D map of the observed surfaces could enhance the identification of unscreened colon tissue and serve as a training platform. However, reconstructing the colon from video footage remains difficult. Lear… ▽ More

    Submitted 2 July, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    MSC Class: I.4.5

    Journal ref: Medical Image Analysis 96 (2024): 103195

  5. arXiv:2306.04269  [pdf, other

    cs.CV cs.HC cs.LG

    ColNav: Real-Time Colon Navigation for Colonoscopy

    Authors: Netanel Frank, Erez Posner, Emmanuelle Muhlethaler, Adi Zholkover, Moshe Bouhnik

    Abstract: Colorectal cancer screening through colonoscopy continues to be the dominant global standard, as it allows identifying pre-cancerous or adenomatous lesions and provides the ability to remove them during the procedure itself. Nevertheless, failure by the endoscopist to identify such lesions increases the likelihood of lesion progression to subsequent colorectal cancer. Ultimately, colonoscopy remai… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  6. arXiv:2305.14585  [pdf, other

    cs.LG

    Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models

    Authors: Andrew Engel, Zhichao Wang, Natalie S. Frank, Ioana Dumitriu, Sutanay Choudhury, Anand Sarwate, Tony Chiang

    Abstract: A recent trend in explainable AI research has focused on surrogate modeling, where neural networks are approximated as simpler ML algorithms such as kernel machines. A second trend has been to utilize kernel functions in various explain-by-example or data attribution tasks. In this work, we combine these two trends to analyze approximate empirical neural tangent kernels (eNTK) for data attribution… ▽ More

    Submitted 11 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures, 3 tables Updated 3/11/2024 various additions/clarifications after ICLR review. Accepted as a Spotlight paper at ICLR 2024

  7. arXiv:2305.09956  [pdf, ps, other

    cs.LG math.ST

    The Adversarial Consistency of Surrogate Risks for Binary Classification

    Authors: Natalie Frank, Jonathan Niles-Weed

    Abstract: We study the consistency of surrogate risks for robust binary classification. It is common to learn robust classifiers by adversarial training, which seeks to minimize the expected $0$-$1$ loss when each example can be maliciously corrupted within a small ball. We give a simple and complete characterization of the set of surrogate loss functions that are \emph{consistent}, i.e., that can replace t… ▽ More

    Submitted 22 December, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 17 pages, published in NeurIps 2023. version 3: added acknowledgements, no other changes. version 2: reorganized Section 4 and added proofs of the approximate complimentary slackness theorems. arXiv admin note: text overlap with arXiv:2206.09099

  8. arXiv:2206.09099   

    cs.LG math.ST

    The Consistency of Adversarial Training for Binary Classification

    Authors: Natalie S. Frank, Jonathan Niles-Weed

    Abstract: Robustness to adversarial perturbations is of paramount concern in modern machine learning. One of the state-of-the-art methods for training robust classifiers is adversarial training, which involves minimizing a supremum-based surrogate risk. The statistical consistency of surrogate risks is well understood in the context of standard machine learning, but not in the adversarial setting. In this p… ▽ More

    Submitted 17 May, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: There was an error in the main theorem of the paper (Theorem 7)

  9. arXiv:2206.09098  [pdf, ps, other

    cs.LG math.ST

    Existence and Minimax Theorems for Adversarial Surrogate Risks in Binary Classification

    Authors: Natalie S. Frank, Jonathan Niles-Weed

    Abstract: Adversarial training is one of the most popular methods for training methods robust to adversarial attacks, however, it is not well-understood from a theoretical perspective. We prove and existence, regularity, and minimax theorems for adversarial surrogate risks. Our results explain some empirical observations on adversarial robustness from prior work and suggest new directions in algorithm devel… ▽ More

    Submitted 10 December, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: 42 pages. version 2: corrects several errors and employs a significantly different proof technique. version 3: modifies the arXiv author list but has no other changes. version 4: improved exposition and fixed typos

  10. arXiv:2206.01961  [pdf, other

    cs.CV cs.AI cs.LG

    C$^3$Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy

    Authors: Erez Posner, Adi Zholkover, Netanel Frank, Moshe Bouhnik

    Abstract: 3D colon reconstruction from Optical Colonoscopy (OC) to detect non-examined surfaces remains an unsolved problem. The challenges arise from the nature of optical colonoscopy data, characterized by highly reflective low-texture surfaces, drastic illumination changes and frequent tracking loss. Recent methods demonstrate compelling results, but suffer from: (1) frangible frame-to-frame (or frame-to… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  11. arXiv:2112.01694  [pdf, other

    cs.LG stat.ML

    On the Existence of the Adversarial Bayes Classifier (Extended Version)

    Authors: Pranjal Awasthi, Natalie S. Frank, Mehryar Mohri

    Abstract: Adversarial robustness is a critical property in a variety of modern machine learning applications. While it has been the subject of several recent theoretical studies, many important questions related to adversarial robustness are still open. In this work, we study a fundamental question regarding Bayes optimality for adversarial robustness. We provide general sufficient conditions under which th… ▽ More

    Submitted 28 August, 2023; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 27 pages, 3 figures. Version 2: Corrects 2 errors in the paper "On the Existence of the Adversarial Bayes Classifier" published in NeurIPS. Version 3: Update to acknowledgements

  12. arXiv:2104.09658  [pdf, other

    cs.LG stat.ML

    Calibration and Consistency of Adversarial Surrogate Losses

    Authors: Pranjal Awasthi, Natalie Frank, Anqi Mao, Mehryar Mohri, Yutao Zhong

    Abstract: Adversarial robustness is an increasingly critical property of classifiers in applications. The design of robust algorithms relies on surrogate losses since the optimization of the adversarial loss with most hypothesis sets is NP-hard. But which surrogate losses should be used and when do they benefit from theoretical guarantees? We present an extensive study of this question, including a detailed… ▽ More

    Submitted 4 May, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

  13. arXiv:2007.11045  [pdf, other

    cs.LG stat.ML

    On the Rademacher Complexity of Linear Hypothesis Sets

    Authors: Pranjal Awasthi, Natalie Frank, Mehryar Mohri

    Abstract: Linear predictors form a rich class of hypotheses used in a variety of learning algorithms. We present a tight analysis of the empirical Rademacher complexity of the family of linear hypothesis classes with weight vectors bounded in $\ell_p$-norm for any $p \geq 1$. This provides a tight analysis of generalization using these hypothesis sets and helps derive sharp data-dependent learning guarantee… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  14. arXiv:2004.13617  [pdf, other

    cs.LG stat.ML

    Adversarial Learning Guarantees for Linear Hypotheses and Neural Networks

    Authors: Pranjal Awasthi, Natalie Frank, Mehryar Mohri

    Abstract: Adversarial or test time robustness measures the susceptibility of a classifier to perturbations to the test input. While there has been a flurry of recent work on designing defenses against such perturbations, the theory of adversarial robustness is not well understood. In order to make progress on this, we focus on the problem of understanding generalization in adversarial settings, via the lens… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  15. arXiv:2002.03842  [pdf, other

    cs.AI

    Overview of chemical ontologies

    Authors: Christian Pachl, Nils Frank, Jan Breitbart, Stefan Bräse

    Abstract: Ontologies order and interconnect knowledge of a certain field in a formal and semantic way so that they are machine-parsable. They try to define allwhere acceptable definition of concepts and objects, classify them, provide properties as well as interconnect them with relations (e.g. "A is a special case of B"). More precisely, Tom Gruber defines Ontologies as a "specification of a conceptualizat… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 2 Figures

  16. arXiv:1912.13107  [pdf, other

    cs.LG cs.MA stat.ML

    Improved Structural Discovery and Representation Learning of Multi-Agent Data

    Authors: Jennifer Hobbs, Matthew Holbrook, Nathan Frank, Long Sha, Patrick Lucey

    Abstract: Central to all machine learning algorithms is data representation. For multi-agent systems, selecting a representation which adequately captures the interactions among agents is challenging due to the latent group structure which tends to vary depending on context. However, in multi-agent systems with strong group structure, we can simultaneously learn this structure and map a set of agents to a c… ▽ More

    Submitted 30 December, 2019; originally announced December 2019.