Zum Hauptinhalt springen

Showing 1–50 of 62 results for author: Wong, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00075  [pdf, other

    cs.AI cs.CL cs.CR cs.LG

    Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

    Authors: Anton Xue, Avishree Khare, Rajeev Alur, Surbhi Goel, Eric Wong

    Abstract: We study how to subvert language models from following the rules. We model rule-following as inference in propositional Horn logic, a mathematical system in which rules have the form "if $P$ and $Q$, then $R$" for some propositions $P$, $Q$, and $R$. We prove that although transformers can faithfully abide by such rules, maliciously crafted prompts can nevertheless mislead even theoretically const… ▽ More

    Submitted 21 June, 2024; originally announced July 2024.

  2. arXiv:2406.18534  [pdf, other

    cs.CL cs.LG

    Towards Compositionality in Concept Learning

    Authors: Adam Stein, Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong

    Abstract: Concept-based interpretability methods offer a lens into the internals of foundation models by decomposing their embeddings into high-level concepts. These concept representations are most useful when they are compositional, meaning that the individual concepts compose to explain the full sample. We show that existing unsupervised concept extraction methods find concepts which are not compositiona… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 26 pages, 10 figures

  3. arXiv:2406.10952  [pdf, other

    cs.CL

    Avoiding Copyright Infringement via Machine Unlearning

    Authors: Guangyao Dou, Zheyuan Liu, Qing Lyu, Kaize Ding, Eric Wong

    Abstract: Pre-trained Large Language Models (LLMs) have demonstrated remarkable capabilities but also pose risks by learning and generating copyrighted material, leading to significant legal and ethical concerns. To address these issues, it is critical for model owners to be able to unlearn copyrighted content at various time steps. We explore the setting of sequential unlearning, where copyrighted content… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  4. arXiv:2406.06246  [pdf, other

    cs.LG

    Data-Efficient Learning with Neural Programs

    Authors: Alaia Solko-Breslin, Seewon Choi, Ziyang Li, Neelay Velingker, Rajeev Alur, Mayur Naik, Eric Wong

    Abstract: Many computational tasks can be naturally expressed as a composition of a DNN followed by a program written in a traditional programming language or an API call to an LLM. We call such composites "neural programs" and focus on the problem of learning the DNN parameters when the training data consist of end-to-end input-output labels for the composite. When the program is written in a differentiabl… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  5. arXiv:2406.00611  [pdf, other

    cs.LG stat.ME

    DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

    Authors: Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong

    Abstract: Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 22 pages, 5 figures

  6. arXiv:2405.06692  [pdf, ps, other

    cs.CL

    Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis Models

    Authors: Ethan Parker Wong, Faten M'hiri

    Abstract: Inspired by the 'Bias Considerations in Bilingual Natural Language Processing' report by Statistics Canada, this study delves into potential biases in multilingual sentiment analysis between English and French. Given a 50-50 dataset of French and English, we aim to determine if there exists a language bias and explore how the incorporation of more diverse datasets in the future might affect the eq… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Undergraduate Research Project

  7. Practice-informed Patterns for Organising Large Groups in Distributed Mixed Reality Collaboration

    Authors: Emily Wong, Juan Sánchez Esquivel, Jens Emil Grønbæk, Germán Leiva, Eduardo Velloso

    Abstract: Collaborating across dissimilar, distributed spaces presents numerous challenges for computer-aided spatial communication. Mixed reality (MR) can blend selected surfaces, allowing collaborators to work in blended f-formations (facing formations), even when their workstations are physically misaligned. Since collaboration often involves more than just participant pairs, this research examines how w… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Journal ref: CHI '24, Proceedings of the CHI Conference on Human Factors in Computing Systems, May 11-16 2024, Honolulu, HI, USA

  8. arXiv:2404.14299  [pdf, other

    quant-ph cs.SE

    A Cross-Platform Execution Engine for the Quantum Intermediate Representation

    Authors: Elaine Wong, Vicente Leyton Ortega, Daniel Claudino, Seth Johnson, Sharmin Afrose, Meenambika Gowrishankar, Anthony M. Cabrera, Travis S. Humble

    Abstract: Hybrid languages like the Quantum Intermediate Representation (QIR) are essential for programming systems that mix quantum and conventional computing models, while execution of these programs is often deferred to a system-specific implementation. Here, we describe and demonstrate the QIR Execution Engine (QIR-EE) for parsing, interpreting, and executing QIR across multiple hardware platforms. QIR-… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  9. arXiv:2404.01318  [pdf, other

    cs.CR cs.LG

    JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

    Authors: Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramer, Hamed Hassani, Eric Wong

    Abstract: Jailbreak attacks cause large language models (LLMs) to generate harmful, unethical, or otherwise objectionable content. Evaluating these attacks presents a number of challenges, which the current collection of benchmarks and evaluation techniques do not adequately address. First, there is no clear standard of practice regarding jailbreaking evaluation. Second, existing works compute costs and suc… ▽ More

    Submitted 16 July, 2024; v1 submitted 27 March, 2024; originally announced April 2024.

    Comments: JailbreakBench v1.0: more attack artifacts, more test-time defenses, a more accurate jailbreak judge (Llama-3-70B with a custom prompt), a larger dataset of human preferences for selecting a jailbreak judge (300 examples), an over-refusal evaluation dataset (100 benign/borderline behaviors), a semantic refusal judge based on Llama-3-8B

  10. arXiv:2402.16192  [pdf, other

    cs.CL

    Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

    Authors: Jiabao Ji, Bairu Hou, Alexander Robey, George J. Pappas, Hamed Hassani, Yang Zhang, Eric Wong, Shiyu Chang

    Abstract: Aligned large language models (LLMs) are vulnerable to jailbreaking attacks, which bypass the safeguards of targeted LLMs and fool them into generating objectionable content. While initial defenses show promise against token-based threat models, there do not exist defenses that provide robustness against semantic attacks and avoid unfavorable trade-offs between robustness and nominal performance.… ▽ More

    Submitted 28 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: 37 pages

  11. arXiv:2401.13650  [pdf, other

    eess.IV cs.CV

    Tyche: Stochastic In-Context Learning for Medical Image Segmentation

    Authors: Marianne Rakic, Hallee E. Wong, Jose Javier Gonzalez Ortiz, Beth Cimini, John Guttag, Adrian V. Dalca

    Abstract: Existing learning-based solutions to medical image segmentation have two important shortcomings. First, for most new segmentation task, a new model has to be trained or fine-tuned. This requires extensive resources and machine learning expertise, and is therefore often infeasible for medical researchers and clinicians. Second, most existing segmentation methods produce a single deterministic segme… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  12. arXiv:2312.07381  [pdf, other

    cs.CV eess.IV

    ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image

    Authors: Hallee E. Wong, Marianne Rakic, John Guttag, Adrian V. Dalca

    Abstract: Biomedical image segmentation is a crucial part of both scientific research and clinical care. With enough labelled data, deep learning models can be trained to accurately automate specific biomedical image segmentation tasks. However, manually segmenting images to create training data is highly labor intensive and requires domain expertise. We present \emph{ScribblePrompt}, a flexible neural netw… ▽ More

    Submitted 16 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted by ECCV 2024. Project Website: https://scribbleprompt.csail.mit.edu Keywords: Interactive Segmentation, Medical Imaging, Segment Anything Model, SAM, Scribble Annotations, Prompt

  13. arXiv:2312.05716  [pdf, other

    cs.CV

    Initialization Matters for Adversarial Transfer Learning

    Authors: Andong Hua, Jindong Gu, Zhiyu Xue, Nicholas Carlini, Eric Wong, Yao Qin

    Abstract: With the prevalence of the Pretraining-Finetuning paradigm in transfer learning, the robustness of downstream tasks has become a critical concern. In this work, we delve into adversarial robustness in transfer learning and reveal the critical role of initialization, including both the pretrained model and the linear head. First, we discover the necessity of an adversarially robust pretrained model… ▽ More

    Submitted 30 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  14. arXiv:2312.03231  [pdf, other

    cs.LG cs.AI cs.CV cs.HC eess.AS

    Deep Multimodal Fusion for Surgical Feedback Classification

    Authors: Rafal Kocielnik, Elyssa Y. Wong, Timothy N. Chu, Lydia Lin, De-An Huang, Jiayun Wang, Anima Anandkumar, Andrew J. Hung

    Abstract: Quantification of real-time informal feedback delivered by an experienced surgeon to a trainee during surgery is important for skill improvements in surgical training. Such feedback in the live operating room is inherently multimodal, consisting of verbal conversations (e.g., questions and answers) as well as non-verbal elements (e.g., through visual cues like pointing to anatomic elements). In th… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Journal ref: Published in Proceedings of Machine Learning for Health 2024

  15. arXiv:2310.16316  [pdf, other

    cs.LG cs.AI

    Sum-of-Parts Models: Faithful Attributions for Groups of Features

    Authors: Weiqiu You, Helen Qu, Marco Gatti, Bhuvnesh Jain, Eric Wong

    Abstract: An explanation of a machine learning model is considered "faithful" if it accurately reflects the model's decision-making process. However, explanations such as feature attributions for deep learning are not guaranteed to be faithful, and can produce potentially misleading interpretations. In this work, we develop Sum-of-Parts (SOP), a class of models whose predictions come with grouped feature at… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  16. arXiv:2310.12508  [pdf, other

    cs.LG cs.AI

    SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

    Authors: Chongyu Fan, Jiancheng Liu, Yihua Zhang, Eric Wong, Dennis Wei, Sijia Liu

    Abstract: With evolving data regulations, machine unlearning (MU) has become an important tool for fostering trust and safety in today's AI models. However, existing MU methods focusing on data and/or weight perspectives often suffer limitations in unlearning accuracy, stability, and cross-domain applicability. To address these challenges, we introduce the concept of 'weight saliency' for MU, drawing parall… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024 as a Spotlight paper

  17. arXiv:2310.08419  [pdf, other

    cs.LG cs.AI

    Jailbreaking Black Box Large Language Models in Twenty Queries

    Authors: Patrick Chao, Alexander Robey, Edgar Dobriban, Hamed Hassani, George J. Pappas, Eric Wong

    Abstract: There is growing interest in ensuring that large language models (LLMs) align with human values. However, the alignment of such models is vulnerable to adversarial jailbreaks, which coax LLMs into overriding their safety guardrails. The identification of these vulnerabilities is therefore instrumental in understanding inherent weaknesses and preventing future misuse. To this end, we propose Prompt… ▽ More

    Submitted 18 July, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  18. arXiv:2310.07135  [pdf, other

    cs.CL

    Comparing Styles across Languages

    Authors: Shreya Havaldar, Matthew Pressimone, Eric Wong, Lyle Ungar

    Abstract: Understanding how styles differ across languages is advantageous for training both humans and computers to generate culturally appropriate text. We introduce an explanation framework to extract stylistic differences from multilingual LMs and compare styles across languages. Our framework (1) generates comprehensive style lexica in any language and (2) consolidates feature importances from LMs into… ▽ More

    Submitted 4 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  19. arXiv:2310.03684  [pdf, other

    cs.LG cs.AI stat.ML

    SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks

    Authors: Alexander Robey, Eric Wong, Hamed Hassani, George J. Pappas

    Abstract: Despite efforts to align large language models (LLMs) with human intentions, widely-used LLMs such as GPT, Llama, and Claude are susceptible to jailbreaking attacks, wherein an adversary fools a targeted LLM into generating objectionable content. To address this vulnerability, we propose SmoothLLM, the first algorithm designed to mitigate jailbreaking attacks. Based on our finding that adversarial… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  20. arXiv:2308.06686  [pdf, other

    cs.DB cs.LG cs.SE

    TorchQL: A Programming Framework for Integrity Constraints in Machine Learning

    Authors: Aaditya Naik, Adam Stein, Yinjun Wu, Mayur Naik, Eric Wong

    Abstract: Finding errors in machine learning applications requires a thorough exploration of their behavior over data. Existing approaches used by practitioners are often ad-hoc and lack the abstractions needed to scale this process. We present TorchQL, a programming framework to evaluate and improve the correctness of machine learning applications. TorchQL allows users to write queries to specify and check… ▽ More

    Submitted 14 February, 2024; v1 submitted 13 August, 2023; originally announced August 2023.

  21. arXiv:2307.05902  [pdf, other

    cs.LG cs.AI

    Stability Guarantees for Feature Attributions with Multiplicative Smoothing

    Authors: Anton Xue, Rajeev Alur, Eric Wong

    Abstract: Explanation methods for machine learning models tend not to provide any formal guarantees and may not reflect the underlying decision-making process. In this work, we analyze stability as a property for reliable feature attribution methods. We prove that relaxed variants of stability are guaranteed if the model is sufficiently Lipschitz with respect to the masking of features. We develop a smoothi… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  22. arXiv:2306.14414  [pdf, ps, other

    math.NT cs.CR cs.IT math.CO

    Rationality of Four-Valued Families of Weil Sums of Binomials

    Authors: Daniel J. Katz, Allison E. Wong

    Abstract: We investigate the rationality of Weil sums of binomials of the form $W^{K,s}_u=\sum_{x \in K} ψ(x^s - u x)$, where $K$ is a finite field whose canonical additive character is $ψ$, and where $u$ is an element of $K^{\times}$ and $s$ is a positive integer relatively prime to $|K^\times|$, so that $x \mapsto x^s$ is a permutation of $K$. The Weil spectrum for $K$ and $s$, which is the family of valu… ▽ More

    Submitted 6 April, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 33 pages

    MSC Class: 11T24; 11L05; 11L40; 11T22; 11G25; 11T71; 94A55; 94A60; 94B15

  23. arXiv:2306.00976  [pdf, other

    cs.CL

    TopEx: Topic-based Explanations for Model Comparison

    Authors: Shreya Havaldar, Adam Stein, Eric Wong, Lyle Ungar

    Abstract: Meaningfully comparing language models is challenging with current explanation methods. Current explanations are overwhelming for humans due to large vocabularies or incomparable across models. We present TopEx, an explanation method that enables a level playing field for comparing language models via model-agnostic topics. We demonstrate how TopEx can identify similarities and differences between… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2023, Tiny Papers Track

  24. arXiv:2305.16308  [pdf, other

    cs.LG

    Rectifying Group Irregularities in Explanations for Distribution Shift

    Authors: Adam Stein, Yinjun Wu, Eric Wong, Mayur Naik

    Abstract: It is well-known that real-world changes constituting distribution shift adversely affect model performance. How to characterize those changes in an interpretable manner is poorly understood. Existing techniques to address this problem take the form of shift explanations that elucidate how to map samples from the original distribution toward the shifted one by reducing the disparity between these… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 19 pages, 5 figures

  25. arXiv:2303.09603  [pdf, ps, other

    math.CO cs.SC

    Rigorous Analytic Combinatorics in Several Variables in SageMath

    Authors: Benjamin Hackl, Andrew Luo, Stephen Melczer, Jesse Selover, Elaine Wong

    Abstract: We introduce the new sage_acsv package for the SageMath computer algebra system, allowing users to rigorously compute asymptotics for a large variety of multivariate sequences with rational generating functions. Using Sage's support for exact computations over the algebraic number field, this package provides the first rigorous implementation of algorithms from the theory of analytic combinatorics… ▽ More

    Submitted 31 August, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: 8 pages; Package: https://pypi.org/project/sage-acsv/

    Journal ref: Séminaire Lotharingiende Combinatoire 89B (2023): Proceedings of the 35th FPSAC Conference, Article #90,12pp

  26. arXiv:2303.01433  [pdf, other

    cs.LG cs.AI

    Do Machine Learning Models Learn Statistical Rules Inferred from Data?

    Authors: Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong

    Abstract: Machine learning models can make critical errors that are easily hidden within vast amounts of data. Such errors often run counter to rules based on human intuition. However, rules based on human knowledge are challenging to scale or to even formalize. We thereby seek to infer statistical rules from the data and quantify the extent to which a model has learned them. We propose a framework SQRL tha… ▽ More

    Submitted 6 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  27. arXiv:2302.11042  [pdf, other

    cs.CL cs.LG

    In-context Example Selection with Influences

    Authors: Tai Nguyen, Eric Wong

    Abstract: In-context learning (ICL) is a powerful paradigm emerged from large language models (LLMs). Despite its promises, ICL performance is known to be highly sensitive to input examples. In this work, we use $\textit{in-context influences}$ to analyze few-shot ICL performance directly from the in-context examples. Our proposed influence-based example selection method can identify both positive and negat… ▽ More

    Submitted 5 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  28. arXiv:2302.04237  [pdf, other

    cs.LG

    Black Box Adversarial Prompting for Foundation Models

    Authors: Natalie Maus, Patrick Chao, Eric Wong, Jacob Gardner

    Abstract: Prompting interfaces allow users to quickly adjust the output of generative models in both vision and language. However, small changes and design choices in the prompt can lead to significant differences in the output. In this work, we develop a black-box framework for generating adversarial prompts for unstructured image and text generation. These prompts, which can be standalone or prepended to… ▽ More

    Submitted 29 May, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  29. A Unified Approach to Unimodality of Gaussian Polynomials

    Authors: Christoph Koutschan, Ali K. Uncu, Elaine Wong

    Abstract: In 2013, Pak and Panova proved the strict unimodality property of $q$-binomial coefficients $\binom{\ell+m}{m}_q$ (as polynomials in $q$) based on the combinatorics of Young tableaux and the semigroup property of Kronecker coefficients. They showed it to be true for all $\ell,m\geq 8$ and a few other cases. We propose a different approach to this problem based on computer algebra, where we establi… ▽ More

    Submitted 31 August, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: Supplementary material at https://wongey.github.io/unimodality

    Journal ref: ISSAC 2023: Proceedings of the 2023 International Symposium on Symbolic and Algebraic Computation, July 2023, Pages 434-442

  30. arXiv:2301.13379  [pdf, other

    cs.CL

    Faithful Chain-of-Thought Reasoning

    Authors: Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-Burch

    Abstract: While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query $\rightarrow$ symbolic reasoning chain) and Problem Solving… ▽ More

    Submitted 20 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: IJCNLP-AACL 2023 camera-ready version

  31. arXiv:2211.08624  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement

    Authors: Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu

    Abstract: Most speech enhancement (SE) models learn a point estimate and do not make use of uncertainty estimation in the learning process. In this paper, we show that modeling heteroscedastic uncertainty by minimizing a multivariate Gaussian negative log-likelihood (NLL) improves SE performance at no extra cost. During training, our approach augments a model learning complex spectral mapping with a tempora… ▽ More

    Submitted 8 March, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 5 pages. Accepted at ICASSP 2023

  32. arXiv:2209.08422  [pdf

    cs.LG stat.ML

    Computed Decision Weights and a New Learning Algorithm for Neural Classifiers

    Authors: Eugene Wong

    Abstract: In this paper we consider the possibility of computing rather than training the decision layer weights of a neural classifier. Such a possibility arises in two way, from making an appropriate choice of loss function and by solving a problem of constrained optimization. The latter formulation leads to a promising new learning process for pre-decision weights with both simplicity and efficacy.

    Submitted 17 September, 2022; originally announced September 2022.

  33. arXiv:2209.02446  [pdf, other

    cs.CY cs.MM

    Web3 Challenges and Opportunities for the Market

    Authors: Dan Sheridan, James Harris, Frank Wear, Jerry Cowell Jr, Easton Wong, Abbas Yazdinejad

    Abstract: The inability of a computer to think has been a limiter in its usefulness and a point of reassurance for humanity since the first computers were created. The semantic web is the first step toward removing that barrier, enabling computers to operate based on conceptual understanding, and AI and ML are the second. Both semantic knowledge and the ability to learn are fundamental to web3, as are block… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  34. arXiv:2207.05739  [pdf, other

    cs.LG

    A Data-Based Perspective on Transfer Learning

    Authors: Saachi Jain, Hadi Salman, Alaa Khaddaj, Eric Wong, Sung Min Park, Aleksander Madry

    Abstract: It is commonly believed that in transfer learning including more pre-training data translates into better performance. However, recent evidence suggests that removing data from the source dataset can actually help too. In this work, we take a closer look at the role of the source dataset's composition in transfer learning and present a framework for probing its impact on downstream performance. Ou… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  35. arXiv:2207.02842  [pdf, other

    cs.LG

    When does Bias Transfer in Transfer Learning?

    Authors: Hadi Salman, Saachi Jain, Andrew Ilyas, Logan Engstrom, Eric Wong, Aleksander Madry

    Abstract: Using transfer learning to adapt a pre-trained "source model" to a downstream "target task" can dramatically increase performance with seemingly no downside. In this work, we demonstrate that there can exist a downside after all: bias transfer, or the tendency for biases of the source model to persist even after adapting the model to the target class. Through a combination of synthetic and natural… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  36. arXiv:2204.08945  [pdf, other

    cs.CV cs.AI cs.LG

    Missingness Bias in Model Debugging

    Authors: Saachi Jain, Hadi Salman, Eric Wong, Pengchuan Zhang, Vibhav Vineet, Sai Vemprala, Aleksander Madry

    Abstract: Missingness, or the absence of features from an input, is a concept fundamental to many model debugging tools. However, in computer vision, pixels cannot simply be removed from an image. One thus tends to resort to heuristics such as blacking out pixels, which may in turn introduce bias into the debugging process. We study such biases and, in particular, show how transformer-based architectures ca… ▽ More

    Submitted 13 June, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Published at ICLR 2022

  37. arXiv:2202.13898   

    cs.SE

    DistAD: Software Anomaly Detection Based on Execution Trace Distribution

    Authors: Shiyi Kong, Jun Ai, Minyan Lu, Shuguang Wang, W. Eric Wong

    Abstract: Modern software systems have become increasingly complex, which makes them difficult to test and validate. Detecting software partial anomalies in complex systems at runtime can assist with handling unintended software behaviors, avoiding catastrophic software failures and improving software runtime availability. These detection techniques aim to identify the manifestation of faults (anomalies) be… ▽ More

    Submitted 26 April, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: need modification, the experiment results need carefully check

  38. CCOMPASSION: A Hybrid Cloudlet Placement Framework over Passive Optical Access Networks

    Authors: Sourav Mondal, Goutam Das, Elaine Wong

    Abstract: Cloud-based computing technology is one of the most significant technical advents of the last decade and extension of this facility towards access networks by aggregation of cloudlets is a step further. To fulfill the ravenous demand for computational resources entangled with the stringent latency requirements of computationally-heavy applications related to augmented reality, cognitive assistance… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: This paper is published in 2018 IEEE Conference on Computer Communications (INFOCOM). Copyright @ IEEE

    Report number: 18150397

    Journal ref: IEEE INFOCOM 2018 - IEEE Conference on Computer Communications

  39. arXiv:2110.07719  [pdf, other

    cs.CV cs.AI cs.LG

    Certified Patch Robustness via Smoothed Vision Transformers

    Authors: Hadi Salman, Saachi Jain, Eric Wong, Aleksander Mądry

    Abstract: Certified patch defenses can guarantee robustness of an image classifier to arbitrary changes within a bounded contiguous region. But, currently, this robustness comes at a cost of degraded standard accuracies and slower inference times. We demonstrate how using vision transformers enables significantly better certified patch robustness that is also more computationally efficient and does not incu… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  40. arXiv:2106.12041  [pdf, other

    physics.ao-ph cs.LG physics.geo-ph

    Analysis of the Evolution of Parametric Drivers of High-End Sea-Level Hazards

    Authors: Alana Hough, Tony E. Wong

    Abstract: Climate models are critical tools for developing strategies to manage the risks posed by sea-level rise to coastal communities. While these models are necessary for understanding climate risks, there is a level of uncertainty inherent in each parameter in the models. This model parametric uncertainty leads to uncertainty in future climate risks. Consequently, there is a need to understand how thos… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  41. arXiv:2106.09117  [pdf, other

    cs.LG

    DeepSplit: Scalable Verification of Deep Neural Networks via Operator Splitting

    Authors: Shaoru Chen, Eric Wong, J. Zico Kolter, Mahyar Fazlyab

    Abstract: Analyzing the worst-case performance of deep neural networks against input perturbations amounts to solving a large-scale non-convex optimization problem, for which several past works have proposed convex relaxations as a promising alternative. However, even for reasonably-sized neural networks, these relaxations are not tractable, and so must be replaced by even weaker relaxations in practice. In… ▽ More

    Submitted 8 July, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Published in IEEE Open Journal of Control Systems

  42. Binomial Determinants for Tiling Problems Yield to the Holonomic Ansatz

    Authors: Hao Du, Christoph Koutschan, Thotsaporn Thanatipanonda, Elaine Wong

    Abstract: We present and prove closed form expressions for some families of binomial determinants with signed Kronecker deltas that are located along an arbitrary diagonal in the corresponding matrix. They count cyclically symmetric rhombus tilings of hexagonal regions with triangular holes. We extend a previous systematic study of these families, where the locations of the Kronecker deltas depended on an a… ▽ More

    Submitted 21 September, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: 45 pages; Supplementary material at https://wongey.github.io/binom-det

    Journal ref: European Journal of Combinatorics, Volume 99, January 2022, 103437

  43. arXiv:2105.04857  [pdf, other

    cs.LG stat.ML

    Leveraging Sparse Linear Layers for Debuggable Deep Networks

    Authors: Eric Wong, Shibani Santurkar, Aleksander Mądry

    Abstract: We show how fitting sparse linear models over learned deep feature representations can lead to more debuggable neural networks. These networks remain highly accurate while also being more amenable to human interpretation, as we demonstrate quantiatively via numerical and human experiments. We further illustrate how the resulting sparse explanations can help to identify spurious correlations, expla… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  44. Creative Telescoping on Multiple Sums

    Authors: Christoph Koutschan, Elaine Wong

    Abstract: We showcase a collection of practical strategies to deal with a problem arising from an analysis of integral estimators derived via quasi-Monte Carlo methods. The problem reduces to a triple binomial sum, thereby enabling us to open up the holonomic toolkit, which contains tools such as creative telescoping that can be used to deduce a recurrence satisfied by the sum. While applying these techniqu… ▽ More

    Submitted 17 March, 2021; v1 submitted 17 October, 2020; originally announced October 2020.

    Comments: 22 pages; Supplementary material at https://wongey.github.io/digital-nets-walsh/

    Journal ref: Mathematics in Computer Science, Vol. 15(3), Pages 483-498 (2021)

  45. arXiv:2007.08450  [pdf, other

    cs.LG stat.ML

    Learning perturbation sets for robust machine learning

    Authors: Eric Wong, J. Zico Kolter

    Abstract: Although much progress has been made towards robust deep learning, a significant gap in robustness remains between real-world perturbations and more narrowly defined sets typically studied in adversarial defenses. In this paper, we aim to bridge this gap by learning perturbation sets from data, in order to characterize real-world effects for robust training and evaluation. Specifically, we use a c… ▽ More

    Submitted 8 October, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

  46. arXiv:2007.00147  [pdf, other

    cs.LG stat.ML

    Neural Network Virtual Sensors for Fuel Injection Quantities with Provable Performance Specifications

    Authors: Eric Wong, Tim Schneider, Joerg Schmitt, Frank R. Schmidt, J. Zico Kolter

    Abstract: Recent work has shown that it is possible to learn neural networks with provable guarantees on the output of the model when subject to input perturbations, however these works have focused primarily on defending against adversarial examples for image classifiers. In this paper, we study how these provable guarantees can be naturally applied to other real world settings, namely getting performance… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

  47. Walsh functions, scrambled $(0,m,s)$-nets, and negative covariance: applying symbolic computation to quasi-Monte Carlo integration

    Authors: Jaspar Wiart, Elaine Wong

    Abstract: We investigate base $b$ Walsh functions for which the variance of the integral estimator based on a scrambled $(0,m,s)$-net in base $b$ is less than or equal to that of the Monte-Carlo estimator based on the same number of points. First we compute the Walsh decomposition for the joint probability density function of two distinct points randomly chosen from a scrambled $(t,m,s)$-net in base $b$ in… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 27 pages; Supplementary material at https://wongey.github.io/digital-nets-walsh/

    Journal ref: Mathematics and Computers in Simulation, Volume 182, April 2021, Pages 277-295

  48. Centralized and Decentralized Non-Cooperative Load-Balancing Games among Federated Cloudlets

    Authors: Sourav Mondal, Goutam Das, Elaine Wong

    Abstract: Edge computing servers like cloudlets from different service providers compensate scarce computational, memory, and energy resources of mobile devices, are distributed across access networks. However, depending on the mobility pattern and dynamically varying computational requirements of associated mobile devices, cloudlets at different parts of the network become either overloaded or under-loaded… ▽ More

    Submitted 5 May, 2021; v1 submitted 30 May, 2020; originally announced June 2020.

    Report number: 108847

  49. arXiv:2002.11569  [pdf, other

    cs.LG stat.ML

    Overfitting in adversarially robust deep learning

    Authors: Leslie Rice, Eric Wong, J. Zico Kolter

    Abstract: It is common practice in deep learning to use overparameterized networks and train for as long as possible; there are numerous studies that show, both theoretically and empirically, that such practices surprisingly do not unduly harm the generalization performance of the classifier. In this paper, we empirically study this phenomenon in the setting of adversarially trained deep networks, which are… ▽ More

    Submitted 4 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

  50. An Additive Decomposition in S-Primitive Towers

    Authors: Hao Du, Jing Guo, Ziming Li, Elaine Wong

    Abstract: We consider the additive decomposition problem in primitive towers and present an algorithm to decompose a function in an S-primitive tower as a sum of a derivative in the tower and a remainder which is minimal in some sense. Special instances of S-primitive towers include differential fields generated by finitely many logarithmic functions and logarithmic integrals. A function in an S-primitive t… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: This article has been submitted to ISSAC2020 for review. Supplementary material at https://wongey.github.io/add-decomp-sprimitive/

    Journal ref: ISSAC 2020: Proceedings of the 45th International Symposium on Symbolic and Algebraic Computation, July 2020, Pages 146-153