Skip to main content

Showing 1–50 of 51 results for author: Weng, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04307  [pdf, other

    cs.CL cs.LG

    Crafting Large Language Models for Enhanced Interpretability

    Authors: Chung-En Sun, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: We introduce the Concept Bottleneck Large Language Model (CB-LLM), a pioneering approach to creating inherently interpretable Large Language Models (LLMs). Unlike traditional black-box LLMs that rely on post-hoc interpretation methods with limited neuron function insights, CB-LLM sets a new standard with its built-in interpretability, scalability, and ability to provide clear, accurate explanation… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Present at ICML 2024 Mechanistic Interpretability (MI) Workshop

  2. arXiv:2406.18062  [pdf, other

    cs.LG cs.AI

    Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

    Authors: Chung-En Sun, Sicun Gao, Tsui-Wei Weng

    Abstract: Robustness remains a paramount concern in deep reinforcement learning (DRL), with randomized smoothing emerging as a key technique for enhancing this attribute. However, a notable gap exists in the performance of current smoothed DRL agents, often characterized by significantly low clean rewards and weak robustness. In response to this challenge, our study introduces innovative algorithms aimed at… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Published in ICML 2024

  3. arXiv:2406.16990  [pdf, other

    cs.SD cs.AI eess.AS

    AND: Audio Network Dissection for Interpreting Deep Acoustic Models

    Authors: Tung-Yu Wu, Yu-Xiang Lin, Tsui-Wei Weng

    Abstract: Neuron-level interpretations aim to explain network behaviors and properties by investigating neurons responsive to specific perceptual or structural input patterns. Although there is emerging work in the vision and language domains, none is explored for acoustic models. To bridge the gap, we introduce $\textit{AND}$, the first $\textbf{A}$udio $\textbf{N}$etwork $\textbf{D}$issection framework th… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML'24

    Journal ref: Forty-first International Conference on Machine Learning (2024)

  4. arXiv:2406.02240  [pdf, other

    cs.NI

    Quantum Computing in Wireless Communications and Networking: A Tutorial-cum-Survey

    Authors: Wei Zhao, Tangjie Weng, Yue Ruan, Zhi Liu, Xuangou Wu, Xiao Zheng, Nei Kato

    Abstract: Owing to its outstanding parallel computing capabilities, quantum computing (QC) has been a subject of continuous attention. With the gradual maturation of QC platforms, it has increasingly played a significant role in various fields such as transportation, pharmaceuticals, and industrial manufacturing,achieving unprecedented milestones. In modern society, wireless communication stands as an indis… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2405.06855  [pdf, other

    cs.LG cs.CV

    Linear Explanations for Individual Neurons

    Authors: Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In recent years many methods have been developed to understand the internal workings of neural networks, often by describing the function of individual neurons in the model. However, these methods typically only focus on explaining the very highest activations of a neuron. In this paper we show this is not sufficient, and that the highest activation range is only responsible for a very small perce… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Published in ICML 2024

  6. arXiv:2404.19651  [pdf, other

    cs.LG cs.AI cs.CV

    Provably Robust Conformal Prediction with Improved Efficiency

    Authors: Ge Yan, Yaniv Romano, Tsui-Wei Weng

    Abstract: Conformal prediction is a powerful tool to generate uncertainty sets with guaranteed coverage using any predictive model, under the assumption that the training and test data are i.i.d.. Recently, it has been shown that adversarial examples are able to manipulate conformal methods to construct prediction sets with invalid coverage rates, as the i.i.d. assumption is violated. To address this issue,… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  7. arXiv:2403.13771  [pdf, other

    cs.CV cs.LG

    Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models

    Authors: Nicholas Bai, Rahul A. Iyer, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this paper, we propose Describe-and-Dissect (DnD), a novel method to describe the roles of hidden neurons in vision networks. DnD utilizes recent advancements in multimodal deep learning to produce complex natural language descriptions, without the need for labeled training data or a predefined set of concepts to choose from. Additionally, DnD is training-free, meaning we don't train any new mo… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  8. arXiv:2312.10469  [pdf, other

    cs.LG stat.ML

    One step closer to unbiased aleatoric uncertainty estimation

    Authors: Wang Zhang, Ziwen Ma, Subhro Das, Tsui-Wei Weng, Alexandre Megretski, Luca Daniel, Lam M. Nguyen

    Abstract: Neural networks are powerful tools in various applications, and quantifying their uncertainty is crucial for reliable decision-making. In the deep learning field, the uncertainties are usually categorized into aleatoric (data) and epistemic (model) uncertainty. In this paper, we point out that the existing popular variance attenuation method highly overestimates aleatoric uncertainty. To address t… ▽ More

    Submitted 20 December, 2023; v1 submitted 16 December, 2023; originally announced December 2023.

  9. arXiv:2311.18496  [pdf, other

    cs.CV

    Accurate Segmentation of Optic Disc And Cup from Multiple Pseudo-labels by Noise-aware Learning

    Authors: Tengjin Weng, Yang Shen, Zhidong Zhao, Zhiming Cheng, Shuai Wang

    Abstract: Optic disc and cup segmentation plays a crucial role in automating the screening and diagnosis of optic glaucoma. While data-driven convolutional neural networks (CNNs) show promise in this area, the inherent ambiguity of segmenting objects and background boundaries in the task of optic disc and cup segmentation leads to noisy annotations that impact model performance. To address this, we propose… ▽ More

    Submitted 15 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: CSCWD 2024

  10. arXiv:2311.11669  [pdf, other

    cs.CV

    PMP-Swin: Multi-Scale Patch Message Passing Swin Transformer for Retinal Disease Classification

    Authors: Zhihan Yang, Zhiming Cheng, Tengjin Weng, Shucheng He, Yaqi Wang, Xin Ye, Shuai Wang

    Abstract: Retinal disease is one of the primary causes of visual impairment, and early diagnosis is essential for preventing further deterioration. Nowadays, many works have explored Transformers for diagnosing diseases due to their strong visual representation capabilities. However, retinal diseases exhibit milder forms and often present with overlapping signs, which pose great difficulties for accurate mu… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 9 pages, 7 figures

  11. arXiv:2311.10380  [pdf, other

    cs.CV

    MSE-Nets: Multi-annotated Semi-supervised Ensemble Networks for Improving Segmentation of Medical Image with Ambiguous Boundaries

    Authors: Shuai Wang, Tengjin Weng, Jingyi Wang, Yang Shen, Zhidong Zhao, Yixiu Liu, Pengfei Jiao, Zhiming Cheng, Yaqi Wang

    Abstract: Medical image segmentation annotations exhibit variations among experts due to the ambiguous boundaries of segmented objects and backgrounds in medical images. Although using multiple annotations for each image in the fully-supervised has been extensively studied for training deep models, obtaining a large amount of multi-annotated data is challenging due to the substantial time and manpower costs… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  12. arXiv:2310.16332  [pdf, other

    cs.LG

    Corrupting Neuron Explanations of Deep Visual Features

    Authors: Divyansh Srivastava, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: The inability of DNNs to explain their black-box behavior has led to a recent surge of explainability methods. However, there are growing concerns that these explainability methods are not robust and trustworthy. In this work, we perform the first robustness analysis of Neuron Explanation Methods under a unified pipeline and show that these explanations can be significantly corrupted by random noi… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 1877-1886

  13. arXiv:2310.07780  [pdf, other

    cs.LG

    Promoting Robustness of Randomized Smoothing: Two Cost-Effective Approaches

    Authors: Linbo Liu, Trong Nghia Hoang, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Randomized smoothing has recently attracted attentions in the field of adversarial robustness to provide provable robustness guarantees on smoothed neural network classifiers. However, existing works show that vanilla randomized smoothing usually does not provide good robustness performance and often requires (re)training techniques on the base classifier in order to boost the robustness of the re… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  14. arXiv:2310.06200  [pdf, other

    cs.CL cs.LG

    The Importance of Prompt Tuning for Automated Neuron Explanations

    Authors: Justin Lee, Tuomas Oikarinen, Arjun Chatha, Keng-Chi Chang, Yilan Chen, Tsui-Wei Weng

    Abstract: Recent advances have greatly increased the capabilities of large language models (LLMs), but our understanding of the models and their safety has not progressed as fast. In this paper we aim to understand LLMs deeper by studying their individual neurons. We build upon previous work showing large language models such as GPT-4 can be useful in explaining what each neuron in a language model does. Sp… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  15. arXiv:2308.12820  [pdf, other

    cs.LG cs.CY stat.ML

    Prediction without Preclusion: Recourse Verification with Reachable Sets

    Authors: Avni Kothari, Bogdan Kulynych, Tsui-Wei Weng, Berk Ustun

    Abstract: Machine learning models are often used to decide who receives a loan, a job interview, or a public benefit. Models in such settings use features without considering their actionability. As a result, they can assign predictions that are fixed $-$ meaning that individuals who are denied loans and interviews are, in fact, precluded from access to credit and employment. In this work, we introduce a pr… ▽ More

    Submitted 1 May, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 Spotlight. The first two authors contributed equally

  16. arXiv:2306.02582  [pdf, other

    cs.CV

    Enhancing Point Annotations with Superpixel and Confidence Learning Guided for Improving Semi-Supervised OCT Fluid Segmentation

    Authors: Tengjin Weng, Yang Shen, Kai Jin, Zhiming Cheng, Yunxiang Li, Gewen Zhang, Shuai Wang, Yaqi Wang

    Abstract: Automatic segmentation of fluid in Optical Coherence Tomography (OCT) images is beneficial for ophthalmologists to make an accurate diagnosis. Although semi-supervised OCT fluid segmentation networks enhance their performance by introducing additional unlabeled data, the performance enhancement is limited. To address this, we propose Superpixel and Confident Learning Guide Point Annotations Networ… ▽ More

    Submitted 30 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Submission to BSPC

  17. arXiv:2304.13346  [pdf, other

    cs.LG cs.CV

    Concept-Monitor: Understanding DNN training through individual neurons

    Authors: Mohammad Ali Khan, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this work, we propose a general framework called Concept-Monitor to help demystify the black-box DNN training processes automatically using a novel unified embedding space and concept diversity metric. Concept-Monitor enables human-interpretable visualization and indicators of the DNN training processes and facilitates transparency as well as deeper understanding on how DNNs develop along the d… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  18. arXiv:2304.12584  [pdf, other

    physics.optics cs.LG

    Learning imaging mechanism directly from optical microscopy observations

    Authors: Ze-Hao Wang, Long-Kun Shan, Tong-Tian Weng, Tian-Long Chen, Qi-Yu Wang, Xiang-Dong Chen, Zhang-Yang Wang, Guang-Can Guo, Fang-Wen Sun

    Abstract: Optical microscopy image plays an important role in scientific research through the direct visualization of the nanoworld, where the imaging mechanism is described as the convolution of the point spread function (PSF) and emitters. Based on a priori knowledge of the PSF or equivalent PSF, it is possible to achieve more precise exploration of the nanoworld. However, it is an outstanding challenge t… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  19. arXiv:2304.06129  [pdf, other

    cs.LG cs.CV

    Label-Free Concept Bottleneck Models

    Authors: Tuomas Oikarinen, Subhro Das, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Concept bottleneck models (CBM) are a popular way of creating more interpretable neural networks by having hidden layer neurons correspond to human-understandable concepts. However, existing CBMs and their variants have two crucial limitations: first, they need to collect labeled data for each of the predefined concepts, which is time consuming and labor intensive; second, the accuracy of a CBM is… ▽ More

    Submitted 5 June, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Published at ICLR 2023. New v2(5 June 2023): added crowdsourced human study in Appendix B

  20. arXiv:2304.00601  [pdf, other

    cs.CV cs.LG

    Constructive Assimilation: Boosting Contrastive Learning Performance through View Generation Strategies

    Authors: Ligong Han, Seungwook Han, Shivchander Sudalairaj, Charlotte Loh, Rumen Dangovski, Fei Deng, Pulkit Agrawal, Dimitris Metaxas, Leonid Karlinsky, Tsui-Wei Weng, Akash Srivastava

    Abstract: Transformations based on domain expertise (expert transformations), such as random-resized-crop and color-jitter, have proven critical to the success of contrastive learning techniques such as SimCLR. Recently, several attempts have been made to replace such domain-specific, human-designed transformations with generated views that are learned. However for imagery data, so far none of these view-ge… ▽ More

    Submitted 8 April, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: Accepted at Generative Models for Computer Vision Workshop 2023

  21. arXiv:2302.05783  [pdf, other

    cs.LG

    ConCerNet: A Contrastive Learning Based Framework for Automated Conservation Law Discovery and Trustworthy Dynamical System Prediction

    Authors: Wang Zhang, Tsui-Wei Weng, Subhro Das, Alexandre Megretski, Luca Daniel, Lam M. Nguyen

    Abstract: Deep neural networks (DNN) have shown great capacity of modeling a dynamical system; nevertheless, they usually do not obey physics constraints such as conservation laws. This paper proposes a new learning framework named ConCerNet to improve the trustworthiness of the DNN based dynamics modeling to endow the invariant properties. ConCerNet consists of two steps: (i) a contrastive learning method… ▽ More

    Submitted 19 July, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted by ICML 2023

  22. arXiv:2301.11324  [pdf, other

    cs.LG

    Certified Interpretability Robustness for Class Activation Mapping

    Authors: Alex Gu, Tsui-Wei Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel

    Abstract: Interpreting machine learning models is challenging but crucial for ensuring the safety of deep networks in autonomous driving systems. Due to the prevalence of deep learning based perception models in autonomous vehicles, accurately interpreting their predictions is crucial. While a variety of such methods have been proposed, most are shown to lack robustness. Yet, little has been done to provide… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 13 pages, 5 figures. Accepted to Machine Learning for Autonomous Driving Workshop at NeurIPS 2020

  23. arXiv:2211.02647  [pdf, other

    cs.RO

    Neural Grasp Distance Fields for Robot Manipulation

    Authors: Thomas Weng, David Held, Franziska Meier, Mustafa Mukadam

    Abstract: We formulate grasp learning as a neural field and present Neural Grasp Distance Fields (NGDF). Here, the input is a 6D pose of a robot end effector and output is a distance to a continuous manifold of valid grasps for an object. In contrast to current approaches that predict a set of discrete candidate grasps, the distance-based NGDF representation is easily interpreted as a cost, and minimizing t… ▽ More

    Submitted 28 December, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted to ICRA 2023

  24. arXiv:2210.11513  [pdf, other

    cs.LG

    Learning Sample Reweighting for Accuracy and Adversarial Robustness

    Authors: Chester Holtz, Tsui-Wei Weng, Gal Mishne

    Abstract: There has been great interest in enhancing the robustness of neural network classifiers to defend against adversarial perturbations through adversarial training, while balancing the trade-off between robust accuracy and standard accuracy. We propose a novel adversarial training framework that learns to reweight the loss associated with individual training samples based on a notion of class-conditi… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  25. arXiv:2207.13891  [pdf, other

    cs.RO

    Quantifying Safety of Learning-based Self-Driving Control Using Almost-Barrier Functions

    Authors: Zhizhen Qin, Tsui-Wei Weng, Sicun Gao

    Abstract: Path-tracking control of self-driving vehicles can benefit from deep learning for tackling longstanding challenges such as nonlinearity and uncertainty. However, deep neural controllers lack safety guarantees, restricting their practical use. We propose a new approach of learning almost-barrier functions, which approximately characterizes the forward invariant set for the system under neural contr… ▽ More

    Submitted 8 August, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Journal ref: International Conference on Intelligent Robots and Systems, IROS 2022, Kyoto, Japan, Oct 23 - Oct. 27, 2022

  26. arXiv:2207.11196  [pdf, other

    cs.RO

    Learning to Singulate Layers of Cloth using Tactile Feedback

    Authors: Sashank Tirumala, Thomas Weng, Daniel Seita, Oliver Kroemer, Zeynep Temel, David Held

    Abstract: Robotic manipulation of cloth has applications ranging from fabrics manufacturing to handling blankets and laundry. Cloth manipulation is challenging for robots largely due to their high degrees of freedom, complex dynamics, and severe self-occlusions when in folded or crumpled configurations. Prior work on robotic manipulation of cloth relies primarily on vision sensors alone, which may pose chal… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: IROS 2022. See https://sites.google.com/view/reskin-cloth for supplementary material

  27. arXiv:2207.09249  [pdf, other

    cs.CY cs.DC

    Fabric-GC: A Blockchain-based Gantt Chart System for Cross-organizational Project Management

    Authors: Dun Li, Dezhi Han, Benhui Xia, Tien-Hsiung Weng, Arcangelo Castiglione, Kuan-Ching Li

    Abstract: Large-scale production is always associated with more and more development and interaction among peers, and many fields achieve higher economic benefits through project cooperation. However, project managers in the traditional centralized approach cannot rearrange their activities to cross-organizational project management. Thanks to its characteristics, the Blockchain can represent a valid soluti… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  28. arXiv:2204.10965  [pdf, other

    cs.CV cs.AI cs.LG

    CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks

    Authors: Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this paper, we propose CLIP-Dissect, a new technique to automatically describe the function of individual hidden neurons inside vision networks. CLIP-Dissect leverages recent advances in multimodal vision/language models to label internal neurons with open-ended concepts without the need for any labeled data or human examples. We show that CLIP-Dissect provides more accurate descriptions than e… ▽ More

    Submitted 5 June, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: Published in ICLR 2023 Conference (Spotlight). New v5(5 June 2023) - Added crowdsourced user study in Appendix B, not included in ICLR publication

  29. arXiv:2202.03558  [pdf, other

    cs.LG cs.AI

    Attacking c-MARL More Effectively: A Data Driven Approach

    Authors: Nhan H. Pham, Lam M. Nguyen, Jie Chen, Hoang Thanh Lam, Subhro Das, Tsui-Wei Weng

    Abstract: In recent years, a proliferation of methods were developed for cooperative multi-agent reinforcement learning (c-MARL). However, the robustness of c-MARL agents against adversarial attacks has been rarely explored. In this paper, we propose to evaluate the robustness of c-MARL agents via a model-based approach, named c-MBA. Our proposed formulation can craft much stronger adversarial state perturb… ▽ More

    Submitted 10 September, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

  30. arXiv:2111.06063  [pdf, other

    stat.ML cs.CV cs.LG math.OC

    On the Equivalence between Neural Network and Support Vector Machine

    Authors: Yilan Chen, Wei Huang, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Recent research shows that the dynamics of an infinitely wide neural network (NN) trained by gradient descent can be characterized by Neural Tangent Kernel (NTK) \citep{jacot2018neural}. Under the squared loss, the infinite-width NN trained by gradient descent with an infinitely small learning rate is equivalent to kernel regression with NTK \citep{arora2019exact}. However, the equivalence is only… ▽ More

    Submitted 7 July, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  31. arXiv:2111.05623  [pdf, other

    cs.RO cs.CV

    FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy

    Authors: Thomas Weng, Sujay Bajracharya, Yufei Wang, Khush Agrawal, David Held

    Abstract: We address the problem of goal-directed cloth manipulation, a challenging task due to the deformability of cloth. Our insight is that optical flow, a technique normally used for motion estimation in video, can also provide an effective representation for corresponding cloth poses across observation and goal images. We introduce FabricFlowNet (FFN), a cloth manipulation policy that leverages flow a… ▽ More

    Submitted 10 April, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: CoRL 2021

  32. arXiv:2102.10454  [pdf, other

    cs.LG cs.AI cs.CV

    On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning

    Authors: Ren Wang, Kaidi Xu, Sijia Liu, Pin-Yu Chen, Tsui-Wei Weng, Chuang Gan, Meng Wang

    Abstract: Model-agnostic meta-learning (MAML) has emerged as one of the most successful meta-learning techniques in few-shot learning. It enables us to learn a meta-initialization} of model parameters (that we call meta-model) to rapidly adapt to new tasks using a small amount of labeled training data. Despite the generalization power of the meta-model, it remains elusive that how adversarial robustness can… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

  33. arXiv:2102.01208  [pdf, ps, other

    cs.LG stat.ML

    Fast Training of Provably Robust Neural Networks by SingleProp

    Authors: Akhilan Boopathy, Tsui-Wei Weng, Sijia Liu, Pin-Yu Chen, Gaoyuan Zhang, Luca Daniel

    Abstract: Recent works have developed several methods of defending neural networks against adversarial attacks with certified guarantees. However, these techniques can be computationally costly due to the use of certification during training. We develop a new regularizer that is both more efficient than existing certified defenses, requiring only one additional forward propagation through a network, and can… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Published at AAAI 2021

  34. arXiv:2010.06651  [pdf, other

    cs.LG stat.ML

    Higher-Order Certification for Randomized Smoothing

    Authors: Jeet Mohapatra, Ching-Yun Ko, Tsui-Wei Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel

    Abstract: Randomized smoothing is a recently proposed defense against adversarial attacks that has achieved SOTA provable robustness against $\ell_2$ perturbations. A number of publications have extended the guarantees to other metrics, such as $\ell_1$ or $\ell_\infty$, by using different smoothing measures. Although the current framework has been shown to yield near-optimal $\ell_p$ radii, the total safet… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS2020(spotlight)

  35. Cloth Region Segmentation for Robust Grasp Selection

    Authors: Jianing Qian, Thomas Weng, Luxin Zhang, Brian Okorn, David Held

    Abstract: Cloth detection and manipulation is a common task in domestic and industrial settings, yet such tasks remain a challenge for robots due to cloth deformability. Furthermore, in many cloth-related tasks like laundry folding and bed making, it is crucial to manipulate specific regions like edges and corners, as opposed to folds. In this work, we focus on the problem of segmenting and grasping these k… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: Accepted at IROS 2020. The first two authors contributed equally and are listed in alphabetical order

    Journal ref: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  36. arXiv:2008.01976  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Robust Deep Reinforcement Learning through Adversarial Loss

    Authors: Tuomas Oikarinen, Wang Zhang, Alexandre Megretski, Luca Daniel, Tsui-Wei Weng

    Abstract: Recent studies have shown that deep reinforcement learning agents are vulnerable to small adversarial perturbations on the agent's inputs, which raises concerns about deploying such agents in the real world. To address this issue, we propose RADIAL-RL, a principled framework to train reinforcement learning agents with improved robustness against $l_p$-norm bounded adversarial attacks. Our framewor… ▽ More

    Submitted 10 November, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

  37. Multi-modal Transfer Learning for Grasping Transparent and Specular Objects

    Authors: Thomas Weng, Amith Pallankize, Yimin Tang, Oliver Kroemer, David Held

    Abstract: State-of-the-art object grasping methods rely on depth sensing to plan robust grasps, but commercially available depth sensors fail to detect transparent and specular objects. To improve grasping performance on such objects, we introduce a method for learning a multi-modal perception model by bootstrapping from an existing uni-modal model. This transfer learning approach requires only a pre-existi… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: RA-L with presentation at ICRA 2020

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 5, NO. 3, JULY 2020. 3791-3798

  38. arXiv:1908.06353  [pdf, other

    cs.LG stat.ML

    Verification of Neural Network Control Policy Under Persistent Adversarial Perturbation

    Authors: Yuh-Shyang Wang, Tsui-Wei Weng, Luca Daniel

    Abstract: Deep neural networks are known to be fragile to small adversarial perturbations. This issue becomes more critical when a neural network is interconnected with a physical system in a closed loop. In this paper, we show how to combine recent works on neural network certification tools (which are mainly used in static settings such as image classification) with robust control theory to certify a neur… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

  39. arXiv:1906.04214  [pdf, other

    cs.LG cs.CR cs.SI stat.ML

    Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective

    Authors: Kaidi Xu, Hongge Chen, Sijia Liu, Pin-Yu Chen, Tsui-Wei Weng, Mingyi Hong, Xue Lin

    Abstract: Graph neural networks (GNNs) which apply the deep neural networks to graph data have achieved significant performance for the task of semi-supervised node classification. However, only few work has addressed the adversarial robustness of GNNs. In this paper, we first present a novel gradient-based attack method that facilitates the difficulty of tackling discrete graph data. When comparing to curr… ▽ More

    Submitted 14 October, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Accepted by IJCAI 2019, the 28th International Joint Conference on Artificial Intelligence

    Journal ref: International Joint Conference on Artificial Intelligence (IJCAI-2019)

  40. arXiv:1905.07387  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    POPQORN: Quantifying Robustness of Recurrent Neural Networks

    Authors: Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, Dahua Lin

    Abstract: The vulnerability to adversarial attacks has been a critical issue for deep neural networks. Addressing this issue requires a reliable way to evaluate the robustness of a network. Recently, several methods have been developed to compute $\textit{robustness quantification}$ for neural networks, namely, certified lower bounds of the minimum adversarial perturbation. Such methods, however, were devis… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 10 pages, Ching-Yun Ko and Zhaoyang Lyu contributed equally, accepted to ICML 2019. Please see arXiv source codes for the appendix by clicking [Other formats]

  41. arXiv:1901.07648  [pdf, other

    math.OC cs.LG stat.ML

    Finite-Sum Smooth Optimization with SARAH

    Authors: Lam M. Nguyen, Marten van Dijk, Dzung T. Phan, Phuong Ha Nguyen, Tsui-Wei Weng, Jayant R. Kalagnanam

    Abstract: The total complexity (measured as the total number of gradient computations) of a stochastic first-order optimization algorithm that finds a first-order stationary point of a finite-sum smooth nonconvex objective function $F(w)=\frac{1}{n} \sum_{i=1}^n f_i(w)$ has been proven to be at least $Ω(\sqrt{n}/ε)$ for $n \leq \mathcal{O}(ε^{-2})$ where $ε$ denotes the attained accuracy… ▽ More

    Submitted 22 April, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

  42. arXiv:1812.08329  [pdf, other

    cs.LG cs.CR stat.ML

    PROVEN: Certifying Robustness of Neural Networks with a Probabilistic Approach

    Authors: Tsui-Wei Weng, Pin-Yu Chen, Lam M. Nguyen, Mark S. Squillante, Ivan Oseledets, Luca Daniel

    Abstract: With deep neural networks providing state-of-the-art machine learning models for numerous machine learning tasks, quantifying the robustness of these models has become an important area of research. However, most of the research literature merely focuses on the \textit{worst-case} setting where the input of the neural network is perturbed with noises that are constrained within an $\ell_p$ ball; a… ▽ More

    Submitted 7 January, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: updated ref [25]

  43. arXiv:1811.12395  [pdf, other

    stat.ML cs.CR cs.LG

    CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks

    Authors: Akhilan Boopathy, Tsui-Wei Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel

    Abstract: Verifying robustness of neural network classifiers has attracted great interests and attention due to the success of deep neural networks and their unexpected vulnerability to adversarial perturbations. Although finding minimum adversarial distortion of neural networks (with ReLU activations) has been shown to be an NP-complete problem, obtaining a non-trivial lower bound of minimum distortion as… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: Accepted by AAAI 2019

  44. arXiv:1811.00866  [pdf, other

    cs.LG cs.CR stat.ML

    Efficient Neural Network Robustness Certification with General Activation Functions

    Authors: Huan Zhang, Tsui-Wei Weng, Pin-Yu Chen, Cho-Jui Hsieh, Luca Daniel

    Abstract: Finding minimum distortion of adversarial examples and thus certifying robustness in neural network classifiers for given data points is known to be a challenging problem. Nevertheless, recently it has been shown to be possible to give a non-trivial certified lower bound of minimum adversarial distortion, and some recent progress has been made towards this direction by exploiting the piece-wise li… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

    Comments: Accepted by NIPS 2018. Huan Zhang and Tsui-Wei Weng contributed equally

  45. arXiv:1810.08640  [pdf, ps, other

    cs.LG cs.CR stat.ML

    On Extensions of CLEVER: A Neural Network Robustness Evaluation Algorithm

    Authors: Tsui-Wei Weng, Huan Zhang, Pin-Yu Chen, Aurelie Lozano, Cho-Jui Hsieh, Luca Daniel

    Abstract: CLEVER (Cross-Lipschitz Extreme Value for nEtwork Robustness) is an Extreme Value Theory (EVT) based robustness score for large-scale deep neural networks (DNNs). In this paper, we propose two extensions on this robustness score. First, we provide a new formal robustness guarantee for classifier functions that are twice differentiable. We apply extreme value theory on the new formal robustness gua… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: Accepted by GlobalSIP 2018. Tsui-Wei Weng and Huan Zhang contributed equally

  46. arXiv:1804.09699  [pdf, other

    stat.ML cs.CR cs.CV cs.LG

    Towards Fast Computation of Certified Robustness for ReLU Networks

    Authors: Tsui-Wei Weng, Huan Zhang, Hongge Chen, Zhao Song, Cho-Jui Hsieh, Duane Boning, Inderjit S. Dhillon, Luca Daniel

    Abstract: Verifying the robustness property of a general Rectified Linear Unit (ReLU) network is an NP-complete problem [Katz, Barrett, Dill, Julian and Kochenderfer CAV17]. Although finding the exact minimum adversarial distortion is hard, giving a certified lower bound of the minimum distortion is possible. Current available methods of computing such a bound are either time-consuming or delivering low qua… ▽ More

    Submitted 2 October, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

    Comments: Tsui-Wei Weng and Huan Zhang contributed equally

  47. arXiv:1801.10578  [pdf, other

    stat.ML cs.CR cs.LG

    Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach

    Authors: Tsui-Wei Weng, Huan Zhang, Pin-Yu Chen, Jinfeng Yi, Dong Su, Yupeng Gao, Cho-Jui Hsieh, Luca Daniel

    Abstract: The robustness of neural networks to adversarial examples has received great attention due to security implications. Despite various attack approaches to crafting visually imperceptible adversarial examples, little has been developed towards a comprehensive measure of robustness. In this paper, we provide a theoretical justification for converting robustness analysis into a local Lipschitz constan… ▽ More

    Submitted 31 January, 2018; originally announced January 2018.

    Comments: Accepted by Sixth International Conference on Learning Representations (ICLR 2018). Tsui-Wei Weng and Huan Zhang contributed equally

  48. arXiv:1701.03259  [pdf, ps, other

    physics.soc-ph cs.SI

    Multitarget search on complex networks: A logarithmic growth of global mean random cover time

    Authors: Tongfeng Weng, Jie Zhang, Michael Small, Ji Yang, Farshid Hassani Bijarbooneh, Pan Hui

    Abstract: We investigate multitarget search on complex networks and derive an exact expression for the mean random cover time that quantifies the expected time a walker needs to visit multiple targets. Based on this, we recover and extend some interesting results of multitarget search on networks. Specifically, we observe the logarithmic increase of the global mean random cover time with the target number f… ▽ More

    Submitted 12 September, 2017; v1 submitted 12 January, 2017; originally announced January 2017.

    Journal ref: Chaos 27, 093103 (2017)

  49. arXiv:1611.02256  [pdf, ps, other

    cs.CE math.NA stat.CO

    A Big-Data Approach to Handle Many Process Variations: Tensor Recovery and Applications

    Authors: Zheng Zhang, Tsui-Wei Weng, Luca Daniel

    Abstract: Fabrication process variations are a major source of yield degradation in the nano-scale design of integrated circuits (IC), microelectromechanical systems (MEMS) and photonic circuits. Stochastic spectral methods are a promising technique to quantify the uncertainties caused by process variations. Despite their superior efficiency over Monte Carlo for many design cases, these algorithms suffer fr… ▽ More

    Submitted 7 November, 2016; originally announced November 2016.

    Comments: 8 figures

    Journal ref: IEEE Transactions on Component, Packaging and Manufacturing Technology, 2017

  50. arXiv:1610.02878  [pdf, ps, other

    physics.soc-ph cs.SI

    Navigation by anomalous random walks on complex networks

    Authors: Tongfeng Weng, Jie Zhang, Moein Khajehnejad, Michael Small, Rui Zheng, Pan Hui

    Abstract: Anomalous random walks having long-range jumps are a critical branch of dynamical processes on networks, which can model a number of search and transport processes. However, traditional measurements based on mean first passage time are not useful as they fail to characterize the cost associated with each jump. Here we introduce a new concept of mean first traverse distance (MFTD) to characterize a… ▽ More

    Submitted 10 October, 2016; originally announced October 2016.