Zum Hauptinhalt springen

Showing 1–50 of 66 results for author: Weng, T

.
  1. arXiv:2408.15105  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn cond-mat.other cond-mat.str-el

    Resolving the pressure induced 'self-insertion' in skutterudite CoSb3

    Authors: Bihan Wang, Anna Pakhomova, Saiana Khandarkhaeva, Mirtha Pillaca, Peter Gille, Zhe Ren, Dmitry Lapkin, Dameli Assalauova, Pavel Alexeev, Ilya Sergeev, Satishkumar Kulkarni, Tsu-Chien Weng, Michael Sprung, Hanns-Peter Liermann, Ivan A. Vartanyants, Konstantin Glazyrin

    Abstract: CoSb3, a skutterudite compound, is key in studying thermoelectric materials. Under compression, it undergoes a 'self-insertion' isostructural transition, redistributing large Sb atoms among crystallographic sites. We investigated CoSb3's structural stability up to 70 GPa using single crystal X-ray diffraction and high-resolution X-ray scattering, including Bragg Coherent Diffraction Imaging. We ex… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: Manuscript: 27 pages, 12 Figures

  2. arXiv:2408.01432  [pdf, other

    cs.CV cs.LG

    VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance

    Authors: Divyansh Srivastava, Ge Yan, Tsui-Wei Weng

    Abstract: Concept Bottleneck Models (CBMs) provide interpretable prediction by introducing an intermediate Concept Bottleneck Layer (CBL), which encodes human-understandable concepts to explain models' decision. Recent works proposed to utilize Large Language Models (LLMs) and pre-trained Vision-Language Models (VLMs) to automate the training of CBMs, making it more scalable and automated. However, existing… ▽ More

    Submitted 18 July, 2024; originally announced August 2024.

  3. arXiv:2407.04307  [pdf, other

    cs.CL cs.LG

    Crafting Large Language Models for Enhanced Interpretability

    Authors: Chung-En Sun, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: We introduce the Concept Bottleneck Large Language Model (CB-LLM), a pioneering approach to creating inherently interpretable Large Language Models (LLMs). Unlike traditional black-box LLMs that rely on post-hoc interpretation methods with limited neuron function insights, CB-LLM sets a new standard with its built-in interpretability, scalability, and ability to provide clear, accurate explanation… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Present at ICML 2024 Mechanistic Interpretability (MI) Workshop

  4. arXiv:2406.18062  [pdf, other

    cs.LG cs.AI

    Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

    Authors: Chung-En Sun, Sicun Gao, Tsui-Wei Weng

    Abstract: Robustness remains a paramount concern in deep reinforcement learning (DRL), with randomized smoothing emerging as a key technique for enhancing this attribute. However, a notable gap exists in the performance of current smoothed DRL agents, often characterized by significantly low clean rewards and weak robustness. In response to this challenge, our study introduces innovative algorithms aimed at… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Published in ICML 2024

  5. arXiv:2406.16990  [pdf, other

    cs.SD cs.AI eess.AS

    AND: Audio Network Dissection for Interpreting Deep Acoustic Models

    Authors: Tung-Yu Wu, Yu-Xiang Lin, Tsui-Wei Weng

    Abstract: Neuron-level interpretations aim to explain network behaviors and properties by investigating neurons responsive to specific perceptual or structural input patterns. Although there is emerging work in the vision and language domains, none is explored for acoustic models. To bridge the gap, we introduce $\textit{AND}$, the first $\textbf{A}$udio $\textbf{N}$etwork $\textbf{D}$issection framework th… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML'24

    Journal ref: Forty-first International Conference on Machine Learning (2024)

  6. arXiv:2406.02240  [pdf, other

    cs.NI

    Quantum Computing in Wireless Communications and Networking: A Tutorial-cum-Survey

    Authors: Wei Zhao, Tangjie Weng, Yue Ruan, Zhi Liu, Xuangou Wu, Xiao Zheng, Nei Kato

    Abstract: Owing to its outstanding parallel computing capabilities, quantum computing (QC) has been a subject of continuous attention. With the gradual maturation of QC platforms, it has increasingly played a significant role in various fields such as transportation, pharmaceuticals, and industrial manufacturing,achieving unprecedented milestones. In modern society, wireless communication stands as an indis… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  7. arXiv:2405.06855  [pdf, other

    cs.LG cs.CV

    Linear Explanations for Individual Neurons

    Authors: Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In recent years many methods have been developed to understand the internal workings of neural networks, often by describing the function of individual neurons in the model. However, these methods typically only focus on explaining the very highest activations of a neuron. In this paper we show this is not sufficient, and that the highest activation range is only responsible for a very small perce… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Published in ICML 2024

  8. arXiv:2404.19651  [pdf, other

    cs.LG cs.AI cs.CV

    Provably Robust Conformal Prediction with Improved Efficiency

    Authors: Ge Yan, Yaniv Romano, Tsui-Wei Weng

    Abstract: Conformal prediction is a powerful tool to generate uncertainty sets with guaranteed coverage using any predictive model, under the assumption that the training and test data are i.i.d.. Recently, it has been shown that adversarial examples are able to manipulate conformal methods to construct prediction sets with invalid coverage rates, as the i.i.d. assumption is violated. To address this issue,… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  9. arXiv:2403.13771  [pdf, other

    cs.CV cs.LG

    Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models

    Authors: Nicholas Bai, Rahul A. Iyer, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this paper, we propose Describe-and-Dissect (DnD), a novel method to describe the roles of hidden neurons in vision networks. DnD utilizes recent advancements in multimodal deep learning to produce complex natural language descriptions, without the need for labeled training data or a predefined set of concepts to choose from. Additionally, DnD is training-free, meaning we don't train any new mo… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  10. arXiv:2312.10469  [pdf, other

    cs.LG stat.ML

    One step closer to unbiased aleatoric uncertainty estimation

    Authors: Wang Zhang, Ziwen Ma, Subhro Das, Tsui-Wei Weng, Alexandre Megretski, Luca Daniel, Lam M. Nguyen

    Abstract: Neural networks are powerful tools in various applications, and quantifying their uncertainty is crucial for reliable decision-making. In the deep learning field, the uncertainties are usually categorized into aleatoric (data) and epistemic (model) uncertainty. In this paper, we point out that the existing popular variance attenuation method highly overestimates aleatoric uncertainty. To address t… ▽ More

    Submitted 20 December, 2023; v1 submitted 16 December, 2023; originally announced December 2023.

  11. arXiv:2311.18496  [pdf, other

    cs.CV

    Accurate Segmentation of Optic Disc And Cup from Multiple Pseudo-labels by Noise-aware Learning

    Authors: Tengjin Weng, Yang Shen, Zhidong Zhao, Zhiming Cheng, Shuai Wang

    Abstract: Optic disc and cup segmentation plays a crucial role in automating the screening and diagnosis of optic glaucoma. While data-driven convolutional neural networks (CNNs) show promise in this area, the inherent ambiguity of segmenting objects and background boundaries in the task of optic disc and cup segmentation leads to noisy annotations that impact model performance. To address this, we propose… ▽ More

    Submitted 15 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: CSCWD 2024

  12. arXiv:2311.11669  [pdf, other

    cs.CV

    PMP-Swin: Multi-Scale Patch Message Passing Swin Transformer for Retinal Disease Classification

    Authors: Zhihan Yang, Zhiming Cheng, Tengjin Weng, Shucheng He, Yaqi Wang, Xin Ye, Shuai Wang

    Abstract: Retinal disease is one of the primary causes of visual impairment, and early diagnosis is essential for preventing further deterioration. Nowadays, many works have explored Transformers for diagnosing diseases due to their strong visual representation capabilities. However, retinal diseases exhibit milder forms and often present with overlapping signs, which pose great difficulties for accurate mu… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 9 pages, 7 figures

  13. arXiv:2311.10380  [pdf, other

    cs.CV

    MSE-Nets: Multi-annotated Semi-supervised Ensemble Networks for Improving Segmentation of Medical Image with Ambiguous Boundaries

    Authors: Shuai Wang, Tengjin Weng, Jingyi Wang, Yang Shen, Zhidong Zhao, Yixiu Liu, Pengfei Jiao, Zhiming Cheng, Yaqi Wang

    Abstract: Medical image segmentation annotations exhibit variations among experts due to the ambiguous boundaries of segmented objects and backgrounds in medical images. Although using multiple annotations for each image in the fully-supervised has been extensively studied for training deep models, obtaining a large amount of multi-annotated data is challenging due to the substantial time and manpower costs… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  14. arXiv:2310.16332  [pdf, other

    cs.LG

    Corrupting Neuron Explanations of Deep Visual Features

    Authors: Divyansh Srivastava, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: The inability of DNNs to explain their black-box behavior has led to a recent surge of explainability methods. However, there are growing concerns that these explainability methods are not robust and trustworthy. In this work, we perform the first robustness analysis of Neuron Explanation Methods under a unified pipeline and show that these explanations can be significantly corrupted by random noi… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 1877-1886

  15. arXiv:2310.07780  [pdf, other

    cs.LG

    Promoting Robustness of Randomized Smoothing: Two Cost-Effective Approaches

    Authors: Linbo Liu, Trong Nghia Hoang, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Randomized smoothing has recently attracted attentions in the field of adversarial robustness to provide provable robustness guarantees on smoothed neural network classifiers. However, existing works show that vanilla randomized smoothing usually does not provide good robustness performance and often requires (re)training techniques on the base classifier in order to boost the robustness of the re… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  16. arXiv:2310.06200  [pdf, other

    cs.CL cs.LG

    The Importance of Prompt Tuning for Automated Neuron Explanations

    Authors: Justin Lee, Tuomas Oikarinen, Arjun Chatha, Keng-Chi Chang, Yilan Chen, Tsui-Wei Weng

    Abstract: Recent advances have greatly increased the capabilities of large language models (LLMs), but our understanding of the models and their safety has not progressed as fast. In this paper we aim to understand LLMs deeper by studying their individual neurons. We build upon previous work showing large language models such as GPT-4 can be useful in explaining what each neuron in a language model does. Sp… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  17. arXiv:2308.12820  [pdf, other

    cs.LG cs.CY stat.ML

    Prediction without Preclusion: Recourse Verification with Reachable Sets

    Authors: Avni Kothari, Bogdan Kulynych, Tsui-Wei Weng, Berk Ustun

    Abstract: Machine learning models are often used to decide who receives a loan, a job interview, or a public benefit. Models in such settings use features without considering their actionability. As a result, they can assign predictions that are fixed $-$ meaning that individuals who are denied loans and interviews are, in fact, precluded from access to credit and employment. In this work, we introduce a pr… ▽ More

    Submitted 1 May, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 Spotlight. The first two authors contributed equally

  18. arXiv:2306.02582  [pdf, other

    cs.CV

    Enhancing Point Annotations with Superpixel and Confidence Learning Guided for Improving Semi-Supervised OCT Fluid Segmentation

    Authors: Tengjin Weng, Yang Shen, Kai Jin, Zhiming Cheng, Yunxiang Li, Gewen Zhang, Shuai Wang, Yaqi Wang

    Abstract: Automatic segmentation of fluid in Optical Coherence Tomography (OCT) images is beneficial for ophthalmologists to make an accurate diagnosis. Although semi-supervised OCT fluid segmentation networks enhance their performance by introducing additional unlabeled data, the performance enhancement is limited. To address this, we propose Superpixel and Confident Learning Guide Point Annotations Networ… ▽ More

    Submitted 30 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Submission to BSPC

  19. arXiv:2304.13346  [pdf, other

    cs.LG cs.CV

    Concept-Monitor: Understanding DNN training through individual neurons

    Authors: Mohammad Ali Khan, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this work, we propose a general framework called Concept-Monitor to help demystify the black-box DNN training processes automatically using a novel unified embedding space and concept diversity metric. Concept-Monitor enables human-interpretable visualization and indicators of the DNN training processes and facilitates transparency as well as deeper understanding on how DNNs develop along the d… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  20. arXiv:2304.12584  [pdf, other

    physics.optics cs.LG

    Learning imaging mechanism directly from optical microscopy observations

    Authors: Ze-Hao Wang, Long-Kun Shan, Tong-Tian Weng, Tian-Long Chen, Qi-Yu Wang, Xiang-Dong Chen, Zhang-Yang Wang, Guang-Can Guo, Fang-Wen Sun

    Abstract: Optical microscopy image plays an important role in scientific research through the direct visualization of the nanoworld, where the imaging mechanism is described as the convolution of the point spread function (PSF) and emitters. Based on a priori knowledge of the PSF or equivalent PSF, it is possible to achieve more precise exploration of the nanoworld. However, it is an outstanding challenge t… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  21. arXiv:2304.06129  [pdf, other

    cs.LG cs.CV

    Label-Free Concept Bottleneck Models

    Authors: Tuomas Oikarinen, Subhro Das, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Concept bottleneck models (CBM) are a popular way of creating more interpretable neural networks by having hidden layer neurons correspond to human-understandable concepts. However, existing CBMs and their variants have two crucial limitations: first, they need to collect labeled data for each of the predefined concepts, which is time consuming and labor intensive; second, the accuracy of a CBM is… ▽ More

    Submitted 5 June, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Published at ICLR 2023. New v2(5 June 2023): added crowdsourced human study in Appendix B

  22. arXiv:2304.00601  [pdf, other

    cs.CV cs.LG

    Constructive Assimilation: Boosting Contrastive Learning Performance through View Generation Strategies

    Authors: Ligong Han, Seungwook Han, Shivchander Sudalairaj, Charlotte Loh, Rumen Dangovski, Fei Deng, Pulkit Agrawal, Dimitris Metaxas, Leonid Karlinsky, Tsui-Wei Weng, Akash Srivastava

    Abstract: Transformations based on domain expertise (expert transformations), such as random-resized-crop and color-jitter, have proven critical to the success of contrastive learning techniques such as SimCLR. Recently, several attempts have been made to replace such domain-specific, human-designed transformations with generated views that are learned. However for imagery data, so far none of these view-ge… ▽ More

    Submitted 8 April, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: Accepted at Generative Models for Computer Vision Workshop 2023

  23. arXiv:2302.05783  [pdf, other

    cs.LG

    ConCerNet: A Contrastive Learning Based Framework for Automated Conservation Law Discovery and Trustworthy Dynamical System Prediction

    Authors: Wang Zhang, Tsui-Wei Weng, Subhro Das, Alexandre Megretski, Luca Daniel, Lam M. Nguyen

    Abstract: Deep neural networks (DNN) have shown great capacity of modeling a dynamical system; nevertheless, they usually do not obey physics constraints such as conservation laws. This paper proposes a new learning framework named ConCerNet to improve the trustworthiness of the DNN based dynamics modeling to endow the invariant properties. ConCerNet consists of two steps: (i) a contrastive learning method… ▽ More

    Submitted 19 July, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted by ICML 2023

  24. arXiv:2301.11324  [pdf, other

    cs.LG

    Certified Interpretability Robustness for Class Activation Mapping

    Authors: Alex Gu, Tsui-Wei Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel

    Abstract: Interpreting machine learning models is challenging but crucial for ensuring the safety of deep networks in autonomous driving systems. Due to the prevalence of deep learning based perception models in autonomous vehicles, accurately interpreting their predictions is crucial. While a variety of such methods have been proposed, most are shown to lack robustness. Yet, little has been done to provide… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 13 pages, 5 figures. Accepted to Machine Learning for Autonomous Driving Workshop at NeurIPS 2020

  25. arXiv:2211.02647  [pdf, other

    cs.RO

    Neural Grasp Distance Fields for Robot Manipulation

    Authors: Thomas Weng, David Held, Franziska Meier, Mustafa Mukadam

    Abstract: We formulate grasp learning as a neural field and present Neural Grasp Distance Fields (NGDF). Here, the input is a 6D pose of a robot end effector and output is a distance to a continuous manifold of valid grasps for an object. In contrast to current approaches that predict a set of discrete candidate grasps, the distance-based NGDF representation is easily interpreted as a cost, and minimizing t… ▽ More

    Submitted 28 December, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted to ICRA 2023

  26. arXiv:2210.11513  [pdf, other

    cs.LG

    Learning Sample Reweighting for Accuracy and Adversarial Robustness

    Authors: Chester Holtz, Tsui-Wei Weng, Gal Mishne

    Abstract: There has been great interest in enhancing the robustness of neural network classifiers to defend against adversarial perturbations through adversarial training, while balancing the trade-off between robust accuracy and standard accuracy. We propose a novel adversarial training framework that learns to reweight the loss associated with individual training samples based on a notion of class-conditi… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  27. arXiv:2207.13891  [pdf, other

    cs.RO

    Quantifying Safety of Learning-based Self-Driving Control Using Almost-Barrier Functions

    Authors: Zhizhen Qin, Tsui-Wei Weng, Sicun Gao

    Abstract: Path-tracking control of self-driving vehicles can benefit from deep learning for tackling longstanding challenges such as nonlinearity and uncertainty. However, deep neural controllers lack safety guarantees, restricting their practical use. We propose a new approach of learning almost-barrier functions, which approximately characterizes the forward invariant set for the system under neural contr… ▽ More

    Submitted 8 August, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Journal ref: International Conference on Intelligent Robots and Systems, IROS 2022, Kyoto, Japan, Oct 23 - Oct. 27, 2022

  28. arXiv:2207.11196  [pdf, other

    cs.RO

    Learning to Singulate Layers of Cloth using Tactile Feedback

    Authors: Sashank Tirumala, Thomas Weng, Daniel Seita, Oliver Kroemer, Zeynep Temel, David Held

    Abstract: Robotic manipulation of cloth has applications ranging from fabrics manufacturing to handling blankets and laundry. Cloth manipulation is challenging for robots largely due to their high degrees of freedom, complex dynamics, and severe self-occlusions when in folded or crumpled configurations. Prior work on robotic manipulation of cloth relies primarily on vision sensors alone, which may pose chal… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: IROS 2022. See https://sites.google.com/view/reskin-cloth for supplementary material

  29. arXiv:2207.09249  [pdf, other

    cs.CY cs.DC

    Fabric-GC: A Blockchain-based Gantt Chart System for Cross-organizational Project Management

    Authors: Dun Li, Dezhi Han, Benhui Xia, Tien-Hsiung Weng, Arcangelo Castiglione, Kuan-Ching Li

    Abstract: Large-scale production is always associated with more and more development and interaction among peers, and many fields achieve higher economic benefits through project cooperation. However, project managers in the traditional centralized approach cannot rearrange their activities to cross-organizational project management. Thanks to its characteristics, the Blockchain can represent a valid soluti… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  30. arXiv:2204.10965  [pdf, other

    cs.CV cs.AI cs.LG

    CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks

    Authors: Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this paper, we propose CLIP-Dissect, a new technique to automatically describe the function of individual hidden neurons inside vision networks. CLIP-Dissect leverages recent advances in multimodal vision/language models to label internal neurons with open-ended concepts without the need for any labeled data or human examples. We show that CLIP-Dissect provides more accurate descriptions than e… ▽ More

    Submitted 5 June, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: Published in ICLR 2023 Conference (Spotlight). New v5(5 June 2023) - Added crowdsourced user study in Appendix B, not included in ICLR publication

  31. arXiv:2202.03558  [pdf, other

    cs.LG cs.AI

    Attacking c-MARL More Effectively: A Data Driven Approach

    Authors: Nhan H. Pham, Lam M. Nguyen, Jie Chen, Hoang Thanh Lam, Subhro Das, Tsui-Wei Weng

    Abstract: In recent years, a proliferation of methods were developed for cooperative multi-agent reinforcement learning (c-MARL). However, the robustness of c-MARL agents against adversarial attacks has been rarely explored. In this paper, we propose to evaluate the robustness of c-MARL agents via a model-based approach, named c-MBA. Our proposed formulation can craft much stronger adversarial state perturb… ▽ More

    Submitted 10 September, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

  32. arXiv:2111.08752  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    The relativistic 5f electronic structure of delocalized $α$-U and localized $δ$-Pu from the self consistent vertex corrected GW approach and X-ray Emission Spectroscopy

    Authors: A. L. Kutepov, J. G. Tobin, S. -W. Yu, B. W. Chung, P. Roussel, S. Nowak, R. Alonso-Mori, T. Kroll, D. Nordlund, T. -C. Weng, D. Sokaras

    Abstract: The recently developed self-consistent vertex corrected GW method is used to calculate the 5f electronic structure in delocalized $α$-U and localized $δ$-Pu, each of which is confirmed by the historical experimental approaches of direct and inverse photoemission. Tender X-Ray Emission Spectroscopy (XES), in a novel application to 5f electronic structure, is used to experimentally prove the existen… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 13 pages, 11 figures, 7 tables

  33. arXiv:2111.06063  [pdf, other

    stat.ML cs.CV cs.LG math.OC

    On the Equivalence between Neural Network and Support Vector Machine

    Authors: Yilan Chen, Wei Huang, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Recent research shows that the dynamics of an infinitely wide neural network (NN) trained by gradient descent can be characterized by Neural Tangent Kernel (NTK) \citep{jacot2018neural}. Under the squared loss, the infinite-width NN trained by gradient descent with an infinitely small learning rate is equivalent to kernel regression with NTK \citep{arora2019exact}. However, the equivalence is only… ▽ More

    Submitted 7 July, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  34. arXiv:2111.05623  [pdf, other

    cs.RO cs.CV

    FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy

    Authors: Thomas Weng, Sujay Bajracharya, Yufei Wang, Khush Agrawal, David Held

    Abstract: We address the problem of goal-directed cloth manipulation, a challenging task due to the deformability of cloth. Our insight is that optical flow, a technique normally used for motion estimation in video, can also provide an effective representation for corresponding cloth poses across observation and goal images. We introduce FabricFlowNet (FFN), a cloth manipulation policy that leverages flow a… ▽ More

    Submitted 10 April, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: CoRL 2021

  35. arXiv:2102.10454  [pdf, other

    cs.LG cs.AI cs.CV

    On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning

    Authors: Ren Wang, Kaidi Xu, Sijia Liu, Pin-Yu Chen, Tsui-Wei Weng, Chuang Gan, Meng Wang

    Abstract: Model-agnostic meta-learning (MAML) has emerged as one of the most successful meta-learning techniques in few-shot learning. It enables us to learn a meta-initialization} of model parameters (that we call meta-model) to rapidly adapt to new tasks using a small amount of labeled training data. Despite the generalization power of the meta-model, it remains elusive that how adversarial robustness can… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

  36. arXiv:2102.01208  [pdf, ps, other

    cs.LG stat.ML

    Fast Training of Provably Robust Neural Networks by SingleProp

    Authors: Akhilan Boopathy, Tsui-Wei Weng, Sijia Liu, Pin-Yu Chen, Gaoyuan Zhang, Luca Daniel

    Abstract: Recent works have developed several methods of defending neural networks against adversarial attacks with certified guarantees. However, these techniques can be computationally costly due to the use of certification during training. We develop a new regularizer that is both more efficient than existing certified defenses, requiring only one additional forward propagation through a network, and can… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Published at AAAI 2021

  37. arXiv:2010.06651  [pdf, other

    cs.LG stat.ML

    Higher-Order Certification for Randomized Smoothing

    Authors: Jeet Mohapatra, Ching-Yun Ko, Tsui-Wei Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel

    Abstract: Randomized smoothing is a recently proposed defense against adversarial attacks that has achieved SOTA provable robustness against $\ell_2$ perturbations. A number of publications have extended the guarantees to other metrics, such as $\ell_1$ or $\ell_\infty$, by using different smoothing measures. Although the current framework has been shown to yield near-optimal $\ell_p$ radii, the total safet… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS2020(spotlight)

  38. Cloth Region Segmentation for Robust Grasp Selection

    Authors: Jianing Qian, Thomas Weng, Luxin Zhang, Brian Okorn, David Held

    Abstract: Cloth detection and manipulation is a common task in domestic and industrial settings, yet such tasks remain a challenge for robots due to cloth deformability. Furthermore, in many cloth-related tasks like laundry folding and bed making, it is crucial to manipulate specific regions like edges and corners, as opposed to folds. In this work, we focus on the problem of segmenting and grasping these k… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: Accepted at IROS 2020. The first two authors contributed equally and are listed in alphabetical order

    Journal ref: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  39. arXiv:2008.01976  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Robust Deep Reinforcement Learning through Adversarial Loss

    Authors: Tuomas Oikarinen, Wang Zhang, Alexandre Megretski, Luca Daniel, Tsui-Wei Weng

    Abstract: Recent studies have shown that deep reinforcement learning agents are vulnerable to small adversarial perturbations on the agent's inputs, which raises concerns about deploying such agents in the real world. To address this issue, we propose RADIAL-RL, a principled framework to train reinforcement learning agents with improved robustness against $l_p$-norm bounded adversarial attacks. Our framewor… ▽ More

    Submitted 10 November, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

  40. Multi-modal Transfer Learning for Grasping Transparent and Specular Objects

    Authors: Thomas Weng, Amith Pallankize, Yimin Tang, Oliver Kroemer, David Held

    Abstract: State-of-the-art object grasping methods rely on depth sensing to plan robust grasps, but commercially available depth sensors fail to detect transparent and specular objects. To improve grasping performance on such objects, we introduce a method for learning a multi-modal perception model by bootstrapping from an existing uni-modal model. This transfer learning approach requires only a pre-existi… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: RA-L with presentation at ICRA 2020

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 5, NO. 3, JULY 2020. 3791-3798

  41. arXiv:1908.06353  [pdf, other

    cs.LG stat.ML

    Verification of Neural Network Control Policy Under Persistent Adversarial Perturbation

    Authors: Yuh-Shyang Wang, Tsui-Wei Weng, Luca Daniel

    Abstract: Deep neural networks are known to be fragile to small adversarial perturbations. This issue becomes more critical when a neural network is interconnected with a physical system in a closed loop. In this paper, we show how to combine recent works on neural network certification tools (which are mainly used in static settings such as image classification) with robust control theory to certify a neur… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

  42. arXiv:1906.04214  [pdf, other

    cs.LG cs.CR cs.SI stat.ML

    Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective

    Authors: Kaidi Xu, Hongge Chen, Sijia Liu, Pin-Yu Chen, Tsui-Wei Weng, Mingyi Hong, Xue Lin

    Abstract: Graph neural networks (GNNs) which apply the deep neural networks to graph data have achieved significant performance for the task of semi-supervised node classification. However, only few work has addressed the adversarial robustness of GNNs. In this paper, we first present a novel gradient-based attack method that facilitates the difficulty of tackling discrete graph data. When comparing to curr… ▽ More

    Submitted 14 October, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Accepted by IJCAI 2019, the 28th International Joint Conference on Artificial Intelligence

    Journal ref: International Joint Conference on Artificial Intelligence (IJCAI-2019)

  43. arXiv:1905.07387  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    POPQORN: Quantifying Robustness of Recurrent Neural Networks

    Authors: Ching-Yun Ko, Zhaoyang Lyu, Tsui-Wei Weng, Luca Daniel, Ngai Wong, Dahua Lin

    Abstract: The vulnerability to adversarial attacks has been a critical issue for deep neural networks. Addressing this issue requires a reliable way to evaluate the robustness of a network. Recently, several methods have been developed to compute $\textit{robustness quantification}$ for neural networks, namely, certified lower bounds of the minimum adversarial perturbation. Such methods, however, were devis… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 10 pages, Ching-Yun Ko and Zhaoyang Lyu contributed equally, accepted to ICML 2019. Please see arXiv source codes for the appendix by clicking [Other formats]

  44. arXiv:1902.07543  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.comp-ph

    What retards the response of graphene based gaseous sensor

    Authors: Hui-Fen Zhang, Bo-Yuan Ning, Tsu-Chien Weng, Dong-Ping Wu, Xi-Jing Ning

    Abstract: Graphene based sensor to gas molecules should be ultrasensitive and ultrafast because of the single-atomic thickness of graphene, while the response is not fast. Usually, the measured response time for many molecules, such as CO, NH3, SO2, CO2 and NO2 and so on, is on the scale of minutes or longer. In the present work, we found via \emph{ab initio} calculations there exists a potential barrier la… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: 23 pages, 9 figures

    Journal ref: Sensors and Actuators A: Physical, 295(2019) 188-192

  45. arXiv:1902.07388  [pdf, ps, other

    cond-mat.stat-mech physics.comp-ph

    Comparison of two efficient methods for calculating partition functions

    Authors: Le-Cheng Gong, Bo-Yuan Ning, Tsu-Chien Weng, Xi-Jing Ning

    Abstract: In the long-time pursuit of the solution to calculate the partition function (or free energy) of condensed matter, Monte-Carlo-based nested sampling should be the state-of-the-art method, and very recently, we established a direct integral approach that works at least four orders faster. In present work, the above two methods were applied to solid argon at temperatures up to $300$K, and the derive… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Comments: 6 pages, 4 figures

  46. arXiv:1902.06248  [pdf

    cond-mat.mtrl-sci cond-mat.stat-mech physics.comp-ph

    Searching for the optimum conditions for silicene growth by calculations of the free energy

    Authors: Yu-Peng Liu, Bo-Yuan Ning, Le-Cheng Gong, Tsu-Chien Weng, Xi-Jing Ning

    Abstract: Very recently we developed an efficient method to calculate the free energy of 2D materials on substrates and achieved high calculation precision for graphene or $γ$-graphyne on copper substrates. In the present work, the method was further confirmed to be accurate by molecular dynamic simulations of silicene on Ag substrate using empirical potential and was applied to predict the optimum conditio… ▽ More

    Submitted 17 February, 2019; originally announced February 2019.

    Comments: 11 pages, 4 figures

    Journal ref: Nanomaterials, 9(2019) 978

  47. arXiv:1901.09205  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.comp-ph

    Calculating the free energy of 2D materials on substrates

    Authors: Yu-Peng Liu, Bo-Yuan Ning, Le-Cheng Gong, Tsu-Chien Weng, Xi-Jing Ning

    Abstract: A method was developed to calculate the free energy of 2D materials on substrates and was demonstrated by the system of graphene and γ-graphyne on copper substrate. The method works at least 3 orders faster than state-of-the-art algorithms, and the accuracy was tested by molecular dynamics simulations, showing that the precision for calculations of the internal energy achieves up to 0.03% in a tem… ▽ More

    Submitted 26 January, 2019; originally announced January 2019.

    Comments: 16 pages, 4 figures

    Journal ref: Nanomaterials, 9(2019) 978

  48. arXiv:1901.08233  [pdf, ps, other

    cond-mat.stat-mech physics.comp-ph

    Solution to the key problem of statistical physics -- calculations of partition function of many-body systems

    Authors: Bo-Yuan Ning, Le-Cheng Gong, Tsu-Chien Weng, Xi-Jing Ning

    Abstract: The key problem of statistical physics standing over one hundred years is how to exactly calculate the partition function (or free energy) of many-body interaction systems, which severely hinders application of the theory for realistic systems. Here we present a novel approach that works at least four orders faster than state-of-the-art algorithms to the problem and can be applied to predict therm… ▽ More

    Submitted 25 April, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: Request of supplementary information should be addressed to BYN and XJN

  49. arXiv:1901.07648  [pdf, other

    math.OC cs.LG stat.ML

    Finite-Sum Smooth Optimization with SARAH

    Authors: Lam M. Nguyen, Marten van Dijk, Dzung T. Phan, Phuong Ha Nguyen, Tsui-Wei Weng, Jayant R. Kalagnanam

    Abstract: The total complexity (measured as the total number of gradient computations) of a stochastic first-order optimization algorithm that finds a first-order stationary point of a finite-sum smooth nonconvex objective function $F(w)=\frac{1}{n} \sum_{i=1}^n f_i(w)$ has been proven to be at least $Ω(\sqrt{n}/ε)$ for $n \leq \mathcal{O}(ε^{-2})$ where $ε$ denotes the attained accuracy… ▽ More

    Submitted 22 April, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

  50. arXiv:1812.08329  [pdf, other

    cs.LG cs.CR stat.ML

    PROVEN: Certifying Robustness of Neural Networks with a Probabilistic Approach

    Authors: Tsui-Wei Weng, Pin-Yu Chen, Lam M. Nguyen, Mark S. Squillante, Ivan Oseledets, Luca Daniel

    Abstract: With deep neural networks providing state-of-the-art machine learning models for numerous machine learning tasks, quantifying the robustness of these models has become an important area of research. However, most of the research literature merely focuses on the \textit{worst-case} setting where the input of the neural network is perturbed with noises that are constrained within an $\ell_p$ ball; a… ▽ More

    Submitted 7 January, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: updated ref [25]