Zum Hauptinhalt springen

Showing 101–150 of 287 results for author: Anandkumar, A

.
  1. arXiv:2205.03017  [pdf, other

    cs.LG math.PR

    Generative Adversarial Neural Operators

    Authors: Md Ashiqur Rahman, Manuel A. Florez, Anima Anandkumar, Zachary E. Ross, Kamyar Azizzadenesheli

    Abstract: We propose the generative adversarial neural operator (GANO), a generative model paradigm for learning probabilities on infinite-dimensional function spaces. The natural sciences and engineering are known to have many types of data that are sampled from infinite-dimensional function spaces, where classical finite-dimensional deep generative adversarial networks (GANs) may not be directly applicabl… ▽ More

    Submitted 12 October, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: Transactions on Machine Learning Research 2022

  2. arXiv:2204.12451  [pdf, other

    cs.CV

    Understanding The Robustness in Vision Transformers

    Authors: Daquan Zhou, Zhiding Yu, Enze Xie, Chaowei Xiao, Anima Anandkumar, Jiashi Feng, Jose M. Alvarez

    Abstract: Recent studies show that Vision Transformers(ViTs) exhibit strong robustness against various corruptions. Although this property is partly attributed to the self-attention mechanism, there is still a lack of systematic understanding. In this paper, we examine the role of self-attention in learning robust representations. Our study is motivated by the intriguing properties of the emerging visual gr… ▽ More

    Submitted 8 November, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

  3. arXiv:2204.11167  [pdf, other

    cs.CV cs.AI cs.LG

    RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning

    Authors: Xiaojian Ma, Weili Nie, Zhiding Yu, Huaizu Jiang, Chaowei Xiao, Yuke Zhu, Song-Chun Zhu, Anima Anandkumar

    Abstract: Reasoning about visual relationships is central to how humans interpret the visual world. This task remains challenging for current deep learning algorithms since it requires addressing three key technical problems jointly: 1) identifying object entities and their properties, 2) inferring semantic relations between pairs of entities, and 3) generalizing to novel object-relation combinations, i.e.,… ▽ More

    Submitted 11 June, 2022; v1 submitted 23 April, 2022; originally announced April 2022.

    Comments: ICLR 2022; Code: https://github.com/NVlabs/RelViT

  4. arXiv:2204.05088  [pdf, other

    cs.CV

    M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

    Authors: Enze Xie, Zhiding Yu, Daquan Zhou, Jonah Philion, Anima Anandkumar, Sanja Fidler, Ping Luo, Jose M. Alvarez

    Abstract: In this paper, we propose M$^2$BEV, a unified framework that jointly performs 3D object detection and map segmentation in the Birds Eye View~(BEV) space with multi-camera image inputs. Unlike the majority of previous works which separately process detection and segmentation, M$^2$BEV infers both tasks with a unified model and improves efficiency. M$^2$BEV efficiently transforms multi-view 2D image… ▽ More

    Submitted 19 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Tech Report

  5. arXiv:2203.08616  [pdf, other

    cs.OH cs.LG

    Generic Lithography Modeling with Dual-band Optics-Inspired Neural Networks

    Authors: Haoyu Yang, Zongyi Li, Kumara Sastry, Saumyadip Mukhopadhyay, Mark Kilgard, Anima Anandkumar, Brucek Khailany, Vivek Singh, Haoxing Ren

    Abstract: Lithography simulation is a critical step in VLSI design and optimization for manufacturability. Existing solutions for highly accurate lithography simulation with rigorous models are computationally expensive and slow, even when equipped with various approximation techniques. Recently, machine learning has provided alternative solutions for lithography simulation tasks such as coarse-grained edge… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

    Comments: 9 pages, 9 figures; accepted at 59th Design Automation Conference

  6. arXiv:2203.06856  [pdf, other

    cs.CV cs.AI cs.RO

    ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation

    Authors: Bokui Shen, Zhenyu Jiang, Christopher Choy, Leonidas J. Guibas, Silvio Savarese, Anima Anandkumar, Yuke Zhu

    Abstract: Manipulating volumetric deformable objects in the real world, like plush toys and pizza dough, bring substantial challenges due to infinite shape variations, non-rigid motions, and partial observability. We introduce ACID, an action-conditional visual dynamics model for volumetric deformable objects based on structured implicit neural representations. ACID integrates two new techniques: implicit r… ▽ More

    Submitted 5 August, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: RSS 2022 Best Student Paper Award Finalist. Please check out more details at https://b0ku1.github.io/acid/

    Journal ref: Robotics: Science and Systems (RSS), 2022

  7. arXiv:2202.12181  [pdf, other

    cs.CV

    FreeSOLO: Learning to Segment Objects without Annotations

    Authors: Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez

    Abstract: Instance segmentation is a fundamental vision task that aims to recognize and segment each object in an image. However, it requires costly annotations such as bounding boxes and segmentation masks for learning. In this work, we propose a fully unsupervised learning method that learns class-agnostic instance segmentation without any annotations. We present FreeSOLO, a self-supervised instance segme… ▽ More

    Submitted 25 April, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 13 pages. Accepted to IEEE/CVF Conf. Comp. Vision Pattern Recognition (CVPR) 2022

  8. arXiv:2202.11214  [pdf, other

    physics.ao-ph cs.LG

    FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators

    Authors: Jaideep Pathak, Shashank Subramanian, Peter Harrington, Sanjeev Raja, Ashesh Chattopadhyay, Morteza Mardani, Thorsten Kurth, David Hall, Zongyi Li, Kamyar Azizzadenesheli, Pedram Hassanzadeh, Karthik Kashinath, Animashree Anandkumar

    Abstract: FourCastNet, short for Fourier Forecasting Neural Network, is a global data-driven weather forecasting model that provides accurate short to medium-range global predictions at $0.25^{\circ}$ resolution. FourCastNet accurately forecasts high-resolution, fast-timescale variables such as the surface wind speed, precipitation, and atmospheric water vapor. It has important implications for planning win… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  9. arXiv:2202.04173  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models

    Authors: Boxin Wang, Wei Ping, Chaowei Xiao, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bo Li, Anima Anandkumar, Bryan Catanzaro

    Abstract: Pre-trained language models (LMs) are shown to easily generate toxic language. In this work, we systematically explore domain-adaptive training to reduce the toxicity of language models. We conduct this study on three dimensions: training corpus, model size, and parameter efficiency. For the training corpus, we propose to leverage the generative power of LMs and generate nontoxic datasets for doma… ▽ More

    Submitted 21 October, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022

  10. arXiv:2202.01771  [pdf, other

    cs.LG cs.CL

    Pre-Trained Language Models for Interactive Decision-Making

    Authors: Shuang Li, Xavier Puig, Chris Paxton, Yilun Du, Clinton Wang, Linxi Fan, Tao Chen, De-An Huang, Ekin Akyürek, Anima Anandkumar, Jacob Andreas, Igor Mordatch, Antonio Torralba, Yuke Zhu

    Abstract: Language model (LM) pre-training is useful in many language processing tasks. But can pre-trained LMs be further leveraged for more general machine learning problems? We propose an approach for using LMs to scaffold learning and generalization in general sequential decision-making problems. In this approach, goals and observations are represented as a sequence of embeddings, and a policy network i… ▽ More

    Submitted 29 October, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

  11. arXiv:2112.10239  [pdf, other

    quant-ph

    TensorLy-Quantum: Quantum Machine Learning with Tensor Methods

    Authors: Taylor L. Patti, Jean Kossaifi, Susanne F. Yelin, Anima Anandkumar

    Abstract: Simulation is essential for developing quantum hardware and algorithms. However, simulating quantum circuits on classical hardware is challenging due to the exponential scaling of quantum state space. While factorized tensors can greatly reduce this overhead, tensor network-based simulators are relatively few and often lack crucial functionalities. To address this deficiency, we created TensorLy-Q… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Comments: 6 pages, 2 figures

  12. arXiv:2112.07868  [pdf, other

    cs.CL cs.AI

    Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases

    Authors: Shrimai Prabhumoye, Rafal Kocielnik, Mohammad Shoeybi, Anima Anandkumar, Bryan Catanzaro

    Abstract: Detecting social bias in text is challenging due to nuance, subjectivity, and difficulty in obtaining good quality labeled datasets at scale, especially given the evolving nature of social biases and society. To address these challenges, we propose a few-shot instruction-based method for prompting pre-trained language models (LMs). We select a few class-balanced exemplars from a small support repo… ▽ More

    Submitted 15 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Submission revised with new results

  13. arXiv:2112.07746  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning

    Authors: Kevin Huang, Sahin Lale, Ugo Rosolia, Yuanyuan Shi, Anima Anandkumar

    Abstract: Current state-of-the-art model-based reinforcement learning algorithms use trajectory sampling methods, such as the Cross-Entropy Method (CEM), for planning in continuous control settings. These zeroth-order optimizers require sampling a large number of trajectory rollouts to select an optimal action, which scales poorly for large prediction horizons or high dimensional action spaces. First-order… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  14. arXiv:2112.03235  [pdf, other

    cs.AI cs.CE cs.LG cs.MS

    Simulation Intelligence: Towards a New Generation of Scientific Methods

    Authors: Alexander Lavin, David Krakauer, Hector Zenil, Justin Gottschlich, Tim Mattson, Johann Brehmer, Anima Anandkumar, Sanjay Choudry, Kamil Rocki, Atılım Güneş Baydin, Carina Prunkl, Brooks Paige, Olexandr Isayev, Erik Peterson, Peter L. McMahon, Jakob Macke, Kyle Cranmer, Jiaxin Zhang, Haruko Wainwright, Adi Hanuka, Manuela Veloso, Samuel Assefa, Stephan Zheng, Avi Pfeffer

    Abstract: The original "Seven Motifs" set forth a roadmap of essential methods for the field of scientific computing, where a motif is an algorithmic method that captures a pattern of computation and data movement. We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for a merger of scientific computing, scientific simul… ▽ More

    Submitted 27 November, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  15. arXiv:2111.13587  [pdf, other

    cs.CV cs.LG

    Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

    Authors: John Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar, Bryan Catanzaro

    Abstract: Vision transformers have delivered tremendous success in representation learning. This is primarily due to effective token mixing through self attention. However, this scales quadratically with the number of pixels, which becomes infeasible for high-resolution inputs. To cope with this challenge, we propose Adaptive Fourier Neural Operator (AFNO) as an efficient token mixer that learns to mix in t… ▽ More

    Submitted 27 March, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  16. arXiv:2111.08565  [pdf, other

    cs.LG cs.MA math.OC

    Polymatrix Competitive Gradient Descent

    Authors: Jeffrey Ma, Alistair Letcher, Florian Schäfer, Yuanyuan Shi, Anima Anandkumar

    Abstract: Many economic games and machine learning approaches can be cast as competitive optimization problems where multiple agents are minimizing their respective objective function, which depends on all agents' actions. While gradient descent is a reliable basic workhorse for single-agent optimization, it often leads to oscillation in competitive optimization. In this work we propose polymatrix competiti… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  17. arXiv:2111.07999  [pdf, other

    cs.LG cs.AI cs.RO

    Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization

    Authors: Youngwoon Lee, Joseph J. Lim, Anima Anandkumar, Yuke Zhu

    Abstract: Skill chaining is a promising approach for synthesizing complex behaviors by sequentially combining previously learned skills. Yet, a naive composition of skills fails when a policy encounters a starting state never seen during its training. For successful skill chaining, prior approaches attempt to widen the policy's starting state distribution. However, these approaches require larger state dist… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: Published at the Conference on Robot Learning (CoRL) 2021

  18. arXiv:2111.03794  [pdf, other

    cs.LG math.NA

    Physics-Informed Neural Operator for Learning Partial Differential Equations

    Authors: Zongyi Li, Hongkai Zheng, Nikola Kovachki, David Jin, Haoxuan Chen, Burigede Liu, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: In this paper, we propose physics-informed neural operators (PINO) that combine training data and physics constraints to learn the solution operator of a given family of parametric Partial Differential Equations (PDE). PINO is the first hybrid approach incorporating data and PDE constraints at different resolutions to learn the operator. Specifically, in PINO, we combine coarse-resolution training… ▽ More

    Submitted 29 July, 2023; v1 submitted 5 November, 2021; originally announced November 2021.

  19. arXiv:2111.01395  [pdf, other

    cs.LG cs.CR stat.ML

    Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds

    Authors: Yujia Huang, Huan Zhang, Yuanyuan Shi, J Zico Kolter, Anima Anandkumar

    Abstract: Certified robustness is a desirable property for deep neural networks in safety-critical applications, and popular training algorithms can certify robustness of a neural network by computing a global bound on its Lipschitz constant. However, such a bound is often loose: it tends to over-regularize the neural network and degrade its natural accuracy. A tighter Lipschitz bound may provide a better t… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

  20. arXiv:2110.14538  [pdf, other

    cs.LG cs.MA

    Reinforcement Learning in Factored Action Spaces using Tensor Decompositions

    Authors: Anuj Mahajan, Mikayel Samvelyan, Lei Mao, Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Animashree Anandkumar

    Abstract: We present an extended abstract for the previously published work TESSERACT [Mahajan et al., 2021], which proposes a novel solution for Reinforcement Learning (RL) in large, factored action spaces using tensor decompositions. The goal of this abstract is twofold: (1) To garner greater interest amongst the tensor research community for creating methods and analysis for approximate RL, (2) To elucid… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Journal ref: 2nd Workshop on Quantum Tensor Networks in Machine Learning (NeurIPS 2021)

  21. arXiv:2110.13771  [pdf, other

    cs.CV cs.LG

    AugMax: Adversarial Composition of Random Augmentations for Robust Training

    Authors: Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Anima Anandkumar, Zhangyang Wang

    Abstract: Data augmentation is a simple yet effective way to improve the robustness of deep neural networks (DNNs). Diversity and hardness are two complementary dimensions of data augmentation to achieve robustness. For example, AugMix explores random compositions of a diverse set of augmentations to enhance broader coverage, while adversarial training generates adversarially hard samples to spot the weakne… ▽ More

    Submitted 1 January, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: NeurIPS, 2021

  22. arXiv:2110.12661  [pdf, other

    cs.LG cs.CV

    ZerO Initialization: Initializing Neural Networks with only Zeros and Ones

    Authors: Jiawei Zhao, Florian Schäfer, Anima Anandkumar

    Abstract: Deep neural networks are usually initialized with random weights, with adequately selected initial variance to ensure stable signal propagation during training. However, selecting the appropriate variance becomes challenging especially as the number of layers grows. In this work, we replace random weight initialization with a fully deterministic initialization scheme, viz., ZerO, which initializes… ▽ More

    Submitted 4 November, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

  23. arXiv:2110.10873  [pdf, other

    cs.CV cs.AI cs.LG

    Controllable and Compositional Generation with Latent-Space Energy-Based Models

    Authors: Weili Nie, Arash Vahdat, Anima Anandkumar

    Abstract: Controllable generation is one of the key requirements for successful adoption of deep generative models in real-world applications, but it still remains as a great challenge. In particular, the compositional ability to generate novel concept combinations is out of reach for most current models. In this work, we use energy-based models (EBMs) to handle compositional generation over a set of attrib… ▽ More

    Submitted 3 December, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 32 pages, NeurIPS 2021

  24. arXiv:2110.00704  [pdf, other

    cs.RO cs.AI cs.LG

    OSCAR: Data-Driven Operational Space Control for Adaptive and Robust Robot Manipulation

    Authors: Josiah Wong, Viktor Makoviychuk, Anima Anandkumar, Yuke Zhu

    Abstract: Learning performant robot manipulation policies can be challenging due to high-dimensional continuous actions and complex physics-based dynamics. This can be alleviated through intelligent choice of action space. Operational Space Control (OSC) has been used as an effective task-space controller for manipulation. Nonetheless, its strength depends on the underlying modeling fidelity, and is prone t… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  25. arXiv:2109.14854  [pdf, other

    eess.SY math.OC

    Stability Constrained Reinforcement Learning for Real-Time Voltage Control

    Authors: Yuanyuan Shi, Guannan Qu, Steven Low, Anima Anandkumar, Adam Wierman

    Abstract: Deep reinforcement learning (RL) has been recognized as a promising tool to address the challenges in real-time control of power systems. However, its deployment in real-world power systems has been hindered by a lack of formal stability and safety guarantees. In this paper, we propose a stability constrained reinforcement learning method for real-time voltage control in distribution grids and we… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  26. arXiv:2109.12456  [pdf, other

    cs.LG cs.AI cs.CV

    Auditing AI models for Verified Deployment under Semantic Specifications

    Authors: Homanga Bharadhwaj, De-An Huang, Chaowei Xiao, Anima Anandkumar, Animesh Garg

    Abstract: Auditing trained deep learning (DL) models prior to deployment is vital for preventing unintended consequences. One of the biggest challenges in auditing is the lack of human-interpretable specifications for the DL models that are directly useful to the auditor. We address this challenge through a sequence of semantically-aligned unit tests, where each unit test verifies whether a predefined speci… ▽ More

    Submitted 1 November, 2021; v1 submitted 25 September, 2021; originally announced September 2021.

    Comments: Preprint; Under review

  27. arXiv:2109.03814  [pdf, other

    cs.CV

    Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

    Authors: Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo, Tong Lu

    Abstract: Panoptic segmentation involves a combination of joint semantic segmentation and instance segmentation, where image contents are divided into two types: things and stuff. We present Panoptic SegFormer, a general framework for panoptic segmentation with transformers. It contains three innovative components: an efficient deeply-supervised mask decoder, a query decoupling strategy, and an improved pos… ▽ More

    Submitted 18 March, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to CVPR 2022

  28. arXiv:2109.03697  [pdf, other

    physics.geo-ph cs.LG

    U-FNO -- An enhanced Fourier neural operator-based deep-learning model for multiphase flow

    Authors: Gege Wen, Zongyi Li, Kamyar Azizzadenesheli, Anima Anandkumar, Sally M. Benson

    Abstract: Numerical simulation of multiphase flow in porous media is essential for many geoscience applications. Machine learning models trained with numerical simulation data can provide a faster alternative to traditional simulators. Here we present U-FNO, a novel neural network architecture for solving multiphase flow problems with superior accuracy, speed, and data efficiency. U-FNO is designed based on… ▽ More

    Submitted 4 May, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

  29. arXiv:2108.13826  [pdf, other

    cs.CV

    Self-Calibrating Neural Radiance Fields

    Authors: Yoonwoo Jeong, Seokjun Ahn, Christopher Choy, Animashree Anandkumar, Minsu Cho, Jaesik Park

    Abstract: In this work, we propose a camera self-calibration algorithm for generic cameras with arbitrary non-linear distortions. We jointly learn the geometry of the scene and the accurate camera parameters without any calibration objects. Our camera model consists of a pinhole model, a fourth order radial distortion, and a generic noise model that can learn arbitrary non-linear camera distortions. While t… ▽ More

    Submitted 2 September, 2021; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted in ICCV21, Project Page: https://postech-cvlab.github.io/SCNeRF/

  30. arXiv:2108.11959  [pdf, ps, other

    cs.LG eess.SY math.OC

    Finite-time System Identification and Adaptive Control in Autoregressive Exogenous Systems

    Authors: Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

    Abstract: Autoregressive exogenous (ARX) systems are the general class of input-output dynamical systems used for modeling stochastic linear dynamical systems (LDS) including partially observable LDS such as LQG systems. In this work, we study the problem of system identification and adaptive control of unknown ARX systems. We provide finite-time learning guarantees for the ARX systems under both open-loop… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 3rd Annual Learning for Dynamics & Control Conference (L4DC)

  31. Neural Operator: Learning Maps Between Function Spaces

    Authors: Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has primarily focused on learning mappings between finite dimensional Euclidean spaces or finite sets. We propose a generalization of neural networks to learn operators, termed neural operators, that map between infinite dimensional function spaces. We formulate the neural operator as a composition of linear integral operators and nonlinear activation f… ▽ More

    Submitted 2 May, 2024; v1 submitted 18 August, 2021; originally announced August 2021.

    Journal ref: The Journal of Machine Learning Research (2023), Volume 24, Issue 1, Article No 89, pp 4061-4157

  32. Tensor Methods in Computer Vision and Deep Learning

    Authors: Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

    Abstract: Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have be… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Proceedings of the IEEE (2021)

  33. arXiv:2107.02192  [pdf, other

    cs.CV cs.CL cs.LG cs.MM

    Long-Short Transformer: Efficient Transformers for Language and Vision

    Authors: Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro

    Abstract: Transformers have achieved success in both language and vision domains. However, it is prohibitively expensive to scale them to long sequences such as long documents or high-resolution images, because self-attention mechanism has quadratic time and memory complexities with respect to the input sequence length. In this paper, we propose Long-Short Transformer (Transformer-LS), an efficient self-att… ▽ More

    Submitted 7 December, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: Published at NeurIPS 2021

  34. arXiv:2107.00299  [pdf, other

    physics.chem-ph

    OrbNet Denali: A machine learning potential for biological and organic chemistry with semi-empirical cost and DFT accuracy

    Authors: Anders S. Christensen, Sai Krishna Sirumalla, Zhuoran Qiao, Michael B. O'Connor, Daniel G. A. Smith, Feizhi Ding, Peter J. Bygrave, Animashree Anandkumar, Matthew Welborn, Frederick R. Manby, Thomas F. Miller III

    Abstract: We present OrbNet Denali, a machine learning model for electronic structure that is designed as a drop-in replacement for ground-state density functional theory (DFT) energy calculations. The model is a message-passing neural network that uses symmetry-adapted atomic orbital features from a low-cost quantum calculation to predict the energy of a molecule. OrbNet Denali is trained on a vast dataset… ▽ More

    Submitted 2 July, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  35. arXiv:2106.13914  [pdf, other

    cs.LG cs.AR

    LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update

    Authors: Jiawei Zhao, Steve Dai, Rangharajan Venkatesan, Brian Zimmer, Mustafa Ali, Ming-Yu Liu, Brucek Khailany, Bill Dally, Anima Anandkumar

    Abstract: Representing deep neural networks (DNNs) in low-precision is a promising approach to enable efficient acceleration and memory reduction. Previous methods that train DNNs in low-precision typically keep a copy of weights in high-precision during the weight updates. Directly training with low-precision weights leads to accuracy degradation due to complex interactions between the low-precision number… ▽ More

    Submitted 23 August, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

  36. arXiv:2106.13304  [pdf, ps, other

    quant-ph

    Variational Quantum Optimization with Multi-Basis Encodings

    Authors: Taylor L. Patti, Jean Kossaifi, Anima Anandkumar, Susanne F. Yelin

    Abstract: Despite extensive research efforts, few quantum algorithms for classical optimization demonstrate realizable quantum advantage. The utility of many quantum algorithms is limited by high requisite circuit depth and nonconvex optimization landscapes. We tackle these challenges by introducing a new variational quantum algorithm that benefits from two innovations: multi-basis graph encodings and nonli… ▽ More

    Submitted 26 January, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: 10 pages, 4 figures. Corrected circuit structure, added citations, clarified key points. Updated title, method nomenclature, manuscript order, and format

  37. arXiv:2106.11921  [pdf, other

    cs.CV cs.LG

    Not All Labels Are Equal: Rationalizing The Labeling Costs for Training Object Detection

    Authors: Ismail Elezi, Zhiding Yu, Anima Anandkumar, Laura Leal-Taixe, Jose M. Alvarez

    Abstract: Deep neural networks have reached high accuracy on object detection but their success hinges on large amounts of labeled data. To reduce the labels dependency, various active learning strategies have been proposed, typically based on the confidence of the detector. However, these methods are biased towards high-performing classes and can lead to acquired datasets that are not good representatives… ▽ More

    Submitted 29 November, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Includes supplementary material

  38. arXiv:2106.09678  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies

    Authors: Linxi Fan, Guanzhi Wang, De-An Huang, Zhiding Yu, Li Fei-Fei, Yuke Zhu, Anima Anandkumar

    Abstract: Generalization has been a long-standing challenge for reinforcement learning (RL). Visual RL, in particular, can be easily distracted by irrelevant factors in high-dimensional observation space. In this work, we consider robust policy learning which targets zero-shot generalization to unseen visual environments with large distributional shift. We propose SECANT, a novel self-expert cloning techniq… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: ICML 2021. Website: https://linxifan.github.io/secant-site/

  39. arXiv:2106.06898  [pdf, other

    cs.LG math.DS

    Learning Dissipative Dynamics in Chaotic Systems

    Authors: Zongyi Li, Miguel Liu-Schiaffini, Nikola Kovachki, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: Chaotic systems are notoriously challenging to predict because of their sensitivity to perturbations and errors due to time stepping. Despite this unpredictable behavior, for many dissipative systems the statistics of the long term trajectories are governed by an invariant measure supported on a set, known as the global attractor; for many problems this set is finite dimensional, even if the state… ▽ More

    Submitted 27 September, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  40. arXiv:2106.00136  [pdf, other

    cs.LG

    Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

    Authors: Anuj Mahajan, Mikayel Samvelyan, Lei Mao, Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Animashree Anandkumar

    Abstract: Reinforcement Learning in large action spaces is a challenging problem. Cooperative multi-agent reinforcement learning (MARL) exacerbates matters by imposing various constraints on communication and observability. In this work, we consider the fundamental hurdle affecting both value-based and policy-gradient approaches: an exponential blowup of the action space with the number of agents. For value… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: 38th International Conference on Machine Learning, PMLR 139, 2021

  41. arXiv:2105.15203  [pdf, other

    cs.CV cs.LG

    SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

    Authors: Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo

    Abstract: We present SegFormer, a simple, efficient yet powerful semantic segmentation framework which unifies Transformers with lightweight multilayer perception (MLP) decoders. SegFormer has two appealing features: 1) SegFormer comprises a novel hierarchically structured Transformer encoder which outputs multiscale features. It does not need positional encoding, thereby avoiding the interpolation of posit… ▽ More

    Submitted 28 October, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted by NeurIPS 2021

  42. arXiv:2105.14655  [pdf, other

    cs.LG physics.chem-ph

    Informing Geometric Deep Learning with Electronic Interactions to Accelerate Quantum Chemistry

    Authors: Zhuoran Qiao, Anders S. Christensen, Matthew Welborn, Frederick R. Manby, Anima Anandkumar, Thomas F. Miller III

    Abstract: Predicting electronic energies, densities, and related chemical properties can facilitate the discovery of novel catalysts, medicines, and battery materials. By developing a physics-inspired equivariant neural network, we introduce a method to learn molecular representations based on the electronic interactions among atomic orbitals. Our method, OrbNet-Equi, leverages efficient tight-binding simul… ▽ More

    Submitted 1 April, 2022; v1 submitted 30 May, 2021; originally announced May 2021.

    Journal ref: Proceedings of the National Academy of Sciences 119.31 (2022): e2205221119

  43. arXiv:2105.08692  [pdf, other

    cs.AI

    Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition

    Authors: Bo Liu, Qiang Liu, Peter Stone, Animesh Garg, Yuke Zhu, Animashree Anandkumar

    Abstract: In real-world multi-agent systems, agents with different capabilities may join or leave without altering the team's overarching goals. Coordinating teams with such dynamic composition is challenging: the optimal team strategy varies with the composition. We propose COPA, a coach-player framework to tackle this problem. We assume the coach has a global view of the environment and coordinates the pl… ▽ More

    Submitted 3 September, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: International Conference on Machine Learning

  44. arXiv:2105.06464  [pdf, other

    cs.CV cs.LG

    DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

    Authors: Shiyi Lan, Zhiding Yu, Christopher Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Larry S. Davis, Anima Anandkumar

    Abstract: We introduce DiscoBox, a novel framework that jointly learns instance segmentation and semantic correspondence using bounding box supervision. Specifically, we propose a self-ensembling framework where instance segmentation and semantic correspondence are jointly guided by a structured teacher in addition to the bounding box supervision. The teacher is a structured energy model incorporating a pai… ▽ More

    Submitted 5 June, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: Tech Report

  45. arXiv:2104.14134  [pdf, other

    math.OC cs.LG eess.SY

    Stable Online Control of Linear Time-Varying Systems

    Authors: Guannan Qu, Yuanyuan Shi, Sahin Lale, Anima Anandkumar, Adam Wierman

    Abstract: Linear time-varying (LTV) systems are widely used for modeling real-world dynamical systems due to their generality and simplicity. Providing stability guarantees for LTV systems is one of the central problems in control theory. However, existing approaches that guarantee stability typically lead to significantly sub-optimal cumulative control cost in online settings where only current or short-te… ▽ More

    Submitted 29 April, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: 3rd Annual Learning for Dynamics & Control Conference (L4DC)

  46. arXiv:2104.07916  [pdf, other

    cs.CV

    Augmenting Deep Classifiers with Polynomial Neural Networks

    Authors: Grigorios G Chrysos, Markos Georgopoulos, Jiankang Deng, Jean Kossaifi, Yannis Panagakis, Anima Anandkumar

    Abstract: Deep neural networks have been the driving force behind the success in classification tasks, e.g., object and audio recognition. Impressive results and generalization have been achieved by a variety of recently proposed architectures, the majority of which are seemingly disconnected. In this work, we cast the study of deep classifiers under a unifying framework. In particular, we express state-of-… ▽ More

    Submitted 11 August, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at ECCV'22

  47. arXiv:2104.05702  [pdf, other

    cs.CV cs.LG

    Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection

    Authors: Nadine Chang, Zhiding Yu, Yu-Xiong Wang, Anima Anandkumar, Sanja Fidler, Jose M. Alvarez

    Abstract: Training on datasets with long-tailed distributions has been challenging for major recognition tasks such as classification and detection. To deal with this challenge, image resampling is typically introduced as a simple but effective approach. However, we observe that long-tailed detection differs from classification since multiple classes may be present in one image. As a result, image resamplin… ▽ More

    Submitted 18 October, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted to ICML 2021

  48. arXiv:2104.02290  [pdf, other

    cs.CV

    Contrastive Syn-to-Real Generalization

    Authors: Wuyang Chen, Zhiding Yu, Shalini De Mello, Sifei Liu, Jose M. Alvarez, Zhangyang Wang, Anima Anandkumar

    Abstract: Training on synthetic data can be beneficial for label or data-scarce scenarios. However, synthetically trained models often suffer from poor generalization in real domains due to domain gaps. In this work, we make a key observation that the diversity of the learned feature embeddings plays an important role in the generalization performance. To this end, we propose contrastive synthetic-to-real g… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: Accepted in ICLR 2021

  49. arXiv:2103.07403  [pdf, other

    cs.RO cs.AI eess.SY

    Generating and Characterizing Scenarios for Safety Testing of Autonomous Vehicles

    Authors: Zahra Ghodsi, Siva Kumar Sastry Hari, Iuri Frosio, Timothy Tsai, Alejandro Troccoli, Stephen W. Keckler, Siddharth Garg, Anima Anandkumar

    Abstract: Extracting interesting scenarios from real-world data as well as generating failure cases is important for the development and testing of autonomous systems. We propose efficient mechanisms to both characterize and generate testing scenarios using a state-of-the-art driving simulator. For any scenario, our method generates a set of possible driving paths and identifies all the possible safe drivin… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  50. arXiv:2102.12596  [pdf, other

    cs.SI cs.LG

    Dynamic Social Media Monitoring for Fast-Evolving Online Discussions

    Authors: Maya Srikanth, Anqi Liu, Nicholas Adams-Cohen, Jian Cao, R. Michael Alvarez, Anima Anandkumar

    Abstract: Tracking and collecting fast-evolving online discussions provides vast data for studying social media usage and its role in people's public lives. However, collecting social media data using a static set of keywords fails to satisfy the growing need to monitor dynamic conversations and to study fast-changing topics. We propose a dynamic keyword search method to maximize the coverage of relevant in… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Preprint, Under Review