Zum Hauptinhalt springen

Showing 1–41 of 41 results for author: Hu, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09807  [pdf, other

    cs.AI

    World Models Increase Autonomy in Reinforcement Learning

    Authors: Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat, Edward S. Hu

    Abstract: Reinforcement learning (RL) is an appealing paradigm for training intelligent agents, enabling policy acquisition from the agent's own autonomously acquired experience. However, the training process of RL is far from automatic, requiring extensive human effort to reset the agent and environments. To tackle the challenging reset-free setting, we first demonstrate the superiority of model-based (MB)… ▽ More

    Submitted 20 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2407.05645  [pdf, other

    cs.CV cs.MM

    OneDiff: A Generalist Model for Image Difference Captioning

    Authors: Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, Jing Liu

    Abstract: In computer vision, Image Difference Captioning (IDC) is crucial for accurately describing variations between closely related images. Traditional IDC methods often rely on specialist models, which restrict their applicability across varied contexts. This paper introduces the OneDiff model, a novel generalist approach that utilizes a robust vision-language model architecture, integrating a siamese… ▽ More

    Submitted 16 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2405.14853  [pdf, other

    cs.LG cs.AI cs.RO

    Privileged Sensing Scaffolds Reinforcement Learning

    Authors: Edward S. Hu, James Springer, Oleh Rybkin, Dinesh Jayaraman

    Abstract: We need to look at our shoelaces as we first learn to tie them but having mastered this skill, can do it from touch alone. We call this phenomenon "sensory scaffolding": observation streams that are not needed by a master might yet aid a novice learner. We consider such sensory scaffolding setups for training artificial agents. For example, a robot arm may need to be deployed with just a low-cost,… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: ICLR 2024 Spotlight version

  4. arXiv:2404.05802  [pdf, other

    cs.CE cs.CV cs.MM

    BatSort: Enhanced Battery Classification with Transfer Learning for Battery Sorting and Recycling

    Authors: Yunyi Zhao, Wei Zhang, Erhai Hu, Qingyu Yan, Cheng Xiang, King Jet Tseng, Dusit Niyato

    Abstract: Battery recycling is a critical process for minimizing environmental harm and resource waste for used batteries. However, it is challenging, largely because sorting batteries is costly and hardly automated to group batteries based on battery types. In this paper, we introduce a machine learning-based approach for battery-type classification and address the daunting problem of data scarcity for the… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  5. arXiv:2402.04513  [pdf, other

    cs.LG cs.CL

    Online Cascade Learning for Efficient Inference over Streams

    Authors: Lunyiu Nie, Zhimin Ding, Erdong Hu, Christopher Jermaine, Swarat Chaudhuri

    Abstract: Large Language Models (LLMs) have a natural role in answering complex queries about data streams, but the high computational cost of LLM inference makes them infeasible in many such tasks. We propose online cascade learning, the first approach to address this challenge. The objective here is to learn a "cascade" of models, starting with lower-capacity models (such as logistic regression) and endin… ▽ More

    Submitted 17 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Main Conference Paper

  6. arXiv:2402.03988  [pdf, other

    eess.AS cs.CL cs.SD

    REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

    Authors: Liang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-shan Lee, Shao-Hua Sun

    Abstract: Unsupervised automatic speech recognition (ASR) aims to learn the mapping between the speech signal and its corresponding textual transcription without the supervision of paired speech-text data. A word/phoneme in the speech signal is represented by a segment of speech signal with variable length and unknown boundary, and this segmental structure makes learning the mapping between speech and text… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  7. arXiv:2312.12655  [pdf, other

    cs.LG cs.AI cs.CL

    Can Transformers Learn Sequential Function Classes In Context?

    Authors: Ryan Campbell, Emma Guo, Evan Hu, Reya Vir, Ethan Hsiao

    Abstract: In-context learning (ICL) has revolutionized the capabilities of transformer models in NLP. In our project, we extend the understanding of the mechanisms underpinning ICL by exploring whether transformers can learn from sequential, non-textual function class data distributions. We introduce a novel sliding window sequential function class and employ toy-sized transformers with a GPT-2 architecture… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: 8 pages, 8 figures

  8. arXiv:2310.05513  [pdf, other

    cs.SD cs.CL eess.AS

    Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond

    Authors: Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chuang, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe

    Abstract: The 2023 Multilingual Speech Universal Performance Benchmark (ML-SUPERB) Challenge expands upon the acclaimed SUPERB framework, emphasizing self-supervised models in multilingual speech recognition and language identification. The challenge comprises a research track focused on applying ML-SUPERB to specific multilingual subjects, a Challenge Track for model submissions, and a New Language Track w… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted by ASRU

  9. arXiv:2310.04363  [pdf, other

    cs.LG cs.CL

    Amortizing intractable inference in large language models

    Authors: Edward J. Hu, Moksh Jain, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin

    Abstract: Autoregressive large language models (LLMs) compress knowledge from their training data through next-token conditional distributions. This limits tractable querying of this knowledge to start-to-end autoregressive sampling. However, many tasks of interest -- including sequence continuation, infilling, and other forms of constrained generation -- involve sampling from intractable posterior distribu… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; 23 pages; code: https://github.com/GFNOrg/gfn-lm-tuning

  10. arXiv:2309.03237  [pdf, other

    cs.LG cs.IT math.OC

    Federated Learning Over Images: Vertical Decompositions and Pre-Trained Backbones Are Difficult to Beat

    Authors: Erdong Hu, Yuxin Tang, Anastasios Kyrillidis, Chris Jermaine

    Abstract: We carefully evaluate a number of algorithms for learning in a federated environment, and test their utility for a variety of image classification tasks. We consider many issues that have not been adequately considered before: whether learning over data sets that do not have diverse sets of images affects the results; whether to use a pre-trained feature extraction "backbone"; how to evaluate lear… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 16 pages, 7 figures, Accepted at ICCV2023

  11. arXiv:2306.00751  [pdf, other

    cs.CL cs.LG

    Differentiable Tree Operations Promote Compositional Generalization

    Authors: Paul Soulos, Edward Hu, Kate McCurdy, Yunmo Chen, Roland Fernandez, Paul Smolensky, Jianfeng Gao

    Abstract: In the context of structure-to-structure transformation tasks, learning sequences of discrete symbolic operations poses significant challenges due to their non-differentiability. To facilitate the learning of these symbolic sequences, we introduce a differentiable tree interpreter that compiles high-level symbolic tree operations into subsymbolic matrix operations on tensors. We present a novel Di… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023. Code available at https://github.com/psoulos/dtm

  12. arXiv:2305.10615  [pdf, other

    cs.SD cs.CL eess.AS

    ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

    Authors: Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe

    Abstract: Speech processing Universal PERformance Benchmark (SUPERB) is a leaderboard to benchmark the performance of Self-Supervised Learning (SSL) models on various speech processing tasks. However, SUPERB largely considers English speech in its evaluation. This paper presents multilingual SUPERB (ML-SUPERB), covering 143 languages (ranging from high-resource to endangered), and considering both automatic… ▽ More

    Submitted 11 August, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech

  13. arXiv:2303.13002  [pdf, other

    cs.LG cs.AI cs.RO

    Planning Goals for Exploration

    Authors: Edward S. Hu, Richard Chang, Oleh Rybkin, Dinesh Jayaraman

    Abstract: Dropped into an unknown environment, what should an agent do to quickly learn about the environment and how to accomplish diverse tasks within it? We address this question within the goal-conditioned reinforcement learning paradigm, by identifying how the agent should set its goals at training time to maximize exploration. We propose "Planning Exploratory Goals" (PEG), a method that sets goals for… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Camera Ready version for ICLR2023 Spotlight

  14. arXiv:2303.04242  [pdf, other

    cs.CR

    MEV in fixed gas price blockchains: Terra Classic as a case of study

    Authors: Facundo Carrillo, Elaine Hu

    Abstract: Maximum extractable value (MEV) has been extensively studied. In most papers, the researchers have worked with the Ethereum blockchain almost exclusively. Even though, Ethereum and other blockchains have dynamic gas prices this is not the case for all blockchains; many of them have fixed gas prices. Extending the research to other blockchains with fixed gas price could broaden the scope of the exi… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  15. arXiv:2302.06576  [pdf, other

    cs.LG stat.ML

    GFlowNet-EM for learning compositional latent variable models

    Authors: Edward J. Hu, Nikolay Malkin, Moksh Jain, Katie Everett, Alexandros Graikos, Yoshua Bengio

    Abstract: Latent variable models (LVMs) with discrete compositional latents are an important but challenging setting due to a combinatorially large number of possible configurations of the latents. A key tradeoff in modeling the posteriors over latents is between expressivity and tractable optimization. For algorithms based on expectation-maximization (EM), the E-step is often intractable without restrictiv… ▽ More

    Submitted 3 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023; code: https://github.com/GFNOrg/GFlowNet-EM

  16. arXiv:2301.12950  [pdf, other

    cs.LG cs.AI cs.PL cs.RO

    Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

    Authors: Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-yi Lee, Shao-Hua Sun

    Abstract: Aiming to produce reinforcement learning (RL) policies that are human-interpretable and can generalize better to novel scenarios, Trivedi et al. (2021) present a method (LEAPS) that first learns a program embedding space to continuously parameterize diverse programs from a pre-generated program dataset, and then searches for a task-solving program in the learned program embedding space when given… ▽ More

    Submitted 31 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  17. arXiv:2301.10859  [pdf, other

    cs.LG cs.AI

    Salesforce CausalAI Library: A Fast and Scalable Framework for Causal Analysis of Time Series and Tabular Data

    Authors: Devansh Arpit, Matthew Fernandez, Itai Feigenbaum, Weiran Yao, Chenghao Liu, Wenzhuo Yang, Paul Josel, Shelby Heinecke, Eric Hu, Huan Wang, Stephen Hoi, Caiming Xiong, Kun Zhang, Juan Carlos Niebles

    Abstract: We introduce the Salesforce CausalAI Library, an open-source library for causal analysis using observational data. It supports causal discovery and causal inference for tabular and time series data, of discrete, continuous and heterogeneous types. This library includes algorithms that handle linear and non-linear causal relationships between variables, and uses multi-processing for speed-up. We al… ▽ More

    Submitted 22 September, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  18. arXiv:2212.08961  [pdf, other

    cs.LG cs.RO

    Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

    Authors: Kun Huang, Edward S. Hu, Dinesh Jayaraman

    Abstract: Physical interactions can often help reveal information that is not readily apparent. For example, we may tug at a table leg to evaluate whether it is built well, or turn a water bottle upside down to check that it is watertight. We propose to train robots to acquire such interactive behaviors automatically, for the purpose of evaluating the result of an attempted robotic skill execution. These ev… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

    Comments: CoRL 2022

  19. arXiv:2210.00580  [pdf, other

    cs.LG stat.ML

    GFlowNets and variational inference

    Authors: Nikolay Malkin, Salem Lahlou, Tristan Deleu, Xu Ji, Edward Hu, Katie Everett, Dinghuai Zhang, Yoshua Bengio

    Abstract: This paper builds bridges between two families of probabilistic algorithms: (hierarchical) variational inference (VI), which is typically used to model distributions over continuous spaces, and generative flow networks (GFlowNets), which have been used for distributions over discrete structures such as graphs. We demonstrate that, in certain cases, VI algorithms are equivalent to special cases of… ▽ More

    Submitted 2 March, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 final version; code: https://github.com/GFNOrg/GFN_vs_HVI

  20. Enjoy the Ride Consciously with CAWA: Context-Aware Advisory Warnings for Automated Driving

    Authors: Erfan Pakdamanian, Erzhen Hu, Shili Sheng, Sarit Kraus, Seongkook Heo, Lu Feng

    Abstract: In conditionally automated driving, drivers decoupled from driving while immersed in non-driving-related tasks (NDRTs) could potentially either miss the system-initiated takeover request (TOR) or a sudden TOR may startle them. To better prepare drivers for a safer takeover in an emergency, we propose novel context-aware advisory warnings (CAWA) for automated driving to gently inform drivers. This… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: Proceeding of the 14th International Conference on Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI '22)

  21. arXiv:2203.06942  [pdf, other

    cs.CL cs.IR

    Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering

    Authors: Jiawei Zhou, Xiaoguang Li, Lifeng Shang, Lan Luo, Ke Zhan, Enrui Hu, Xinyu Zhang, Hao Jiang, Zhao Cao, Fan Yu, Xin Jiang, Qun Liu, Lei Chen

    Abstract: To alleviate the data scarcity problem in training question answering systems, recent works propose additional intermediate pre-training for dense passage retrieval (DPR). However, there still remains a large discrepancy between the provided upstream signals and the downstream question-passage relevance, which leads to less improvement. To bridge this gap, we propose the HyperLink-induced Pre-trai… ▽ More

    Submitted 12 April, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted by ACL 2022 main conference; The dataset and code are available at https://github.com/jzhoubu/HLP

  22. arXiv:2203.03466  [pdf, other

    cs.LG cond-mat.dis-nn cs.NE

    Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

    Authors: Greg Yang, Edward J. Hu, Igor Babuschkin, Szymon Sidor, Xiaodong Liu, David Farhi, Nick Ryder, Jakub Pachocki, Weizhu Chen, Jianfeng Gao

    Abstract: Hyperparameter (HP) tuning in deep learning is an expensive process, prohibitively so for neural networks (NNs) with billions of parameters. We show that, in the recently discovered Maximal Update Parametrization (muP), many optimal HPs remain stable even as model size changes. This leads to a new HP tuning paradigm we call muTransfer: parametrize the target model in muP, tune the HP indirectly on… ▽ More

    Submitted 28 March, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: NeurIPS 2021

  23. A "Distance Matters" Paradox: Facilitating Intra-Team Collaboration Can Harm Inter-Team Collaboration

    Authors: Xinlan Emily Hu, Rebecca Hinds, Melissa A. Valentine, Michael S. Bernstein

    Abstract: By identifying the socio-technical conditions required for teams to work effectively remotely, the Distance Matters framework has been influential in CSCW since its introduction in 2000. Advances in collaboration technology and practices have since brought teams increasingly closer to achieving these conditions. This paper presents a ten-month ethnography in a remote organization, where we observe… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: Accepted at CSCW 2022 (The 25th ACM Conference on Computer-Supported Cooperative Work and Social Computing)

    Journal ref: Proc. ACM Hum.-Comput. Interact. 6, CSCW1, Article 48 (April 2022), 36 pages

  24. arXiv:2111.09266  [pdf, other

    cs.LG cs.AI stat.ML

    GFlowNet Foundations

    Authors: Yoshua Bengio, Salem Lahlou, Tristan Deleu, Edward J. Hu, Mo Tiwari, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets) have been introduced as a method to sample a diverse set of candidates in an active learning context, with a training objective that makes them approximately sample in proportion to a given reward function. In this paper, we show a number of additional theoretical properties of GFlowNets. They can be used to estimate joint probability distributions and the corr… ▽ More

    Submitted 10 July, 2023; v1 submitted 17 November, 2021; originally announced November 2021.

  25. arXiv:2110.07431  [pdf, other

    cs.CL

    Towards More Effective and Economic Sparsely-Activated Model

    Authors: Hao Jiang, Ke Zhan, Jianwei Qu, Yongkang Wu, Zhaoye Fei, Xinyu Zhang, Lei Chen, Zhicheng Dou, Xipeng Qiu, Zikai Guo, Ruofei Lai, Jiawen Wu, Enrui Hu, Yinxia Zhang, Yantao Jia, Fan Yu, Zhao Cao

    Abstract: The sparsely-activated models have achieved great success in natural language processing through large-scale parameters and relatively low computational cost, and gradually become a feasible technique for training and implementing extremely large models. Due to the limit of communication cost, activating multiple experts is hardly affordable during training and inference. Therefore, previous work… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  26. arXiv:2110.05053  [pdf

    cs.SE

    Integrating Structural Description of Data Format Information into Programming to Auto-generate File Reading Programs

    Authors: Xinghua Cheng, Erjie Hu, Di Hu

    Abstract: File reading is the basis for data sharing and scientific computing. However, manual programming for file reading is labour-intensive and time-consuming, as data formats are heterogeneous and complex. To address such an issue, this study proposes a novel approach for the automatic generation of file reading programs based on structured and self-described data format information. This approach prov… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 28 pages, 17 figures

    ACM Class: K.6.3

  27. Guided Generation of Cause and Effect

    Authors: Zhongyang Li, Xiao Ding, Ting Liu, J. Edward Hu, Benjamin Van Durme

    Abstract: We present a conditional text generation framework that posits sentential expressions of possible causes and effects. This framework depends on two novel resources we develop in the course of this work: a very large-scale collection of English sentences expressing causal patterns CausalBank; and a refinement over previous work on constructing large lexical causal knowledge graphs Cause Effect Grap… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: accepted in IJCAI 2020 main track

  28. arXiv:2107.09047  [pdf, other

    cs.LG cs.CV cs.RO

    Know Thyself: Transferable Visual Control Policies Through Robot-Awareness

    Authors: Edward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman

    Abstract: Training visual control policies from scratch on a new robot typically requires generating large amounts of robot-specific data. How might we leverage data previously collected on another robot to reduce or even completely remove this need for robot-specific data? We propose a "robot-aware control" paradigm that achieves this by exploiting readily available knowledge about the robot. We then insta… ▽ More

    Submitted 17 October, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Updated to ICLR22 version

  29. arXiv:2106.09685  [pdf, other

    cs.CL cs.AI cs.LG

    LoRA: Low-Rank Adaptation of Large Language Models

    Authors: Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen

    Abstract: An important paradigm of natural language processing consists of large-scale pre-training on general domain data and adaptation to particular tasks or domains. As we pre-train larger models, full fine-tuning, which retrains all model parameters, becomes less feasible. Using GPT-3 175B as an example -- deploying independent instances of fine-tuned models, each with 175B parameters, is prohibitively… ▽ More

    Submitted 16 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: Draft V2 includes better baselines, experiments on GLUE, and more on adapter latency

  30. arXiv:2103.09190  [pdf, other

    cs.SE

    Using Grammar Patterns to Interpret Test Method Name Evolution

    Authors: Anthony Peruma, Emily Hu, Jiajun Chen, Eman Abdullah Alomar, Mohamed Wiem Mkaouer, Christian D. Newman

    Abstract: It is good practice to name test methods such that they are comprehensible to developers; they must be written in such a way that their purpose and functionality are clear to those who will maintain them. Unfortunately, there is little automated support for writing or maintaining the names of test methods. This can lead to inconsistent and low-quality test names and increase the maintenance cost o… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

  31. arXiv:2011.14522  [pdf, other

    cs.LG cond-mat.dis-nn cs.NE

    Feature Learning in Infinite-Width Neural Networks

    Authors: Greg Yang, Edward J. Hu

    Abstract: As its width tends to infinity, a deep neural network's behavior under gradient descent can become simplified and predictable (e.g. given by the Neural Tangent Kernel (NTK)), if it is parametrized appropriately (e.g. the NTK parametrization). However, we show that the standard and NTK parametrizations of a neural network do not admit infinite-width limits that can learn features, which is crucial… ▽ More

    Submitted 15 July, 2022; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: 4th paper in the Tensor Programs series. Appearing in ICML 2021

  32. arXiv:2007.00320  [pdf, other

    cs.CL

    Iterative Paraphrastic Augmentation with Discriminative Span Alignment

    Authors: Ryan Culkin, J. Edward Hu, Elias Stengel-Eskin, Guanghui Qin, Benjamin Van Durme

    Abstract: We introduce a novel paraphrastic augmentation strategy based on sentence-level lexically constrained paraphrasing and discriminative span alignment. Our approach allows for the large-scale expansion of existing resources, or the rapid creation of new resources from a small, manually-produced seed corpus. We illustrate our framework on the Berkeley FrameNet Project, a large-scale language understa… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  33. arXiv:2004.12478  [pdf, other

    cs.LG cs.CR stat.ML

    Improved Image Wasserstein Attacks and Defenses

    Authors: Edward J. Hu, Adith Swaminathan, Hadi Salman, Greg Yang

    Abstract: Robustness against image perturbations bounded by a $\ell_p$ ball have been well-studied in recent literature. Perturbations in the real-world, however, rarely exhibit the pixel independence that $\ell_p$ threat models assume. A recently proposed Wasserstein distance-bounded threat model is a promising alternative that limits the perturbation to pixel mass movements. We point out and rectify flaws… ▽ More

    Submitted 9 May, 2023; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: Best paper award at ICLR Trustworthy ML Workshop 2020

  34. arXiv:2003.10045  [pdf, other

    cs.CV cs.LG eess.IV

    Architectural Resilience to Foreground-and-Background Adversarial Noise

    Authors: Carl Cheng, Evan Hu

    Abstract: Adversarial attacks in the form of imperceptible perturbations of normal images have been extensively studied, and for every new defense methodology created, multiple adversarial attacks are found to counteract it. In particular, a popular style of attack, exemplified in recent years by DeepFool and Carlini-Wagner, relies solely on white-box scenarios in which full access to the predictive model a… ▽ More

    Submitted 7 June, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: 9 pages, 8 figures; updated email addresses

  35. arXiv:2002.08118  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Randomized Smoothing of All Shapes and Sizes

    Authors: Greg Yang, Tony Duan, J. Edward Hu, Hadi Salman, Ilya Razenshteyn, Jerry Li

    Abstract: Randomized smoothing is the current state-of-the-art defense with provable robustness against $\ell_2$ adversarial attacks. Many works have devised new randomized smoothing schemes for other metrics, such as $\ell_1$ or $\ell_\infty$; however, substantial effort was needed to derive such new guarantees. This begs the question: can we find a general theory for randomized smoothing? We propose a n… ▽ More

    Submitted 23 July, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: 9 pages main text, 49 pages total

  36. arXiv:1912.07670  [pdf, other

    cs.RO cs.AI cs.LG

    To Follow or not to Follow: Selective Imitation Learning from Observations

    Authors: Youngwoon Lee, Edward S. Hu, Zhengyu Yang, Joseph J. Lim

    Abstract: Learning from demonstrations is a useful way to transfer a skill from one agent to another. While most imitation learning methods aim to mimic an expert skill by following the demonstration step-by-step, imitating every step in the demonstration often becomes infeasible when the learner and its environment are different from the demonstration. In this paper, we propose a method that can imitate a… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

    Comments: Published at the Conference on Robot Learning (CoRL) 2019

  37. arXiv:1911.07246  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks

    Authors: Youngwoon Lee, Edward S. Hu, Zhengyu Yang, Alex Yin, Joseph J. Lim

    Abstract: The IKEA Furniture Assembly Environment is one of the first benchmarks for testing and accelerating the automation of complex manipulation tasks. The environment is designed to advance reinforcement learning from simple toy tasks to complex tasks requiring both long-term planning and sophisticated low-level control. Our environment supports over 80 different furniture models, Sawyer and Baxter rob… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: Simulator

  38. arXiv:1905.04629  [pdf, other

    cs.LG stat.ML

    Efficient Low-Rank Semidefinite Programming with Robust Loss Functions

    Authors: Quanming Yao, Hangsi Yang, En-Liang Hu, James Kwok

    Abstract: In real-world applications, it is important for machine learning algorithms to be robust against data outliers or corruptions. In this paper, we focus on improving the robustness of a large class of learning algorithms that are formulated as low-rank semi-definite programming (SDP) problems. Traditional formulations use square loss, which is notorious for being sensitive to outliers. We propose to… ▽ More

    Submitted 2 June, 2021; v1 submitted 11 May, 2019; originally announced May 2019.

    Comments: Preprint version. Final version is accepted to "IEEE Transactions on Pattern Analysis and Machine Intelligence"

  39. arXiv:1901.03644  [pdf, other

    cs.CL

    ParaBank: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-constrained Neural Machine Translation

    Authors: J. Edward Hu, Rachel Rudinger, Matt Post, Benjamin Van Durme

    Abstract: We present ParaBank, a large-scale English paraphrase dataset that surpasses prior work in both quantity and quality. Following the approach of ParaNMT, we train a Czech-English neural machine translation (NMT) system to generate novel paraphrases of English reference sentences. By adding lexical constraints to the NMT decoding procedure, however, we are able to produce multiple high-quality sente… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: To be presented at AAAI 2019. 8 pages

  40. arXiv:1811.01198  [pdf, other

    cs.LG math.OC stat.ML

    A biconvex optimization for solving semidefinite programs via bilinear factorization

    Authors: En-Liang Hu

    Abstract: Many problems in machine learning can be reduced to learning a low-rank positive semidefinite matrix (denoted as $Z$), which encounters semidefinite program (SDP). Existing SDP solvers by classical convex optimization are expensive to solve large-scale problems. Employing the low rank of solution, Burer-Monteiro's method reformulated SDP as a nonconvex problem via the $quadratic$ factorization (… ▽ More

    Submitted 28 September, 2021; v1 submitted 3 November, 2018; originally announced November 2018.

  41. arXiv:1804.08207  [pdf, ps, other

    cs.CL

    Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

    Authors: Adam Poliak, Aparajita Haldar, Rachel Rudinger, J. Edward Hu, Ellie Pavlick, Aaron Steven White, Benjamin Van Durme

    Abstract: We present a large-scale collection of diverse natural language inference (NLI) datasets that help provide insight into how well a sentence representation captures distinct types of reasoning. The collection results from recasting 13 existing datasets from 7 semantic phenomena into a common NLI structure, resulting in over half a million labeled context-hypothesis pairs in total. We refer to our c… ▽ More

    Submitted 29 August, 2018; v1 submitted 22 April, 2018; originally announced April 2018.

    Comments: To be presented at EMNLP 2018. 15 pages