Skip to main content

Showing 1–11 of 11 results for author: Acero, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04467  [pdf, other

    cs.AI cs.CL cs.GT

    Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games

    Authors: Nathan Herr, Fernando Acero, Roberta Raileanu, María Pérez-Ortiz, Zhibin Li

    Abstract: Large Language Models (LLMs) have been increasingly used in real-world settings, yet their strategic abilities remain largely unexplored. Game theory provides a good framework for assessing the decision-making abilities of LLMs in interactions with other agents. Although prior studies have shown that LLMs can solve these tasks with carefully curated prompts, they fail when the problem setting or p… ▽ More

    Submitted 16 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 8 pages (19 with appendix), 6 figures in the main body (4 in the appendix), 4 tables in the main body

  2. arXiv:2404.19664  [pdf, other

    cs.RO cs.LG

    Towards Generalist Robot Learning from Internet Video: A Survey

    Authors: Robert McCarthy, Daniel C. H. Tan, Dominik Schmidt, Fernando Acero, Nathan Herr, Yilun Du, Thomas G. Thuruthel, Zhibin Li

    Abstract: This survey presents an overview of methods for learning from video (LfV) in the context of reinforcement learning (RL) and robotics. We focus on methods capable of scaling to large internet video datasets and, in the process, extracting foundational knowledge about the world's dynamics and physical human behaviour. Such methods hold great promise for developing general-purpose robots. We open w… ▽ More

    Submitted 7 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Updated formatting. Reduced paper length and made other minor improvements

  3. arXiv:2403.16667  [pdf, other

    cs.AI

    Deep Reinforcement Learning and Mean-Variance Strategies for Responsible Portfolio Optimization

    Authors: Fernando Acero, Parisa Zehtabi, Nicolas Marchesotti, Michael Cashmore, Daniele Magazzeni, Manuela Veloso

    Abstract: Portfolio optimization involves determining the optimal allocation of portfolio assets in order to maximize a given investment objective. Traditionally, some form of mean-variance optimization is used with the aim of maximizing returns while minimizing risk, however, more recently, deep reinforcement learning formulations have been explored. Increasingly, investors have demonstrated an interest in… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Presented at the AAAI 2024 Workshop on AI in Finance for Social Impact

  4. arXiv:2403.14328  [pdf, other

    cs.RO cs.AI cs.LG

    Distilling Reinforcement Learning Policies for Interpretable Robot Locomotion: Gradient Boosting Machines and Symbolic Regression

    Authors: Fernando Acero, Zhibin Li

    Abstract: Recent advancements in reinforcement learning (RL) have led to remarkable achievements in robot locomotion capabilities. However, the complexity and ``black-box'' nature of neural network-based RL policies hinder their interpretability and broader acceptance, particularly in applications demanding high levels of safety and reliability. This paper introduces a novel approach to distill neural RL po… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  5. Modular Neural Network Policies for Learning In-Flight Object Catching with a Robot Hand-Arm System

    Authors: Wenbin Hu, Fernando Acero, Eleftherios Triantafyllidis, Zhaocheng Liu, Zhibin Li

    Abstract: We present a modular framework designed to enable a robot hand-arm system to learn how to catch flying objects, a task that requires fast, reactive, and accurately-timed robot motions. Our framework consists of five core modules: (i) an object state estimator that learns object trajectory prediction, (ii) a catching pose quality network that learns to score and rank object poses for catching, (iii… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 8 pages. Accepted and presented at IEEE IROS 2023

  6. arXiv:2307.08816  [pdf, other

    cs.LG cs.AI math.OC

    Accelerating Cutting-Plane Algorithms via Reinforcement Learning Surrogates

    Authors: Kyle Mana, Fernando Acero, Stephen Mak, Parisa Zehtabi, Michael Cashmore, Daniele Magazzeni, Manuela Veloso

    Abstract: Discrete optimization belongs to the set of $\mathcal{NP}$-hard problems, spanning fields such as mixed-integer programming and combinatorial optimization. A current standard approach to solving convex discrete optimization problems is the use of cutting-plane algorithms, which reach optimal solutions by iteratively adding inequalities known as \textit{cuts} to refine a feasible set. Despite the e… ▽ More

    Submitted 27 February, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Extended version (includes Supplementary Material). Accepted at AAAI 24 Main Track with Oral Presentation

  7. arXiv:2307.00125  [pdf, other

    cs.RO cs.LG

    RObotic MAnipulation Network (ROMAN) $\unicode{x2013}$ Hybrid Hierarchical Learning for Solving Complex Sequential Tasks

    Authors: Eleftherios Triantafyllidis, Fernando Acero, Zhaocheng Liu, Zhibin Li

    Abstract: Solving long sequential tasks poses a significant challenge in embodied artificial intelligence. Enabling a robotic system to perform diverse sequential tasks with a broad range of manipulation skills is an active area of research. In this work, we present a Hybrid Hierarchical Learning framework, the Robotic Manipulation Network (ROMAN), to address the challenge of solving multiple complex tasks… ▽ More

    Submitted 7 July, 2023; v1 submitted 30 June, 2023; originally announced July 2023.

    Comments: To appear in Nature Machine Intelligence. Includes the main and supplementary manuscript. Total of 70 pages, with a total of 9 Figures and 17 Tables

  8. arXiv:2306.04026  [pdf, other

    cs.LG cs.AI cs.RO

    Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory

    Authors: Daniel C. H. Tan, Fernando Acero, Robert McCarthy, Dimitrios Kanoulas, Zhibin Li

    Abstract: Guaranteeing safe behaviour of reinforcement learning (RL) policies poses significant challenges for safety-critical applications, despite RL's generality and scalability. To address this, we propose a new approach to apply verification methods from control theory to learned value functions. By analyzing task structures for safety preservation, we formalize original theorems that establish links b… ▽ More

    Submitted 5 December, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  9. Semi-Blind Source Separation with Learned Constraints

    Authors: Rémi Carloni Gertosio, Jérôme Bobin, Fabio Acero

    Abstract: Blind source separation (BSS) algorithms are unsupervised methods, which are the cornerstone of hyperspectral data analysis by allowing for physically meaningful data decompositions. BSS problems being ill-posed, the resolution requires efficient regularization schemes to better distinguish between the sources and yield interpretable solutions. For that purpose, we investigate a semi-supervised so… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Journal ref: Signal Processing, Volume 202, January 2023, 108776

  10. arXiv:2109.14026  [pdf, other

    cs.RO cs.CV cs.LG

    Learning Perceptual Locomotion on Uneven Terrains using Sparse Visual Observations

    Authors: Fernando Acero, Kai Yuan, Zhibin Li

    Abstract: To proactively navigate and traverse various terrains, active use of visual perception becomes indispensable. We aim to investigate the feasibility and performance of using sparse visual observations to achieve perceptual locomotion over a range of common terrains (steps, ramps, gaps, and stairs) in human-centered environments. We formulate a selection of sparse visual inputs suitable for locomoti… ▽ More

    Submitted 26 May, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Video summary can be found at https://youtu.be/vtp43jYQ5w4

  11. arXiv:2109.04322  [pdf, other

    cs.RO

    Learning Vision-Guided Dynamic Locomotion Over Challenging Terrains

    Authors: Zhaocheng Liu, Fernando Acero, Zhibin Li

    Abstract: Legged robots are becoming increasingly powerful and popular in recent years for their potential to bring the mobility of autonomous agents to the next level. This work presents a deep reinforcement learning approach that learns a robust Lidar-based perceptual locomotion policy in a partially observable environment using Proximal Policy Optimisation. Visual perception is critical to actively overc… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: 9 pages, 27 figures, 1 table