Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Del Verme, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07718  [pdf, other

    cs.LG cs.AI

    WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

    Authors: Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H. Laradji, Manuel Del Verme, Tom Marty, Léo Boisvert, Megh Thakkar, Quentin Cappart, David Vazquez, Nicolas Chapados, Alexandre Lacoste

    Abstract: We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on measuring the agents' ability to perform tasks that span the typical daily work of knowledge workers utilizing enterprise software systems. To this end, we propose WorkArena, a remote-hosted benchmark of 33 tasks based on the widely-used ServiceNow platform. We also… ▽ More

    Submitted 23 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 21 pages, 11 figures, preprint

  2. arXiv:2110.08307  [pdf, other

    cs.LG cs.AI

    GrowSpace: Learning How to Shape Plants

    Authors: Yasmeen Hitti, Ionelia Buzatu, Manuel Del Verme, Mark Lefsrud, Florian Golemo, Audrey Durand

    Abstract: Plants are dynamic systems that are integral to our existence and survival. Plants face environment changes and adapt over time to their surrounding conditions. We argue that plant responses to an environmental stimulus are a good example of a real-world problem that can be approached within a reinforcement learning (RL)framework. With the objective of controlling a plant by moving the light sourc… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

  3. arXiv:2001.01620  [pdf, other

    cs.LG cs.AI stat.ML

    Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints

    Authors: Manuel Del Verme, Bruno Castro da Silva, Gianluca Baldassarre

    Abstract: Reinforcement learning can greatly benefit from the use of options as a way of encoding recurring behaviours and to foster exploration. An important open problem is how can an agent autonomously learn useful options when solving particular distributions of related tasks. We investigate some of the conditions that influence optimality of options, in settings where agents have a limited time budget… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.