-
Hearing the shape of an arena with spectral swarm robotics
Authors:
Leo Cazenille,
Nicolas Lobato-Dauzier,
Alessia Loi,
Mika Ito,
Olivier Marchal,
Nathanael Aubert-Kato,
Nicolas Bredeche,
Anthony J. Genot
Abstract:
Swarm robotics promises adaptability to unknown situations and robustness against failures. However, it still struggles with global tasks that require understanding the broader context in which the robots operate, such as identifying the shape of the arena in which the robots are embedded. Biological swarms, such as shoals of fish, flocks of birds, and colonies of insects, routinely solve global g…
▽ More
Swarm robotics promises adaptability to unknown situations and robustness against failures. However, it still struggles with global tasks that require understanding the broader context in which the robots operate, such as identifying the shape of the arena in which the robots are embedded. Biological swarms, such as shoals of fish, flocks of birds, and colonies of insects, routinely solve global geometrical problems through the diffusion of local cues. This paradigm can be explicitly described by mathematical models that could be directly computed and exploited by a robotic swarm. Diffusion over a domain is mathematically encapsulated by the Laplacian, a linear operator that measures the local curvature of a function. Crucially the geometry of a domain can generally be reconstructed from the eigenspectrum of its Laplacian. Here we introduce spectral swarm robotics where robots diffuse information to their neighbors to emulate the Laplacian operator - enabling them to "hear" the spectrum of their arena. We reveal a universal scaling that links the optimal number of robots (a global parameter) with their optimal radius of interaction (a local parameter). We validate experimentally spectral swarm robotics under challenging conditions with the one-shot classification of arena shapes using a sparse swarm of Kilobots. Spectral methods can assist with challenging tasks where robots need to build an emergent consensus on their environment, such as adaptation to unknown terrains, division of labor, or quorum sensing. Spectral methods may extend beyond robotics to analyze and coordinate swarms of agents of various natures, such as traffic or crowds, and to better understand the long-range dynamics of natural systems emerging from short-range interactions.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning
Authors:
Maxime Toquebiau,
Nicolas Bredeche,
Faïz Benamar,
Jae-Yun Jun
Abstract:
Multi-agent deep reinforcement learning (MADRL) problems often encounter the challenge of sparse rewards. This challenge becomes even more pronounced when coordination among agents is necessary. As performance depends not only on one agent's behavior but rather on the joint behavior of multiple agents, finding an adequate solution becomes significantly harder. In this context, a group of agents ca…
▽ More
Multi-agent deep reinforcement learning (MADRL) problems often encounter the challenge of sparse rewards. This challenge becomes even more pronounced when coordination among agents is necessary. As performance depends not only on one agent's behavior but rather on the joint behavior of multiple agents, finding an adequate solution becomes significantly harder. In this context, a group of agents can benefit from actively exploring different joint strategies in order to determine the most efficient one. In this paper, we propose an approach for rewarding strategies where agents collectively exhibit novel behaviors. We present JIM (Joint Intrinsic Motivation), a multi-agent intrinsic motivation method that follows the centralized learning with decentralized execution paradigm. JIM rewards joint trajectories based on a centralized measure of novelty designed to function in continuous environments. We demonstrate the strengths of this approach both in a synthetic environment designed to reveal shortcomings of state-of-the-art MADRL methods, and in simulated robotic tasks. Results show that joint exploration is crucial for solving tasks where the optimal strategy requires a high level of coordination.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Self-Aligning Active Agents with Inertia and Active Torque
Authors:
Jeremy Fersula,
Nicolas Bredeche,
Olivier Dauchot
Abstract:
We extend the study of the inertial effects on the dynamics of active agents to the case where self-alignment is present. In contrast with the most common models of active particles, we find that self-alignment, which couples the rotational dynamics to the translational one, produces unexpected and non-trivial dynamics, already at the deterministic level. Examining first the motion of a free parti…
▽ More
We extend the study of the inertial effects on the dynamics of active agents to the case where self-alignment is present. In contrast with the most common models of active particles, we find that self-alignment, which couples the rotational dynamics to the translational one, produces unexpected and non-trivial dynamics, already at the deterministic level. Examining first the motion of a free particle, we contrast the role of inertia depending on the sign of the self-aligning torque. When positive, inertia does not alter the steady-state linear motion of an a-chiral self-propelled particle. On the contrary, for a negative self-aligning torque, inertia leads to the destabilization of the linear motion into a spontaneously broken chiral symmetry orbiting dynamics. Adding an active torque, or bias, to the angular dynamics the bifurcation becomes imperfect in favor of the chiral orientation selected by the bias. In the case of a positive self-alignment, the interplay of the active torque and inertia leads to the emergence, out of a saddle-node bifurcation, of truly new solutions, which coexist with the simply biased linear motion. In the context of a free particle, the rotational inertia leaves unchanged the families of steady-state solutions but can modify their stability properties. The situation is radically different when considering the case of a collision with a wall, where a very singular oscillating dynamics takes place which can only be captured if both translational and rotational inertia are present.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Is Limited Information Enough? An Approximate Multi-agent Coverage Control in Non-Convex Discrete Environments
Authors:
Tatsuya Iwase,
Aurélie Beynier,
Nicolas Bredeche,
Nicolas Maudet,
Jason R. Marden
Abstract:
Conventional distributed approaches to coverage control may suffer from lack of convergence and poor performance, due to the fact that agents have limited information, especially in non-convex discrete environments. To address this issue, we extend the approach of [Marden 2016] which demonstrates how a limited degree of inter-agent communication can be exploited to overcome such pitfalls in one-di…
▽ More
Conventional distributed approaches to coverage control may suffer from lack of convergence and poor performance, due to the fact that agents have limited information, especially in non-convex discrete environments. To address this issue, we extend the approach of [Marden 2016] which demonstrates how a limited degree of inter-agent communication can be exploited to overcome such pitfalls in one-dimensional discrete environments. The focus of this paper is on extending such results to general dimensional settings. We show that the extension is convergent and keeps the approximation ratio of 2, meaning that any stable solution is guaranteed to have a performance within 50% of the optimal one. The experimental results exhibit that our algorithm outperforms several state-of-the-art algorithms, and also that the runtime is scalable.
△ Less
Submitted 7 April, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Reactive Stepping for Humanoid Robots using Reinforcement Learning: Application to Standing Push Recovery on the Exoskeleton Atalante
Authors:
Alexis Duburcq,
Fabian Schramm,
Guilhem Boéris,
Nicolas Bredeche,
Yann Chevaleyre
Abstract:
State-of-the-art reinforcement learning is now able to learn versatile locomotion, balancing and push-recovery capabilities for bipedal robots in simulation. Yet, the reality gap has mostly been overlooked and the simulated results hardly transfer to real hardware. Either it is unsuccessful in practice because the physics is over-simplified and hardware limitations are ignored, or regularity is no…
▽ More
State-of-the-art reinforcement learning is now able to learn versatile locomotion, balancing and push-recovery capabilities for bipedal robots in simulation. Yet, the reality gap has mostly been overlooked and the simulated results hardly transfer to real hardware. Either it is unsuccessful in practice because the physics is over-simplified and hardware limitations are ignored, or regularity is not guaranteed, and unexpected hazardous motions can occur. This paper presents a reinforcement learning framework capable of learning robust standing push recovery for bipedal robots that smoothly transfer to reality, providing only instantaneous proprioceptive observations. By combining original termination conditions and policy smoothness conditioning, we achieve stable learning, sim-to-real transfer and safety using a policy without memory nor explicit history. Reward engineering is then used to give insights into how to keep balance. We demonstrate its performance in reality on the lower-limb medical exoskeleton Atalante.
△ Less
Submitted 31 July, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Morphological computation and decentralized learning in a swarm of sterically interacting robots
Authors:
Matan Yah Ben Zion,
Jeremy Fersula,
Nicolas Bredeche,
Olivier Dauchot
Abstract:
Whereas naturally occurring swarms thrive when crowded, physical interactions in robotic swarms are either avoided or carefully controlled, thus limiting their operational density. Here we present a mechanical design rule that allows robots to act in a collision-dominated environment. We introduce the Morphobots -- a robotic swarm platform developed to implement embodied computation through a morp…
▽ More
Whereas naturally occurring swarms thrive when crowded, physical interactions in robotic swarms are either avoided or carefully controlled, thus limiting their operational density. Here we present a mechanical design rule that allows robots to act in a collision-dominated environment. We introduce the Morphobots -- a robotic swarm platform developed to implement embodied computation through a morpho-functional design. By engineering a 3D-printed exoskeleton we encode a re-orientation response to an external body force (such as gravity) or a surface force (such as a collision). We show that the force-orientation response is generic, and can augment existing swarm-robotic platforms (e.g Kilobots) as well as custom robots even 10 times larger. At the individual level, the exoskeleton improves the motility and stability, and also allows to encode two contrasting dynamical behaviors in response to an external force or a collision (including collision with a wall or a movable obstacle, and on a dynamically tilting plane). This force-orientation response adds a mechanical layer to the robot's sense-act cycle at the swarm level, leveraging steric interactions for collective phototaxis when crowded. Enabling collisions also promotes information flow, facilitating online distributed learning. Each robot runs an embedded algorithm that ultimately optimizes collective performance. We identify an effective parameter that controls the force-orientation response and explore its implications in swarms that transition from dilute to crowded. \green{Experimenting with both physical swarms (of up to 64 robots), and simulated swarms (of up to 8192 agents) show that the effect of morphological-computation increases with growing swarm size.
△ Less
Submitted 22 February, 2023; v1 submitted 12 November, 2021;
originally announced November 2021.
-
Meta-control of social learning strategies
Authors:
Anil Yaman,
Nicolas Bredeche,
Onur Çaylak,
Joel Z. Leibo,
Sang Wan Lee
Abstract:
Social learning, copying other's behavior without actual experience, offers a cost-effective means of knowledge acquisition. However, it raises the fundamental question of which individuals have reliable information: successful individuals versus the majority. The former and the latter are known respectively as success-based and conformist social learning strategies. We show here that while the su…
▽ More
Social learning, copying other's behavior without actual experience, offers a cost-effective means of knowledge acquisition. However, it raises the fundamental question of which individuals have reliable information: successful individuals versus the majority. The former and the latter are known respectively as success-based and conformist social learning strategies. We show here that while the success-based strategy fully exploits the benign environment of low uncertainly, it fails in uncertain environments. On the other hand, the conformist strategy can effectively mitigate this adverse effect. Based on these findings, we hypothesized that meta-control of individual and social learning strategies provides effective and sample-efficient learning in volatile and uncertain environments. Simulations on a set of environments with various levels of volatility and uncertainty confirmed our hypothesis. The results imply that meta-control of social learning affords agents the leverage to resolve environmental uncertainty with minimal exploration cost, by exploiting others' learning as an external knowledge base.
△ Less
Submitted 7 March, 2022; v1 submitted 18 June, 2021;
originally announced June 2021.
-
Policy Search with Rare Significant Events: Choosing the Right Partner to Cooperate with
Authors:
Paul Ecoffet,
Nicolas Fontbonne,
Jean-Baptiste André,
Nicolas Bredeche
Abstract:
This paper focuses on a class of reinforcement learning problems where significant events are rare and limited to a single positive reward per episode. A typical example is that of an agent who has to choose a partner to cooperate with, while a large number of partners are simply not interested in cooperating, regardless of what the agent has to offer. We address this problem in a continuous state…
▽ More
This paper focuses on a class of reinforcement learning problems where significant events are rare and limited to a single positive reward per episode. A typical example is that of an agent who has to choose a partner to cooperate with, while a large number of partners are simply not interested in cooperating, regardless of what the agent has to offer. We address this problem in a continuous state and action space with two different kinds of search methods: a gradient policy search method and a direct policy search method using an evolution strategy. We show that when significant events are rare, gradient information is also scarce, making it difficult for policy gradient search methods to find an optimal policy, with or without a deep neural architecture. On the other hand, we show that direct policy search methods are invariant to the rarity of significant events, which is yet another confirmation of the unique role evolutionary algorithms has to play as a reinforcement learning method.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics
Authors:
Stephane Doncieux,
Nicolas Bredeche,
Léni Le Goff,
Benoît Girard,
Alexandre Coninx,
Olivier Sigaud,
Mehdi Khamassi,
Natalia Díaz-Rodríguez,
David Filliat,
Timothy Hospedales,
A. Eiben,
Richard Duro
Abstract:
Robots are still limited to controlled conditions, that the robot designer knows with enough details to endow the robot with the appropriate models or behaviors. Learning algorithms add some flexibility with the ability to discover the appropriate behavior given either some demonstrations or a reward to guide its exploration with a reinforcement learning algorithm. Reinforcement learning algorithm…
▽ More
Robots are still limited to controlled conditions, that the robot designer knows with enough details to endow the robot with the appropriate models or behaviors. Learning algorithms add some flexibility with the ability to discover the appropriate behavior given either some demonstrations or a reward to guide its exploration with a reinforcement learning algorithm. Reinforcement learning algorithms rely on the definition of state and action spaces that define reachable behaviors. Their adaptation capability critically depends on the representations of these spaces: small and discrete spaces result in fast learning while large and continuous spaces are challenging and either require a long training period or prevent the robot from converging to an appropriate behavior. Beside the operational cycle of policy execution and the learning cycle, which works at a slower time scale to acquire new policies, we introduce the redescription cycle, a third cycle working at an even slower time scale to generate or adapt the required representations to the robot, its environment and the task. We introduce the challenges raised by this cycle and we present DREAM (Deferred Restructuring of Experience in Autonomous Machines), a developmental cognitive architecture to bootstrap this redescription process stage by stage, build new state representations with appropriate motivations, and transfer the acquired knowledge across domains or tasks or even across robots. We describe results obtained so far with this approach and end up with a discussion of the questions it raises in Neuroscience.
△ Less
Submitted 13 May, 2020;
originally announced May 2020.
-
Online Trajectory Planning Through Combined Trajectory Optimization and Function Approximation: Application to the Exoskeleton Atalante
Authors:
Alexis Duburcq,
Yann Chevaleyre,
Nicolas Bredeche,
Guilhem Boéris
Abstract:
Autonomous robots require online trajectory planning capability to operate in the real world. Efficient offline trajectory planning methods already exist, but are computationally demanding, preventing their use online. In this paper, we present a novel algorithm called Guided Trajectory Learning that learns a function approximation of solutions computed through trajectory optimization while ensuri…
▽ More
Autonomous robots require online trajectory planning capability to operate in the real world. Efficient offline trajectory planning methods already exist, but are computationally demanding, preventing their use online. In this paper, we present a novel algorithm called Guided Trajectory Learning that learns a function approximation of solutions computed through trajectory optimization while ensuring accurate and reliable predictions. This function approximation is then used online to generate trajectories. This algorithm is designed to be easy to implement, and practical since it does not require massive computing power. It is readily applicable to any robotics systems and effortless to set up on real hardware since robust control strategies are usually already available. We demonstrate the computational performance of our algorithm on flat-foot walking with the self-balanced exoskeleton Atalante.
△ Less
Submitted 4 March, 2020; v1 submitted 1 October, 2019;
originally announced October 2019.
-
Exploring Self-Assembling Behaviors in a Swarm of Bio-micro-robots using Surrogate-Assisted MAP-Elites
Authors:
Leo Cazenille,
Nicolas Bredeche,
Nathanael Aubert-Kato
Abstract:
Swarms of molecular robots are a promising approach to create specific shapes at the microscopic scale through self-assembly. However, controlling their behavior is a challenging problem as it involves complex non-linear dynamics and high experimental variability. Hand-crafting a molecular controller will often be time-consuming and give sub-optimal results. Optimization methods, like the bioNEAT…
▽ More
Swarms of molecular robots are a promising approach to create specific shapes at the microscopic scale through self-assembly. However, controlling their behavior is a challenging problem as it involves complex non-linear dynamics and high experimental variability. Hand-crafting a molecular controller will often be time-consuming and give sub-optimal results. Optimization methods, like the bioNEAT algorithm, were previously employed to partially overcome these difficulties, but they still had to cope with deceptive high-dimensional search spaces and computationally expensive simulations. Here, we describe a novel approach to solve this problem by using MAP-Elites, an algorithm that searches for both high-performing and diverse solutions. We then apply it to a molecular robotic framework we recently introduced that allows sensing, signaling and self-assembly at the micro-scale and show that MAP-Elites outperforms previous approaches. Additionally, we propose a surrogate model of micro-robots physics and chemical reaction dynamics to reduce the computational costs of simulation. We show that the resulting methodology is capable of optimizing controllers with similar accuracy as when using only a full-fledged realistic model, with half the computational budget.
△ Less
Submitted 1 October, 2019;
originally announced October 2019.
-
Automatic Calibration of Artificial Neural Networks for Zebrafish Collective Behaviours using a Quality Diversity Algorithm
Authors:
Leo Cazenille,
Nicolas Bredeche,
José Halloy
Abstract:
During the last two decades, various models have been proposed for fish collective motion. These models are mainly developed to decipher the biological mechanisms of social interaction between animals. They consider very simple homogeneous unbounded environments and it is not clear that they can simulate accurately the collective trajectories. Moreover when the models are more accurate, the questi…
▽ More
During the last two decades, various models have been proposed for fish collective motion. These models are mainly developed to decipher the biological mechanisms of social interaction between animals. They consider very simple homogeneous unbounded environments and it is not clear that they can simulate accurately the collective trajectories. Moreover when the models are more accurate, the question of their scalability to either larger groups or more elaborate environments remains open. This study deals with learning how to simulate realistic collective motion of collective of zebrafish, using real-world tracking data. The objective is to devise an agent-based model that can be implemented on an artificial robotic fish that can blend into a collective of real fish. We present a novel approach that uses Quality Diversity algorithms, a class of algorithms that emphasise exploration over pure optimisation. In particular, we use CVT-MAP-Elites, a variant of the state-of-the-art MAP-Elites algorithm for high dimensional search space. Results show that Quality Diversity algorithms not only outperform classic evolutionary reinforcement learning methods at the macroscopic level (i.e. group behaviour), but are also able to generate more realistic biomimetic behaviours at the microscopic level (i.e. individual behaviour).
△ Less
Submitted 22 July, 2019;
originally announced July 2019.
-
Modelling zebrafish collective behaviours with multilayer perceptrons optimised by evolutionary algorithms
Authors:
Leo Cazenille,
Nicolas Bredeche,
José Halloy
Abstract:
Collective movements are pervasive behaviours among social organisms and have led to the development of many models. However, modelling animal trajectories and social interactions in simple bounded environments remains a challenge. Moreover, advances in the understanding of the sensory-motor loop and the information processing by animals are leading to revisions of the traditional assumptions made…
▽ More
Collective movements are pervasive behaviours among social organisms and have led to the development of many models. However, modelling animal trajectories and social interactions in simple bounded environments remains a challenge. Moreover, advances in the understanding of the sensory-motor loop and the information processing by animals are leading to revisions of the traditional assumptions made in decision-making algorithms. In this context, we develop a methodology based on artificial neural networks (ANN) to describe the collective motion of small zebrafish groups in a bounded environment. Although ANN models are commonly used in artificial systems they are still under-explored to model animal collective behaviours. Here, we present a methodology to calibrate Multilayer Perceptrons by learning from real fish experimental data. The ANNs are trained using either supervised learning or various forms of evolutionary reinforcement learning methods (using the CMA-ES and NSGA-III algorithms). We reveal that ANN models trained using evolutionary methods are capable of generating realistic collective motions for groups of 5 zebrafish including the tank wall effects, a feature that is lacking in previous models. Finally, we also discuss the benefits of optimised ANNs as candidates for driving robotic lure with biologically realistic behaviour, a method that is becoming increasingly popular to gather data and validate assumptions on collective behaviours.
△ Less
Submitted 26 November, 2018;
originally announced November 2018.
-
Evolutionary optimisation of neural network models for fish collective behaviours in mixed groups of robots and zebrafish
Authors:
Leo Cazenille,
Nicolas Bredeche,
José Halloy
Abstract:
Animal and robot social interactions are interesting both for ethological studies and robotics. On the one hand, the robots can be tools and models to analyse animal collective behaviours, on the other hand, the robots and their artificial intelligence are directly confronted and compared to the natural animal collective intelligence. The first step is to design robots and their behavioural contro…
▽ More
Animal and robot social interactions are interesting both for ethological studies and robotics. On the one hand, the robots can be tools and models to analyse animal collective behaviours, on the other hand, the robots and their artificial intelligence are directly confronted and compared to the natural animal collective intelligence. The first step is to design robots and their behavioural controllers that are capable of socially interact with animals. Designing such behavioural bio-mimetic controllers remains an important challenge as they have to reproduce the animal behaviours and have to be calibrated on experimental data. Most animal collective behavioural models are designed by modellers based on experimental data. This process is long and costly because it is difficult to identify the relevant behavioural features that are then used as a priori knowledge in model building. Here, we want to model the fish individual and collective behaviours in order to develop robot controllers. We explore the use of optimised black-box models based on artificial neural networks (ANN) to model fish behaviours. While the ANN may not be biomimetic but rather bio-inspired, they can be used to link perception to motor responses. These models are designed to be implementable as robot controllers to form mixed-groups of fish and robots, using few a priori knowledge of the fish behaviours. We present a methodology with multilayer perceptron or echo state networks that are optimised through evolutionary algorithms to model accurately the fish individual and collective behaviours in a bounded rectangular arena. We assess the biomimetism of the generated models and compare them to the fish experimental behaviours.
△ Less
Submitted 9 August, 2018;
originally announced August 2018.
-
How to Blend a Robot within a Group of Zebrafish: Achieving Social Acceptance through Real-time Calibration of a Multi-level Behavioural Model
Authors:
Leo Cazenille,
Yohann Chemtob,
Frank Bonnet,
Alexey Gribovskiy,
Francesco Mondada,
Nicolas Bredeche,
Jose Halloy
Abstract:
We have previously shown how to socially integrate a fish robot into a group of zebrafish thanks to biomimetic behavioural models. The models have to be calibrated on experimental data to present correct behavioural features. This calibration is essential to enhance the social integration of the robot into the group. When calibrated, the behavioural model of fish behaviour is implemented to drive…
▽ More
We have previously shown how to socially integrate a fish robot into a group of zebrafish thanks to biomimetic behavioural models. The models have to be calibrated on experimental data to present correct behavioural features. This calibration is essential to enhance the social integration of the robot into the group. When calibrated, the behavioural model of fish behaviour is implemented to drive a robot with closed-loop control of social interactions into a group of zebrafish. This approach can be useful to form mixed-groups, and study animal individual and collective behaviour by using biomimetic autonomous robots capable of responding to the animals in long-standing experiments. Here, we show a methodology for continuous real-time calibration and refinement of multi-level behavioural model. The real-time calibration, by an evolutionary algorithm, is based on simulation of the model to correspond to the observed fish behaviour in real-time. The calibrated model is updated on the robot and tested during the experiments. This method allows to cope with changes of dynamics in fish behaviour. Moreover, each fish presents individual behavioural differences. Thus, each trial is done with naive fish groups that display behavioural variability. This real-time calibration methodology can optimise the robot behaviours during the experiments. Our implementation of this methodology runs on three different computers that perform individual tracking, data-analysis, multi-objective evolutionary algorithms, simulation of the fish robot and adaptation of the robot behavioural models, all in real-time.
△ Less
Submitted 29 May, 2018;
originally announced May 2018.
-
Embodied Evolution in Collective Robotics: A Review
Authors:
Nicolas Bredeche,
Evert Haasdijk,
Abraham Prieto
Abstract:
This paper provides an overview of evolutionary robotics techniques applied to on-line distributed evolution for robot collectives -- namely, embodied evolution. It provides a definition of embodied evolution as well as a thorough description of the underlying concepts and mechanisms. The paper also presents a comprehensive summary of research published in the field since its inception (1999-2017)…
▽ More
This paper provides an overview of evolutionary robotics techniques applied to on-line distributed evolution for robot collectives -- namely, embodied evolution. It provides a definition of embodied evolution as well as a thorough description of the underlying concepts and mechanisms. The paper also presents a comprehensive summary of research published in the field since its inception (1999-2017), providing various perspectives to identify the major trends. In particular, we identify a shift from considering embodied evolution as a parallel search method within small robot collectives (fewer than 10 robots) to embodied evolution as an on-line distributed learning method for designing collective behaviours in swarm-like collectives. The paper concludes with a discussion of applications and open questions, providing a milestone for past and an inspiration for future research.
△ Less
Submitted 9 February, 2018; v1 submitted 26 September, 2017;
originally announced September 2017.
-
Automated optimisation of multi-level models of collective behaviour in a mixed society of animals and robots
Authors:
Leo Cazenille,
Nicolas Bredeche,
José Halloy
Abstract:
Animal and robotic collective behaviours can exhibit complex dynamics that require multi-level descriptions. Here, we are interested in developing a multi-level modeling framework for the use of robots in studies about animal collective decision-making. In this context, using robots can be useful for validating models in silico, inducing calibrated repetitive stimuli to trigger animal responses or…
▽ More
Animal and robotic collective behaviours can exhibit complex dynamics that require multi-level descriptions. Here, we are interested in developing a multi-level modeling framework for the use of robots in studies about animal collective decision-making. In this context, using robots can be useful for validating models in silico, inducing calibrated repetitive stimuli to trigger animal responses or modulating and controlling animal collective behaviour. However, designing appropriate biomimetic robotic behaviour faces a major challenge: how to go from the collective decision dynamics observed with animals to an actual algorithmic implementation in robots. In previous work, this was mainly done by hand, often by taking inspiration from human-designed models. Typically, models of behaviour are either macroscopic, differential equations of the population dynamics, or microscopic,explicit spatio-temporal state of each individual. Only microscopic models can easily be implemented as robot controllers. Here, we address the problem of automating the design of lower level description models that can be implemented in robots and exhibit the same collective dynamics as a given higher level model. We apply evolutionary algorithms to simultaneously optimise the parameters of models accounting for different levels of description. This methodology is applied to an experimentally validated shelter-selection problem solved by gregarious insects and robots. We successfully design and calibrate automatically both a microscopic and a hybrid model exhibiting the same dynamics as a macroscopic one. Our framework can be used for multi-level modeling of collective behaviour in animal or robot populations and bio-hybrid systems.
△ Less
Submitted 2 February, 2016;
originally announced February 2016.
-
Automated Analysis of Behavioural Variability and Filial Imprinting of Chicks (G. gallus), using Autonomous Robots
Authors:
A. Gribovskiy,
F. Mondada,
J. L. Deneubourg,
L. Cazenille,
N. Bredeche,
J. Halloy
Abstract:
Inter-individual variability has various impacts in animal social behaviour. This implies that not only collective behaviours have to be studied but also the behavioural variability of each member composing the groups. To understand those effects on group behaviour, we develop a quantitative methodology based on automated ethograms and autonomous robots to study the inter-individual variability am…
▽ More
Inter-individual variability has various impacts in animal social behaviour. This implies that not only collective behaviours have to be studied but also the behavioural variability of each member composing the groups. To understand those effects on group behaviour, we develop a quantitative methodology based on automated ethograms and autonomous robots to study the inter-individual variability among social animals. We choose chicks of \textit{Gallus gallus domesticus} as a classic social animal model system for their suitability in laboratory and controlled experimentation. Moreover, even domesticated chicken present social structures implying forms or leadership and filial imprinting. We develop an imprinting methodology on autonomous robots to study individual and social behaviour of free moving animals. This allows to quantify the behaviours of large number of animals. We develop an automated experimental methodology that allows to make relatively fast controlled experiments and efficient data analysis. Our analysis are based on high-throughput data allowing a fine quantification of individual behavioural traits. We quantify the efficiency of various state-of-the-art algorithms to automate data analysis and produce automated ethograms. We show that the use of robots allows to provide controlled and quantified stimuli to the animals in absence of human intervention. We quantify the individual behaviour of 205 chicks obtained from hatching after synchronized fecundation. Our results show a high variability of individual behaviours and of imprinting quality and success. Three classes of chicks are observed with various level of imprinting. Our study shows that the concomitant use of autonomous robots and automated ethograms allows detailed and quantitative analysis of behavioural patterns of animals in controlled laboratory experiments.
△ Less
Submitted 7 September, 2015;
originally announced September 2015.
-
Roborobo! a Fast Robot Simulator for Swarm and Collective Robotics
Authors:
Nicolas Bredeche,
Jean-Marc Montanier,
Berend Weel,
Evert Haasdijk
Abstract:
Roborobo! is a multi-platform, highly portable, robot simulator for large-scale collective robotics experiments. Roborobo! is coded in C++, and follows the KISS guideline ("Keep it simple"). Therefore, its external dependency is solely limited to the widely available SDL library for fast 2D Graphics. Roborobo! is based on a Khepera/ePuck model. It is targeted for fast single and multi-robots simul…
▽ More
Roborobo! is a multi-platform, highly portable, robot simulator for large-scale collective robotics experiments. Roborobo! is coded in C++, and follows the KISS guideline ("Keep it simple"). Therefore, its external dependency is solely limited to the widely available SDL library for fast 2D Graphics. Roborobo! is based on a Khepera/ePuck model. It is targeted for fast single and multi-robots simulation, and has already been used in more than a dozen published research mainly concerned with evolutionary swarm robotics, including environment-driven self-adaptation and distributed evolutionary optimization, as well as online onboard embodied evolution and embodied morphogenesis.
△ Less
Submitted 10 April, 2013;
originally announced April 2013.
-
Robust Multi-Cellular Developmental Design
Authors:
Alexandre Devert,
Nicolas Bredèche,
Marc Schoenauer
Abstract:
This paper introduces a continuous model for Multi-cellular Developmental Design. The cells are fixed on a 2D grid and exchange "chemicals" with their neighbors during the growth process. The quantity of chemicals that a cell produces, as well as the differentiation value of the cell in the phenotype, are controlled by a Neural Network (the genotype) that takes as inputs the chemicals produced b…
▽ More
This paper introduces a continuous model for Multi-cellular Developmental Design. The cells are fixed on a 2D grid and exchange "chemicals" with their neighbors during the growth process. The quantity of chemicals that a cell produces, as well as the differentiation value of the cell in the phenotype, are controlled by a Neural Network (the genotype) that takes as inputs the chemicals produced by the neighboring cells at the previous time step. In the proposed model, the number of iterations of the growth process is not pre-determined, but emerges during evolution: only organisms for which the growth process stabilizes give a phenotype (the stable state), others are declared nonviable. The optimization of the controller is done using the NEAT algorithm, that optimizes both the topology and the weights of the Neural Networks. Though each cell only receives local information from its neighbors, the experimental results of the proposed approach on the 'flags' problems (the phenotype must match a given 2D pattern) are almost as good as those of a direct regression approach using the same model with global information. Moreover, the resulting multi-cellular organisms exhibit almost perfect self-healing characteristics.
△ Less
Submitted 9 May, 2007;
originally announced May 2007.