Search | arXiv e-print repository

Dimensionality Reduction of Dynamics on Lie Manifolds via Structure-Aware Canonical Correlation Analysis

Authors: Wooyoung Chung, Daniel Polani, Stas Tiomkin

Abstract: Incorporating prior knowledge into a data-driven modeling problem can drastically improve performance, reliability, and generalization outside of the training sample. The stronger the structural properties, the more effective these improvements become. Manifolds are a powerful nonlinear generalization of Euclidean space for modeling finite dimensions. Structural impositions in constrained systems… ▽ More Incorporating prior knowledge into a data-driven modeling problem can drastically improve performance, reliability, and generalization outside of the training sample. The stronger the structural properties, the more effective these improvements become. Manifolds are a powerful nonlinear generalization of Euclidean space for modeling finite dimensions. Structural impositions in constrained systems increase when applying group structure, converting them into Lie manifolds. The range of their applications is very wide and includes the important case of robotic tasks. Canonical Correlation Analysis (CCA) can construct a hierarchical sequence of maximal correlations of up to two paired data sets in these Euclidean spaces. We present a method to generalize this concept to Lie Manifolds and demonstrate its efficacy through the substantial improvements it achieves in making structure-consistent predictions about changes in the state of a robotic hand. △ Less

Submitted 16 November, 2023; originally announced November 2023.

arXiv:2310.16555 [pdf, ps, other]

Towards Information Theory-Based Discovery of Equivariances

Authors: Hippolyte Charvin, Nicola Catenacci Volpi, Daniel Polani

Abstract: The presence of symmetries imposes a stringent set of constraints on a system. This constrained structure allows intelligent agents interacting with such a system to drastically improve the efficiency of learning and generalization, through the internalisation of the system's symmetries into their information-processing. In parallel, principled models of complexity-constrained learning and behavio… ▽ More The presence of symmetries imposes a stringent set of constraints on a system. This constrained structure allows intelligent agents interacting with such a system to drastically improve the efficiency of learning and generalization, through the internalisation of the system's symmetries into their information-processing. In parallel, principled models of complexity-constrained learning and behaviour make increasing use of information-theoretic methods. Here, we wish to marry these two perspectives and understand whether and in which form the information-theoretic lens can "see" the effect of symmetries of a system. For this purpose, we propose a novel variant of the Information Bottleneck principle, which has served as a productive basis for many principled studies of learning and information-constrained adaptive behaviour. We show (in the discrete case and under a specific technical assumption) that our approach formalises a certain duality between symmetry and information parsimony: namely, channel equivariances can be characterised by the optimal mutual information-preserving joint compression of the channel's input and output. This information-theoretic treatment furthermore suggests a principled notion of "soft" equivariance, whose "coarseness" is measured by the amount of input-output mutual information preserved by the corresponding optimal compression. This new notion offers a bridge between the field of bounded rationality and the study of symmetries in neural representations. The framework may also allow (exact and soft) equivariances to be automatically discovered. △ Less

Submitted 29 May, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

Comments: 23 pages, 0 figures

arXiv:2306.12345 [pdf, other]

The Effect of Noise on the Emergence of Continuous Norms and its Evolutionary Dynamics

Authors: Stavros Anagnou, Daniel Polani, Christoph Salge

Abstract: We examine the effect of noise on societies of agents using an agent-based model of evolutionary norm emergence. Generally, we see that noisy societies are more selfish, smaller and discontent, and are caught in rounds of perpetual punishment preventing them from flourishing. Surprisingly, despite the effect of noise on the population, it does not seem to evolve away. We carry out further analysis… ▽ More We examine the effect of noise on societies of agents using an agent-based model of evolutionary norm emergence. Generally, we see that noisy societies are more selfish, smaller and discontent, and are caught in rounds of perpetual punishment preventing them from flourishing. Surprisingly, despite the effect of noise on the population, it does not seem to evolve away. We carry out further analysis and provide reasons for why this may be the case. Furthermore, we claim that our framework that evolves the noise/ambiguity of norms may be a new way to model the tight/loose framework of norms, suggesting that despite ambiguous norms detrimental effect on society, evolution does not favour clarity. △ Less

Submitted 30 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

Comments: Accepted for publication in the Proceedings of the Artificial Life Conference 2023 (ALIFE 2023), MIT Press

arXiv:2303.03881 [pdf]

Spatial, Social and Data Gaps in On-Demand Mobility Services: Towards a Supply-Oriented MaaS

Authors: Ronit Purian, Daniel Polani

Abstract: After a decade of on-demand mobility services that change spatial behaviors in metropolitan areas, the Shared Autonomous Vehicle (SAV) service is expected to increase traffic congestion and unequal access to transport services. A paradigm of scheduled supply that is aware of demand but not on-demand is proposed, introducing coordination and social and behavioral understanding, urban cognition and… ▽ More After a decade of on-demand mobility services that change spatial behaviors in metropolitan areas, the Shared Autonomous Vehicle (SAV) service is expected to increase traffic congestion and unequal access to transport services. A paradigm of scheduled supply that is aware of demand but not on-demand is proposed, introducing coordination and social and behavioral understanding, urban cognition and empowerment of agents, into a novel informational framework. Daily routines and other patterns of spatial behaviors outline a fundamental demand layer in a supply-oriented paradigm that captures urban dynamics and spatial-temporal behaviors, mostly in groups. Rather than real-time requests and instant responses that reward unplanned actions, and beyond just reservation of travels in timetables, the intention is to capture mobility flows in scheduled travels along the day considering time of day, places, passengers etc. Regulating goal-directed behaviors and caring for service resources and the overall system welfare is proposed to minimize uncertainty, considering the capacity of mobility interactions to hold value, i.e., Motility as a Service (MaaS). The principal-agent problem in the smart city is a problem of collective action among service providers and users that create expectations based on previous actions and reactions in mutual systems. Planned behavior that accounts for service coordination is expected to stabilize excessive rides and traffic load, and to induce a cognitive gain, thus balancing information load and facilitating cognitive effort. △ Less

Submitted 20 February, 2023; originally announced March 2023.

Comments: 30 pages, 1 figure, 3 tables. September 30, 2021

arXiv:2301.00005 [pdf, other]

Intrinsic Motivation in Dynamical Control Systems

Authors: Stas Tiomkin, Ilya Nemenman, Daniel Polani, Naftali Tishby

Abstract: Biological systems often choose actions without an explicit reward signal, a phenomenon known as intrinsic motivation. The computational principles underlying this behavior remain poorly understood. In this study, we investigate an information-theoretic approach to intrinsic motivation, based on maximizing an agent's empowerment (the mutual information between its past actions and future states).… ▽ More Biological systems often choose actions without an explicit reward signal, a phenomenon known as intrinsic motivation. The computational principles underlying this behavior remain poorly understood. In this study, we investigate an information-theoretic approach to intrinsic motivation, based on maximizing an agent's empowerment (the mutual information between its past actions and future states). We show that this approach generalizes previous attempts to formalize intrinsic motivation, and we provide a computationally efficient algorithm for computing the necessary quantities. We test our approach on several benchmark control problems, and we explain its success in guiding intrinsically motivated behaviors by relating our information-theoretic control function to fundamental properties of the dynamical system representing the combined agent-environment system. This opens the door for designing practical artificial, intrinsically motivated controllers and for linking animal behaviors to their dynamical properties. △ Less

Submitted 29 December, 2022; originally announced January 2023.

arXiv:2111.03699 [pdf, other]

A space of goals: the cognitive geometry of informationally bounded agents

Authors: Karen Archer, Nicola Catenacci Volpi, Franziska Bröker, Daniel Polani

Abstract: Traditionally, Euclidean geometry is treated by scientists as a priori and objective. However, when we take the position of an agent, the problem of selecting a best route should also factor in the abilities of the agent, its embodiment and particularly its cognitive effort. In this paper we consider geometry in terms of travel between states within a world by incorporating information processing… ▽ More Traditionally, Euclidean geometry is treated by scientists as a priori and objective. However, when we take the position of an agent, the problem of selecting a best route should also factor in the abilities of the agent, its embodiment and particularly its cognitive effort. In this paper we consider geometry in terms of travel between states within a world by incorporating information processing costs with the appropriate spatial distances. This induces a geometry that increasingly differs from the original geometry of the given world as information costs become increasingly important. We visualise this "cognitive geometry" by projecting it onto 2- and 3-dimensional spaces showing distinct distortions reflecting the emergence of epistemic and information-saving strategies as well as pivot states. The analogies between traditional cost-based geometries and those induced by additional informational costs invite a generalisation of the notion of geodesics as cheapest routes towards the notion of infodesics. In this perspective, the concept of infodesics is inspired by the property of geodesics that, travelling from a given start location to a given goal location along a geodesic, not only the goal, but all points along the way are visited at optimal cost from the start. △ Less

Submitted 2 November, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

Comments: Includes supplementary material, 6 figures in the main document, 5 figures in the supplementary material. Replacing preprint with author accepted manuscript

arXiv:2008.12568 [pdf, other]

Causal blankets: Theory and algorithmic framework

Authors: Fernando E. Rosas, Pedro A. M. Mediano, Martin Biehl, Shamil Chandaria, Daniel Polani

Abstract: We introduce a novel framework to identify perception-action loops (PALOs) directly from data based on the principles of computational mechanics. Our approach is based on the notion of causal blanket, which captures sensory and active variables as dynamical sufficient statistics -- i.e. as the "differences that make a difference." Moreover, our theory provides a broadly applicable procedure to con… ▽ More We introduce a novel framework to identify perception-action loops (PALOs) directly from data based on the principles of computational mechanics. Our approach is based on the notion of causal blanket, which captures sensory and active variables as dynamical sufficient statistics -- i.e. as the "differences that make a difference." Moreover, our theory provides a broadly applicable procedure to construct PALOs that requires neither a steady-state nor Markovian dynamics. Using our theory, we show that every bipartite stochastic process has a causal blanket, but the extent to which this leads to an effective PALO formulation varies depending on the integrated information of the bipartition. △ Less

Submitted 29 September, 2020; v1 submitted 28 August, 2020; originally announced August 2020.

arXiv:2006.14796 [pdf, other]

AvE: Assistance via Empowerment

Authors: Yuqing Du, Stas Tiomkin, Emre Kiciman, Daniel Polani, Pieter Abbeel, Anca Dragan

Abstract: One difficulty in using artificial agents for human-assistive applications lies in the challenge of accurately assisting with a person's goal(s). Existing methods tend to rely on inferring the human's goal, which is challenging when there are many potential goals or when the set of candidate goals is difficult to identify. We propose a new paradigm for assistance by instead increasing the human's… ▽ More One difficulty in using artificial agents for human-assistive applications lies in the challenge of accurately assisting with a person's goal(s). Existing methods tend to rely on inferring the human's goal, which is challenging when there are many potential goals or when the set of candidate goals is difficult to identify. We propose a new paradigm for assistance by instead increasing the human's ability to control their environment, and formalize this approach by augmenting reinforcement learning with human empowerment. This task-agnostic objective preserves the person's autonomy and ability to achieve any eventual state. We test our approach against assistance based on goal inference, highlighting scenarios where our method overcomes failure modes stemming from goal ambiguity or misspecification. As existing methods for estimating empowerment in continuous domains are computationally hard, precluding its use in real time learned assistance, we also propose an efficient empowerment-inspired proxy metric. Using this, we are able to successfully demonstrate our method in a shared autonomy user study for a challenging simulated teleoperation task with human-in-the-loop training. △ Less

Submitted 7 January, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

Comments: Final version from NeurIPS 2020 Conference Proceedings

arXiv:2002.05936 [pdf, other]

doi 10.1177/10597123211066153

Human Perception of Intrinsically Motivated Autonomy in Human-Robot Interaction

Authors: Marcus M. Scheunemann, Christoph Salge, Daniel Polani, Kerstin Dautenhahn

Abstract: A challenge in using robots in human-inhabited environments is to design behavior that is engaging, yet robust to the perturbations induced by human interaction. Our idea is to imbue the robot with intrinsic motivation (IM) so that it can handle new situations and appears as a genuine social other to humans and thus be of more interest to a human interaction partner. Human-robot interaction (HRI)… ▽ More A challenge in using robots in human-inhabited environments is to design behavior that is engaging, yet robust to the perturbations induced by human interaction. Our idea is to imbue the robot with intrinsic motivation (IM) so that it can handle new situations and appears as a genuine social other to humans and thus be of more interest to a human interaction partner. Human-robot interaction (HRI) experiments mainly focus on scripted or teleoperated robots, that mimic characteristics such as IM to control isolated behavior factors. This article presents a "robotologist" study design that allows comparing autonomously generated behaviors with each other, and, for the first time, evaluates the human perception of IM-based generated behavior in robots. We conducted a within-subjects user study (N=24) where participants interacted with a fully autonomous Sphero BB8 robot with different behavioral regimes: one realizing an adaptive, intrinsically motivated behavior and the other being reactive, but not adaptive. The robot and its behaviors are intentionally kept minimal to concentrate on the effect induced by IM. A quantitative analysis of post-interaction questionnaires showed a significantly higher perception of the dimension "Warmth" compared to the reactive baseline behavior. Warmth is considered a primary dimension for social attitude formation in human social cognition. A human perceived as warm (friendly, trustworthy) experiences more positive social interactions. △ Less

Submitted 29 November, 2021; v1 submitted 14 February, 2020; originally announced February 2020.

Comments: 32 pages, 3 tables, 3 figures; to be published in Adaptive Behavior (accepted 15-Nov-2021)

MSC Class: 68T40; 68T05 (Primary) 93C85; 94A15 (Secondary) ACM Class: I.2.9; I.2.6; G.3; J.m

arXiv:1910.05979 [pdf, other]

Information Decomposition based on Cooperative Game Theory

Authors: Nihat Ay, Daniel Polani, Nathaniel Virgo

Abstract: We offer a new approach to the information decomposition problem in information theory: given a 'target' random variable co-distributed with multiple 'source' variables, how can we decompose the mutual information into a sum of non-negative terms that quantify the contributions of each random variable, not only individually but also in combination? We derive our composition from cooperative game t… ▽ More We offer a new approach to the information decomposition problem in information theory: given a 'target' random variable co-distributed with multiple 'source' variables, how can we decompose the mutual information into a sum of non-negative terms that quantify the contributions of each random variable, not only individually but also in combination? We derive our composition from cooperative game theory. It can be seen as assigning a "fair share" of the mutual information to each combination of the source variables. Our decomposition is based on a different lattice from the usual 'partial information decomposition' (PID) approach, and as a consequence our decomposition has a smaller number of terms: it has analogs of the synergy and unique information terms, but lacks terms corresponding to redundancy. Because of this, it is able to obey equivalents of the axioms known as 'local positivity' and 'identity', which cannot be simultaneously satisfied by a PID measure. △ Less

Submitted 14 October, 2019; originally announced October 2019.

Comments: under review by Kybernetika journal

arXiv:1904.10066 [pdf, other]

Bold Hearts Team Description for RoboCup 2019 (Humanoid Kid Size League)

Authors: Marcus M. Scheunemann, Sander G. van Dijk, Rebecca Miko, Daniel Barry, George M. Evans, Alessandra Rossi, Daniel Polani

Abstract: We participated in the RoboCup 2018 competition in Montreal with our newly developed BoldBot based on the Darwin-OP and mostly self-printed custom parts. This paper is about the lessons learnt from that competition and further developments for the RoboCup 2019 competition. Firstly, we briefly introduce the team along with an overview of past achievements. We then present a simple, standalone 2D si… ▽ More We participated in the RoboCup 2018 competition in Montreal with our newly developed BoldBot based on the Darwin-OP and mostly self-printed custom parts. This paper is about the lessons learnt from that competition and further developments for the RoboCup 2019 competition. Firstly, we briefly introduce the team along with an overview of past achievements. We then present a simple, standalone 2D simulator we use for simplifying the entry for new members with making basic RoboCup concepts quickly accessible. We describe our approach for semantic-segmentation for our vision used in the 2018 competition, which replaced the lookup-table (LUT) implementation we had before. We also discuss the extra structural support we plan to add to the printed parts of the BoldBot and our transition to ROS 2 as our new middleware. Lastly, we will present a collection of open-source contributions of our team. △ Less

Submitted 22 April, 2019; originally announced April 2019.

Comments: Technical report

arXiv:1806.08083 [pdf, ps, other]

Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop

Authors: Martin Biehl, Christian Guckelsberger, Christoph Salge, Simón C. Smith, Daniel Polani

Abstract: Active inference is an ambitious theory that treats perception, inference and action selection of autonomous agents under the heading of a single principle. It suggests biologically plausible explanations for many cognitive phenomena, including consciousness. In active inference, action selection is driven by an objective function that evaluates possible future actions with respect to current, inf… ▽ More Active inference is an ambitious theory that treats perception, inference and action selection of autonomous agents under the heading of a single principle. It suggests biologically plausible explanations for many cognitive phenomena, including consciousness. In active inference, action selection is driven by an objective function that evaluates possible future actions with respect to current, inferred beliefs about the world. Active inference at its core is independent from extrinsic rewards, resulting in a high level of robustness across e.g.\ different environments or agent morphologies. In the literature, paradigms that share this independence have been summarised under the notion of intrinsic motivations. In general and in contrast to active inference, these models of motivation come without a commitment to particular inference and action selection mechanisms. In this article, we study if the inference and action selection machinery of active inference can also be used by alternatives to the originally included intrinsic motivation. The perception-action loop explicitly relates inference and action selection to the environment and agent memory, and is consequently used as foundation for our analysis. We reconstruct the active inference approach, locate the original formulation within, and show how alternative intrinsic motivations can be used while keeping many of the original features intact. Furthermore, we illustrate the connection to universal reinforcement learning by means of our formalism. Active inference research may profit from comparisons of the dynamics induced by alternative intrinsic motivations. Research on intrinsic motivations may profit from an additional way to implement intrinsically motivated agents that also share the biological plausibility of active inference. △ Less

Submitted 21 June, 2018; originally announced June 2018.

Comments: 53 pages, 6 figures, 2 tables

MSC Class: 62F15; 91B06 ACM Class: I.2.0; I.2.6; I.5.0; I.5.1

arXiv:1706.03576 [pdf, ps, other]

doi 10.7551/ecal_a_015

Action and perception for spatiotemporal patterns

Authors: Martin Biehl, Daniel Polani

Abstract: This is a contribution to the formalization of the concept of agents in multivariate Markov chains. Agents are commonly defined as entities that act, perceive, and are goal-directed. In a multivariate Markov chain (e.g. a cellular automaton) the transition matrix completely determines the dynamics. This seems to contradict the possibility of acting entities within such a system. Here we present de… ▽ More This is a contribution to the formalization of the concept of agents in multivariate Markov chains. Agents are commonly defined as entities that act, perceive, and are goal-directed. In a multivariate Markov chain (e.g. a cellular automaton) the transition matrix completely determines the dynamics. This seems to contradict the possibility of acting entities within such a system. Here we present definitions of actions and perceptions within multivariate Markov chains based on entity-sets. Entity-sets represent a largely independent choice of a set of spatiotemporal patterns that are considered as all the entities within the Markov chain. For example, the entity-set can be chosen according to operational closure conditions or complete specific integration. Importantly, the perception-action loop also induces an entity-set and is a multivariate Markov chain. We then show that our definition of actions leads to non-heteronomy and that of perceptions specialize to the usual concept of perception in the perception-action loop. △ Less

Submitted 12 June, 2017; originally announced June 2017.

Comments: 8 pages, 2 figures, accepted at the European Conference on Artificial Life 2017, Lyon, France

MSC Class: 92B20 ACM Class: G.3; H.1.1; I.2.11; I.5.m; J.3

Journal ref: Proceedings of The Fourteenth European Conference on Artificial Life (September 2017) p.68-75

arXiv:1701.04984 [pdf, other]

Control Capacity of Partially Observable Dynamic Systems in Continuous Time

Authors: Stas Tiomkin, Daniel Polani, Naftali Tishby

Abstract: Stochastic dynamic control systems relate in a prob- abilistic fashion the space of control signals to the space of corresponding future states. Consequently, stochastic dynamic systems can be interpreted as an information channel between the control space and the state space. In this work we study this control-to-state informartion capacity of stochastic dynamic systems in continuous-time, when t… ▽ More Stochastic dynamic control systems relate in a prob- abilistic fashion the space of control signals to the space of corresponding future states. Consequently, stochastic dynamic systems can be interpreted as an information channel between the control space and the state space. In this work we study this control-to-state informartion capacity of stochastic dynamic systems in continuous-time, when the states are observed only partially. The control-to-state capacity, known as empowerment, was shown in the past to be useful in solving various Artificial Intelligence & Control benchmarks, and was used to replace problem-specific utilities. The higher the value of empowerment is, the more optional future states an agent may reach by using its controls inside a given time horizon. The contribution of this work is that we derive an efficient solution for computing the control-to-state information capacity for a linear, partially-observed Gaussian dynamic control system in continuous time, and discover new relationships between control-theoretic and information-theoretic properties of dynamic systems. Particularly, using the derived method, we demonstrate that the capacity between the control signal and the system output does not grow without limits with the length of the control signal. This means that only the near-past window of the control signal contributes effectively to the control-to-state capacity, while most of the information beyond this window is irrelevant for the future state of the dynamic system. We show that empowerment depends on a time constant of a dynamic system. △ Less

Submitted 18 January, 2017; originally announced January 2017.

Comments: 11 [ages, 7 figures

MSC Class: 93C41;

arXiv:1605.05676 [pdf, other]

doi 10.7551/978-0-262-33936-0-ch115

Towards information based spatiotemporal patterns as a foundation for agent representation in dynamical systems

Authors: Martin Biehl, Takashi Ikegami, Daniel Polani

Abstract: We present some arguments why existing methods for representing agents fall short in applications crucial to artificial life. Using a thought experiment involving a fictitious dynamical systems model of the biosphere we argue that the metabolism, motility, and the concept of counterfactual variation should be compatible with any agent representation in dynamical systems. We then propose an informa… ▽ More We present some arguments why existing methods for representing agents fall short in applications crucial to artificial life. Using a thought experiment involving a fictitious dynamical systems model of the biosphere we argue that the metabolism, motility, and the concept of counterfactual variation should be compatible with any agent representation in dynamical systems. We then propose an information-theoretic notion of \emph{integrated spatiotemporal patterns} which we believe can serve as the basic building block of an agent definition. We argue that these patterns are capable of solving the problems mentioned before. We also test this in some preliminary experiments. △ Less

Submitted 18 May, 2016; originally announced May 2016.

Comments: 8 pages, 3 figures

MSC Class: 92B20 ACM Class: G.3; I.2.11; I.5.1; J.3

Journal ref: Proceedings of the Artificial Life Conference 2016

arXiv:1505.04142 [pdf, other]

An informational study of the evolution of codes and of emerging concepts in populations of agents

Authors: Andres C. Burgos, Daniel Polani

Abstract: We consider the problem of the evolution of a code within a structured population of agents. The agents try to maximise their information about their environment by acquiring information from the outputs of other agents in the population. A naive use of information-theoretic methods would assume that every agent knows how to "interpret" the information offered by other agents. However, this assume… ▽ More We consider the problem of the evolution of a code within a structured population of agents. The agents try to maximise their information about their environment by acquiring information from the outputs of other agents in the population. A naive use of information-theoretic methods would assume that every agent knows how to "interpret" the information offered by other agents. However, this assumes that one "knows" which other agents one observes, and thus which code they use. In our model, however, we wish to preclude that: it is not clear which other agents an agent is observing, and the resulting usable information is therefore influenced by the universality of the code used and by which agents an agent is "listening" to. We further investigate whether an agent who does not directly perceive the environment can distinguish states by observing other agents' outputs. For this purpose, we consider a population of different types of agents "talking" about different concepts, and try to extract new ones by considering their outputs only. △ Less

Submitted 15 May, 2015; originally announced May 2015.

Comments: Accepted in Artificial Life Journal

arXiv:1505.00956 [pdf, other]

Informational parasites in code evolution

Authors: Andres C. Burgos, Daniel Polani

Abstract: In a previous study, we considered an information-theoretic model of code evolution. In it, agents obtain information about their (common) environment by the perception of messages of other agents, which is determined by an interaction probability (the structure of the population). For an agent to understand another agent's messages, the former must either know the identity of the latter, or the c… ▽ More In a previous study, we considered an information-theoretic model of code evolution. In it, agents obtain information about their (common) environment by the perception of messages of other agents, which is determined by an interaction probability (the structure of the population). For an agent to understand another agent's messages, the former must either know the identity of the latter, or the code producing the messages must be universally interpretable. A universal code, however, introduces a vulnerability: a parasitic entity can take advantage of it. Here, we investigate this problem. In our specific setting, we consider a parasite to be an agent that tries to inflict as much damage as possible in the mutual understanding of the population (i.e. the parasite acts as a disinformation agent). We show that, after introducing a parasite in the population, the former adopts a code such that it captures the information about the environment that is missing in the population. Such agent would be of great value, but only if the rest of the population could understand its messages. However, it is of little use here, since the parasite utilises the most common messages in the population to express different concepts. Now we let the population respond by updating their codes such that, in this arms race, they again maximise their mutual understanding. As a result, there is a code drift in the population where the utilisation of the messages of the parasite is avoided. A consequence of this is that the information that the parasite possesses but the agents lack becomes understandable and readily available. △ Less

Submitted 2 June, 2015; v1 submitted 5 May, 2015; originally announced May 2015.

Comments: Accepted for the 13th European Conference on Artificial Life (ECAL 2015)

arXiv:1505.00950 [pdf, other]

Cooperation and antagonism in information exchange in a growth scenario with two species

Authors: Andres C. Burgos, Daniel Polani

Abstract: We consider a simple information-theoretic model of communication, in which two species of bacteria have the option of exchanging information about their environment, thereby improving their chances of survival. For this purpose, we model a system consisting of two species whose dynamics in the world are modelled by a bet-hedging strategy. It is well known that such models lend themselves to elega… ▽ More We consider a simple information-theoretic model of communication, in which two species of bacteria have the option of exchanging information about their environment, thereby improving their chances of survival. For this purpose, we model a system consisting of two species whose dynamics in the world are modelled by a bet-hedging strategy. It is well known that such models lend themselves to elegant information-theoretical interpretations by relating their respective long-term growth rate to the information the individual species has about its environment. We are specifically interested in modelling how this dynamics are affected when the species interact cooperatively or in an antagonistic way in a scenario with limited resources. For this purpose, we consider the exchange of environmental information between the two species in the framework of a game. Our results show that a transition from a cooperative to an antagonistic behaviour in a species results as a response to a change in the availability of resources. Species cooperate in abundance of resources, while they behave antagonistically in scarcity. △ Less

Submitted 3 April, 2016; v1 submitted 5 May, 2015; originally announced May 2015.

Comments: To appear in the Journal of Theoretical Biology

arXiv:1406.1767 [pdf, other]

doi 10.3390/e16052789

Changing the Environment Based on Empowerment as Intrinsic Motivation

Authors: Christoph Salge, Cornelius Glackin, Daniel Polani

Abstract: One aspect of intelligence is the ability to restructure your own environment so that the world you live in becomes more beneficial to you. In this paper we investigate how the information-theoretic measure of agent empowerment can provide a task-independent, intrinsic motivation to restructure the world. We show how changes in embodiment and in the environment change the resulting behaviour of th… ▽ More One aspect of intelligence is the ability to restructure your own environment so that the world you live in becomes more beneficial to you. In this paper we investigate how the information-theoretic measure of agent empowerment can provide a task-independent, intrinsic motivation to restructure the world. We show how changes in embodiment and in the environment change the resulting behaviour of the agent and the artefacts left in the world. For this purpose, we introduce an approximation of the established empowerment formalism based on sparse sampling, which is simpler and significantly faster to compute for deterministic dynamics. Sparse sampling also introduces a degree of randomness into the decision making process, which turns out to beneficial for some cases. We then utilize the measure to generate agent behaviour for different agent embodiments in a Minecraft-inspired three dimensional block world. The paradigmatic results demonstrate that empowerment can be used as a suitable generic intrinsic motivation to not only generate actions in given static environments, as shown in the past, but also to modify existing environmental conditions. In doing so, the emerging strategies to modify an agent's environment turn out to be meaningful to the specific agent capabilities, i.e., de facto to its embodiment. △ Less

Submitted 3 June, 2014; originally announced June 2014.

Comments: 31 pages, 8 figures, published in Entropy (http://www.mdpi.com/1099-4300/16/5/2789), much extended version of http://arxiv.org/abs/1310.3692

Journal ref: Entropy 16, no. 5: 2789-2819 (2014)

arXiv:1406.1502 [pdf, ps, other]

doi 10.7551/978-0-262-32621-6-ch154

Towards designing artificial universes for artificial agents under interaction closure

Authors: Martin Biehl, Christoph Salge, Daniel Polani

Abstract: We are interested in designing artificial universes for artifi- cial agents. We view artificial agents as networks of high- level processes on top of of a low-level detailed-description system. We require that the high-level processes have some intrinsic explanatory power and we introduce an extension of informational closure namely interaction closure to capture this. Then we derive a method to d… ▽ More We are interested in designing artificial universes for artifi- cial agents. We view artificial agents as networks of high- level processes on top of of a low-level detailed-description system. We require that the high-level processes have some intrinsic explanatory power and we introduce an extension of informational closure namely interaction closure to capture this. Then we derive a method to design artificial universes in the form of finite Markov chains which exhibit high-level pro- cesses that satisfy the property of interaction closure. We also investigate control or information transfer which we see as an building block for networks representing artificial agents. △ Less

Submitted 5 June, 2014; originally announced June 2014.

Comments: 8 pages, 3 figures; accepted for publication in ALIFE 14 proceedings

MSC Class: G.3; H.1.1; I.2.0; I.6.m; J.2; J.3

arXiv:1406.1034 [pdf, ps, other]

Don't Believe Everything You Hear; Preserving Relevant Information by Discarding Social Information

Authors: Christoph Salge, Daniel Polani

Abstract: Integrating information gained by observing others via Social Bayesian Learning can be beneficial for an agent's performance, but can also enable population wide information cascades that perpetuate false beliefs through the agent population. We show how agents can influence the observation network by changing their probability of observing others, and demonstrate the existence of a population-wid… ▽ More Integrating information gained by observing others via Social Bayesian Learning can be beneficial for an agent's performance, but can also enable population wide information cascades that perpetuate false beliefs through the agent population. We show how agents can influence the observation network by changing their probability of observing others, and demonstrate the existence of a population-wide equilibrium, where the advantages and disadvantages of the Social Bayesian update are balanced. We also use the formalism of relevant information to illustrate how negative information cascades are characterized by processing increasing amounts of non-relevant information. △ Less

Submitted 4 June, 2014; originally announced June 2014.

Comments: 8 pages, 4 figures, accepted for publication in Proceedings of Alife14

arXiv:1310.3692 [pdf, other]

Changing the Environment based on Intrinsic Motivation

Authors: Christoph Salge, Daniel Polani

Abstract: One of the remarkable feats of intelligent life is that it restructures the world it lives in for its own benefit. This extended abstract outlines how the information-theoretic principle of empowerment, as an intrinsic motivation, can be used to restructure the environment an agent lives in. We present a first qualitative evaluation of how an agent in a 3d-gridworld builds a staircase-like structu… ▽ More One of the remarkable feats of intelligent life is that it restructures the world it lives in for its own benefit. This extended abstract outlines how the information-theoretic principle of empowerment, as an intrinsic motivation, can be used to restructure the environment an agent lives in. We present a first qualitative evaluation of how an agent in a 3d-gridworld builds a staircase-like structure, which reflects the agent's embodiment. △ Less

Submitted 14 October, 2013; originally announced October 2013.

Comments: 3 page, 1 figure, extended abstract of work presented at the Workshop for "Guided Self-Organization" 2013 (http://prokopenko.net/gso6.html)

arXiv:1310.1863 [pdf, other]

Empowerment -- an Introduction

Authors: Christoph Salge, Cornelius Glackin, Daniel Polani

Abstract: This book chapter is an introduction to and an overview of the information-theoretic, task independent utility function "Empowerment", which is defined as the channel capacity between an agent's actions and an agent's sensors. It quantifies how much influence and control an agent has over the world it can perceive. This book chapter discusses the general idea behind empowerment as an intrinsic mot… ▽ More This book chapter is an introduction to and an overview of the information-theoretic, task independent utility function "Empowerment", which is defined as the channel capacity between an agent's actions and an agent's sensors. It quantifies how much influence and control an agent has over the world it can perceive. This book chapter discusses the general idea behind empowerment as an intrinsic motivation and showcases several previous applications of empowerment to demonstrate how empowerment can be applied to different sensor-motor configuration, and how the same formalism can lead to different observed behaviors. Furthermore, we also present a fast approximation for empowerment in the continuous domain. △ Less

Submitted 8 October, 2013; v1 submitted 7 October, 2013; originally announced October 2013.

Comments: 46 pages, 8 figures, to be published in Prokopenko, M., editor, Guided Self-Organization: Inception. Springer. In Press

arXiv:1207.2080 [pdf, ps, other]

doi 10.1103/PhysRevE.87.012130

A Bivariate Measure of Redundant Information

Authors: Malte Harder, Christoph Salge, Daniel Polani

Abstract: We define a measure of redundant information based on projections in the space of probability distributions. Redundant information between random variables is information that is shared between those variables. But in contrast to mutual information, redundant information denotes information that is shared about the outcome of a third variable. Formalizing this concept, and being able to measure it… ▽ More We define a measure of redundant information based on projections in the space of probability distributions. Redundant information between random variables is information that is shared between those variables. But in contrast to mutual information, redundant information denotes information that is shared about the outcome of a third variable. Formalizing this concept, and being able to measure it, is required for the non-negative decomposition of mutual information into redundant and synergistic information. Previous attempts to formalize redundant or synergistic information struggle to capture some desired properties. We introduce a new formalism for redundant information and prove that it satisfies all the properties necessary outlined in earlier work, as well as an additional criterion that we propose to be necessary to capture redundancy. We also demonstrate the behaviour of this new measure for several examples, compare it to previous measures and apply it to the decomposition of transfer entropy. △ Less

Submitted 20 July, 2012; v1 submitted 9 July, 2012; originally announced July 2012.

Comments: 16 pages, 15 figures, 1 table, added citation to Griffith et al 2012, Maurer et al 1999

MSC Class: 62B10

arXiv:1201.6626 [pdf, ps, other]

Learning RoboCup-Keepaway with Kernels

Authors: Tobias Jung, Daniel Polani

Abstract: We apply kernel-based methods to solve the difficult reinforcement learning problem of 3vs2 keepaway in RoboCup simulated soccer. Key challenges in keepaway are the high-dimensionality of the state space (rendering conventional discretization-based function approximation like tilecoding infeasible), the stochasticity due to noise and multiple learning agents needing to cooperate (meaning that the… ▽ More We apply kernel-based methods to solve the difficult reinforcement learning problem of 3vs2 keepaway in RoboCup simulated soccer. Key challenges in keepaway are the high-dimensionality of the state space (rendering conventional discretization-based function approximation like tilecoding infeasible), the stochasticity due to noise and multiple learning agents needing to cooperate (meaning that the exact dynamics of the environment are unknown) and real-time learning (meaning that an efficient online implementation is required). We employ the general framework of approximate policy iteration with least-squares-based policy evaluation. As underlying function approximator we consider the family of regularization networks with subset of regressors approximation. The core of our proposed solution is an efficient recursive implementation with automatic supervised selection of relevant basis functions. Simulation results indicate that the behavior learned through our approach clearly outperforms the best results obtained earlier with tilecoding by Stone et al. (2005). △ Less

Submitted 31 January, 2012; originally announced January 2012.

Journal ref: JMLR Workshop and Conference Proceedings (1st Gaussian Processes in Practice Workshop, 2006)

arXiv:1201.6583 [pdf, ps, other]

Empowerment for Continuous Agent-Environment Systems

Authors: Tobias Jung, Daniel Polani, Peter Stone

Abstract: This paper develops generalizations of empowerment to continuous states. Empowerment is a recently introduced information-theoretic quantity motivated by hypotheses about the efficiency of the sensorimotor loop in biological organisms, but also from considerations stemming from curiosity-driven learning. Empowemerment measures, for agent-environment systems with stochastic transitions, how much in… ▽ More This paper develops generalizations of empowerment to continuous states. Empowerment is a recently introduced information-theoretic quantity motivated by hypotheses about the efficiency of the sensorimotor loop in biological organisms, but also from considerations stemming from curiosity-driven learning. Empowemerment measures, for agent-environment systems with stochastic transitions, how much influence an agent has on its environment, but only that influence that can be sensed by the agent sensors. It is an information-theoretic generalization of joint controllability (influence on environment) and observability (measurement by sensors) of the environment by the agent, both controllability and observability being usually defined in control theory as the dimensionality of the control/observation spaces. Earlier work has shown that empowerment has various interesting and relevant properties, e.g., it allows us to identify salient states using only the dynamics, and it can act as intrinsic reward without requiring an external reward. However, in this previous work empowerment was limited to the case of small-scale and discrete domains and furthermore state transition probabilities were assumed to be known. The goal of this paper is to extend empowerment to the significantly more important and relevant case of continuous vector-valued state spaces and initially unknown state transition probabilities. The continuous state space is addressed by Monte-Carlo approximation; the unknown transitions are addressed by model learning and prediction for which we apply Gaussian processes regression with iterated forecasting. In a number of well-known continuous control tasks we examine the dynamics induced by empowerment and include an application to exploration and online model learning. △ Less

Submitted 31 January, 2012; originally announced January 2012.

Journal ref: Adaptive Behavior 19(1),2011

Showing 1–26 of 26 results for author: Polani, D