-
Consensus Function from an $L_p^q-$norm Regularization Term for its Use as Adaptive Activation Functions in Neural Networks
Authors:
Juan Heredia-Juesas,
José Á. Martínez-Lorenzo
Abstract:
The design of a neural network is usually carried out by defining the number of layers, the number of neurons per layer, their connections or synapses, and the activation function that they will execute. The training process tries to optimize the weights assigned to those connections, together with the biases of the neurons, to better fit the training data. However, the definition of the activatio…
▽ More
The design of a neural network is usually carried out by defining the number of layers, the number of neurons per layer, their connections or synapses, and the activation function that they will execute. The training process tries to optimize the weights assigned to those connections, together with the biases of the neurons, to better fit the training data. However, the definition of the activation functions is, in general, determined in the design process and not modified during the training, meaning that their behavior is unrelated to the training data set. In this paper we propose the definition and utilization of an implicit, parametric, non-linear activation function that adapts its shape during the training process. This fact increases the space of parameters to optimize within the network, but it allows a greater flexibility and generalizes the concept of neural networks. Furthermore, it simplifies the architectural design since the same activation function definition can be employed in each neuron, letting the training process to optimize their parameters and, thus, their behavior. Our proposed activation function comes from the definition of the consensus variable from the optimization of a linear underdetermined problem with an $L_p^q$ regularization term, via the Alternating Direction Method of Multipliers (ADMM). We define the neural networks using this type of activation functions as $pq-$networks. Preliminary results show that the use of these neural networks with this type of adaptive activation functions reduces the error in regression and classification examples, compared to equivalent regular feedforward neural networks with fixed activation functions.
△ Less
Submitted 30 June, 2022;
originally announced June 2022.
-
Preliminary Experimental Results of Context-Aware Teams of Multiple Autonomous Agents Operating under Constrained Communications
Authors:
Jose Martinez-Lorenzo,
Jeff Hudack,
Yutao Jing,
Michael Shaham,
Zixuan Liang,
Abdullah Al Bashit,
Yushu Wu,
Weite Zhang,
Matthew Skopin,
Juan Heredia-Juesas,
Yuntao Ma,
Tristan Sweeney,
Nicolas Ares,
Ari Fox
Abstract:
This work presents and experimentally test the framework used by our context-aware, distributed team of small Unmanned Aerial Systems (SUAS) capable of operating in real-time, in an autonomous fashion, and under constrained communications. Our framework relies on three layered approach: (1) Operational layer, where fast temporal and narrow spatial decisions are made; (2) Tactical Layer, where temp…
▽ More
This work presents and experimentally test the framework used by our context-aware, distributed team of small Unmanned Aerial Systems (SUAS) capable of operating in real-time, in an autonomous fashion, and under constrained communications. Our framework relies on three layered approach: (1) Operational layer, where fast temporal and narrow spatial decisions are made; (2) Tactical Layer, where temporal and spatial decisions are made for a team of agents; and (3) Strategical Layer, where slow temporal and wide spatial decisions are made for the team of agents. These three layers are coordinated by an ad-hoc, software-defined communications network, which ensures sparse, but timely delivery of messages amongst groups and teams of agents at each layer even under constrained communications. Experimental results are presented for a team of 10 small unmanned aerial systems tasked with searching and monitoring a person in an open area. At the operational layer, our use case presents an agent autonomously performing searching, detection, localization, classification, identification, tracking, and following of the person, while avoiding malicious collisions. At the tactical layer, our experimental use case presents the cooperative interaction of a group of multiple agents that enable the monitoring of the targeted person over a wider spatial and temporal regions. At the strategic layer, our use case involves the detection of complex behaviors-i.e. the person being followed enters a car and runs away, or the person being followed exits the car and runs away-that requires strategic responses to successfully accomplish the mission.
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
Decentralized Reinforcement Learning for Multi-Target Search and Detection by a Team of Drones
Authors:
Roi Yehoshua,
Juan Heredia-Juesas,
Yushu Wu,
Christopher Amato,
Jose Martinez-Lorenzo
Abstract:
Targets search and detection encompasses a variety of decision problems such as coverage, surveillance, search, observing and pursuit-evasion along with others. In this paper we develop a multi-agent deep reinforcement learning (MADRL) method to coordinate a group of aerial vehicles (drones) for the purpose of locating a set of static targets in an unknown area. To that end, we have designed a rea…
▽ More
Targets search and detection encompasses a variety of decision problems such as coverage, surveillance, search, observing and pursuit-evasion along with others. In this paper we develop a multi-agent deep reinforcement learning (MADRL) method to coordinate a group of aerial vehicles (drones) for the purpose of locating a set of static targets in an unknown area. To that end, we have designed a realistic drone simulator that replicates the dynamics and perturbations of a real experiment, including statistical inferences taken from experimental data for its modeling. Our reinforcement learning method, which utilized this simulator for training, was able to find near-optimal policies for the drones. In contrast to other state-of-the-art MADRL methods, our method is fully decentralized during both learning and execution, can handle high-dimensional and continuous observation spaces, and does not require tuning of additional hyperparameters.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
Consensus and Sectioning-based ADMM with Norm-1 Regularization for Imaging with a Compressive Reflector Antenna
Authors:
Juan Heredia-Juesas,
Ali Molaei,
Luis Tirado,
Jose A. Martinez-Lorenzo
Abstract:
This paper presents three distributed techniques to find a sparse solution of the underdetermined linear problem $\textbf{g}=\textbf{Hu}$ with a norm-1 regularization, based on the Alternating Direction Method of Multipliers (ADMM). These techniques divide the matrix $\textbf{H}$ in submatrices by rows, columns, or both rows and columns, leading to the so-called consensus-based ADMM, sectioning-ba…
▽ More
This paper presents three distributed techniques to find a sparse solution of the underdetermined linear problem $\textbf{g}=\textbf{Hu}$ with a norm-1 regularization, based on the Alternating Direction Method of Multipliers (ADMM). These techniques divide the matrix $\textbf{H}$ in submatrices by rows, columns, or both rows and columns, leading to the so-called consensus-based ADMM, sectioning-based ADMM, and consensus and sectioning-based ADMM, respectively. These techniques are applied particularly for millimeter-wave imaging through the use of a Compressive Reflector Antenna (CRA). The CRA is a hardware designed to increase the sensing capacity of an imaging system and reduce the mutual information among measurements, allowing an effective imaging of sparse targets with the use of Compressive Sensing (CS) techniques. Consensus-based ADMM has been proved to accelerate the imaging process and sectioning-based ADMM has shown to highly reduce the amount of information to be exchange among the computational nodes. In this paper, the mathematical formulation and graphical interpretation of these two techniques, together with the consensus and sectioning-based ADMM approach, are presented. The imaging quality, the imaging time, the convergence, and the communication efficiency among the nodes are analyzed and compared. The distributed capabitities of the ADMM-based approaches, together with the high sensing capacity of the CRA, allow the imaging of metallic targets in a 3D domain in quasi-real time with a reduced amount of information exchanged among the nodes.
△ Less
Submitted 13 November, 2018;
originally announced November 2018.