Presentation + Paper
28 October 2022 Learning without gradients: multi-agent reinforcement learning approach to optimization
Amir Morcos, Hong Man, Aaron West, Brian Maguire
Author Affiliations +
Abstract
The field of Reinforcement Learning continues to show promise in solving old problems in new innovative ways. Thanks to the algorithms’ ability to learn without an explicit set of labeled training data, the action, environment, reward approach has lured many researches into framing old problems in this manner. Recent publications have demonstrated how utilizing a multi-agent reinforcement learning approach can lead to a superior policy for optimization algorithm over the current standards. The challenge with the aforementioned approaches is the inclusion of the gradient in the state-space. This forces a costly calculation that is often the bottle neck in most machine learning problems, often limiting or preventing training at the edge or on the front lines. While previous works dating back decades have demonstrated the ability to train simple machine learning models without the use of gradients, none have done so using a policy which leverages previous experiences to solve the problem more quickly. This work will show how a Multi-Agent Reinforcement Learning approach can be used to optimize models in training without the need for the gradient of the loss function, effectively eliminating the need for backpropagation and significantly reducing the computational power required to train a model. Furthermore, the work will examine conditions under which the agents failed to find an optimal solution. As well as how this approach can be beneficial in complex defense applications.
Conference Presentation
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Amir Morcos, Hong Man, Aaron West, and Brian Maguire "Learning without gradients: multi-agent reinforcement learning approach to optimization", Proc. SPIE 12276, Artificial Intelligence and Machine Learning in Defense Applications IV, 1227606 (28 October 2022); https://doi.org/10.1117/12.2636231
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Machine learning

Data modeling

Neural networks

Performance modeling

Defense and security

Nomenclature

Optimization (mathematics)

Back to Top