Novel First Order Bayesian Optimization with an Application to Reinforcement Learning

J., Prabuchandran K. ; Penubothula, Santosh ; Kamanchi, Chandramouli ; Bhatnagar, Shalabh (2021) Novel First Order Bayesian Optimization with an Application to Reinforcement Learning Applied Intelligence, 51 (3). pp. 1565-1579. ISSN 0924-669X

Full text not available from this repository.

Official URL: http://doi.org/10.1007/s10489-020-01896-w

Related URL: http://dx.doi.org/10.1007/s10489-020-01896-w

Abstract

Zeroth Order Bayesian Optimization (ZOBO) methods optimize an unknown function based on its black-box evaluations at the query locations. Unlike most optimization procedures, ZOBO methods fail to utilize gradient information even when it is available. On the other hand, First Order Bayesian Optimization (FOBO) methods exploit the available gradient information to arrive at better solutions faster. However, the existing FOBO methods do not utilize a crucial information that the gradient is zero at the optima. Further, the inherent sequential nature of the FOBO methods incur high computational cost limiting their wide applicability. To alleviate the aforementioned difficulties of FOBO methods, we propose a relaxed statistical model to leverage the gradient information that directly searches for points where gradient vanishes. To accomplish this, we develop novel acquisition algorithms that search for global optima effectively. Unlike the existing FOBO methods, the proposed methods are parallelizable. Through extensive experimentation on standard test functions, we compare the performance of our methods over the existing methods. Furthermore, we explore an application of the proposed FOBO methods in the context of policy gradient reinforcement learning.

Item Type:Article
Source:Copyright of this article belongs to Springer Nature.
Keywords:Bayesian Optimization; Reinforcement Learning; First Order Methods; Policy Gradient.
ID Code:116422
Deposited On:12 Apr 2021 05:52
Last Modified:12 Apr 2021 05:52

Repository Staff Only: item control page