„policy gradient methods“
Suchergebnisse
11 Treffer
-
Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching
-
A Monte Carlo Policy Gradient Method with Local Search for Binary Optimization
-
Quantum Policy Gradient Algorithms
-
Parametric estimation of stochastic differential equations via online gradient descent
-
Softmax policy gradient methods can take exponential time to converge
-
Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes
-
Robust gradient boosting for generalized additive models for location, scale and shape
-
Geographic patterns of seed dormancy strategies along latitudinal and climatic gradients, Japanese East Asian islands
-
Geometry and convergence of natural policy gradient methods
-
Diagnostic test for misspecification of a random-effect distribution using the gradient function
-
Gradient-projection and policy-iteration methods for solving optimization problems in STEOR-networks