arlbench.core.algorithms.ppo.models¶
Models for the PPO algorithm.
Classes
|
A CNN-based Actor-Critic network for PPO. |
|
An MLP-based Actor-Critic network for PPO. |
- class arlbench.core.algorithms.ppo.models.CNNActorCritic(action_dim, activation='tanh', hidden_size=512, discrete=True, parent=<flax.linen.module._Sentinel object>, name=None)[source]¶
Bases:
Module
A CNN-based Actor-Critic network for PPO. Based on NatureCNN https://github.com/DLR-RM/stable-baselines3/blob/master/stable_baselines3/common/torch_layers.py#L48.