Ez greedy
mighty.mighty_exploration.ez_greedy
#
Epsilon Greedy Exploration.
EZGreedy
#
Bases: EpsilonGreedy
Epsilon Greedy Exploration.
:param algo: algorithm name :param model: model :param epsilon: exploration epsilon :param zipf_param: parametrizes the Zipf distribution for skipping :return:
Source code in mighty/mighty_exploration/ez_greedy.py
__call__
#
Get action.
:param s: state :param return_logp: return logprobs :param metrics: current metric dict :param eval: eval mode :return: action or (action, logprobs)
Source code in mighty/mighty_exploration/mighty_exploration_policy.py
explore
#
Explore.
:param s: state :param return_logp: return logprobs :param _: not used :return: action or (action, logprobs)