Skip to main content

Ctrl+K

Site Navigation

Overview
Quickstart
Examples
Changelogs
API Reference

Developer Guide

GitHub

Site Navigation

Overview
Quickstart
Examples
Changelogs
API Reference

Developer Guide

GitHub

Section Navigation

Input/Output
Data Structures
Tokenizer
- a2rl.Tokenizer
- a2rl.DiscreteTokenizer
Simulator
Utilities
Experimental
- a2rl.experimental.lightgpt.LightGPTBuilder

API Reference
Utilities
a2rl.utils.action_reward

a2rl.utils.action_reward#

a2rl.utils.action_reward(df, lag, mask=False)[source]#

Test for the effect of the action on the reward in the data H(reward|prev_action).

Parameters:

df (WiDataFrame) – a discretized dataframe.
lag (int) – int for the lag.

Return type:

Returns:

Returns the conditional entropy of future reward given various lags. It is masked if the information gain is better than random

See also

Reference: https://en.wikipedia.org/wiki/Entropy_(information_theory)

previous

a2rl.utils.action_effective

next

a2rl.utils.backtest

On this page

action_reward()

© Copyright 2022, AWS ProServe.

Created using Sphinx 6.2.1.

Built with the PyData Sphinx Theme 0.13.3.