a2rl.utils.action_effective#

a2rl.utils.action_effective(df, lag, mask=False)[source]#

Test for the effect of the action on the state in the data H(state|prev_action) based on their conditional entropies.

Parameters:
  • df (WiDataFrame) – a discretized dataframe.

  • lag (int) – int for the lag.

Return type:

float

Returns:

Returns the conditional entropy of future states given various lags. It is masked if the information gain is better than random