a2rl.utils.stationary_policy#
- a2rl.utils.stationary_policy(df, lag, mask=False)[source]#
Test for a stationary policy in the data H(action|state) based on their conditional entropies.
- Parameters:
df (
WiDataFrame
) – a discretized dataframe.lag (
int
) – int for the lag.
- Return type:
- Returns:
Returns the conditional entropy of action given various lags. It is masked if the information gain is better than random