a2rl.utils.stationary_policy#
- a2rl.utils.stationary_policy(df, lag, mask=False)[source]#
- Test for a stationary policy in the data H(action|state) based on their conditional entropies. - Parameters:
- df ( - WiDataFrame) – a discretized dataframe.
- lag ( - int) – int for the lag.
 
- Return type:
- Returns:
- Returns the conditional entropy of action given various lags. It is masked if the information gain is better than random