Utilities#
Utility functions.
Test for the effect of the action on the state in the data H(state|prev_action) based on their conditional entropies. |
|
Test for the effect of the action on the reward in the data H(reward|prev_action). |
|
Utility to perform backtesting on simulator. |
|
Tests if the information gain of the input series given a conditioning series H(Y|X) is better than random. |
|
The conditional entropy of the input series given a conditioning series H(Y|X). |
|
Generate different types of data for your testing. |
|
The entropy of the input series. |
|
Raise |
|
Calculate the information gain of the input series given a conditioning series H(Y|X). |
|
Test for the Markov property in the data H(state|prev_state, prev_action) based on their conditional entropies. |
|
Test for the key MDP properties based on their conditional entropies. |
|
Test for the key MDP properties based on their conditional entropies. |
|
Plot the results of MDP checks on dataframe |
|
Test for a reward function in the data H(r|state,action) based on their conditional entropies. |
|
Minimalistic implementation to fix random seeds in python.random, numpy, and pytorch. |
|
Test for a stationary policy in the data H(action|state) based on their conditional entropies. |
|
Concats all columns to one. |