a2rl.utils.data_generator_simple#
- a2rl.utils.data_generator_simple(markov_order=0, action_effect=False, policy=False, reward_function=False)[source]#
Generate different types of data for your testing.
- Parameters:
- Return type:
- Reference:
if markov_order=0 then the states are randomly generated if markov_order=1 then the next state is affected by the previous one only if markov_order>1 then the next state is affected by a mixture of the previous history. Keep this number less than 10.
if action_affect = True then the actions can affect the state as well by using a different transition function
if policy = True then there is a consistent rule choosing the action
if reward_function = True then the reward function is calculated on the states