Reinforcement Learning Python example