Discounting and Reward Learning in the Exercise Domain