Reward summary:
Environment rewards per step (trajectory): [1.0]
Environment reward total: 1.000
Decision rewards:
turn=0, ach_delta=0, unique_delta=0, achievements=[]
turn=1, ach_delta=0, unique_delta=0, achievements=[]
turn=2, ach_delta=0, unique_delta=0, achievements=[]
turn=3, ach_delta=0, unique_delta=0, achievements=[]
turn=4, ach_delta=0, unique_delta=0, achievements=[]
turn=5, ach_delta=0, unique_delta=0, achievements=[]
turn=6, ach_delta=0, unique_delta=0, achievements=[]
turn=7, ach_delta=0, unique_delta=0, achievements=[]
turn=8, ach_delta=0, unique_delta=0, achievements=[]
turn=9, ach_delta=0, unique_delta=0, achievements=[]
turn=10, ach_delta=0, unique_delta=0, achievements=[]
turn=11, ach_delta=0, unique_delta=0, achievements=[]
turn=12, ach_delta=0, unique_delta=0, achievements=[]
turn=13, ach_delta=0, unique_delta=0, achievements=[]
turn=14, ach_delta=0, unique_delta=0, achievements=[]
turn=15, ach_delta=0, unique_delta=0, achievements=[]
turn=16, ach_delta=0, unique_delta=0, achievements=[]
turn=17, ach_delta=0, unique_delta=0, achievements=[]
turn=18, ach_delta=0, unique_delta=0, achievements=[]
turn=19, ach_delta=0, unique_delta=0, achievements=[]
Outcome rewards (episode returns): [1.0]