Description
python: A list is provided in the cell below. The contents of the list are intended to represent the rewards earned by a reinforcement learning agent during an episode. Calculate the return earned by the agent, assuming a discount factor of 0.99. Store the return in a variable named G. Print the value of G rounded to 4 decimal places. When writing your solution, please store the discount factor in a variable named gamma and use this variable in your calculations rather than hard-coding in the value 0.99 in your calculations. This is not strictly necessary to accomplish this task, but it is a good programming practice.rewards = [12, 7, -6, 3, 6, -10, -5, -14]Within your function, you should use a loop to calculate the sum of pairs of corresponding elements one at a time. Each time you calculate a new sum, you should append it to the list that you will ultimately return
Diana Sands –
great work as usual thanks
Rajasekhar Patibandla –
Thanks a lots, I really appreciate your effort it is an excellent report.
Connie Cannon –
I have received phenomenal service that is consistent. The quality of work is A+ and the prices are very reasonable.
Kirsten Gorby –
It was amazing work and in a very short time.
Jonathan Cooper –
The assignment was excellent and I have scored 100 percent. I am very proud and happy with you.