◾Intro

🔻references

◾Main

🔻Grid World example

Untitled

🔸구성

🔸Agent의 목표와 Policy

select an action for each state to maximize the total sum of rewards.

🔸Actions in grid world

Untitled