Partial observability, memory, and recurrent networks
One major difference between the Hallway
and GridWorld
examples is their perception of state, or observation. We already know that the GridWorld
agent used visual observations, but we never really got into what state input the Hallway
agent used. As it turns out, the Hallway
agent collects observations of state in a different manner. It is important for us to understand the difference, so open up Unity and go through the following exercise:
- Make sure the
Hallway
example scene is loaded. Check back to the previous exercise if you need help. - Locate the
Agent
object in theHierarchy
window. You can use the search bar at the top of the window to find it quicker. - Find the
Hallway Agent
component/script in theInspector
window. - Click the target icon beside the component and select
Edit Script...
. This will open your previously set code editor. - Locate the
CollectObservation
method in the script shown in the following code:
public override void...