On the page
POMDP, you will find a
Python example of solving a so-called Partially Observable
Markovian Decision Process (POMDP). In this example, an agent is
dropped somewhere, at an unknown location in a Maze. The agent
has the map of the maze and can see immediately around him. How
an agent can determine where he is ? (then reaching the exit is
much easier).