Coding the GridWorld Example from DeepMind’s Reinforcement Learning Course in Python

Fig 3.2 [1]
Here is a description of the GridWorld example [1]
Fig 3.3 [1]
Formula 3.14 [1]

So now the value function of the current state , i.e. first row, first column is -0.50

References:

--

--

--

I build cool stuff… Sometimes weird too

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Fisher Linear Discriminant Analysis(LDA)

LSTM vs GRU: Understanding the 2 major Neural Networks Ruling Character Wise Text Prediction

Step-By-Step Process of Implementing A Machine Learning Project

How to make a chess engine

Machine Learning(ML) — Data Preprocessing

Crack Data Science Interviews: Essential Machine Learning Concepts

Crack Data Science Interviews: Essential Machine Learning Concepts

Table Extraction using Deep Learning

Machine Learning Continuous Integration with MLflow

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Diganta Kalita

Diganta Kalita

I build cool stuff… Sometimes weird too

More from Medium

Machine Learning…

Accuracy is not accurate.

Linear regression from scratch: Math and Python implementation

Loss functions in Machine Learning