A Short Introduction To Q-learning - Hado van Hasselt

A Short Introduction To Some Reinforcement Learning Algorithms

By Hado van Hasselt

Q-learning

Previous -- Up -- Next

Q-learning is perhaps the most well known reinforcement learning algorithm. Its equation is:

Q_{t+1}(s_t,a_t) \overset{\alpha_t}{\longleftarrow} r_t + \gamma \max_a Q_t(s_{t+1},a)

Neutral characteristics

Advantages

Disadvantages

Algorithm

The Q-learning algorithm in schematic form:

Q-learning algorithm

Compare with the similar, but different algorithms: QV-learning, Sarsa and Expected-Sarsa.

Selected relevant publications:

Quick links:

Previous -- Up -- Next

Contact

My contact data can be found here.