Temporal Difference Learning

Description

Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods

Technical Details

Framework	MITRE D3FEND
Ontology URI	d3f:TemporalDifferenceLearning
Local Identifier	TemporalDifferenceLearning
Publication Status	Exists in ontology only

Relationships

Parent Tactics

D3A-MFRL Model-free Reinforcement Learning
Model Model

Child Concepts

D3A-AC Actor-Critic

Related Techniques

D3A-SAR SARSA (Unpublished)
D3A-PG Policy Gradient (Unpublished)
D3A-QL Q-Learning (Unpublished)

D3FEND