Actor-Critic

ID: D3A-AC | Type: Technique | Ontology: d3f:Actor-Critic
Unpublished

Description

Actor-Critic is a Temporal Difference(TD) version of Policy gradient. It has two networks: Actor and Critic. The actor decided which action should be taken and critic inform the actor how good was the action and how it should adjust. The learning of the actor is based on policy gradient approach. In comparison, critics evaluate the action produced by the actor by computing the value function.

Technical Details

Framework MITRE D3FEND
Ontology URI d3f:Actor-Critic
Local Identifier Actor-Critic
Publication Status Exists in ontology only

Relationships

Parent Tactics