Policy Gradient

ID: D3A-PG | Type: Technique | Ontology: d3f:PolicyGradient
Unpublished

Description

The objective of a Reinforcement Learning Policy Gradient agent is to maximize the “expected” reward when following a policy

Technical Details

Framework MITRE D3FEND
Ontology URI d3f:PolicyGradient
Local Identifier PolicyGradient
Publication Status Exists in ontology only

Relationships

Parent Tactics

Child Concepts