Temporal Difference Learning is an unsupervised technique in which the learning agent learns to predict the expected value of a variable occurring at the end of a sequence of states. Reinforcement learning (RL) extends this technique by allowing the learned state-values to guide actions which subsequently change the environment state. Temporal Difference Learning algorithm is related to the temporal difference model of animal learning. It is an approach to learning how to predict a quantity that depends on future values of a given signal.
More Post
Latest Post
-
Cathodic Modification
-
Anodic Protection (AP)
-
New Maps Assist Decision-makers in Considering Albedo when Planting Trees
-
Experts Fear that Climate Change will Exacerbate the Spread of Infectious Diseases
-
Curbside Pickup enhances Organic Waste Composting and Decreases Methane Emissions
-
Key Concepts of Electromagnetic Induction