________ is given a system of rewards and punishments.
Answer:
Reinforcement Learning (RL) is given a system of rewards and punishments. Reinforcement learning is broader than monitored or unsupervised learning to reach an objective or to just gain from incentives and penalties from environmental contact. Algorithms are learning to adapt to the environment, in all the other terms. TD learning appears to be close to how individuals learn in this sort of situation, but Q-learning others still have their benefits.
At the same time, a learning problem and a sub-field of machine learning have been applied to reinforcement learning. A learning problem requires learning to operate a programme to optimise some numerical value representing a long-term target.