| A part of Machine Learning where an agent learns to behave optimally, by performing actions that will either be rewarded or punished. Question Posted on 18 Jun 2020 Home >> Artificial Intelligence >> MFDM >> A part of Machine Learning where an agent learns to behave optimally, by performing actions that will either be rewarded or punished. |