Abstract
Reinforcement Learning is one of the hottest issues in current AI research fields. It's a effective method in solving some machine learning problems. It's high efficiency, simpler programming, easier understanding, and better performance. Here I will share my understanding. If there are something wrong, thanks for correct. In reinforcement learning, the learner is a decision-making agent that takes actions in an environment and receives reward (or penalty) for its actions in trying to solve a problem. After a set of trial-and-error runs, it should learn the best policy, which is the sequence of actions that maximize the total reward.