News

Reinforcement learning is how AlphaZero learned to become a chess master. DeepMind reported that during the program’s first nine hours of training, in December 2017, it played 44 million games against ...