Print version: Lapan, Maxim. Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more. Birmingham, England : Packt Publishing, c2018 547 pages ISBN 9781788834247