MARC View

000			a
999			_c33374 _d33374
008			241114b xxu\|\|\|\|\| \|\|\|\| 00\| 0 eng d
020			_a9789811906374
082			_a006.31 _bPLA
100			_aPlaat, Aske
245			_aDeep reinforcement learning
260			_bSpringer, _c2022 _aSingapore :
300			_axiv, 406 p. ; _bill., _c25 cm.
365			_b3240.00 _c₹ _d01
504			_aIncludes bibliographical references and index.
520			_aDeep reinforcement learning has attracted considerable attention recently. Impressive results have been achieved in such diverse fields as autonomous driving, game playing, molecular recombination, and robotics. In all these fields, computer programs have taught themselves to understand problems that were previously considered to be very difficult. In the game of Go, the program AlphaGo has even learned to outmatch three of the worlds leading players.Deep reinforcement learning takes its inspiration from the fields of biology and psychology. Biology has inspired the creation of artificial neural networks and deep learning, while psychology studies how animals and humans learn, and how subjects desired behavior can be reinforced with positive and negative stimuli. When we see how reinforcement learning teaches a simulated robot to walk, we are reminded of how children learn, through playful exploration. Techniques that are inspired by biology and psychology work amazingly well in computers: animal behavior and the structure of the brain as new blueprints for science and engineering. In fact, computers truly seem to possess aspects of human behavior; as such, this field goes to the heart of the dream of artificial intelligence. These research advances have not gone unnoticed by educators. Many universities have begun offering courses on the subject of deep reinforcement learning. The aim of this book is to provide an overview of the field, at the proper level of detail for a graduate course in artificial intelligence. It covers the complete field, from the basic algorithms of Deep Q-learning, to advanced topics such as multi-agent reinforcement learning and meta learning.
650			_aHuman-computer interaction
650			_aReinforcement learning
650			_aClimate change
650			_aConditional probability
650			_aExtraneous variables
650			_aNull hypothesis
650			_aProbability distribution
650			_aSexual selection
650			_aStandard deviation
942			_2ddc _cBK