000 -LEADER |
fixed length control field |
a |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
fixed length control field |
250818b xxu||||| |||| 00| 0 eng d |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9780262039246 |
Terms of availability |
(hbk) |
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER |
Classification number |
006.3 |
Item number |
SUT |
100 ## - MAIN ENTRY--PERSONAL NAME |
Personal name |
Sutton, Richard S. |
245 ## - TITLE STATEMENT |
Title |
Reinforcement learning : an introduction |
250 ## - EDITION STATEMENT |
Edition statement |
2nd ed. |
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT) |
Name of publisher, distributor, etc |
MIT Press, |
Date of publication, distribution, etc |
2018 |
Place of publication, distribution, etc |
Cambridge, Massachusetts : |
300 ## - PHYSICAL DESCRIPTION |
Extent |
xxii, 526 p. ; |
Other physical details |
ill., |
Dimensions |
24 cm. |
365 ## - TRADE PRICE |
Price amount |
9650.00 |
Price type code |
₹ |
Unit of pricing |
01 |
490 ## - SERIES STATEMENT |
Series statement |
Adaptive computation and machine learning series |
504 ## - BIBLIOGRAPHY, ETC. NOTE |
Bibliography, etc |
Includes bibliographical references and index. |
520 ## - SUMMARY, ETC. |
Summary, etc |
In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.--Jacket. Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms.-- Provided by publisher.
|
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Artificial Intelligence |
|
Topical term or geographic name as entry element |
Bellman equation |
|
Topical term or geographic name as entry element |
Dynamic programming |
|
Topical term or geographic name as entry element |
Function approximation |
|
Topical term or geographic name as entry element |
Monte Carlo methods |
|
Topical term or geographic name as entry element |
Markov property |
|
Topical term or geographic name as entry element |
Q-learning |
700 ## - ADDED ENTRY--PERSONAL NAME |
Personal name |
Barto, Andrew G. |
942 ## - ADDED ENTRY ELEMENTS (KOHA) |
Source of classification or shelving scheme |
|
Item type |
Books |