Algorithms for reinforcement learning (Record no. 29072)

000 -LEADER
fixed length control field nam a22 7a 4500
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 180820b xxu||||| |||| 00| 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781608454921
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 006.31
Item number SZE
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name Szepesvari, Csaba
245 ## - TITLE STATEMENT
Title Algorithms for reinforcement learning
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Name of publisher, distributor, etc Morgan & Claypool,
Date of publication, distribution, etc 2010
Place of publication, distribution, etc UK:
300 ## - PHYSICAL DESCRIPTION
Extent xii, 89 p. :
Other physical details ill.;
Dimensions 23.5 cm.
365 ## - TRADE PRICE
Price type code US$
Price amount 35.00
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc Includes bibliographical references.
520 ## - SUMMARY, ETC.
Summary, etc Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms'
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Machine learning
Topical term or geographic name as entry element Natural gradient
Topical term or geographic name as entry element Policy gradient
Topical term or geographic name as entry element Actor-critic methods
Topical term or geographic name as entry element Q-learning
Topical term or geographic name as entry element PAC-learning
Topical term or geographic name as entry element Planning
Topical term or geographic name as entry element Simulation
Topical term or geographic name as entry element Online learning
Topical term or geographic name as entry element Active learning
Topical term or geographic name as entry element Bias-variance tradeoff
Topical term or geographic name as entry element Overfitting
Topical term or geographic name as entry element Least-squares methods
Topical term or geographic name as entry element Stochastic gradient methods
Topical term or geographic name as entry element Function approximation
Topical term or geographic name as entry element Simulation optimization
Topical term or geographic name as entry element Two-timescale stochastic approximation
Topical term or geographic name as entry element Monte-Carlo methods
Topical term or geographic name as entry element Stochastic approximation
Topical term or geographic name as entry element Mathematical models
Topical term or geographic name as entry element Temporal difference learning
Topical term or geographic name as entry element Engineering &​ Applied Sciences
Topical term or geographic name as entry element Markov decision processes
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Item type Books
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Permanent location Current location Date acquired Source of acquisition Cost, normal purchase price Total Checkouts Full call number Barcode Checked out Date last seen Date last borrowed Koha item type
          DAIICT DAIICT 2018-08-20 Kushal Books 2523.50 23 006.31 SZE 031611 2024-12-16 2024-06-13 2024-06-13 Books

Powered by Koha