Published by Athena Scientific in Belmont, Mass .
Written in English


  • Neural networks (Computer Science),
  • Mathematical optimization.,
  • Dynamic programming.

Book details:

Edition Notes

Includes bibliographical references (p. 475-486) and index.

StatementDimitri P. Bertsekas and John N. Tsitsiklis.
SeriesOptimization and neural computation series
ContributionsTsitsiklis, John N.
LC ClassificationsQA76.87 .B47 1996
The Physical Object
Paginationxiii, 491 p. :
Number of Pages491
ID Numbers
Open LibraryOL1022653M
ISBN 101886529108
LC Control Number96085338

The book develops a comprehensive analysis of neuro-dynamic programming algorithms, and guides the reader to their successful application through case studies from complex problem areas. Review by George Cybenko for IEEE Computational Science and Engineering, May Neuro-dynamic programming (or "Reinforcement Learning", which is the term used in the Artificial Intelligence literature) uses neural network and other approximation architectures to overcome such bottlenecks to the applicability of dynamic programming.

  This book provides the first systematic presentation of the science and the art behind this exciting and far-reaching methodology. The book develops a comprehensive analysis of neuro-dynamic programming algorithms, and guides the reader to their successful application through case studies from complex problem areas. Neuro–dynamic programming is comprised of algorithms for solving large– scale stochastic control problems. Many ideas underlying these algorithms originated in the field of artificial intelligence and were motivated to some extent by descriptive models of animal behavior. A major expansion of the discussion of approximate DP (neuro-dynamic programming), which allows the practical application of dynamic programming to large and complex problems. Approximate DP has become the central focal point of dynamic programming research.

Neuro-Dynamic Programming was, and is, a foundational reference for anyone wishing to work in the field that goes under names such as approximate dynamic programming, adaptive dynamic programming, reinforcement learning or, as a result of this book, neuro-dynamic programming.