Dynamic Programming and Stochastic Control, Academic Press, 1976. Establishes a connection of rollout with model predictive control, one of the most prominent control system design methodology. More specifically I am going to talk about the unbelievably awesome Linear Quadratic Regulator that is used quite often in the optimal control world and also address some of the similarities between optimal control and the recently hyped reinforcement learning. Reinforcement Learning (RL) addresses the problem of controlling a dynamical system so as to maximize a notion of reward cumulated over time. REINFORCEMENT LEARNING AND OPTIMAL CONTROL BOOK, Athena Scientific, July 2019. Publication: 2020, 376 pages, hardcover In a generalizable end-to-end fashion, muscle activations are learned given current and desired position-velocity pairs. Kretchmar and Anderson (1997) Comparison of CMACs and Radial Basis Functions for Local Function Approximators in Reinforcement Learning. Bert-sekas, 2018, ISBN 978-1-886529-46-5, 360 pages 3. Contents, Preface, Selected Sections. Bertsekas and Tsitsiklis (1995) Neuro-Dynamic Programming. He is the recipient of the 2001 A. R. Raggazini ACC education award, the 2009 INFORMS expository writing award, the 2014 Kachiyan Prize, the 2014 AACC Bellman Heritage Award, the 2015 SIAM/MOS George B. Dantsig Prize. 