Batch Asynchronous Stochastic Approximation and Applications to Temporal Difference Learning STCS Colloquium Speaker: M. VidyasagarOrganisers: Sandeep K JunejaTime: Wednesday, 11 May 2022, 11:00 to 12:00 Venue: In person @ A-201 and also via Zoom (This talk consists of joint work with Prof. Rajeeva L. Read more about Batch Asynchronous Stochastic Approximation and Applications to Temporal Difference Learning
A Simple Convergence Proof for A Simple Convergence Proof Stochastic Approximation and Applications to Reinforcement Learning STCS Colloquium Speaker: M. VidyasagarOrganisers: Sandeep K JunejaTime: Tuesday, 10 May 2022, 16:00 to 17:00 Venue: In person @ A-201 and also via Zoom Since its invention by Robbins and Monro in 1951, the stochastic approximation (SA) algorithm has been a widely used tool for finding solutions of equations, or minimizing functions, with noisy measurements. Read more about A Simple Convergence Proof for A Simple Convergence Proof Stochastic Approximation and Applications to Reinforcement Learning