Home > Articles

This chapter is from the book

6.9 Further Reading

  • “Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems,” Barto et al., 1983 [11].

  • “High Dimensional Continuous Control with Generalized Advantage Estimation,” Schulman et al., 2015 [123].

  • “Trust Region Policy Optimization,” Schulman et al., 2015 [122].

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.