Packt+ | Advance your knowledge in tech

You're reading from Learn Unity ML-Agents ??? Fundamentals of Unity Machine Learning Incorporate new powerful ML algorithms such as Deep Reinforcement Learning for games

Product type Paperback

Published in Jun 2018

Publisher Packt

ISBN-13 9781789138139

Length 204 pages

Edition 1st Edition

Languages

Tools

Deep Reinforcement Learning

Concepts

Deep Reinforcement Learning

Table of Contents (13) Chapters

Title Page

Dedication

Packt Upsell

Contributors

Preface

1. Introducing Machine Learning and ML-Agents

2. The Bandit and Reinforcement Learning FREE CHAPTER

3. Deep Reinforcement Learning with Python

4. Going Deeper with Deep Learning

5. Playing the Game

6. Terrarium Revisited – A Multi-Agent Ecosystem

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Proximal policy optimization

Thus far, our discussion of RL has looked at simpler techniques for building agents with bandits and Q-learning. Q-learning is a popular algorithm, and as we learned, deep Q neural networks provide us with a great foundation to use to solve more difficult problems, such as a cart balancing a pole. The following table summarizes the various RL algorithms, what conditions they are capable of working in, and how they function:

Algorithm	Model	Policy	Action	Observation	Operator
Q-Learning	Model-free	Off-policy	Discrete	Discrete	Q value
SARSA – State Action Reward State Action	Model-free	On-policy	Discrete	Discrete	Q value
DQN – Deep Q Network	Model-free	Off-policy	Discrete	Continuous	Q value
DDPG – Deep Deterministic Policy Gradient	Model-free	Off-policy	Continuous	Continuous	Q value
TRPO – Trust Region Policy Optimization	Model-free	Off-policy	Continuous	Continuous	Advantage
PPO – Proximal Policy Optimization	Model-free	Off-policy	Continuous	Continuous	Advantage