Packt+ | Advance your knowledge in tech

You're reading from Deep Learning with Theano Perform large-scale numerical and scientific computations efficiently

Product type Paperback

Published in Jul 2017

Publisher Packt

ISBN-13 9781786465825

Length 300 pages

Edition 1st Edition

Tools

Theano

Concepts

Deep Learning

Author (1):

Bourez

View More author details

Table of Contents (22) Chapters

Deep Learning with Theano

Credits

About the Author

Acknowledgments

About the Reviewers

www.PacktPub.com

Customer Feedback

Preface

1. Theano Basics FREE CHAPTER

2. Classifying Handwritten Digits with a Feedforward Network

3. Encoding Word into Vector

4. Generating Text with a Recurrent Neural Net

5. Analyzing Sentiment with a Bidirectional LSTM

6. Locating with Spatial Transformer Networks

7. Classifying Images with Residual Networks

8. Translating and Explaining with Encoding – decoding Networks

9. Selecting Relevant Inputs or Memories with the Mechanism of Attention

10. Predicting Times Sequences with Advanced RNN

11. Learning from the Environment with Reinforcement

12. Learning Features with Unsupervised Generative Networks

13. Extending Deep Learning with Theano

Index

Summary

Classification is a very wide topic in machine learning. It consists of predicting a class or a category, as we have shown with our handwritten digits example. In Chapter 7, Classifying Images with Residual Networks, we'll see how to classify a wider set of natural images and objects.

Classification can be applied to different problems and the cross-entropy/negative log likelihood is the common loss function to solve them through gradient descent. There are many other loss functions for problems such as regression (mean square error loss) or unsupervised joint learning (hinge loss).

In this chapter, we have been using a very simple update rule as gradient descent named stochastic gradient descent, and presented some other gradient descent variants (Momentum, Nesterov, RMSprop, ADAM, ADAGRAD, ADADELTA). There has been some research into second order optimizations, such as Hessian Free, or K-FAC, which provided better results in deep or recurrent networks but remain complex and costly...