Packt+ | Advance your knowledge in tech

You're reading from Deep Learning By Example A hands-on guide to implementing advanced machine learning algorithms and neural networks

Product type Paperback

Published in Feb 2018

Publisher Packt

ISBN-13 9781788399906

Length 450 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Deep Learning

Author (1):

Menshawy

View More author details

Table of Contents (23) Chapters

Title Page

Packt Upsell

Contributors

Preface

1. Data Science - A Birds' Eye View FREE CHAPTER

2. Data Modeling in Action - The Titanic Example

3. Feature Engineering and Model Complexity – The Titanic Example Revisited

4. Get Up and Running with TensorFlow

5. TensorFlow in Action - Some Basic Examples

6. Deep Feed-forward Neural Networks - Implementing Digit Classification

7. Introduction to Convolutional Neural Networks

8. Object Detection – CIFAR-10 Example

9. Object Detection – Transfer Learning with CNNs

10. Recurrent-Type Neural Networks - Language Modeling

11. Representation Learning - Implementing Word Embeddings

12. Neural Sentiment Analysis

13. Autoencoders – Feature Extraction and Denoising

14. Generative Adversarial Networks

15. Face Generation and Handling Missing Labels

16. Implementing Fish Recognition

Code for fish recognition

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Skip-gram Word2Vec implementation

After understanding the mathematical details of how skip-gram models work, we are going to implement skip-gram, which encodes words into real-valued vectors that have certain properties (hence the name Word2Vec). By implementing this architecture, you will get a clue of how the process of learning another representation works.

Text is the main input for a lot of natural language processing applications such as machine translation, sentiment analysis, and text to speech systems. So, learning a real-valued representation for the text will help us use different deep learning techniques for these tasks.

In the early chapters of this book, we introduced something called one-hot encoding, which produces a vector of zeros except for the index of the word that this vector represents. So, you may wonder why we are not using it here. This method is very inefficient because usually you have a big set of distinct words, maybe something like 50,000 words, and using one...