Further reading
Please refer to the following articles:
Efficient Estimation of Word Representations in Vector Space, Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean, Jan 2013
Factor-based Compositional Embedding Models, Mo Yu, 2014
Character-level Convolutional Networks for Text Classification, Xiang Zhang, Junbo Zhao, Yann LeCun, 2015
Distributed Representations of Words and Phrases and their Compositionality, Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, Jeffrey Dean, 2013
Using the Output Embedding to Improve Language Models, Ofir Press, Lior Wolf, Aug 2016