Datasets and pre-trained model description
We are going to use the Large Movie Review dataset for training and testing the mode. Additionally, we will be using the Sentiment labeled Sentences dataset for making a single prediction on reviews on products, movies, and restaurants.
Large Movie Review dataset for training and testing
The former one is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. The dataset can be downloaded from http://ai.stanford.edu/~amaas/data/sentiment/. Alternatively, I have utilized a Java method that comes from DL4J examples that also downloads and extracts this dataset.
I would like to acknowledge the following publications: Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. (2011), Learning Word Vectors for Sentiment Analysis, The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011).
This dataset contains 50,000 movie reviews along...