Packt+ | Advance your knowledge in tech

You're reading from Raspberry Pi 3 Cookbook for Python Programmers Unleash the potential of Raspberry Pi 3 with over 100 recipes

Product type Paperback

Published in Apr 2018

Publisher

ISBN-13 9781788629874

Length 552 pages

Edition 3rd Edition

Languages

Python

Tools

Raspberry Pi

Concepts

Single Board Computers

Authors (2):

Steven Lawrence Fernandes

Tim Cox

View More author details

Table of Contents (23) Chapters

Title Page

Dedication

Packt Upsell

Contributors

Preface

1. Getting Started with a Raspberry Pi 3 Computer FREE CHAPTER

2. Dividing Text Data and Building Text Classifiers

3. Using Python for Automation and Productivity

4. Predicting Sentiments in Words

5. Creating Games and Graphics

6. Detecting Edges and Contours in Images

7. Creating 3D Graphics

8. Building Face Detector and Face Recognition Applications

9. Using Python to Drive Hardware

10. Sensing and Displaying Real-World Data

11. Building Neural Network Modules for Optical Character Recognition

12. Building Robots

13. Interfacing with Technology

14. Can I Recommend a Movie for You?

1. Hardware and Software List

2. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Building a text classifier

Classifier units are normally considered to separate a database into various classes. The Naive Bayes classifier scheme is widely considered in literature to segregate the texts based on the trained model. This section of the chapter initially considers a text database with keywords; feature extraction extracts the key phrases from the text and trains the classifier system. Then, term frequency-inverse document frequency (tf-idf) transformation is implemented to specify the importance of the word. Finally, the output is predicted and printed using the classifier system.

How to do it...

Include the following lines in a new Python file to add datasets:

from sklearn.datasets import fetch_20newsgroups 
category_mapping = {'misc.forsale': 'Sellings', 'rec.motorcycles': 'Motorbikes', 
        'rec.sport.baseball': 'Baseball', 'sci.crypt': 'Cryptography', 
        'sci.space': 'OuterSpace'} 
 
training_content = fetch_20newsgroups(subset='train', 
categories=category_mapping...