Packt+ | Advance your knowledge in tech

0

All Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

OpenCV 3 Computer Vision with Python Cookbook

You're reading from OpenCV 3 Computer Vision with Python Cookbook Leverage the power of OpenCV 3 and Python to build computer vision applications

Product type Paperback

Published in Mar 2018

Publisher Packt

ISBN-13 9781788474443

Length 306 pages

Edition 1st Edition

Languages

Python

Tools

OpenCV

Concepts

Computer Vision

Authors (2):

Aleksei Spizhevoi

Rybnikov

View More author details

Table of Contents (15) Chapters

Title Page

Packt Upsell

Contributors

Preface

1. I/O and GUI

2. Matrices, Colors, and Filters FREE CHAPTER

3. Contours and Segmentation

4. Object Detection and Machine Learning

5. Deep Learning

6. Linear Algebra

7. Detectors and Descriptors

8. Image and Video Processing

9. Multiple View Geometry

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Optical character recognition using different machine learning models

In this recipe, you will learn how to train KNN- and SVM-based digit recognition models. It's a simple Optical Character Recognition (OCR) system that can be extended for other characters as well. OCR is a powerful instrument used in many practical applications for recognizing text documents, automatically reading traffic sign messages, and so on.

Getting ready

Before you proceed with this recipe, you will need to install the OpenCV 3.x Python API package and the matplotlib package.

How to do it...

Import the modules:

import cv2
import numpy as np

Specify a few constants:

CELL_SIZE = 20     # Digit image size. 
NCLASSES = 10      # Number of digits.
TRAIN_RATIO = 0.8  # Part of all samples used for training.

Read the digits image and prepare the labels:

digits_img = cv2.imread('../data/digits.png', 0)
digits = [np.hsplit(r, digits_img.shape[1] // CELL_SIZE) 
          for r in np.vsplit(digits_img, digits_img.shape[0] // CELL_SIZE...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Aleksei Spizhevoi

Aleksei Spizhevoi

Alexey Spizhevoy has over 7 years of experience in computer vision R&D. He has worked for 5 years at Itseez, the main OpenCV contributor, before it was acquired by Intel. He has contributed to video stabilization and photo stitching modules into OpenCV library. He has successfully participated in numerous Computer Vision projects in such areas as 3D reconstruction, video conferencing, object detection and tracking, semantic segmentation, driving assistance, and others. He holds a master's degree in computer science, and he is currently pursuing PhD.

See other products by Aleksei Spizhevoi

Rybnikov

Rybnikov

Aleksandr Rybnikov has over 5 years of experience in C++ programming, including 3 years in the Computer Vision (CV) domain. He worked at Itseez, a company that supported and developed OpenCV, and then at Intel. He enriched OpenCV's dnn module by adding support of another two Deep Learning (DL) frameworks and many features, along with improving the existing functionality. As an engineer, he participated in CV and DL projects such as iris recognition, object detection, semantic segmentation, 6-DOF pose estimation, and digital hologram reconstruction. He has a master's degree in physics.

See other products by Rybnikov

Other recommended products

Related to this chapter

OpenCV 3 Computer Vision Application Programming Cookbook

OpenCV 3 Computer Vision Application Programming Cookbook

Feb 2017 15h 48m

OpenCV 3.x with Python By Example

OpenCV 3.x with Python By Example

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease.

Jan 2018 8h 56m

Mastering OpenCV 4 with Python

Mastering OpenCV 4 with Python

Mastering OpenCV 4 with Python is a comprehensive guide to help you to get acquainted with various computer vision algorithms running in real-time. This book will help you to build complete projects on image processing, motion detection, and image segmentation where you can gain advanced computer vision techniques.

Mar 2019 17h 44m

The Computer Vision Workshop

The Computer Vision Workshop

With The Computer Vision Workshop, you'll explore the basic and advanced techniques in video and image processing using OpenCV and Python. It is filled with real-world exercises and activities that will make the learning process easy and enjoyable.

Jul 2020 18h 56m

OpenCV 4 Computer Vision Application Programming Cookbook

OpenCV 4 Computer Vision Application Programming Cookbook

This book will present a variety of CV algorithms using the standard library. It will implement any shortfall that might come in CV by practicing the recipes that implement various tasks such as image processing and object recognition among others. It will help you in implementing CV algorithms to meet the technical requirement of your projects.

May 2019 16h 28m

Raspberry Pi Computer Vision Programming

Raspberry Pi Computer Vision Programming

You will learn the basics of hardware and software required for image processing and computer vision with Raspberry Pi and Python 3. You will have a look at all the major image processing, manipulation, and computer vision techniques and algorithms in detail using engaging examples. You will build a lot of real-life computer vision applications.

Jun 2020 10h 12m

Learning OpenCV 4 Computer Vision with Python 3

Learning OpenCV 4 Computer Vision with Python 3

Now in its third edition, this is the original book on OpenCV's Python bindings. Readers will learn a great range of techniques and algorithms, from the classics to the state-of-the-art, and from geometry to machine learning. All of this is in aid of solving practical computer vision problems in well-built applications.

Feb 2020 12h 24m

Computer Vision with Python 3

Computer Vision with Python 3

The field of computer vision involves designing and implementing algorithms to understand images and extract meaningful information from them. This book enables you to build real-world applications using Python and open source image processing libraries.

Aug 2017 6h 52m

Hands-On Algorithms for Computer Vision

Hands-On Algorithms for Computer Vision

The field of Computer Vision has seen advancements in terms of processing power and performance. Many algorithms are introduced to perform Computer Vision tasks efficiently. This book is a starting point for anyone interested in this field and wants to dig deeper into the most practical algorithms used by professional Computer Vision developers.

Jul 2018 9h 40m

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

Learn OpenCV 4 By Building Projects

Learn OpenCV 4 By Building Projects

OpenCV is mainly used in Computer Vision and image processing and is considered to be one of the best open source libraries that helps developers focus on constructing complete projects on image processing, motion detection, and image segmentation. This book will be your guide to understanding the basic OpenCV concepts and algorithms.

Nov 2018 10h 20m

OpenCV 4 with Python Blueprints

OpenCV 4 with Python Blueprints

OpenCV 4 with Python Blueprints demonstrates how to develop a series of intermediate to advanced projects using OpenCV and Python, rather than teaching the core concepts of OpenCV in theoretical lessons. and allow you to understand the foundations and approaches used in modern computer vision.

Mar 2020 12h 12m

Personalised recommendations for you

Based on your interests and search pattern

Mathematics of Machine Learning

Mathematics of Machine Learning

Deepen your theoretical knowledge and enhance your ability to solve complex machine learning problems with structured guidance. Gain the confidence to engage with advanced ML literature and tailor algorithms to meet your project requirements.

May 2025 24h 20m

Generative AI with Python and PyTorch

Generative AI with Python and PyTorch

Learn how to create images and text using VAEs, GANs, LSTMs, and transformers. Implement applications in natural language processing and computer vision through practical tutorials.

Mar 2025 15h 0m

Practical Generative AI with ChatGPT

Practical Generative AI with ChatGPT

This book helps you unlock ChatGPT's potential to make your working life better. From prompt engineering to creating custom GPTs, you'll enhance your productivity, creativity, and efficiency with practical insights and advanced techniques.

Apr 2025 12h 52m

Generative AI with LangChain

Generative AI with LangChain

Gain a solid foundation in LangChain, agentic AI, and LangGraph, and learn to build production-ready systems with multi-agent architectures, advanced RAG pipelines, Tree of Thought reasoning, agent handoffs, and fine-grained error handling.

May 2025 15h 52m

Architecting Power BI Solutions in Microsoft Fabric

Architecting Power BI Solutions in Microsoft Fabric

Power BI provides several options to solve common data problems, and designing the correct solution for each scenario can be a daunting task. This book makes it easier by guiding you through designing optimal solutions using Power BI.

Apr 2025 14h 16m

Microsoft Identity and Access Administrator SC-300 Exam Guide

Microsoft Identity and Access Administrator SC-300 Exam Guide

This comprehensive guide covers key topics such as Microsoft Entra ID implementation, authentication and access management, external user management, and hybrid identity solutions, providing practical insights and techniques for SC-300 exam success.

Mar 2025 19h 48m

LLM Design Patterns

LLM Design Patterns

This book helps you gain practical skills to develop and deploy LLMs. You'll learn data prep, training, pruning, quantization, and evaluation, as well as explore RAG, advanced prompting, and optimization to build robust, scalable language models.

May 2025 17h 48m

Tableau Cookbook for Experienced Professionals

Tableau Cookbook for Experienced Professionals

Advance your Tableau knowledge beyond the basics, streamline dashboard performance, tackle advanced geospatial challenges, and unlock API potential while fortifying your corporate data infrastructure with proven best practices.

Apr 2025 12h 24m

Time Series Analysis with Spark

Time Series Analysis with Spark

This book offers a complete guide to time series analysis with Apache Spark and Databricks, covering essential concepts and advanced techniques including Generative AI to equip readers with skills for real-world challenges across industries.

Mar 2025 9h 56m

Hands-On Artificial Intelligence for IoT

Hands-On Artificial Intelligence for IoT

Transform IoT systems with the power of artificial intelligence using this hands-on guide. Dive into practical techniques and expert insights to innovate and optimize your IoT devices, making them smarter and more efficient.

May 2025 15h 44m