Image recognition is the process of identifying and detecting an object or a feature in a digital image or video. This concept is used in many applications like systems for factory automation, toll booth monitoring, and security surveillance. Typical image recognition algorithms include:

- Optical character recognition
- Pattern and gradient matching
- Face recognition
- License plate matching
- Scene change detection

(Figure 1).

A support vector machine (SVM) is a supervised learning algorithm that can be used for binary classification or regression. Support vector machines are popular in applications such as natural language processing, speech and image recognition, and computer vision.

A support vector machine constructs an optimal hyperplane as a decision surface such that the margin of separation between the two classes in the data is maximized. Support vectors refer to a small subset of the training observations that are used as support for the optimal location of the decision surface.

Support vector machines fall under a class of machine learning algorithms called kernel methods and are also referred to as kernel machines.

Training for a support vector machine has two phases:

- Transform predictors (input data) to a high-dimensional feature space. It is sufficient to just specify the kernel for this step and the data is never explicitly transformed to the feature space. This process is commonly known as the kernel trick.
- Solve a quadratic optimization problem to fit an optimal hyperplane to classify the transformed features into two classes. The number of transformed features is determined by the number of support vectors.

Popular kernels used with SVMs include:

Type of SVM | Mercer Kernel | Description |
---|---|---|

Gaussian or Radial Basis Function (RBF) | One class learning. σ is the width of the kernel | |

Linear | K(x = _{1},x_{2})x_{1}^{T}x_{2} |
Two class learning. |

Polynomial | K(x = (_{1},x_{2})x + 1)_{1}^{T}x_{2}^{p} |
p is the order of the polynomial |

Sigmoid | K(x = tanh (_{1},x_{2})β + _{0} x_{1}^{T}x_{2}β)_{1} |
It is a mercer kernel for certain β values only_{0} and β_{1} |

Cross correlation can be used for pattern matching and target tracking as shown in Figure 2.

An effective approach for image recognition includes using a technical computing environment for data analysis, visualization, and algorithm development.

Sign up here with your email

ConversionConversion EmoticonEmoticon