Study Note: 1. Introduction of pattern recognition.

在BBC一档关于数学的节目里,数学家兼主持人谈到了他对Pattern的痴迷。在他看来,数学就是对Pattern寻找和研究的过程。以下笔记将要涉及的当然不是他所说的数学也不是他所说的广义的Pattern,而是曾经在大学里因为上课打盹或者逃课而没能学完(一点都没能)的模式识别。从一开始就想要把它学好学精,But God knows what I was afraid of?  没有基石无以成大业。从头来过!

 

所有笔记内容(以问答方式)均来自于学习《Pattern Recognition 4th Edition》(主要部分)以及我的拙见(very very 次要部分)。以下是该书的原版出版社链接:http://www.elsevierdirect.com/product.jsp?isbn=9781597492720

 

 

 

 

 

 What is pattern recognition?

Pattern recognition is the scientific discipline whose goal is the classification of objects into a number of categories or classes.

 In what field the pattern recognition will be applicable?

Machine vision, Character recognition, Computer-aided diagnosis, Speech recognition, and Data mining and knowledge discovery in data bases.

 

What is feature, feature vector, l-dimensional feature space, training patterns?

Feature is the measurements used for classification. In more general case, l features constitute a feature vector.

Feature vectors are sometimes treated as random variables as the measurements resulting from different patterns exhibit a random variation.

All feature vectors constitute an l-dimensional feature space. The classification is to separate the space with decision surfaces.

The patterns (feature vectors) whose true class is known (called labeled) and which are used for the design of the classifier are known as training patterns (training feature vectors).

 

What the basic stages to design a classification system are?

1: Feature Generation Stage

2: Feature Selection Stage

3: Classifier Design Stage

    Optimality Criteria, Decision Surfaces

4: System Evaluation Stage

                 Classification Error Rate

 

What is supervised, unsupervised, and semi-supervised pattern recognition?

Supervised pattern recognition: A set of training data are available. The classifier is designed by exploiting the known information.

Unsupervised pattern recognition (Clustering): No available training data of known class labels. A set of feature vectors are given and the goal is to find the underlying similarities and cluster similar vectors together.

The major issue is to define the similarity between two vectors and choose an appropriate measure for it. Another issue is to choose an algorithmic scheme to cluster the vectors on basis of the adopted similarity measure.

Semi-supervised pattern recognition: Share the same goal with supervised pattern recognition. Labeled and unlabeled training data both exists. Recovering the related information from unlabeled data is useful to improve the system design. In other words, labeled data can be constraints in clustering task and provide knowledge that clustering algorithm has to respect.

 

 

发布了26 篇原创文章 · 获赞 9 · 访问量 10万+
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章