




版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
1、The Science of Pattern RecognitionAchievements and PerspectivesRobert P.W. Duin1 and Elzbieta P_ekalska21 ICT group, Faculty of Electr.Eng., Mathematics and Computer ScienceDelftUniversityofTechnology, TheN2SchoolofComputerScience,University of Manchester,United Kingdompekals
2、kacs.man.ac.ukSummary.Automatic pattern recognition is usually considered as an engineering area which focusses on the development and evaluation of systems that imitate or assist humans in their ability of recognizing patterns. It may, however, also be considered as a science that studies the facul
3、ty of human beings (and possibly other biological systems) to discover, distinguish, characterize patterns in their environment and accordingly identify new observations. The engineering approach to pattern recognition is in this view an attempt to build systems that simulate this phenomenon. By doi
4、ng that, scientific understanding is gained of what is needed in order to recognize patterns, in general.Like in any science understanding can be built from different, sometimes even opposite viewpoints. We will therefore introduce the main approaches to the science of pattern recognition as two dic
5、hotomies of complementary scenarios. They give rise to four different schools, roughly defined under the terms of expert systems, neural networks, structural pattern recognition and statistical pattern recognition.We will briefly describe what has been achieved by these schools, what is common and w
6、hat is specific, which limitations are encountered and which perspectives arise for the future. Finally, we will focus on the challenges facing pattern recognition in the decennia to come. They mainly deal with weaker assumptions of the models to make the corresponding procedures for learning and re
7、cognition wider applicable. In addition, new formalisms need to be developed.IntroductionWe are very familiar with the human ability of pattern recognition. Since our early years we have been able to recognize voices, faces, animals, fruits or inanimate objects. Before the speaking faculty is develo
8、ped, an object like a ball is recognized, even if it barely resembles the balls seen before. So, except for the memory, the skills of abstraction and generalization are essential to find our way in the world. In later years we are able to deal with much more complex patterns that may not directly be
9、 based on sensorial observations.For example, we can observe the underlying theme in a discussion or subtle patterns in human relations. The latter may become apparent, e.g. only by listening to somebodys complaints about his personal problems at work that again occur in a completely new job. Withou
10、t a direct participation in theevents, we are able to see both analogy and similarity in examples as complex as social interaction between people. Here, we learn to distinguish the pattern from just two examples.The pattern recognition ability may also be found in other biological systems:the cat kn
11、ows the way home, the dog recognizes his boss from the footsteps or the bee finds the delicious flower. In these examples a direct connection can be made to sensory experiences. Memory alone is insufficient; an important role is that of generalization from observations which are similar,although not
12、 identical to the previous ones. A scientific challenge is to find out how this may work.Scientific questions may be approached by building models and, more explicitly, by creating simulators, i.e. artificial systems that roughly exhibit the same phenomenon as the object under study. Understanding w
13、ill be gained while constructing such a system and evaluating it with respect to the real object. Such systems may be used to replace the original ones and may even improve some of their properties. On the other hand, they may also perform worse in other aspects. For instance, planes fly faster than
14、 birds but are far from being autonomous. We should realize, however, that what is studied in this case may not be the bird itself, but more importantly, the ability to fly.Much can be learned about flying in an attempt to imitate the bird, but also when differentiating from its exact behavior or ap
15、pearance. By constructing fixed wings instead of freely movable ones, the insight in how to fly grows.Finally, there are engineering aspects that may gradually deviate from the original scientific question. These are concerned with how to fly for a long time, with heavy loads, or by making less nois
16、e, and slowly shift the point of attention to other domains of knowledge.The above shows that a distinction can be made between the scientific study of pattern recognition as the ability to abstract and generalize from observations and the applied technical area of the design of artificial pattern r
17、ecognition devices without neglecting the fact that they may highly profit from each other. Note that patterns can be distinguished on many levels,starting from simple characteristics of structural elements like strokes, through features of an individual towards a set of qualities in a group of indi
18、viduals,to a composite of traits of concepts and their possible generalizations. A pattern may also denote a single individual as a representative for its population, model or concept. Pattern recognition deals, therefore, with patterns, regularities,characteristics or qualities that can be discusse
19、d on a low level of sensory measurements (such as pixels in an image) as well as on a high level of the derived and meaningful concepts (such as faces in images). In this work, we will focus on the scientific aspects, i.e. what we know about the way pattern recognition works and, especially, what ca
20、n be learned from our attempts to build artificial recognition devices.A number of authors have already discussed the science of pattern recognition based on their simulation and modeling attempts. One of the first, in the beginning of the sixties, was Sayre 64, who presented a philosophical study o
21、n perception, pattern recognition and classification. He made clear that classification is a task that can be fulfilled with some success, but recognition either happens or not. We can stimulate the recognition by focussing on some aspects of the question. Although we cannot set out to fully recogni
22、ze an individual, we can at least start to classify objects on demand. The way Sayre distinguishes between recognition and classification is related to the two subfields discussed in traditional texts on pattern recognition, namely unsupervised and supervised learning. They fulfill two complementary
23、 tasks. They act as automatic tools in the hand of a scientist who sets out to find the regularities in nature.Unsupervised learning(also related to exploratory analysis or cluster analysis) gives the scientist an automatic system to indicate the presence of yet unspecified patterns (regularities) i
24、n the observations. They have to be confirmed (verified) by him. Here, in the terms of Sayre, a pattern is recognized.Supervised learningis an automatic system that verifies (confirms)the patterns described by the scientist based on a representation defined by him. This is done by an automatic class
25、ification followed by an evaluation.In spite of Sayres discussion, the concepts of pattern recognition and classification are still frequently mixed up. In our discussion, classification is a significant component of the pattern recognition system, but unsupervised learning may also play a role ther
26、e. Typically, such a system is first presented with a set of known objects, the training set, in some convenient representation. Learning relies on finding the data descriptions such that the system can correctly characterize, identify or classify novel examples. After appropriate preprocessing and
27、adaptations, various mechanisms are employed to train the entire system well. Numerous models and techniques are used and their performances are evaluated and compared by suitable criteria. If the final goal is prediction, the findings are validated by applying the best model to unseen data. If the
28、final goal is characterization, the findings may be validated by complexity of organization (relations between objects) as well as by interpretability of the results.Fig. 1 shows the three main stages of pattern recognition systems: Representation, Generalization and Evaluation, and an intermediate
29、stage of Adaptation20. The system is trained and evaluated by a set of examples, the Design Set. The components are:Design Set.It is used both for training and validating the system. Given the background knowledge, this set has to be chosen such that it is representative for the set of objects to be
30、 recognized by the trained system.There are various approaches how to split it into suitable subsets for training,validation and testing. See e.g. 22, 32, 62, 77 for details.Representation.Real world objects have to be represented in a formal way in order to be analyzed and compared by mechanical me
31、ans such as a computer. Moreover, the observations derived from the sensors or other formal representations have to be integrated with the existing, explicitly formulated knowledge either on the objects themselves or on the class they may belong to. The issue of representation is an essential aspect
32、 of pattern recognition and is different from classification. It largely influences the success of the stages to come.Adaptation.It is an intermediate stage between Representation and Generalization,in which representations, learning methodology or problem statement are adapted or extended in order
33、to enhance the final recognition.This step may be neglected as being transparent, but its role is essential.It may reduce or simplify the representation, or it may enrich it by emphasizing particular aspects, e.g. by a nonlinear transformation of features that simplifies the next stage. Background k
34、nowledge may appropriately be (re)formulated and incorporated into a representation. If needed, additional representations may be considered to reflect other aspects of the problem. Exploratory data analysis (unsupervised learning) may be used to guide the choice of suitable learning strategies.Gene
35、ralization or Inference.In this stage we learn a concept from a training set, the set of known and appropriately represented examples, in such a way that predictions can be made on some unknown properties of new examples. We either generalize towards a concept or infer a set of general rules that de
36、scribe the qualities of the training data. The most common property is the class or pattern it belongs to, which is the above mentioned classification task.Evaluation.In this stage we estimate how our system performs on known training and validation data while training the entire system. If the resu
37、lts are unsatisfactory, then the previous steps have to be reconsidered.Different disciplines emphasize or just exclusively study different parts of this system. For instance, perception and computer vision deal mainly with the representation aspects 21, while books on artificial neural networks 62,
38、machine learning 4, 53 and pattern classification 15 are usually restricted to generalization. It should be noted that these and other studies with the words “pattern” and “recognition” in the title often almost entirely neglect the issue of representation. We think, however, thatthe main goal of th
39、e field of pattern recognition is to study generalization in relation to representation20.In the context of representations, and especially images, generalization has been thoroughly studied by Grenander 36. What is very specific and worthwhile is that he deals with infinite representations (say, un
40、sampled images),thereby avoiding the frequently returning discussions on dimensionality and directly focussing on a high, abstract level of pattern learning. We like to mention two other scientists that present very general discussions on the pattern recognition system: Watanabe 75 and Goldfarb 31,
41、32. They both emphasize the structural approach to pattern recognition that we will discuss later on. Here objects are represented in a form that focusses on their structure.A generalization over such structural representations is very difficult if one aims to learn theconcept, i.e. the underlying,
42、often implicit definition of a pattern class that is able to generate possible realizations. Goldfarb argues that traditionally used numeric representations are inadequate and that an entirely new, structural representation is necessary. We judge his research program as very ambitious, as he wants t
43、o learn the (generalized) structure of the concept from the structures of the examples. He thereby aims to make explicit what usually stays implicit. We admit that a way like his has to be followed if one ever wishes to reach more in concept learning than the ability to name the right class with a h
44、igh probability, without having built a proper understanding.模式識(shí)別研究的成果與展望 自動(dòng)模式識(shí)別通常被認(rèn)為是這樣的一個(gè)工程領(lǐng)域:專注于開發(fā)和評(píng)價(jià)模仿或輔助人類識(shí)別模式能力的系統(tǒng),但是也可能被認(rèn)為是這樣的一門科學(xué):學(xué)習(xí)人類(或其它生物系統(tǒng))在所處環(huán)境中發(fā)現(xiàn)、區(qū)別和找出特征從而標(biāo)識(shí)出觀察結(jié)果的本領(lǐng)。模式識(shí)別中工程的觀點(diǎn)是試圖建立模擬生物識(shí)別能力的系統(tǒng),通過工程中的實(shí)踐,總的來說,科學(xué)上的理解在模式識(shí)別中的技術(shù)需求方面得到了發(fā)展。 象任何科學(xué)一樣,對(duì)模式識(shí)別的理解能夠從不同方向來建立,有時(shí)甚至是相反的觀點(diǎn)。我們將介紹模式識(shí)別科學(xué)中的主要
45、方法,即兩種不同方向且各有兩個(gè)不同種類的技術(shù),這些技術(shù)產(chǎn)生了四個(gè)不同學(xué)派,粗略地可以定義為:專家系統(tǒng),神經(jīng)網(wǎng)絡(luò),結(jié)構(gòu)模式識(shí)別和統(tǒng)計(jì)模式識(shí)別。 我們將簡(jiǎn)要地描述這四個(gè)學(xué)派的發(fā)展成果,它們之間的相同點(diǎn)及不同點(diǎn),它們各自碰到的局限性及未來發(fā)展的展望。最后,我們?cè)賮砜茨J阶R(shí)別在未來幾十年所面臨的挑戰(zhàn),這個(gè)挑戰(zhàn)主要是解決在學(xué)習(xí)和識(shí)別更大范圍適用性時(shí)所碰到的為建立相應(yīng)處理的模型的脆弱問題。再有就是需要發(fā)展新的模式識(shí)別形式。介紹 對(duì)于人類的識(shí)別能力我們是非常熟悉的。因?yàn)槲覀冊(cè)谠缧┠昃鸵呀?jīng)會(huì)開發(fā)識(shí)別聲音、臉、動(dòng)物、水果或簡(jiǎn)單不動(dòng)的東西的技術(shù)了。在開發(fā)出說話技術(shù)之前,一個(gè)象球的東西,甚至看上去只是象個(gè)球,就已經(jīng)
46、可以被識(shí)別出來了。所以除了記憶,抽象和推廣能力是推進(jìn)模式識(shí)別技術(shù)的關(guān)鍵技術(shù)。最近幾年我們已可以處理更復(fù)雜的模式,這種模式可能不是直接基于通過感知器觀察出來的。 例如,我們能夠觀察發(fā)現(xiàn)某個(gè)討論會(huì)的中心議題或人與人之間關(guān)系的微妙的模式。后面一種模式是可能可以被明顯觀察到,例如傾聽某人在新的工作中因人際關(guān)系問題而產(chǎn)生的抱怨,我們不用切身其中就能夠發(fā)現(xiàn)這種相似和相同的例子,其復(fù)雜性莫過于人與人之間的社會(huì)相互影響。這里我們要學(xué)會(huì)區(qū)分只是從兩個(gè)例子中得到的模式。 模式識(shí)別的能力也可以在其它生物中被發(fā)現(xiàn)到:貓可以知道回家的路,狗能夠識(shí)別主人的腳印,蜜蜂會(huì)發(fā)現(xiàn)它要采蜜的花。這些例子中每一個(gè)直接聯(lián)結(jié)都是通過感觀
47、來實(shí)現(xiàn)的。不只是記憶方面,推廣能力是重要的一方面,從觀察到的相似事物中,雖然前后不一樣,也能夠進(jìn)行識(shí)別,發(fā)現(xiàn)動(dòng)物是怎么做到這一點(diǎn)是一個(gè)科學(xué)挑戰(zhàn)。 科學(xué)問題可以通過建立模型來解決,更確切的說是建立模擬器,例如人工系統(tǒng)通過學(xué)習(xí)來粗略地展示具有相同功能的東西,在建立這個(gè)系統(tǒng)和取得真實(shí)對(duì)象相關(guān)參數(shù)的過程中獲得得了對(duì)這個(gè)事物的理解,這樣的系統(tǒng)可以替換原來的對(duì)象,甚至可以提高原來的性能,但在其它方面可能是更差。例如,飛機(jī)可以飛得比鳥快,但在智能方面卻遠(yuǎn)遠(yuǎn)不如鳥,然而,我們的研究不是為了達(dá)到跟鳥全部一樣,更重要的是飛行能力。 通過模仿鳥的飛行可以學(xué)到很多飛行方面的技術(shù),但無法學(xué)到其精確的分辨能力。通過建立固
48、定不動(dòng)的翅膀,而不是自由扇動(dòng)的翅膀,我們知道了怎么飛行。 最后,存在希望逐漸從原來的科學(xué)問題中引申出來的工程技術(shù),如在重載下怎么飛得更長(zhǎng)時(shí)間,怎么減少噪音,慢慢地把注意點(diǎn)轉(zhuǎn)移到其它的知識(shí)領(lǐng)域。 上面表明,模式識(shí)別(源于觀察的抽象和歸納能力)科學(xué)研究和應(yīng)用技術(shù)領(lǐng)域中的人工智能模式識(shí)別設(shè)備設(shè)計(jì)存在差別,后者不會(huì)放過任何相互間互利的因素。注意這里所說的模式在很多層次上是有區(qū)分的,就如結(jié)構(gòu)元素的簡(jiǎn)單特征(如筆畫),體現(xiàn)了從在一組個(gè)體中表示某一個(gè)性質(zhì)集的個(gè)體特征,到綜合概念和歸納的特征。一個(gè)模式可能表示成一個(gè)單獨(dú)個(gè)體,如某個(gè)總體、模型或概念的表示。結(jié)合模式、規(guī)律、特征或性質(zhì),模式識(shí)別所做的事可以說是在感
49、觀測(cè)定的低層次上(如圖像的象素),也可以說是在推理和有意義概念的高層層次上(如圖像中的人臉)。這里,我們注重在科學(xué)研究方面,如模式識(shí)別的實(shí)現(xiàn)途徑是什么,特別是我們?cè)诮⑷斯ぷR(shí)別設(shè)備需要具備什么技術(shù)。 已經(jīng)有些人在討論基于模擬和建模嘗試的模式識(shí)別科學(xué)了。在開始的六十年里,其中有個(gè)叫Sayre的人做了關(guān)于感知器、模式識(shí)別和分類的哲學(xué)研究,他斷言分類方法在某些程度上可以被成功實(shí)現(xiàn),但或許也會(huì)失敗。根據(jù)問題的一些情況我們可以進(jìn)行模擬識(shí)別。雖然我們不能完全識(shí)別某個(gè)個(gè)體,但是我們至少可以根據(jù)需要把對(duì)象分類出來。識(shí)別和分類的Sayre區(qū)分方法跟模式識(shí)別的兩個(gè)傳統(tǒng)的學(xué)習(xí)方法有關(guān):無監(jiān)督學(xué)習(xí)和有監(jiān)督學(xué)習(xí),這個(gè)兩個(gè)方法可以實(shí)現(xiàn)識(shí)別和分類方法,科學(xué)家利用這個(gè)自動(dòng)化工具來發(fā)現(xiàn)自然界中的規(guī)律。 無監(jiān)督學(xué)習(xí)(也稱為試探性分析或聚類分析):這個(gè)方法給研究者一種在觀察中自動(dòng)表示未確定模式(規(guī)律)方法,通過這種方法模式種類被確定(檢驗(yàn))了下來,依此,根據(jù)Sayre觀點(diǎn),一個(gè)模式就可以被被識(shí)別出來了。 有監(jiān)督學(xué)習(xí):是這樣的一個(gè)自動(dòng)系統(tǒng),檢驗(yàn)(確定)已被研究者通過一種表示方法定義好了的模式,這就是通過評(píng)估來實(shí)現(xiàn)的自動(dòng)分類方法。 盡管Sayre已做了相關(guān)
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025至2030年中國高速膠板市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國防裂劑市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國鋼襯四氟反應(yīng)塔市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國超高速單線機(jī)市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國現(xiàn)場(chǎng)總線連接器市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國液壓搬運(yùn)車市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國氨基模塑料市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國抗菌接頭市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國平底試劑槽市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030年中國獎(jiǎng)狀市場(chǎng)分析及競(jìng)爭(zhēng)策略研究報(bào)告
- 2025至2030石墨電極行業(yè)產(chǎn)業(yè)運(yùn)行態(tài)勢(shì)及投資規(guī)劃深度研究報(bào)告
- 江蘇省高郵市2025屆八下英語期末調(diào)研模擬試題含答案
- 垃圾爐渣廠管理制度
- 2025安全生產(chǎn)月一把手講安全公開課主題宣講三十三(60P)
- 2025至2030中國二甲醚汽車行業(yè)市場(chǎng)分析及競(jìng)爭(zhēng)形勢(shì)與發(fā)展前景預(yù)測(cè)報(bào)告
- 統(tǒng)編版七年級(jí)歷史上冊(cè)期末復(fù)習(xí)課件
- 2025春季學(xué)期國開電大本科《人文英語4》一平臺(tái)機(jī)考真題及答案(第五套)
- 2025三明市三元區(qū)輔警考試試卷真題
- 新生兒高膽紅素血癥護(hù)理措施
- 2025春季學(xué)期國開電大??啤吨屑?jí)財(cái)務(wù)會(huì)計(jì)(二)》一平臺(tái)在線形考(第二次形考任務(wù))試題及答案
- 污水處理工程設(shè)計(jì)投標(biāo)文件技術(shù)方案
評(píng)論
0/150
提交評(píng)論