




版權(quán)說(shuō)明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
PatternRecognitionNanyangTechnologicalUniversityDr.Shi,DamingHarbinEngineeringUniversity1PatternRecognitionNanyangTec標(biāo)題添加點(diǎn)擊此處輸入相關(guān)文本內(nèi)容點(diǎn)擊此處輸入相關(guān)文本內(nèi)容總體概述點(diǎn)擊此處輸入相關(guān)文本內(nèi)容標(biāo)題添加點(diǎn)擊此處輸入相關(guān)文本內(nèi)容標(biāo)題添加點(diǎn)擊此處輸入相點(diǎn)擊此處輸入總體概述點(diǎn)擊此處輸入標(biāo)題添WhatisPatternRecognitionClassifyrawdataintothe‘category’ofthepattern.Abranchofartificialintelligenceconcernedwiththeidentificationofvisualoraudiopatternsbycomputers.Forexamplecharacterrecognition,speechrecognition,facerecognition,etc.
Twocategories:syntactic(orstructural)patternrecognitionandstatisticalpatternrecognitionIntroductionPatternRecognition=PatternClassification3WhatisPatternRecognitionCla44WhatisPatternRecognitionTrainingPhaseTrainingdataUnknowndataFeatureExtractionLearning(Featureselection,clustering,discriminantfunctiongeneration,grammarparsing)
Recognition(statistical,structural)ResultsRecognitionPhaseKnowledge5WhatisPatternRecognitionTraWhatisPatternRecognitionTrainingPhaseTrainingdataUnknowndataFeatureExtractionLearning(Featureselection,clustering,discriminantfunctiongeneration,grammarparsing)
Recognition(statistical,structural)ResultsRecognitionPhaseKnowledge6WhatisPatternRecognitionTraCategorisationBasedonApplicationAreasFaceRecognitionSpeechRecognitionCharacterRecognitionetc,etcBasedonDecisionMakingApproachesSyntacticPatternRecognitionStatisticalPatternRecognitionIntroduction7CategorisationBasedonApplicaSyntacticPatternRecognitionAnyproblemisdescribedwithformallanguage,andthesolutionisobtainedthroughgrammaticalparsingInMemoryofProf.FU,King-SunandProf.ShuWenhaoIntroduction8SyntacticPatternRecognitionAStatisticalPatternRecognitionInthestatisticalapproach,eachpatternisviewedasapointinamulti-dimensionalspace.Thedecisionboundariesaredeterminedbytheprobabilitydistributionofthepatternsbelongingtoeachclass,whichmusteitherbespecifiedorlearned.Introduction9StatisticalPatternRecognitioScopeoftheSeminarModule1Distance-BasedClassificationModule2ProbabilisticClassificationModule3LinearDiscriminantAnalysisModule4NeuralNetworksforP.R.Module5ClusteringModule6FeatureSelectionIntroduction10ScopeoftheSeminarModule1DModule1Distance-BasedClassificationNanyangTechnologicalUniversityDr.Shi,DamingHarbinEngineeringUniversityPatternRecognition11Module1Distance-BasedClassiOverviewDistancebasedclassificationisthemostcommontypeofpatternrecognitiontechniqueConceptsareabasisforotherclassificationtechniquesFirst,aprototypeischosenthroughtrainingtorepresentaclassThen,thedistanceiscalculatedfromanunknowndatatotheclassusingtheprototype
Distance-BasedClassification12OverviewDistancebasedclassifClassificationbydistanceObjectscanberepresentedbyvectorsinaspace.Intraining,wehavethesamples:Inrecognition,anunknowndataisclassifiedbydistance:Howtorepresentclasses?Distance-BasedClassification13ClassificationbydistanceObjePrototypeTofindthepattern-to-classdistance,weneedtouseaclassprototype(pattern):(1)SampleMean.Forclassci,(2)MostTypicalSample.chooseSuchthatisminimized.Distance-BasedClassification14PrototypeTofindthepattern-tPrototype–NearestNeighbour(3)NearestNeighbour.chooseSuchthatisminimized.Nearestneighbourprototypesaresensitivetonoiseandoutliersinthetrainingset.Distance-BasedClassification15Prototype–NearestNeighbour(Prototype–k-NN(4)k-NearestNeighbours.K-NNismorerobustagainstnoise,butismorecomputationallyexpensive.Thepatternyisclassifiedintheclassofitsknearestneighboursfromthetrainingsamples.Thechosendistancedetermineshow‘near’isdefined.Distance-BasedClassification16Prototype–k-NN(4)k-NearestDistanceMeasuresMostfamiliardistancemetricistheEuclideandistanceAnotherexampleistheManhattandistance:Manyotherdistancemeasures…Distance-BasedClassification17DistanceMeasuresMostfamiliarMinimumEuclideanDistance(MED)ClassifierEquivalently,18MinimumEuclideanDistance(MEDecisionBoundaryGivenaprototypeandadistancemetric,itispossibletofindthedecisionboundarybetweenclasses.LinearboundaryNonlinearboundaryDecisionBoundary=DiscriminantFunctionDistance-BasedClassificationlightnesslengthlightnesslength19DecisionBoundaryGivenaprotoExampleDistance-BasedClassification20ExampleDistance-BasedClassifiExampleAnyfishisavectorinthe2-dimensionalspaceofwidthandlightness.fishDistance-BasedClassificationlightnesslength21ExampleAnyfishisavectorinExampleDistance-BasedClassification22ExampleDistance-BasedClassifiSummaryClassificationbythedistancefromanunknowndatatoclassprototypes.Choosingprototype:SampleMeanMostTypicalSampleNearestNeighbourK-NearestNeighbourDecisionBoundary=DiscriminantFunctionDistance-BasedClassification23SummaryClassificationbythedModule2ProbabilisticClassificationNanyangTechnologicalUniversityDr.Shi,DamingHarbinEngineeringUniversityPatternRecognition24Module2ProbabilisticClassifReviewandExtend25ReviewandExtend25MaximumAPosterior(MAP)ClassifierIdeally,wewanttofavourtheclasswiththehighestprobabilityforthegivenpattern:WhereP(Ci|x)istheaposteriorprobabilityofclassCi
givenx26MaximumAPosterior(MAP)ClasBayesianClassificationBayes’Theoreom:WhereP(x|Ci)istheclassconditionalprobabilitydensity(p.d.f),whichneedstobeestimatedfromtheavailablesamplesorotherwiseassumed.WhereP(Ci)isaprioriprobabilityofclassCi.ProbabilisticClassification27BayesianClassificationBayes’MAPClassifierBayesianClassifier,alsoknownasMAPClassifierSo,assignthepatternxtotheclasswithmaximumweightedp.d.f.ProbabilisticClassification28MAPClassifierBayesianClassifAccuracyVS.RiskHowever,intherealworld,lifeisnotjustaboutaccuracy.Insomecases,asmallmisclassificationmayresultinabigdisaster.Forexample,medicaldiagnosis,frauddetection.TheMAPclassifierisbiasedtowardsthemostlikelyclass.–maximumlikelihoodclassification.ProbabilisticClassification29AccuracyVS.RiskHowever,intLossFunctionOntheotherhand,inthecaseofP(C1)>>P(C2),thelowesterrorratecanbeattainedbyalwaysclassifyingasC1Asolutionistoassignalosstomisclassification.whichleadsto…Alsoknownastheproblemofimbalancedtrainingdata.ProbabilisticClassification30LossFunctionOntheotherhandConditionalRiskInsteadofusingthelikelihoodP(Ci|x),weuseconditionalriskcostofactionigivenclassj
Tominimizeoverallrisk,choosetheactionwiththelowestriskforthepattern:ProbabilisticClassification31ConditionalRiskInsteadofusiConditionalRiskProbabilisticClassification32ConditionalRiskProbabilisticExampleAssumingthattheamountoffraudulentactivityisabout1%ofthetotalcreditcardactivity:C1=FraudP(C1)=0.01C2=NofraudP(C2)=0.99Iflossesareequalformisclassification,then:ProbabilisticClassification33ExampleAssumingthattheamounExampleHowever,lossesareprobablynotthesame.Classifyingafraudulenttransactionaslegitimateleadstodirectdollarlossesaswellasintangiblelosses(e.g.reputation,hasslesforconsumers).Classifyingalegitimatetransactionasfraudulentinconveniencesconsumers,astheirpurchasesaredenied.Thiscouldleadtolossoffuturebusiness.Let’sassumethattheratiooflossfornotfraudtofraudis1to50,i.e.,Amissedfraudis50timesmoreexpensivethanaccidentallyfreezingacardduetolegitimateuse.ProbabilisticClassification34ExampleHowever,lossesareproExampleByincludingthelossfunction,thedecisionboundarieschangesignificantly.InsteadofWeuseProbabilisticClassification35ExampleByincludingthelossfProbabilityDensityFunctionRelativelyspeaking,it’smucheasytoestimateaprioriprobability,e.g.simplytakeToestimatep.d.f.,wecan(1)Assumeaknownp.d.f,andestimateitsparameters(2)Estimatethenon-parametricp.d.ffromtrainingsamplesProbabilisticClassification36ProbabilityDensityFunctionReMaximumLikelihoodParameterEstimationWithoutthelossofgenerality,weconsiderGaussiandensity.P(x|Ci)=TrainingexamplesforclassCiParametervaluestobeidentifiedWearelookingforthatmaximizethelikelihood,soThesamplecovariancematrix!37MaximumLikelihoodParameterEDensityEstimationifwedonotknowthespecificformofthep.d.f.,thenweneedadifferentdensityestimationapproachwhichisanon-parametrictechniquethatusesvariationsofhistogramapproximation.(1)Simplestdensityestimationistouse“bins”.e.g.,in1-Dcase,takethex-axisanddivideintobinsoflengthh.Estimatetheprobabilityofasampleineachbin.kNisthenumberofsamplesinthebin(2)Alternatively,wecantakewindowsofunitvolumeandapplythesewindowstoeachsample.Theoverlapofthewindowsdefinestheestimatedp.d.f.ThistechniqueisknownasParzenwindowsorkernels.ProbabilisticClassification38DensityEstimationifwedonotSummaryBayesianTheoreomMaximumAPosteriorClassifier=MaximumLikelihoodclassiferDensityEstimationProbabilisticClassification39SummaryBayesianTheoreomProbaModule3LinearDiscriminantAnalysisNanyangTechnologicalUniversityDr.Shi,DamingHarbinEngineeringUniversityPatternRecognition40Module3LinearDiscriminantALinearClassifier-1Alinearclassifierimplementsdiscriminantfunctionoradecisionboundaryrepresentedbyastraightlineinthemultidimensionalspace.Givenaninput,x=(x1…xm)TthedecisionboundaryofalinearclassifierisgivenbyadiscriminantfunctionWithweightvectorw=(w1…wm)TLDA41LinearClassifier-1AlinearLinearClassifier-2Theoutputofthefunctionf(x)foranyinputwilldependuponthevalueofweightvectorandinputvector.Forexample,thefollowingclassdefinitionmaybeemployed:Iff(x)>0ThenxisBalletdancerIff(x)≤0ThenxisRugbyplayerLDA42LinearClassifier-2TheoutpuLinearClassifier-3x1x2f(x)>0f(x)<0f(x)=0wTheboundaryisalwaysorthogonaltotheweightvectorwTheinnerproductoftheinputvectorandtheweightvector,wTx
wTxisthesameforallpointsontheboundary--(-b).LDA43LinearClassifier-3x1x2f(x)>Perceptronx=(x1
…xm)Tw=(w1
…wm)TInputsOutput
Activation
Function
w2
w1
Linear
Combiner
bx2x1yLDA44Perceptronx=(x1…xm)Tw=(wMulti-classproblemLDA45Multi-classproblemLDA45LimitationofPerceptronAsingle-layerperceptroncanperformpatternclassificationonlyonlinearlyseparablepatterns.(a)LinearlySeparablePatterns(b)Non-linearlySeparablePatternsLDA46LimitationofPerceptronAsingGeneralizedLinearDiscriminantFunctionsDecisionboundarieswhichseparatebetweenclassesmaynotalwaysbelinear
Thecomplexityoftheboundariesmaysometimesrequesttheuseofhighlynon-linearsurfaces
Apopularapproachtogeneralizetheconceptoflineardecisionfunctionsistoconsiderageneralizeddecisionfunctionas:LDAwhereisanonlinearmappingfunction47GeneralizedLinearDiscriminanSummaryLinearclassifierVectoranalysisPerceptronPerceptroncannotclassifylinearlynon-separablepatternsMLP,RBF,SVMLDA48SummaryLinearclassifierLDA48Module4NeuralNetworksforPatternRecognitionNanyangTechnologicalUniversityDr.Shi,DamingHarbinEngineeringUniversityPatternRecognition49Module4NeuralNetworksforPDetailsinanotherseminar:NeuralNetworks50Detailsinanotherseminar:50Module5ClusteringNanyangTechnologicalUniversityDr.Shi,DamingHarbinEngineeringUniversityPatternRecognition51Module5ClusteringNanyangTecSupervisedLearningVS.unsupervisedLearningClusteringSupervisedLearning(Thetargetoutputisknown)Foreachtraininginputpattern,thenetworkispresentedwiththecorrecttargetanswer(thedesiredoutput)byateacher.UnsupervisedLearning(Thetargetoutputisunknown)Foreachtraininginputpattern,thenetworkadjustsweightswithoutknowingthecorrecttarget.Inunsupervisedtraining,thenetworkself-organizestoclassifysimilarinputpatternsintoclusters.52SupervisedLearningVS.unsupeClusteringCluster:asetofpatternsthataremoresimilartoeachotherthantopatternsnotinthecluster.Givenunlabelledsamplesandhavenoinformationabouttheclasses.Wanttodiscoverifthereareanynaturallyoccurringclustersinthedata.Twoapproaches:ClusteringbyDistanceMeasureClusteringbyDensityEstimationClustering53ClusteringCluster:asetofpaClusteringbyDistanceTwoissues:Howtomeasurethesimilaritybetweensamples?Howtoevaluateapartitioningofasetintoclusters?TypicaldistancemetricsincludeEuclideanDistance,HammingDistance,etc.Clustering54ClusteringbyDistanceTwoissuGoodnessofPartitioningWecanuseameasureofthescatterofeachclustertogaugehowgoodtheoverallclusteringis.Ingeneral,wewouldlikecompactclusterswithalotofspacebetweenthem.WecanusethemeasureofgoodnesstoiterativelymovesamplesfromoneclustertoanothertooptimizethegroupingClustering55GoodnessofPartitioningWecanCriterion:sumofsquarederrorThiscriteriondefinesclustersastheirmeanvectorsmi
inthesensethatitminimizesthesumofthesquaredlengthsoftheerrorx-mi.TheoptimalpartitionisdefinedasonethatminimizesJe,alsocalledminimumvariancepartition.Workfinewhenclustersformwellseparatedcompactclouds,lesswhentherearegreatdifferencesinthenumberofsamplesindifferentclusters.Clustering56Criterion:sumofsquarederroCriterion:ScatterScattermatricesusedinmultiplediscriminantanalysis,i.e.,thewithin-scattermatrixSWandthebetween-scattermatrixSB
ST=SB+SW thatdoesdependonlyfromthesetofsamples(notonthepartitioning)Thecriteriacanbetominimizethewithin-clusterormaximizethebetween-clusterscatterThetrace(sumofdiagonalelements)isthesimplestscalarmeasureofthescattermatrix,asitisproportionaltothesumofthevariancesinthecoordinatedirectionsClustering57Criterion:ScatterScattermatrIterativeoptimizationOnceacriterionfunctionhasbeemselected,clusteringbecomesaproblemofdiscreteoptimization.Asthesamplesetisfinitethereisafinitenumberofpossiblepartitions,andtheoptimalonecanbealwaysfoundbyexhaustivesearch.Mostfrequently,itisadoptedaniterativeoptimizationproceduretoselecttheoptimalpartitionsThebasicidealiesinstartingfromareasonableinitialpartitionand“move”samplesfromoneclustertoanothertryingtominimizethecriterionfunction.Ingeneral,thiskindsofapproachesguaranteelocal,notglobal,optimization.Clustering58IterativeoptimizationOnceaK-MeansClustering-1k-meansclusteringalgorithmInitialization.t=0.Chooserandomvaluesfortheinitialcentersck(t),
k=1,…,KSampling.DrawasamplefromthetrainingsamplesetSimilaritymatching.k(x)denoteindexofbestmatchingcenter4)
Updating.Foreveryk=1,…,K5)
Continuation.t=t+1,gobacktostep(2)untilnonoticeablechangesareobservedClustering59K-MeansClustering-1k-meansK-MeansClustering-2c1c2Clustering60K-MeansClustering-2c1c2ClusK-MeansClustering-3c1c3c2Clustering61K-MeansClustering-3c1c3c2ClClusteringbyDensityEstimatione.g.Findingthenucleusandcytoplasmpelsinwhitebloodcells.ImageGrey-levelHistogram:Set?=valley(localminimum)Ifvalue>?peliscytoplasmIfvalue<?pelisnucleusthisisclusteringbasedondensityestimation.peaks=clustercentres.valleys=clusterboundariesClustering62ClusteringbyDensityEstimatiParameterizedDensityEstimationWeshallbeginwithparameterizedp.d.f.,inwhichtheonlythingthatmustbelearnedisthevalueofanunknownparametervector
Wemakethefollowingassumptions:
Thesamplescomefromaknownnumbercofclasses
ThepriorprobabilitiesP(j)foreachclassareknown
P(x|j,j)(j=1,…,c)areknown
Thevaluesofthecparametervectors1,2,…,careunknownClustering63ParameterizedDensityEstimatiMixtureDensityThecategorylabelsareunknown,andthisdensityfunctioniscalledamixturedensity,andOurgoalwillbetousesamplesdrawnfromthismixturedensitytoestimatetheunknownparametervector.Onceisknown,wecandecomposethemixtureintoitscomponentsanduseaMAPclassifieronthederiveddensities.Clustering64MixtureDensityThecategorylaChineseYing-YangPhilosophyEverythingintheuniversecanbeviewedasaproductofaconstantconflictbetweentheopposites–YingandYang.YingnegativefemaleinvisiblepositivemalevisibleYangTheoptimalstatusisreachedifYing-YangachievesharmonyClustering65ChineseYing-YangPhilosophyEvBayesianYing-YangClusteringTofindaclustersytopartitioninputdataxxisvisiblebutyisinvisiblexdecidesyintrainingbutydecidesxinrunningp(x,y)=p(y|x)p(x)p(x,y)=p(x|y)p(y)xyp(,)Clustering66BayesianYing-YangClusteringTBayesianYingYangHarmonyLearning(1)TominimisethedifferencebetweentheYing-Yangpair:Toselecttheoptimalmodel(clusternumber):whereClustering67BayesianYingYangHarmonyLeaBayesianYingYangHarmonyLearning(2)ParameterlearningusingEMalgorithmE-Step:M-Step:Clustering68BayesianYingYangHarmonyLeaSummaryClusteringbyDistanceGoodnessofparetitioningK-meansClusteringbyDensityEstimationBYYClustering69SummaryClusteringbyDistanceCModule6FeatureSelectionNanyangTechnologicalUniversityDr.Shi,DamingHarbinEngineeringUniversityPatternRecognition70Module6FeatureSelectionNanyMotivationFeatureSelectionClassifierperformancedependonacombinationofthenumberofsamples,numberoffeatures,andcomplexityoftheclassifier.Q1:Themoresamples,thebetter?Q2:Themorefeatures,thebetter?Q3:Themorecomplex,thebetter?However,thenumberofsamplesisfixedwhentrainingBothrequirestoreducethenumberoffeatures71Motivatio
溫馨提示
- 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 麗江酒店轉(zhuǎn)讓合同范本
- 上海粒子倉(cāng)儲(chǔ)托管合同范例
- 沖床鋼板采購(gòu)合同范例
- led車度合同范例
- 初中生幾何素養(yǎng)培養(yǎng)的教學(xué)實(shí)踐研究
- 制式服裝供貨合同范本
- 農(nóng)民秸稈采購(gòu)合同范本
- 上游合同和下游合同范例
- pvc吊頂合同范例
- 儀器協(xié)議采購(gòu)合同范例
- 教學(xué)課件:《新時(shí)代新征程》
- 交通事故授權(quán)委托書樣本(通用)正規(guī)范本(通用版)
- 2022年福建省公務(wù)員錄用考試《行測(cè)》題
- (新湘科版)六年級(jí)下冊(cè)科學(xué)知識(shí)點(diǎn)
- 文言文閱讀訓(xùn)練:蘇軾《刑賞忠厚之至論》(附答案解析與譯文)
- 人際關(guān)系與溝通技巧-職場(chǎng)中的平行溝通與同事溝通
- 教師系列高、中級(jí)職稱申報(bào)人員民意測(cè)評(píng)表
- 文件定期審核記錄
- 社會(huì)穩(wěn)定風(fēng)險(xiǎn)評(píng)估報(bào)告-穩(wěn)評(píng)報(bào)告
- 2024屆高考英語(yǔ)作文:讀后續(xù)寫課件
- 小學(xué)班隊(duì)工作原理與實(shí)踐 全套課件
評(píng)論
0/150
提交評(píng)論