版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
StructuredModelingandLearninginGeneralizedDataCompressionandProcessingHongkaiXiong熊紅凱電子工程系上海交通大學(xué)13Jan.2016VALSE-Webinar
LectureSparseRepresentation
Sparserepresentation
where
ΨLNx=θΣMultimediaCommunicationHighdimensionUnicastOne-to-oneMulticastOne-to-manyMulticastMany-to-manyNetworksVideocoding:advancesinhigherdimensionandhigherresolution,ingoalsofbetterR-DbehaviorandgreatercompressionrateNetworks:developstomultipledatastreamingwithinheterogeneousnetworkstructure,ingoalsofhigherthroughputandtransmissionreliabilitySources1Dsignal(audio)2Dsignal(image)3Dsignal(video)SVCDVC3
4GeneralizedContextModelinginSignalProcessingWenruiDai,HongkaiXiong,J.Wang,S.Cheng,Y.F.Zheng,"GeneralizedContextModelingwithMulti-DirectionalStructuringandMDL-basedModelSelectionforHeterogeneousDataCompression",IEEETrans.SignalProcessing,2015.WenruiDai,HongkaiXiong,J.Wang,etal.,“Discriminativestructuredsetpredictionmodelingwithmax-marginMarkovnetworkforlosslessimagecoding,”IEEETransactionsonImageProcessing,2014.WenruiDai,HongkaiXiong,X.Jiang,etal.,“Structuredsetintrapredictionwithdiscriminativelearninginmax-marginMarkovnetworkforHighEfficiencyVideoCoding,”IEEETransactionsonCircuitsandSystemsforVideoTechnology,vol.23,no.11,pp.1941-1956,2013.HeterogeneousDataCompression:DataHeterogeneousdataaregeneratedbymultipleinterlacingsourcescompliedwithdifferentincompletedistributionstatisticsImage
&
Video:
SpatialcorrelationsarecharacterizedbypiecewisesmoothwithlocaloscillatorypatternslikemultiscaleedgesandtexturesGenome
sequence:
RepeatablepatternsofnucleotidesinvariousregionsExecutable
files:
Multipleinterlacingdatastreams,e.g.opcode,displacement,andimmediatedataHeterogeneousDataCompression:FrameworkContext
ModelData
to
be
predictedEstimated
probabilityCoderEncoded
bitstream01011101……Structured
probabilistic
model
Capture
regular
patternsOptimized
for
specific
dataContext-based
set
predictionClassic
context
modelVariable
orderSequential
predictionWeighted
estimationContextissuffixofdatasymbols
inclassicalcontextmodeling.Definethesetofsubsequenceswhosesuffixiss.
Disjointproperty:nostringinthecontextsetisasuffixofanyotherstringinthisset,orfor,;
Exhaustiveproperty:eachsubsequenceofdatasymbolscanfinditssuffixinthecontextset,or.7BackgroundGraphicalprobabilisticmodel
Structuredpredictionmodelcanberepresentedintheformofgraph.Eachnodefortherandomvariabletopredictandeachedgefortheinterdependenciesamongnodes.Estimatingjointorconditionaldistributionforitsnodes.Learningmethods
Markovrandomfield(MRF)Max-marginMarkovnetwork:MRF+SVMReasoningalgorithmsBeliefpropagation(BP)Expectationpropagation(EP)Foundation:StructuredPredictionModelStructured
Probabilistic
Model:
Motivation
ComplexdatastructureCannotbeanalyticallyrepresented
Adaptivelycapturefeatureswithlearning-basedalgorithmsIncompletedistributionCannotexactlyestimateparametersforactualdistribution
Context-basedpredictivemodelusinglearning-basedalgorithmsStructuralcoherenceCannotguaranteestructuralcoherenceofpredictiontaskwithisolatedprediction
Structuredprobabilisticmodeltoconstrainpredictiontaskwithstructuralcoherence10StructuralcoherenceAnexample:ImagesgeneratedwiththesamelocaldistributionAnaturalimagesisnota2Darrayofpixelsgeneratedwithprobabilisticdistribution.Thestructuralcoherenceismaintainedtokeepanimagemeaningful.
Structuredpredictionmodelisproposedtomaintainsuchcoherence.Intuition1:StructuralcoherenceforheterogeneousdataMotivationPerceptualIntuitionIntuition2:ComplexstructureforheterogeneousdataStatisticsofheterogeneousdataisnotsequentialandwithuniformdistribution,butisflexibleandwithinterlacedcomplexdistribution.Predictionbasedonsequential,contextswithuniformdistributionGeneralizedContextModelingMotivationPredictionbasedonflexiblyconstructedcontextswithinterlacedcomplexdistributionContribution:Structureconsistence
→
StructuredSetProblemDefinitionChallenge:LinearPredictionwithouthigh-ordernonlinearityIndependentPredictionwithoutinter-dependencySimilarPDF12Pixel-wisepredictiontoimpairthestructureParallelpredictiontokeepstructureStructuredProbabilisticModel:
ExampleLeastSquaresMSE:82.06StructuredProbabilisticModelMSE:68.754x4blockatcoordinate(401,113)inLENATheoreticalsupport:SequentialSourceCodingMotivationHarishViswanathan&Berger2000:
GivenrandomvariableX1andX2,underarbitrarydistortionD1andD2,therateforjointlydescribingthemisnogreaterthantheratetodescribethemseparately.Encoder1Encoder2Decoder1Decoder2ContributionGCMforheterogeneousdatacompressionStructuredprobabilisticmodelforgenomecompressionMRFfordependencybetweensideinformation&optimizedwithBPStructuredprobabilisticmodelforlosslessimagecodingM3NforjointspatialstatisticsandstructuralcoherenceStructuredprobabilisticmodelforintra-framevideocodingM3NoptimizedwithEPContributionWenruiDai@SJTU,201416GCMImageGenomeVideoExecutablesUniversalCodingHeterogeneousDataLearningSyntax&SemanticsInputspaceFeaturespaceDataDependencyHeterogeneousdataGenomicdataLongrepeatswithexceptionofinsertion,deletion,andsubstitution.ImageandVideoSpatiallyspannedalongthestructures,e.g.edge,texture,andetc.ExecutablesInterlaceddatastreams,e.g.opcodes,immediatedata,andetc.Background:HeterogeneousDataDefinition:
HeterogeneousData
Heterogeneousdataisgeneratedbyinterlacing
Mdatastreamswithatime-varyingrandomprocessThej-thdatastreamisemittedfromastationaryMarkovsourcewithorderSymbolisobtainedfromthe-thdatastreambyandisnotstationaryandwidesensestationaryClueInMemoryOfBenTaskar(1977-2013)Arisingstarinmachinelearning,computationallinguisticsandcomputervisionThefounderofMax-MarginMarkovNetworkPredictivemodelsforheterogeneousdatacompressionContextModelSymbolsforpredictingEstimatedprobabilityCodingengineEnocdedbits01011101……CaptureintrinsicstructureofcomplexdataFindadaptationtospecificdataOptimalsetpredictionbasedonobservedcontextsStructuredpredictionmodelGeneralizedContextModeling Scenario:CodingBasedonContextModeling
CurrentsymbolExtendedcontextSequentialcontextContextwithcombinatorialstructuringContextwithmulti-dimensionalextension2-DcontextM-DcontextGeneralizedcontextmodeling(GCM)withcombinatorialstructuring&multi-dimensionalextensionGeneralizedContextModelingTopologyIllustration
GraphicalmodelforGCMwithD-orderandM-directionalcontext.Symbolsforpredictingarecorrelatedwiththeirneighboringsymbols.Componentofcontextineachdirectionisservedasanobservationfortheprediction.Conditionalrandomfieldrepresentdependenciesamongsymbolsforpredictingandcontext-basedcorrelations.SymbolsforpredictingContextsGeneralizedContextModelingGraphicalModelforPredictionDefinition:
Context
SetGivencontexts,thesetofsubsequencescontainingsiswhereistheindexsetofs.
isavalidgeneralizedcontextmodel,ifitsatisfiesineachofitsdirectionExhaustiveproperty:foranysubsequenceinj-thdirection,thereexistssinsuchthatDisjointproperty:foranysubsequenceinj-thdirection,givenarbitrarys
ands’Modeling
&
Prediction:
Model
GraphTrellis-likegraphrootedfromM-aryvector(?,···
,?)Eachnodecorrespondstoanindexsetforfiniteordercombinationofpredictedsymbols.Givennode,ItssucceedingnodesatisfiesthatItsprecedingnodestatisfiesthat
possiblecontextstructureslocatinginDM+1verticalslicesforGCMwithgivenMandD.GeneralizedContextModeling ModelTreeexampleRepresentation
&
Prediction:Model
GraphModelgraphwithdepthD=3andM=2directionsSolid(red)anddashed(blue)pathssharesomecommonnodesModeltreetorepresentgeneralizedcontextmodelsandtheirsuccessiverelationship.
Minimumdescriptionlength(MDL)principletoselectoptimalclassofcontextmodels.
Normalizedmaximumlikelihood(NML)todetermineoptimalweightedprobabilityforprediction.GeneralizedContextModeling ProblemStatement
GeneralizedContextModeling ModelTreeModel
Selection:
Separable
Context
ModelingPredictionbasedoncontextswithmulti-directionalstructuringcanbemadeseparatelyineachofitsdirections.Given
whereanditselementsareThesizeofmodelclassgrowslinearlywithMNMLfunctionwithMDLprincipleformodelselection.ContextsineachdirectionarecomparedwiththeNMLfunctiontofindoptimalcontextforpredictingcurrentsymbol.ForM-interlacedautoregressivesources,themodelcomplexityisconstantwithdatasizeN.ModelcomplexityCodeassignmentfunctionGeneralizedContextModelingModelSelectionEstimatedprobabilityforgeneralizedcontextmodeling
Inasequentialway,foreachsymbolForeachcontexts,itsweightsis
whereGeneralizedContextModelingWeightedProbabilityforPredictionGivengeneralizedmodelclassMwithmaximumorderDandMdirections,themodelredundancyledbymulti-directionalextensioniswhereListhesizeofalphabet,η
isthecompensationforvariouscontexts.Themodelredundancyledbymulti-directionalextensiononlydependsonthemaximumorderDandthenumberofdirectionsM,butisindependentofsizeofdataN.GeneralizedContextModelingModelRedundancyInCalgarycorpus,GCMoutperformsCTWby7%-12%inexecutablefilesandseismicdata.Inexecutablefilecompression,GCMoutperformsPPMdandPPMonstrby10%and4%,respectively.GCMiscomparativetothebestcompressorPAQ8withlesscomputationalcomplexity.
ML-baseddoesnotfullyexploitthestatisticsinheterogeneousdata.Asanalternative,learningofstructuredpredictionmodelisproposed.GeneralizedContextModelingExperimentalResultsGivengeneralizedmodelclass
MwithmaximumorderDandMdirections,themodelredundancyledbycombinatorialstructuringiswhereListhesizeofalphabet,η
isthecompensationforvariouscontexts.Themodelredundancyledbymulti-directionalextensiononlydependsonthemaximumorderD,butisindependentofsizeofdataN.34ModelRedundancy
Discriminativepredictiondistinguishestheactualvaluesofpixelswithotherpossibleestimationstothemaxmarginbasedoncontexts,butcannotutilizethestructureforpredictions.
Markovnetworkmaintainsthestructuralcoherenceintheregionsforpredictingbutcannotoptimizethecontext-basedprediction.
Jointoptimizationbymax-marginMarkovnetworkConceptualDiagram:ImageContext-basedpredictionforeachpixelImagingconstraintsforsetofpixelsStructuralcoherenceSamplingEncodingImagingconstraintsforsetofpixelsContext-basedpredictionforeachpixelStructuralcoherenceDecodingReconstructionFlowingdiagramforstructuredsetpredictionmodelDiagram
Givenytheblockofencodingpixelsandxthereconstructedpixelsascontexts,itspredictionisderivedintheconcurrentform.
Localspatialstatisticsisrepresentedbythelinearcombinationoftheclassoffeaturefunctions.CollectionoffeaturefunctionsTrainedmodelparametersPredictionJointoptimizationStructuralcoherenceLossfunctionFeaturefunctionTrainedmodelparameterModelparameterwistrainedoverthecollectionoftrainingdataS={xi,yi}.
Thefeaturefunctions{fi}establishtheconditionalprobabilisticmodelforpredictionbasedonthevariouscontextsderivedfromthesupposedpredictivedirection
isthelossfunctionthatevaluatesthepredictionandadjuststhemodelparameterw.TrainingTheM-aryestimatedoutput?(i)forblockofpixelsyismeasuredoverthegeneratedgraphicalmodel.Log-GaussianfunctionfornodecliqueandDiracfunctionforedgeclique.Withpredictionerror?i=?(i)-y(i)andvarianceσ2overerrorsLossFunctionStandardquadraticprogramming(QP)forsolvingthemin-maxformulationsuffersthehighcomputationalcostfortheproblemswithlargealphabetsize.Asanalternative,itsdualisproposed.Sequentialminimaloptimization(SMO)breaksthedualproblemintoaseriesofsmall(QP)problemsforcliquesandtakesanascentsteptomodifytheleastnumberofvariableswhereandαi(y)isthemarginaldistributionfortheithclique.SMOiterativelychoosesthepairsofywithrespecttotheKKTconditionforsolution.SolutionJunctiontreeforloopyMarkovnetwork.Eachjunctionisgeneratedbyaddingedgestolinkcliques.Junctiontreeisunique.EachcliqueispredictedalongthejunctiontreeBeliefPropagation(BP)asmessagepassingalgorithmforinferenceandupdatethepotentialofeachcliqueSolutionTheoreticalupperboundforpredictionerrorTheorem:Giventhetrainedweightingvectorwandarbitraryconstantη>0,thepredictionerrorisasymptoticallyequivalenttotheoneobtainedoverthetrainingdatawithprobabilityatleast1-exp(-η).UpperboundforaveragepredictionerrorUpperboundforγ-relaxedaveragetrainingerrorAdditionaltermconvergestozerowhenNgrowsRemark:Thepredictionerrorisupper-boundedbythewell-tunedtrainingerror.TheTheoremensuresthepredictiveperformanceofthestructuredpredictionmodel.UpperBoundofPredictionErrorInviewofprobability,giventhetrainedweightingvectorwandarbitraryconstantη>0,withsufficientsampling,thereexistsε(L,γ,N,η)→0,satisfyingThepredictionerrorisupper-boundedbythewell-tunedtrainingerror.TheTheoremensuresthepredictiveperformanceofthestructuredsetpredictionmodel.UpperBoundofPredictionErrorCombiningwithvariance-basedpredictorforsmoothregions,structuredsetpredictionservesasanalternativemodeComparingthecodingcostoftwoalternativemodesfortheoptimaloneLog-GaussianlossfunctiontoobtainoptimalcodingoftheresidualbasedonallegedGaussiandistribution.ImplementationExperimentalResultsImageProposedMRPBMFTMWCALICJPEG-LSJPEG2000HDPhotoAirplane3.5363.5913.6023.6013.7433.8174.0134.247Baboon5.6355.6635.7145.7385.6666.0376.1076.149Balloon2.5482.5792.6492.6492.8252.9043.0313.320Barb3.7643.8153.9594.0844.4134.6914.6004.836Barb24.1754.2164.2764.3784.5304.6864.7895.024Camera3.9013.9494.0604.0984.1904.3144.5354.959Couple3.3233.3883.4483.4463.6093.6993.9154.318Goldhill4.1734.2074.2384.2664.3944.4774.6034.746Lena3.8773.8893.9293.9084.1024.2384.3034.477Peppers4.1634.1994.2414.2514.2464.5134.6294.850
PerformanceexceedsJPEG-LSby10%andJPEG2000losslessmodeby14%inaverageinbitsperpixel.
Performanceexceedstheminimumratepredictor(MRP,theoptimalpredictor)by1.35%inaverageinbitsperpixel.JointoptimizationStructuralcoherenceLossfunctionFeaturefunctionTrainedmodelparameterOptimaljointpredictionbymax-marginMarkovnetwork
Max-marginestimationdirectlyconditionedonthepredictedpixelsforcontext-basedprediction
Markovnetworktomaintainthestructuralcoherenceintheregionsforpredicting.ConceptualDiagram:VideoConceptualdescriptionforstructuredpredictionmodelLaplacianlossfunctionfortheM-aryestimatederror.Laplacianerrorsderivedforeachnodeandthestatetransitionoftheneighboringnodesforeachedge.Foreachnode,itspredictionerroriswheretheerror?i=?(i)-y(i)andvarianceσ2overerrors.LaplacianlossfunctionmeetswithDCTtransform.ThestructuredpredictionmodeloptimizeitforminimalcodinglengthunderHEVCframework.LossFunctionExpectationPropagationforMessagePassingUtilizeSMOforSolvingthestandardquadraticprogramming(QP)forthemax-marginMarkovnetwork.Accordingly,junctiontreeisgeneratedandmessagepassingalgorithmisconductedalongjunctiontreeforthemostprobablestatesofeachpixel.Thelossyintravideocodingdoesnotrequiretopropagatetheactualstatesalongthejunctiontree.Statisticslikemeansandvariancecannotbeselectedandpropagatedforrobustmessagepassingwithconvergence.Expectationpropagation(EP)utilizessuchstatisticssothattheactualdistributionisapproximatedwiththeexponentialfamily.Themetricforapproximationcanbevariedbasedonthevideodata.
PredictionbasedonEPisproventoconvergetoanupperbound.Structuredpredictionmodelasanalternativemode:MODE_STRUCTIntegrationintocurrentHEVCframeworkwithoutadditionalsyntaxelementModedecisionbyrate-distortionoptimizationLaplacian-basedlossfunctionfortheresidualobtainingbestcodingperformanceunderDCTtransformImplementationForeman_352×288BlowingBubbles_416×240BQMall_832×480Cactus_1920×1080
PerformanceexceedsHEVCcommontestmodelby2.26%inBD-rates.ThegaininBD-PSNRisupto0.38dB.
PerformanceexceedsHEVCwithcombinedintracoding(CIP)by1.31%inBD-rates.ExperimentalResultsProteinPhenotypeDNA(Genotype)PurineBases:Adenine(A);Guanine(G)PyramidineBases:Thymine(T);Cytosine(C)RNA51Thecentraldogmaofmolecularbiology:Aframeworkforunderstandingthenormalflowforthetransferofsequenceinformationbetweensequentialinformationcarrier.ProteinsareconstructedaccordingtoDNA.ConceptualDiagram:GenomeSequence52Background:CompressiveStructuresinGenomicDataTGTCTGCAGCAGCCGCTTGTCTGCAACAGCCGCTTGTCTGCACAGCCGCTTGTCTGCAGGCAGCCGCTACAGACGTGTCGGCGAReversiblePalindromeDeletionof‘G’Substitutionof‘G’with‘A’Insertionof‘G’DNAsequenceisrepeatedpatternsofnucleotides,namely‘A’,‘T’,‘G’,and‘C’Approximaterepeats:Exactrepeat,Insertion,Deletion,andSubstitutionReversiblepalindrome:substitute‘A’with‘T’and‘G’with‘C’,andviceversaReference-basedMethodsRLZ
RelativeLZcompressionwithrelatedreferencesequencesindicatedbytheirself-indexCannothandlealphabetotherthan{A,T,G,C,N}GRS
GeneralGenomeReSequencingtoolConsiderchromosomevariedsequencepercentageGReEn
CopymodelformatchingofexactrepeatsinreferenceStatisticalmodelforestimatingtheprobabilitiesofmatchingBackgroundMainConcernsApproximateorexactrepeatsofnucleotides
Variablerepeatsizes&offsetsofrepeats.
Exceptionofinsertion,deletionandsubstitutionofnucleotidesinrepeatsMotivations
Differencesbetweentargetandreferencesequencearenotuniformlydistributed,butsparseforcoding.Sideinformation,e.g.sizesandoffsetsarecorrelated,whichcanbepredictedbasedonstructuredpredictionmodel.MotivationUnderthehierarchicalpredictionstructure,thereferenceisselectedaccordingtothelossfunctionDifferencesequenceisthezerosequencewithemergenceofnon-zerosymbolatalowfrequency.Itissuitableforwavelettransformandsubsequentbitplanecoding.
Markovrandomfieldisestablishedforcorrelatedsideinformation,whichispredictedandupdatedwithBPalgorithm.FrameworkExampleforhierarchicalpredictionstructure:A(sub-)fragmentof16nucleotidesispredictedbasedona(sub-)fragmentof16nucleotidesinreferencesequence
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 校園午餐食品安全承諾
- 2025屆高考英語一輪復(fù)習(xí)讀后續(xù)寫說課課件
- 建設(shè)工程施工合同書-2
- 山東省濟(jì)南市(2024年-2025年小學(xué)四年級(jí)語文)統(tǒng)編版摸底考試((上下)學(xué)期)試卷及答案
- 甘肅省甘南藏族自治州(2024年-2025年小學(xué)四年級(jí)語文)人教版綜合練習(xí)(上學(xué)期)試卷及答案
- 甘肅省酒泉市(2024年-2025年小學(xué)四年級(jí)語文)人教版摸底考試((上下)學(xué)期)試卷及答案
- 2022年秋10月自考05679憲法學(xué)練習(xí)考題含解析
- 2021年上半年全國(guó)自考憲法學(xué)模擬試題含解析
- 2021年春季自考法學(xué)專業(yè)本科憲法學(xué)試題含解析
- unit 4 課件教學(xué)課件
- 從倫敦鎳期貨事件看中外交易所風(fēng)險(xiǎn)防控差異
- 一年級(jí)下冊(cè)語文課件-經(jīng)典誦讀-小豬唏哩呼嚕 (共20張PPT)部編版
- 中國(guó)碳交易市場(chǎng)展望與林業(yè)碳匯前景解析
- 思想政治教育專業(yè)師范生雙導(dǎo)師制實(shí)施辦法
- 風(fēng)電場(chǎng)建設(shè)項(xiàng)目綠色施工方案
- 繪本《大腳丫跳芭蕾》PPT課件
- 《漢服》PPT課件(完整版)
- 自考00456教育科學(xué)研究方法復(fù)習(xí)要點(diǎn)
- 為中華之崛起而讀書設(shè)計(jì)
- 新起點(diǎn)小學(xué)一至六年級(jí)英語單詞帶音標(biāo)
- 熱處理調(diào)質(zhì)工藝守則及操作規(guī)程
評(píng)論
0/150
提交評(píng)論