云計(jì)算-cloud l主要需求和目標(biāo)_第1頁
云計(jì)算-cloud l主要需求和目標(biāo)_第2頁
云計(jì)算-cloud l主要需求和目標(biāo)_第3頁
云計(jì)算-cloud l主要需求和目標(biāo)_第4頁
云計(jì)算-cloud l主要需求和目標(biāo)_第5頁
免費(fèi)預(yù)覽已結(jié)束,剩余19頁可下載查看

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡介

副研究員博士 主要需求和目標(biāo)實(shí)現(xiàn)為程序員隱藏系統(tǒng)層細(xì)需要考慮的細(xì)節(jié)技術(shù)問題如何管理 數(shù)據(jù)?如何劃分?jǐn)?shù)據(jù)如何調(diào)度計(jì)算任務(wù)并分配map和reduce節(jié)點(diǎn)如果節(jié)點(diǎn)間需要共享或交換數(shù)據(jù)如何如何掌控節(jié)點(diǎn)的執(zhí)行完成情況?如何收集中間和最終的結(jié)果節(jié)點(diǎn)失效如何處理?如何恢復(fù)數(shù)節(jié)點(diǎn)擴(kuò)充后如何保證原有程序仍能正常運(yùn) 系統(tǒng)性能提升 TextText1:theweatheris Text2:todayisText3:goodweatheris Text4:todayhasgoodString[]text=args[0];HashTableht=newHashTable();for(i=0;i<3;++i){StringTokenizerst=newStringTokenizer(text[i]);while(st.hasMoreTokens()){Stringword=if(!ht.containsKey(word)){ht.put(word,newInteger(1));}else{intwc((Integer)ht.get(word)).intValue()+1;//計(jì)數(shù)加1ht.put(word,new}}}for(Iteratoritr=ht.KeySet().iterator();itr.hasNext();{Stringword=System.out.print(word+“:”+ }輸出:good: has:1;is: the: today: weather:

鍵值

鍵值

海量數(shù)

鍵值

初始鍵值

Barrier:AggregationBarrier:Aggregationand

中間結(jié)

計(jì)算結(jié)MapReducemap:(k1;v1)→[(k2;reduce:(k2;[v2])→[(k3;)][最終的某種形式的結(jié)果輸出[(k3;v3)]輸出:最終輸出結(jié)果[(k3;使用4個(gè)map節(jié)點(diǎn)map節(jié)點(diǎn)輸入:(text1theweatheris輸出:(the1weather1(is1(goodmap節(jié)點(diǎn)輸入:(text2todayismap節(jié)點(diǎn)map節(jié)點(diǎn)reduce節(jié)點(diǎn)輸入:(good,1),(good,1),(good,1),(good1),(good(is,1),(is,輸出(is,1),(is,輸出good:is:3weather:reduce節(jié)點(diǎn)輸入:(has,1),輸出:(has,1),(is,輸入:(the,1),today1),(today,(weather,1),(weather,1),(weather,輸出:(the,1),(today,2),weatherreduce節(jié)點(diǎn)輸入:(good,1),(good,1),(good,2)(good,(is,1),(is,輸出(is,1),(is,輸出good:is:3weather:reduce節(jié)點(diǎn)輸入:(has,1),輸出:(has,1),(is,輸入:(the,1),today1),(today,(weather,1),(weather,1),(weather,輸出:(the,1),(today,2),weatherHadoop:Mapper&ReducerHadoop:DriverHadoop:Run HadoopMapReducesplitsplitsplitsplit

partpartpartpartsplitsplitReducershas3primaryphases:shuffle,sortandHadoop:HowmanyTasksetuptakesawhile,soitisbestifthemapstakeatleastaminuteto用戶可以顯示配置Configuration.set(MRjobConfig.NUM_MAPSintbutitonlyprovidesahinttotheframework10TBofinputdata,blocksizeisYouwillendupwith80KHadoop:HowmanyThenumberofreducersforthejobissetbytheuserTherightnumberofreducesseemstobe0.95or1.75multipliedby(<no.of<no. umcontainersperWith0.95allofthereducescanlaunch yandstarttransferringmapoutputsthemapsWith1.75thefasternodeswillfinishtheirfirstroundofreducesandlaunchasecondwave ngamuchbetterjobofloadIncreasingthenumberofreducesincreasestheframeworkoverhead,butloadbalancingandlowersthecostofHadoop:PartitionercontrolsthepartitioningofthekeysoftheintermediateThetotalnumberofpartitionsisthesameasthenumberofreducetasksforthejob.Hencethiscontrolswhichofthemreducetaskstheintermediatekey(andhencetherecord)issenttoforreductionHashPartitioneristhedefaultHadoop:JobJobrepresentsaMapReducejobJobistheprimaryinterfaceforausertodescribeaMapReducejobtoHadoopframeworkforJobistypicallyusedtoHadoop:JobOptionally,JobisusedtospecifyotheradvancedfacetsofthejobsuchFilestobeputintheWhetherintermediateand/orjoboutputsaretobecompressed(andWhetherjobtaskscanbeexecutedinaspeculativeumnumberofattemptsperUserscanuseConfiguration.set(String,String)/Configuration.get(String)toset/getarbitraryparametersneededbyapplicationsCountersrepresentglobalCounterscouldbedefinedMapReduceInthemapand/orreducemethods,applicationscandefinearbitraryCounters(oftypeEnum)andupdatethemvia:Counters.incrCounter(Enum,Counters.incrCounter(String,String,ThesecountersarethengloballyaggregatedbytheHadoop:CurrentlyFileInputFormat,TextInputFormat,SequenceFileInputFormat,CombineFileInputFormat,CombineSequenceFileInputFormat,CombineTextInputFormat,CompositeInputFormat,FixedLengthInputFormat,SequenceFileAsBinaryInputFormat,Seq

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

評(píng)論

0/150

提交評(píng)論