數(shù)據(jù)記錄倡議DDI課件_第1頁
數(shù)據(jù)記錄倡議DDI課件_第2頁
數(shù)據(jù)記錄倡議DDI課件_第3頁
數(shù)據(jù)記錄倡議DDI課件_第4頁
數(shù)據(jù)記錄倡議DDI課件_第5頁
已閱讀5頁,還剩77頁未讀, 繼續(xù)免費閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認(rèn)領(lǐng)

文檔簡介

1、International Workshop on Introduction to the DDI and the IHSN Microdata Management ToolkitBeijing, 17-19 June 2013UNITED NATIONSDEPARTMENT OF ECONOMIC AND SOCIAL AFFAIRSSTATISTICS DIVISIONNATIONAL BUREAU OF STATISTICS OF CHINADDI元數(shù)據(jù)標(biāo)準(zhǔn)及IHSN國際住戶調(diào)查網(wǎng)絡(luò)微觀數(shù)據(jù)管理工具國際培訓(xùn)班北京,2013年6月17日-19日聯(lián)合國經(jīng)濟和社會事務(wù)部統(tǒng)計司中華人民共和國國

2、家統(tǒng)計局Generic Statistical Business Process Model (GSBPM)Workshop objectives - ContextDescribes statistical processes (e.g., implementation of a survey) in 9 phases, each divided into sub-processes.A convenient tool for assessment, planning of statistical processes.Specify the needsDesignBuildCollectPr

3、ocessAnalyzeDisseminateArchiveEvaluateMetadata ManagementQuality Management3通用統(tǒng)計業(yè)務(wù)流程模型 (GSBPM)培訓(xùn)班目標(biāo) 背景描述統(tǒng)計流程(例如,實施一項調(diào)查)的9個階段,每個階段有各自的子流程。一個用來評估與規(guī)劃統(tǒng)計流程的便利工具。指明需求設(shè)計建立收集 處理分析傳播存檔評估元數(shù)據(jù)管理質(zhì)量管理4The workshop will introduce standards and tools for: Workshop objectivesMetadata managementThe DDI standardIHSN M

4、etadata EditorDisseminationPolicy, technical and ethical issuesNADA softwareArchivingPreservation of digital informationSpecify the needsDesignBuildCollectProcessAnalyzeDisseminateArchiveEvaluateMetadata ManagementQuality Management5培訓(xùn)班介紹標(biāo)準(zhǔn)和工具的目的是:培訓(xùn)班目標(biāo)元數(shù)據(jù)管理DDI標(biāo)準(zhǔn)IHSN元數(shù)據(jù)編輯軟件傳播政策,技術(shù)和道德問題NADA軟件存檔數(shù)字信息保存

5、指明需求設(shè)計建立收集處理分析傳播存檔評估元數(shù)據(jù)管理質(zhì)量管理6Part 1 Documenting your surveys and censuses using the DDI Metadata Standard and the IHSN Metadata Editor (Nesstar Publisher)Metadata management7第1部分 使用DDI元數(shù)據(jù)標(biāo)準(zhǔn)以及IHSN元數(shù)據(jù)編輯軟件(Nesstar發(fā)布軟件)記錄您的調(diào)查和普查元數(shù)據(jù)管理8To increase the credibility and transparency of their statistical out

6、putsTo preserve institutional memory To allow replication of data collection and analysisTo allow re-use or re-purposing of the metadataWhy do data producers need metadata?9為了增加其統(tǒng)計輸出的公信力和透明度為了保持機構(gòu)記憶為了允許復(fù)制數(shù)據(jù)收集和分析為了允許重復(fù)使用或重新利用元數(shù)據(jù)為何數(shù)據(jù)生產(chǎn)者需要元數(shù)據(jù)?10To fully understand the (micro)data and make good use of t

7、hemTo minimize the risk of misuse/misinterpretation, users need to fully understand the data. Why, by whom, when, and how data were collected and processed are important information.For making data discoverable in on-line catalogsUsers will know about the availability of your data by searching or br

8、owsing detailed metadata catalogs. Why do data users need metadata?11為了充分認(rèn)識(微觀)數(shù)據(jù)并很好的利用他們?yōu)榱吮M量減少誤用/曲解的風(fēng)險,使用者需要充分了解數(shù)據(jù)。數(shù)據(jù)收集和處理的重要信息包括:目的,收集者/處理者,時間和方式。為了便于搜尋在線目錄中的數(shù)據(jù)使用者通過搜索或瀏覽詳細(xì)的元數(shù)據(jù)目錄,將會知道是否可以獲得您的數(shù)據(jù)。為何數(shù)據(jù)使用者需要元數(shù)據(jù)?12The Data Documentation Initiative (DDI) metadata standard helps structure, preserve and s

9、hare survey or census metadataThe IHSN Microdata Management Toolkit, a.k.a. Nesstar Publisher, provides a free and user friendly solution to document and catalog surveys/censuses in compliance with the DDI standard and international best practicesStandards and tools13數(shù)據(jù)記錄倡議(DDI)元數(shù)據(jù)標(biāo)準(zhǔn)有助于結(jié)構(gòu)化,保存和分享調(diào)查或普

10、查的元數(shù)據(jù)IHSN 國際住戶調(diào)查網(wǎng)絡(luò)微觀數(shù)據(jù)管理工具包,又名Nesstar發(fā)布軟件,為記錄并編目符合DDI元數(shù)據(jù)標(biāo)準(zhǔn)和國際最佳實踐的調(diào)查/普查,提供了一個免費且用戶友好的解決方案。標(biāo)準(zhǔn)和工具14A checklist of what you need to know about a study and its datasetA structured and comprehensive list of hundreds of elements that may be used to document a survey datasetAn XML metadata standardDevelope

11、d by academic data centers / the DDI Alliance.Designed to encompass the kinds of data generated by surveys, censuses, administrative records.For microdata, not indicators.Two versions: Version 2.n (DDI codebook), used by the IHSN ToolkitVersion 3.n (DDI life cycle)What is the DDI?15一張列有您所需要知道的,有關(guān)一個研

12、究及其數(shù)據(jù)集信息的核對表一張結(jié)構(gòu)化的綜合列表,包含數(shù)百個元素,可用來記錄一項調(diào)查的數(shù)據(jù)集。一個XML格式的元數(shù)據(jù)標(biāo)準(zhǔn)由學(xué)術(shù)數(shù)據(jù)中心/DDI聯(lián)盟開發(fā)。旨在涵蓋由調(diào)查,普查,行政記錄產(chǎn)生的這類數(shù)據(jù)。用于微觀數(shù)據(jù),而非指標(biāo)。兩個版本:版本2.n (DDI碼本), 用于IHSN國際住戶調(diào)查網(wǎng)絡(luò)工具包版本3.n (DDI生命周期)什么是DDI元數(shù)據(jù)標(biāo)準(zhǔn)?16XML stands for eXtensible Markup Language. It is used to structure information to be shared on the Web or exchanged between s

13、oftware systems. XML is a file format, readable by any text editor (e.g., Notepad).XML tags text for meaning. HTML tags text for appearance. The “tags” are conceptually the same as “fields” in a database. In an XML file, the information is wrapped between an opening tag and a closing tag. The tag na

14、me indicates its content. What is XML ?17XML代表可擴展標(biāo)記語言,用于結(jié)構(gòu)化在網(wǎng)絡(luò)上共享或在軟件系統(tǒng)之間交換的信息。XML是一種文件格式,在任何文本編輯器(例如:Notepad)上可讀。XML語言的標(biāo)簽文本具有內(nèi)容含義。HTML語言的標(biāo)簽文本用于文字外觀。在XML語言下的數(shù)據(jù)庫中,“標(biāo)簽”和“字段”在概念上是相同的。在一個XML文件中,信息被包裹在開始標(biāo)簽和結(jié)束標(biāo)簽之間。標(biāo)簽名稱表示其內(nèi)容。什么是XML?18“The National Statistics Office (NSO) of Popstan conducted the Multiple I

15、ndicators Cluster Survey (MICS) with the financial support of UNICEF. 5,000 households, representing the overall population of the country, were randomly selected to participate in the survey, following a two-stage stratified sampling methodology. 4,900 of these households provided information.” In

16、XML/DDI this would look like this: Multiple Indicator Cluster Survey 2005 MICS 2005 National Statistics Office (NSO) United Nations Children Fund Popstan National 5,000 households, stratified two stages 98 percent DDI and XML - An example19“Popstan國國家統(tǒng)計局(NSO)在聯(lián)合國兒童基金會(UNICEF)的資金支持下,開展了多指標(biāo)類集調(diào)查(MICS)。

17、調(diào)查采用二階段分層抽樣法,從參與這項調(diào)查的全國總?cè)丝谥?,隨機抽取了5000戶家庭作為代表全體的樣本。其中4900戶家庭提供了信息。”在XML/DDI中,以上內(nèi)容呈現(xiàn)如下:多指標(biāo)類集調(diào)查 2005MICS 2005國家統(tǒng)計局 (NSO)聯(lián)合國兒童基金會Popstan國全國5000戶家庭, 二階段分層抽樣百分之98DDI和XML - 例子20Can be transformed into many kinds of outputs:Databases, HTML, PDF, on-line catalogs, othersPlain text files. Not specific to any

18、operating system or applicationEasy to generate using specialized tools such as the IHSN Metadata EditorAdvantage of XML21可以轉(zhuǎn)化為多種輸出:數(shù)據(jù)庫、HTML、PDF、在線目錄,及其他純文本文件,不是某個操作系統(tǒng)或應(yīng)用程序的特定文件使用特定工具生成非常便捷,例如IHSN國際住戶調(diào)查網(wǎng)絡(luò)元數(shù)據(jù)編輯軟件XML的優(yōu)勢22The DDI elements are organized in five sections:Document Description. Used to doc

19、ument the documentation process (“metadata on metadata”). Study Description. Information about the survey such as title, dates/method of data collection, sampling, funding, etc.Data File Description. Content, producer, version, etc.Variable Description. Literal question, universe, labels, derivation

20、 and imputation methods, etc.Other Material. Description of materials related to the study such as questionnaires, coding information, reports, interviewers manuals, data processing and analysis programs, etc.Structure of the DDI 2.0 standard23DDI元素由5部分組成:文檔描述:用來記錄文檔著錄過程(“元數(shù)據(jù)的元數(shù)據(jù)”)。研究描述:關(guān)于調(diào)查的信息,例如標(biāo)題

21、、數(shù)據(jù)收集的日期/方法、抽樣、資金等等。數(shù)據(jù)文件描述:內(nèi)容、生產(chǎn)者、版本等等。變量描述:字面問題、全域、標(biāo)簽、推導(dǎo)和估算方法,等等。其他相關(guān)信息:描述與研究相關(guān)的材料,例如問卷、編碼信息、報告、面試官手冊、數(shù)據(jù)處理和分析程序等等。DDI2.0標(biāo)準(zhǔn)的結(jié)構(gòu)24Workshop participants will install the IHSN Metadata Editor (a.k.a. Nesstar Publisher) and document a small census dataset.Exercises25培訓(xùn)班與會者將安裝IHSN國際住戶調(diào)查網(wǎng)絡(luò)元數(shù)據(jù)編輯軟件(又名Nesstar

22、發(fā)布軟件)并學(xué)習(xí)記錄一個小的普查數(shù)據(jù)集。練習(xí)26Content of the USB provided to participantsExercise data filesChinese version of:Popstan census data files (2) in Stata formatCensus questionnaireEnumerator manualSame content in EnglishSelected technical and policy guidelinesIHSN Metadata Editor software and templates27USB存儲

23、盤向與會者提供以下內(nèi)容練習(xí)的數(shù)據(jù)文件中文版本:Stata格式的人口普查數(shù)據(jù)(2個文件)人口普查問卷統(tǒng)計員手冊英文內(nèi)容相同技術(shù)和政策方面的指導(dǎo)原則IHSN國際住戶調(diào)查網(wǎng)元數(shù)據(jù)編輯軟件和模板28Run NesstarPublisherInstaller_v4.0.9.exe to install the softwareNext step is to install the IHSN templates Exercise 1 InstallationOpen the Template Manager29運行NesstarPublisherInstaller_v4.0.9.exe,安裝軟件下一步是安

24、裝IHSN國際住戶調(diào)查網(wǎng)絡(luò)模板練習(xí)1- 安裝打開模板管理程序30Exercise 1 InstallationThen select the added template and click “Use” to activate it. This will now be the default study template.Click on “Import” and select the English (EN) or Chinese (CN) template found in folder “Software”Repeat the exact same process for the Res

25、ource Description Template31練習(xí)1- 安裝然后選擇要添加的模板,點擊“使用”來激活它。這個模板將成為默認(rèn)的研究模板。點擊“導(dǎo)入”,在“Software (軟件)”文件夾中選擇英語(EN)或中文(CN)模板 重復(fù)相同的步驟來添加資源描述模板32The next steps will be to document the Census:Import the data files (Stata)Add metadata in the Document Description, Study Description, Data Files Description, and V

26、ariables Description sectionsAttach and document the questionnaire and manual as external resourcesExport the metadata to DDI (and RDF) formatsExercise 2 - Documentation33接下來的步驟是記錄普查:導(dǎo)入數(shù)據(jù)文件(Stata)添加文件描述,研究描述,數(shù)據(jù)文件描述,和變量描述部分的元數(shù)據(jù)將調(diào)查問卷和面試官手冊作為外部資源附加并記錄將元數(shù)據(jù)以DDI(和RDF)格式導(dǎo)出練習(xí)2 記錄34When should data be docume

27、nted?Much information loss, or never generatedDocument “as you go” not after completion of the operation. When documentation is done as a “l(fā)ast step”, much information is lost.35數(shù)據(jù)在何時應(yīng)該被記錄?“按進度”記錄每一步 而不是在調(diào)查結(jié)束以后。如果只在“最后一步”記錄數(shù)據(jù),許多信息已經(jīng)丟失。36Available at Software and guidelines/home/node/117/home/softwar

28、e/ddi-metadata-editor37可下載于軟件和指導(dǎo)原則/home/node/117/home/software/ddi-metadata-editor38Part 2 Formulating a microdata dissemination policy, disseminating data and metadata, and the IHSN National Data Archive (NADA) softwareMetadata and microdata dissemination39第2部分 制定一個微觀數(shù)據(jù)傳播政策,數(shù)據(jù)和元數(shù)據(jù)的傳播,以及IHSN國際住戶調(diào)查網(wǎng)絡(luò)

29、國家數(shù)據(jù)歸檔(NADA)軟件元數(shù)據(jù)和微觀數(shù)據(jù)傳播40Diversity of research work. Data producers usually publish tabular and analytical outputs. But they will never identify all the research questions that can be addressed using the data. Microdata dissemination encourages diversity (and quality) of analysis. Credibility/accep

30、tability of data. Broader access to metadata and microdata demonstrates the producers confidence in the data, by making replication (or correction) possible by independent parties.Benefits of dissemination41使研究工作多元化 :數(shù)據(jù)生產(chǎn)者通常發(fā)布表格和分析輸出。但他們絕不會辨識出這組數(shù)據(jù)能解決的所有研究問題。微觀數(shù)據(jù)的傳播促進了分析的多樣性(和質(zhì)量)。數(shù)據(jù)的公信力和認(rèn)可度:通過讓獨立的第三方

31、能夠復(fù)制(或修正)數(shù)據(jù),對元數(shù)據(jù)和微觀數(shù)據(jù)更廣泛的訪問顯示了生產(chǎn)者對數(shù)據(jù)的信心。傳播的優(yōu)點42Reduced duplication. Non accessibility to microdata forces users to conduct their own surveys. Microdata dissemination would reduce the risk of duplicated activities. It will also reduce the burden on respondents, and minimize the risk of inconsistent s

32、tudies on a same topic.Funding. Better use of data means better return for survey sponsors, who will thus be more inclined to support data collection activities. Quality of data. It is often through the use of data that insights for improvement for survey design can be identified. Benefits of dissem

33、ination43減少重復(fù):無法獲得微觀數(shù)據(jù)迫使用戶自己進行調(diào)查。微觀數(shù)據(jù)的傳播將減少重復(fù)工作的風(fēng)險。它也將減少受訪者的負(fù)擔(dān),并將同一主題不一致研究的風(fēng)險降到最低。資金:更好地利用數(shù)據(jù)意味著對調(diào)查贊助者更好的回報,從而使他們更傾向于支持?jǐn)?shù)據(jù)收集活動。數(shù)據(jù)質(zhì)量:往往在數(shù)據(jù)使用的過程中,會產(chǎn)生如何改進調(diào)查設(shè)計的見解。傳播的優(yōu)點44Exposure to criticism. Quality itself often puts a brake on microdata dissemination. Some data producers may fear to be exposed to criti

34、cism when data are not fully reliable, and to be confronted to the obligation to defend their results when challenged by secondary users.Loss of exclusivity. When disseminating microdata, data owners lose their exclusive right to discoveries. This is more of an issue for academic researchers than of

35、ficial producers. Costs and risks of dissemination45受到批評:質(zhì)量本身往往會阻礙微觀數(shù)據(jù)的傳播。一些數(shù)據(jù)生產(chǎn)者可能擔(dān)心當(dāng)數(shù)據(jù)不是完全可靠時會受到批評,并且在面臨二級用戶質(zhì)疑時,要承擔(dān)為自己的結(jié)果辯論的義務(wù)。喪失專用性:微觀數(shù)據(jù)的傳播使數(shù)據(jù)擁有者失去了他們對自己發(fā)現(xiàn)的數(shù)據(jù)的專用權(quán)。相比官方數(shù)據(jù)生產(chǎn)者,這對學(xué)術(shù)研究者來說是更大的一個問題。傳播的成本和風(fēng)險46Official vs. non-official results, and exposure to contradiction. Dissemination of microdata may

36、 lead to a proliferation of differing -and possibly contradictory- results and statistics. It may become more and more difficult to distinguish between official figures and other sources of statistics. Financial cost. Properly documenting and disseminating microdata has a cost. This includes not onl

37、y the costs of creating and documenting microdata files, but also the costs of creating access tools and safeguards, and of supporting enquiries made by the research community.Costs and risks of dissemination47官方與非官方結(jié)果,對比揭露矛盾:微觀數(shù)據(jù)的傳播可能激增不同的 - 并可能是相互矛盾的 - 結(jié)果和統(tǒng)計。傳播可能導(dǎo)致官方數(shù)據(jù)和其他來源的統(tǒng)計數(shù)據(jù)變得越來越難以區(qū)分。財務(wù)成本:妥善記錄

38、和傳播微觀數(shù)據(jù)是有成本的。這不僅包括創(chuàng)建和記錄微觀數(shù)據(jù)文件的成本,還包括建立訪問工具和保障措施,以及向研究界提供輔助問詢的成本。傳播的成本和風(fēng)險48Confidentiality. One of the biggest challenges of microdata dissemination is to minimize the risk of disclosure of any data that would compromise the identity of respondents. Legality. All countries have a specific national st

39、atistical and data protection legislation. Costs and risks of dissemination49保密性:微觀數(shù)據(jù)傳播的最大挑戰(zhàn)之一,是如何盡量減少任何由于披露數(shù)據(jù)而導(dǎo)致的,可能危及受訪者身份保密性的風(fēng)險。合法性:所有國家都有其特定的國家統(tǒng)計和數(shù)據(jù)保護法例。傳播的成本和風(fēng)險50It is appropriate for microdata collected for official statistical purposes to be used for statistical analysis to support research a

40、s long as confidentiality is protected.Provision of microdata should be consistent with legal and other necessary arrangements that ensure that confidentiality of the released microdata is protected.Principles - UNECEManaging Statistical Confidentiality and Microdata Access - Principles and guidelin

41、es of Good Practice, by the Conference of European Statisticians (CES) and United Nations Economic Commission for Europe (UNECE)51在確保保密性的前提下,研究者可以使用為了官方統(tǒng)計目而收集的微觀數(shù)據(jù),來進行統(tǒng)計分析并支持研究。提供微觀數(shù)據(jù)應(yīng)當(dāng)符合法律和其他必要的約定,以確保被發(fā)布的微觀數(shù)據(jù)的保密性。原則 - UNECE聯(lián)合國歐洲經(jīng)濟委員會管理統(tǒng)計保密性和微觀數(shù)據(jù)訪問 -良好實踐的原則和指導(dǎo)方針, 歐洲統(tǒng)計學(xué)家會議(CES) 與聯(lián)合國歐洲經(jīng)濟委員會(UNECE)52St

42、atistical agencies are charged with protecting the confidentiality of survey respondents.Protecting confidentiality necessitates some sort of data anonymization so that individual respondents can not be identified. Anonymization53統(tǒng)計機構(gòu)被委以為調(diào)查受訪者保密的責(zé)任。為了保密,必須采取一定的數(shù)據(jù)匿名化措施,從而使得個體受訪者不會被辨識。匿名化54Identifying

43、 variables include:Direct identifiers, which are variables such as names, addresses, or identity card numbers. They should be removed from the published dataset.Indirect identifiers, which are characteristics whose combination could lead to the re-identification of respondents (e.g., region, age, se

44、x, occupation). Such variables are needed for statistical purposes, and should not be removed from the published data files. Anonymizing the data involves determining which variables are potential identifiers and modifying the specificity of these variables to reduce the risk of re-identification to

45、 an acceptable level. The challenge is to maximize the security while minimizing the resulting information loss.Anonymization concepts55識別變量包括:直接識別符, 是諸如姓名、地址或身份證號碼的變量。這些變量應(yīng)該從被公布的數(shù)據(jù)集中刪除。間接識別符, 是一些個體特征變量,若組合在一起可重新識別受訪者(例如地區(qū)、年齡、性別、職業(yè))。這樣的變量出于統(tǒng)計目的需要,不應(yīng)該從被公布的數(shù)據(jù)文件中刪除。數(shù)據(jù)匿名化涉及確定哪些變量是潛在識別符,并修改這些變量的特征,從而將重新識

46、別的風(fēng)險降低到一個可接受的水平。當(dāng)前的挑戰(zhàn)是如何在保持最大程度安全性的同時,最大限度地減少信息損失。匿名化概念56Removing variables (e.g., detailed geographic identification)Removing records (outliers)Global recoding (e.g., from age to age groups)Top- or bottom-coding (e.g., create “65+” age category)Local suppression (replace with missing)Micro-aggregat

47、ion (e.g., for income variable)Data swappingPost-randomizationNoise additionResamplingAnonymization techniques57刪除變量(例如,詳細(xì)的地理標(biāo)識)刪除記錄(離群值)全球性重新編碼(例如,將年齡改成年齡組)頂部或底部編碼(例如,創(chuàng)建“65+”年齡組別)本地隱瞞(更換為缺失值)微聚集(例如,對于收入變量)數(shù)據(jù)替換后隨機化添加噪聲重新抽樣匿名化技術(shù)58Software: sdcMicroAn open source (R-based) packageAnonymization tools a

48、nd guidelinesTechnical guidelines/home/node/118More practical guidelines are being produced by the IHSN.NOTE :Anonymization is a complex process. It requires analytical skills and involves some arbitrary decisions. 59軟件: sdcMicro一個開放資源的(以R語言為基礎(chǔ)的)軟件包匿名化工具和指導(dǎo)原則技術(shù)指引 /home/node/118IHSN國際住戶調(diào)查網(wǎng)絡(luò)提供了更多的實際操作

49、指引。注釋 :匿名化是一個復(fù)雜的過程。它需要分析技巧并涉及到一些主觀的判定。60Formulating a microdata access policyPolicy guidelines on dissemination/home/node/12061制定一個微觀數(shù)據(jù)訪問政策傳播的政策指引/home/node/12062Data and metadata need to be made visible.Users will benefit from advanced data discovery tools, in particular on-line searchable catalogs

50、.The IHSN developed an open source application, compliant with the DDI standard, to help disseminate metadata and (optional) microdata. This application (NADA) complements the Metadata Editor.Cataloguing63數(shù)據(jù)和元數(shù)據(jù)要成為用戶可見的。用戶將受益于先進的數(shù)據(jù)發(fā)現(xiàn)工具,特別是可在線搜索的目錄。IHSN國際住戶調(diào)查網(wǎng)絡(luò)開發(fā)了一款符合DDI元數(shù)據(jù)標(biāo)準(zhǔn)的,開放資源的應(yīng)用程序,用來幫助傳播元數(shù)據(jù)和微觀數(shù)

51、據(jù)(可選)。此應(yīng)用程序(NADA)是Nesstar元數(shù)據(jù)編輯軟件的一個補充。編目64Workshop participants will upload their DDI metadata (generated during the documentation exercise) in an on-line, searchable survey catalogDissemination Exercise 65培訓(xùn)班的與會者將把他們(在記錄練習(xí)中生成)的DDI元數(shù)據(jù),上傳到一個可在線搜索的調(diào)查目錄中。傳播練習(xí)66100+ agencies in 65+ countries have starte

52、d establishing a microdata archive using IHSN toolsSurvey catalogs 67在65個以上國家的100多個機構(gòu)中,已經(jīng)開展了使用IHSN國際住戶調(diào)查網(wǎng)絡(luò)工具對微觀數(shù)據(jù)進行存檔調(diào)查目錄68Part 3 Preserving data and metadataArchiving69第3部分 保存數(shù)據(jù)和元數(shù)據(jù)存檔70Common issues include:Loss of data and metadata, because of human error, technical problems, or disasters such as

53、fire or floodData available, but on unreadable formats/media (hardware and software obsolescence)Data available, but undocumentedDocumentation only available in hard copyMultiple versions of datasets available, with no “versioning” informationIssues71常見問題包括:由于人為失誤,技術(shù)問題,或諸如火災(zāi)或水災(zāi)等災(zāi)害,造成數(shù)據(jù)和元數(shù)據(jù)的損失存在數(shù)據(jù),但格

54、式/媒介無法讀?。ㄓ布蛙浖^時)存在數(shù)據(jù), 但尚未記錄只提供硬拷貝文檔存在多個版本的數(shù)據(jù)集,但沒有 “版本管理”信息問題72Physical damage can occur to hardware and media due to:Material instability Improper storage environment (temperature, humidity, light, dust) Overuse (mainly for physical contact media) Natural disaster (fire, flood, earthquake) Infrastr

55、ucture failure (plumbing, electrical, climate control) Inadequate hardware maintenance Human error (including improper handling) Sabotage (theft, vandalism)Physical threats73有形損害可能因為以下因素,發(fā)生在硬件和媒介上:材料的不穩(wěn)定性不適當(dāng)?shù)膬Υ姝h(huán)境(溫度、濕度、光照、灰塵)過度使用(主要針對有直接接觸的媒介)自然災(zāi)害(火災(zāi)、水災(zāi)、地震)基礎(chǔ)設(shè)施故障(水暖、電氣、氣候控制)硬件維護不足人為失誤(包括處理不當(dāng))蓄意破壞(盜竊、破壞)實體威脅74A file format may be superseded by newer versions and no longer be supported. Software obsolescenceHtml 21976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論