分布式DBMS的概念與設(shè)計(jì)計(jì)算機(jī)等專業(yè)畢業(yè)設(shè)計(jì)(論文)外文翻譯_第1頁
分布式DBMS的概念與設(shè)計(jì)計(jì)算機(jī)等專業(yè)畢業(yè)設(shè)計(jì)(論文)外文翻譯_第2頁
分布式DBMS的概念與設(shè)計(jì)計(jì)算機(jī)等專業(yè)畢業(yè)設(shè)計(jì)(論文)外文翻譯_第3頁
分布式DBMS的概念與設(shè)計(jì)計(jì)算機(jī)等專業(yè)畢業(yè)設(shè)計(jì)(論文)外文翻譯_第4頁
分布式DBMS的概念與設(shè)計(jì)計(jì)算機(jī)等專業(yè)畢業(yè)設(shè)計(jì)(論文)外文翻譯_第5頁
已閱讀5頁,還剩4頁未讀, 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報或認(rèn)領(lǐng)

文檔簡介

1、大連交通大學(xué)2008屆本科生畢業(yè)設(shè)計(jì)(論文)外文翻譯The Concepts and Design of Distributed DBMS1. INTRODUCTION A major behind the development of database systems is the desire to integrate the operational data of an organization and to provide controlled access to the data. Although integration and controlled access may impl

2、y centralization, this is not the intention. In fact, the development of computer networks promotes a decentralized mode of work. This decentralized approach mirrors the organizational structure of many companies, which are logically distributed into divisions, departments, projects, and so on, and

3、physically distributed into offices, plants, factories, where each nit maintains its own operational data. The shareability of the data and the efficiency of data access should be improved by the development of a distributed database system that reflects this organizational structure, makes the data

4、 in all units accessible, and stores data proximate to the location where it is most frequently used. Distributed DBMSs should help resolve the islands of information problem. Databases are sometimes regarded, as electronic islands that are distinct and generally inaccessible places, like remote isl

5、ands. This may be a result of geographical separation, incompatible computer architectures, incompatible communication protocols, and so on. Integrating the databases into a logical whole may prevent this way of thinking. 2Concepts To start the discussion of distributed DBMSs, we first give a defini

6、tion of a distributed database. Distributed database: a logically interrelated collection of shared data physically distributed over a computer network. Following on from this we have the definition of distributed DBMS. Distributed DBMS: the software system that permits the management of the distrib

7、uted database and makes the distribution transparent to users.A distributed database management system consists of a single logical database that is split into a number of fragments. Each fragment is stored on one or more computers under the control of a separate DBMS, with the computers connected b

8、y a communications network. Each site is capable of independently processing user requests that require access to local data and is also capable of processing data stored on other computers in the network.Users access the distributed database via application. Applications are classified as those tha

9、t do not require data from other sites and those that do require data from other sites. We require a DBMSs to have at least one global application. A DDBMS therefore has the following characteristics:A collection of logically related shared data;The data is split into a number of fragments;Fragments

10、 may be replicated;Fragments/replicas are allocated to sites;The sites are linked by a communications network;The data at each site is under the control of a DBMS;The DBMS at each site can handle local applications, autonomously;Each DBMS participates in at least one global application;From the defi

11、nition of the DDBMS, the system is expected to make the distribution transparent to the user. Thus, the fact that a distributed database is split tinto fragments that can be stored on different computers and perhaps replicated, should be hidden from the user. The objective of transparency is to make

12、 the distributed system appear like a centralized system. This is sometimes referred to as the fundamental principle of distributed DBMSs.Advantages and Disadvantages of DDBMSsThe distribution of data and applications has potential advantages over traditional centralized database systems. Unfortunat

13、ely, there are also disadvantages. In this section, we review the advantages and disadvantages of the DDBMS.AdvantagesReflects organizational structureMany organizations are naturally distributed over several locations. For example, DreamHome has many officers in different cities. It is natural for

14、databases used in such an application to be distributed over these locations. DreamHome may keep a database at each branch office containing details of such things as the staff who work at that location, the properties that are for rent, and the clients whoown or wish to rent out these properties. T

15、he staff at a branch office will make local inquiries of the databases. The company headquarters may wish to make global inquiries involving the access of data at all or a number of branches.Improved shareability and local autonomyThe geographical distribution of an organization can be reflected in

16、the distribution of the data; users at one site can access data stored at other sties. Data can be placed at the site close to the users who normally use that data. In this way, users have local control of the data, and they can consequently establish and enforce local policies regarding the use of

17、this data. A global database administrator is responsible for the entire system. Generally, part of this responsibility is devolved to the local level, so that the local DBA can manage the local DBMS.Improved availabilityIn a centralized DBMS, a computer failure terminates the operations of the DBMS

18、. However, a failure at one site of a DBMS, or a failure of a communication link making some sites inaccessible, does not make the entire system inoperable. Distributed DBMSs are designed to continue to function despite such failures. If a single node fails, the system may be able to reroute the fai

19、led nodes requests to another site.Improved reliabilityAs data may be replicated so that it exists at more than one site, the failure of a node or a communication link does not necessarily make the data inaccessible.Improved performance As the data is located near the site of greatest demand, and gi

20、ven the inherent parallelism of distributed DBMSs, speed of database access may be better than that achievable form a remote centralized database. Furthermore, since each site handles only a part of the entire database, there may not be the same contention for CPU and I/O services as characterized b

21、y a centralized DBMS.EconomicsIn the 1960s,computing power was calculated according to the square of the costs of the equipment: three times the cost would provide nine times the power. This was known as Grouchs Law. However, it is now generally accepted that it costs much less to create a system of

22、 smaller computers with the equivalent power of a single large computer. This makes it more cost-effective for corporate divisions and departments to obtain separate computers. It is also much more cost-effective to add workstations to a network than to update a mainframe system.The second potential

23、 cost saving occurs where databases are geographically remote and the applications require access to distributed data. In such cases, owing to the relative expense of data being transmitted across the network as opposed to the cost of local access, it may be much more economical to partition the app

24、lication and perform the processing locally at each site.Modular growthIn a distributed environment, it is much easier to handle expansion. New sites can be added to the network without affecting the operations of other sites. This flexibility allows an organization to expand relatively easily. Incr

25、easing database size can usually be handled by adding processing and storage power to the network. In a centralized DBMS, a growth may entail changes to both hardware and software.ComplexityA distributed DBMS that hides the distributed nature form the user and provides an acceptable level of perform

26、ance, reliability is inherently more complex than a centralized DBMS. The fact that data replication adequately, there will be degradation in availability, reliability, and performance compared with the centralized system, and the advantages we cited above will become disvantages.CostIncreased compl

27、exity means that we can expect the procurement and maintenance costs for a DDBMS to be higher than those for a centralized DBMS. Furthermore, a distributed DBMS requires additional hardware to establish a network between sites. These are ongoing communication costs incurred with the use of this netw

28、ork. There are also additional labor costs to manage and maintain the local DBMSs and the underlying network.SecurityIn a centralized system, access to the data can be easily controlled. However, in a distributed DBMS not only does access to replicated data have to be controlled in multiple location

29、s, but the network itself has to be made secure. In the past, networks were regarded as an insecure communication medium. Although this is still partially true, significant developments have been made to make network more secure.Integrity control more difficultDatabase integrity refers to the validi

30、ty and consistency of stored data. Integrity is usually expressed in terms of constraints, which are consistency rules that the database is not permitted to violate. Enforcing integrity constraints generally requires access to a large amount of data that defines the constraint but which is not invol

31、ved in the actual update operation itself. In a distributed DBMS, the communication and processing costs that are required to enforce integrity constraints may be prohibitive. We return to this problem in Section Lack of standards Although distributed DBMSs depend on effective communication, we are

32、only now starting to see the appearance of standard communication and data access protocols. This lack of standards has significantly limited the potential of distributed DBMSs. There are also no tools or methodologies to help users convert a centralized DBMS into a distributed DBMS.Lack of experien

33、ceGeneral-purpose distributed DBMSs have not been widely accepted, although many of the protocols and problems are well understood. Consequently, we do not yet have the same level of experience in industry as we have with centralized DBMSs, For a prospective adopter of this technology, this may be a

34、 significant deterrent.Database design more complexBesides the normal difficulties of designing a centralized database, the design of a distributed database has to take account of fragmentation of data, allocation of fragments to specific sites, and data replication. 分布式DBMS的概念與設(shè)計(jì)1 介紹推動數(shù)據(jù)庫系統(tǒng)發(fā)展的一個主要因

35、素是,人們希望將一個企業(yè)的操作數(shù)據(jù)綜合起來并能提供對數(shù)據(jù)的受控訪問,經(jīng)管綜合和受控訪問意味著集中管理,但這并不是目的。實(shí)際上,計(jì)算機(jī)網(wǎng)絡(luò)的發(fā)展促進(jìn)了分散式的作業(yè)模式。這種分散的方式反映了許多公司的組織結(jié)果:在邏輯上分成多個分公司,部門,項(xiàng)目等,而在體制上分為辦公室,車間,工廠,每一個單元都維護(hù)著自己的操作數(shù)據(jù)。數(shù)據(jù)的共享和數(shù)據(jù)訪問效率的提高以來與分布式數(shù)據(jù)庫系統(tǒng)的發(fā)展,分布式數(shù)據(jù)庫反映了上述組織結(jié)構(gòu),使得每個單元的數(shù)據(jù)都是可訪問的,并將最近的數(shù)據(jù)存放于最常用的位置。分布式DBMS有助于結(jié)局信息孤島的問題,有時,數(shù)據(jù)庫被看做是孤立的,不可訪問的電子島,就像遙遠(yuǎn)的島嶼一樣。這可能是由于地理分割,計(jì)

36、算機(jī)體系結(jié)構(gòu)不兼容或通信協(xié)議不兼容原因造成的。把數(shù)據(jù)庫綜合成一個邏輯整體可能會改變這種思考的方式。2. 概念在開始討論分布式DBMS之前,先給出分布式數(shù)據(jù)庫的定義。分布式數(shù)據(jù)庫:物理上分布于計(jì)算機(jī)網(wǎng)絡(luò)中,但在邏輯上相關(guān)的共享的集合。由此可以得到分布式DBMS的定義。分布式DBMS:管理分布式數(shù)據(jù)庫并使分布性對用戶透明的軟件系統(tǒng)。分布式數(shù)據(jù)庫管理系統(tǒng)是由一個被氛圍多段的邏輯數(shù)據(jù)庫構(gòu)成的。每個段在獨(dú)立的DBMS的控制下,可以存儲在一個或多個通過通信網(wǎng)絡(luò)互聯(lián)起來的計(jì)算機(jī)上。每一個結(jié)點(diǎn)都可以獨(dú)立的處理用戶訪問本地?cái)?shù)據(jù)的請求,并且也可以處理網(wǎng)絡(luò)上其他計(jì)算機(jī)存儲的數(shù)據(jù)。用戶是通過聽用來訪問分布市數(shù)據(jù)庫的,

37、應(yīng)用又可氛圍不需要從其他結(jié)點(diǎn)獲得數(shù)據(jù)的應(yīng)用和確實(shí)需要從其他結(jié)點(diǎn)獲得數(shù)據(jù)的應(yīng)用,一般要求DDBMS至少包含一個全局應(yīng)用。因此,DDBMS應(yīng)當(dāng)具有如下特征:l 邏輯上相關(guān)的共享數(shù)據(jù)的集合。l 數(shù)據(jù)是分段的。l 段是可以復(fù)制的。l 段/副本是分配在各個結(jié)點(diǎn)上的。l 結(jié)點(diǎn)是用通信網(wǎng)絡(luò)連接起來的。l 每個結(jié)點(diǎn)的數(shù)據(jù)都由DBMS控制,l 每個結(jié)點(diǎn)的DBMS都能自主地處理本地應(yīng)用。l 每個DBMS至少參與一個全局應(yīng)用。從DDBMS的定義中可以看出,系統(tǒng)期望分布性對于用戶來講是透明的。這樣,用戶就不需要知道分布式數(shù)據(jù)庫是分段的,存儲在多個不同計(jì)算機(jī)上,并且可能被復(fù)制等細(xì)節(jié)。透明性的目的就是要使用戶使用分布式系

38、統(tǒng)如同使用集中式系統(tǒng)一樣。這常常被稱為DDBMS的基本原則。這個要求為終端用戶提供了強(qiáng)大的功能。DDBMS的優(yōu)缺點(diǎn)數(shù)據(jù)分布以及應(yīng)用分布比傳統(tǒng)的集中式數(shù)據(jù)庫具有潛在的優(yōu)勢,但也存在著不足。優(yōu)點(diǎn):反映了組織結(jié)構(gòu)許多組織都是自然的分布與各個地方。例如,DreamHome在許多城市都設(shè)有分支機(jī)構(gòu)。于是該應(yīng)用的數(shù)據(jù)庫將很自然的分布于這些不同的地方。DreamHome的每個分支機(jī)構(gòu)都有一個數(shù)據(jù)庫用來記錄該機(jī)構(gòu)的職員信息,出租房產(chǎn)信息以及房產(chǎn)所有者的信息。本地的職員可以在本地?cái)?shù)據(jù)庫上進(jìn)行本地查詢,而公司的高層則可以訪問所有分支機(jī)構(gòu)的任何數(shù)據(jù),進(jìn)行全局查詢。改進(jìn)了共享性和本地自主權(quán)數(shù)據(jù)的分布可以反映出一個組織

39、在地理上的分布,一個結(jié)點(diǎn)的用戶可以訪問其他結(jié)點(diǎn)上的數(shù)據(jù)。數(shù)據(jù)會存放在靠近經(jīng)常是同這些數(shù)據(jù)的用戶的結(jié)點(diǎn)上。這樣,擁護(hù)就可以對數(shù)據(jù)擁有本地控制權(quán),從而可以建立和執(zhí)行關(guān)于使用這些數(shù)據(jù)的本地策略。全局?jǐn)?shù)據(jù)管理員(DBA)對整個系統(tǒng)負(fù)責(zé)。通常也可以把責(zé)任部分地下放到本地級,所以本地DBA可以管理本地DBMS。改進(jìn)了可用性對于集中式DBMS,計(jì)算機(jī)的一次故障會中斷所有的DBMS操作。然而,在DDBMS系統(tǒng)中,一個結(jié)點(diǎn)的故障或通信鏈路的故障只會使某些結(jié)點(diǎn)不能被訪問,但決不會中斷整個系統(tǒng)的操作。分布式DBMS就是設(shè)計(jì)用來使系統(tǒng)在這些故障發(fā)生時仍然可以繼續(xù)工作。即使一個結(jié)點(diǎn)出現(xiàn)故障,系統(tǒng)也可以把對故障結(jié)點(diǎn)的訪問請求重定向到其他結(jié)點(diǎn)。提高了可靠性因?yàn)閿?shù)據(jù)可以被復(fù)制而存在與多個結(jié)點(diǎn)之上,所以一個結(jié)點(diǎn)的鼓掌或通信鏈路故障不會妨礙對該數(shù)據(jù)的訪問。改進(jìn)了性能因?yàn)閿?shù)據(jù)是防止在離“最大需求“最近的結(jié)點(diǎn)上的,而且由于分布式DBMS內(nèi)在的并行機(jī)制,所以訪問分布式數(shù)據(jù)庫的速度肯定比訪問遠(yuǎn)程集中式

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論