




版權說明:本文檔由用戶提供并上傳,收益歸屬內容提供方,若內容存在侵權,請進行舉報或認領
文檔簡介
1、隨著網絡、計算機和通信技術的快速發展,數字圖書館已經成為人們取得信息的重要來源,然而對于網絡上眾多的分布式數字資源1?2,人們希望通過檢索獲得所需的信息。但由于早期在建立數字圖書館時沒有統一的標準,其資源的元數據格式并不相同,因而要制定出一套能夠描述不同數字圖書館資源的統一元數據格式相當困難;同時,早期建立的數字圖書館數據提供者與服務提供者間多是點對點的協議服務,協議外的數據提供者與服務提供者間無法進行元數據交互3?4。針對上述問題,設計了數字圖書館中廣泛使用的元數據格式MARC和DC的格式轉換器,并重新設計了基于OAI?PMH協議互操作平臺的命令動詞,對數字圖書館間資源的共享具有重要意義。1
2、 OAI?PMH協議概述數據提供者、服務提供者、注冊服務器構成了數字圖書館領域5?7中OAI?PMH協議的框架。數據提供者是框架中存儲大量元數據的信息庫,服務提供者可從多個數據提供者中提取元數據并提供檢索、瀏覽等增值服務,服務提供者對數據提供者發出的是到UDP的請求,也就是基于OAI?PMH的命令動詞,數據提供者對服務提供者的請求是通過HTTP協議以XML的形式響應的,注冊服務器中可以進行數據提供者和服務提供者的注冊。其具體框架如圖1所示。2 MARC到DC格式轉換器的設計與實現目前數字圖書管中使用的元數據格式主要為MARC,而基于OAI?PMH協議的數據提供者只能支持DC格式的元數據8。所以
3、要實現基于OAI?PMH協議的元數據互操作平臺,就需要把MARC格式的元數據轉換為DC格式的元數據。2.1 MARC與DC格式的域對照關系DC格式包含的元素描述共有15個,按照其描述內容的范圍和類別可分為三組,具體如表1所示。本系統提供了一個項批輸入接口,所以設計的MarcDc模塊應在項批輸入者之前進行處理。如果輸入的是MARC格式,則先經過MarcDc格式轉換器進行數據轉換,然后通過項批處理模塊進行處理;如果輸入的就是DC格式,則直接應用項批處理模塊進行處理。3 OAI?PMH互操作平臺的實現3.1 基于OAI?PMH協議的命令動詞分析基于OAI?PMH協議的數字圖書館中,為實現多個數據提供
4、者與服務提供者間之間的元數據交互,重新設計了基于OAI?PMH協議互操作平臺的命令動詞,命令動詞的主要功能如下:(1) GetRecord:此動詞可從倉儲中搜索元數據記錄。如果從倉儲中和指定的項中無法得到由metadataPrefix指定的元數據格式,其可以返回狀態屬性的頭部信息值為“刪除”,此功能的實現依賴于倉儲所跟蹤的刪除級別;(2) Identify:此動詞用于檢索倉儲的有關信息。利用此動詞,倉儲也可以返回余下的描述性信息,在返回的信息中,一部分對于基于OAI?PMH協議的互操作平臺是需要的;(3) ListIdentifiers:此動詞不返回記錄本身,而僅返回頭部的記錄,通過基于集合成
5、員和時間戳的頭部可獲取選擇的參數。基于倉儲對刪除的支持特性,如過請求中刪除了與指定參數匹配的記錄,則返回的記錄中將含有值為“刪除”的狀態屬性;(4) ListMetadataFormats:此動詞可從倉儲中檢索獲得所需的元數據格式,同時可以通過參數限制該請求對指定項元數據格式的獲取; (5) ListRecords:此動詞用于從倉儲中獲取記錄。基于倉儲對刪除的支持特性,如果刪除了請求中與參數匹配的記錄,則返回的記錄中將含有值為“刪除”的狀態屬性,但返回有“刪除”狀態的記錄是不能被表達為元數據格式的;(6) ListSets:此動詞用于返回倉儲的集合結構,對于選擇性獲取有益。3.2 數據提供者與
6、服務提供者間命令動詞使用方式分析數據提供者與數據服務者之間的請求和應答是通過上述六個核心動詞來實現的。使用標準的Web服務器是一個典型請求的實現方式,通過配置該Web服務器,使其可以向能夠處理OAI?PMH請求的軟件分發OAI?PMH請求,具體交互過程如圖5所示。數據提供者與數據服務者之間的交互步驟具體如下:(1) 服務提供者首先找到其所需元數據的數據提供者,然后獲取其惟一標識;(2) 服務提供者向數據提供者發出ListSets請求和ListMetaFormats請求。數據提供者根據服務提供者的請求返回其所能提供的滿足一定條件(如某種格式,某種主題,某個時間段)的元數據;服務提供者取得元數據后
7、向用戶提供服務。3.3 OAI?PMH命令動詞的請求和應答格式分析(1) 請求格式3.4 OAI六個動詞的設計(1) 概要設計當數據提供者接收到OAI請求后,需對該請求進行解析。首先判斷該請求類型是否合法,若不合法,則向服務提供者發送一個錯誤的信息;若合法,則判斷該請求屬于六個有效請求的哪個類型。由于參數metadataPrifix對于請求類型 ListIdentifiers是強制的,所以如果數據提供者接收到的請求類型是ListIdentifiers,那么解析器可以直接檢查這個請求的第二個參數。但如果請求中沒有相關參數,那么要確認這個請求有效,則需確認請求中必須包含resumptionToke
8、n (恢復標志參數),并且數據提供者知道這個參數。假設數據提供者在unqualified DC模式下只能發送元數據集合,那么metadataPrifix參數惟一有效的值只能是oai_dc。在正常情況下,請求中可選取的參數是必須被解析的,但可以簡單化,以一種非正式的形式進行描述。然后,根據接收到的請求參數,數據提供者運用SQL語句在倉儲中進行查詢,如果產生了記錄大于傳遞標識符一次所能傳遞的最大值,那么數據提供者則會新生成一個resumptionToken標志,并將查詢參數與指針信息存儲在一起,具體實現流程如圖6所示。(2) 詳細設計 利用Protocol數據庫包進行六個動詞的請求、響應及信息顯示
9、,其中Date類使用java.util.Date,Calendar或String方法實現年、月、日等各種格式間的相互轉換和輸出;Set類可通過一個已命名的集合創建一個新的集合,并用XML形式描述。 處理客戶端源代碼Client,可以利用一個URL或一個集合的性質創建HarvesterItinerary類用于描述harvester類的狀態,HarvesterItinerary類中可以對Harvester類的狀態進行保存。OAIConnection類可以描述一個到OAI的連接,用于接收單一的請求,該連接是基于用doReques或基于倉儲的URL所返回的Response。 互操作平臺中的Server
10、包可以提供服務器端的OAI源代碼,并把源代碼轉換為文檔服務。OAI服務的目標編程接口通過Target類實現,互操作平臺通過TargetAdapter類定義了一個不做任何事的執行,GenericTarget類可以實現由一個非常簡單的機制以創建小型的OAI收集器,并在存儲器中尋找與服務,利用集合規程和給定的元數據前綴得到所有的記錄; JDBCServer類是OAI servlet提供的一般服務器,OAIServerIfc類為OAI提供了一個框架,元數據前綴用在DC元數據核心記錄,處理GetRecord 請求、Identify 請求、ListRecords請求、 ListSets請求、ListMet
11、adataFormats請求和ListIdentifiers請求。ResumableResultSet類描述了一個結果,該結果通過與重用標記相結合,可以被客戶作為一系列局部結果而重用。3.5 數據提供者的功能實現(1) 項的訂購及提交用戶可以使用e?mail訂購自己所需的項,這樣可以共享數字圖書館的資源。用戶首先提交項的要素描述元數據,然后上傳源文件,在通過系統的驗證后可把文件提交到互操作平臺上供大家瀏覽及下載。(2) 工作流的實現三個小組負責人負責實現社團的工作流,每個小組負責人完成不同的工作流步驟。工作流的次序如下:當社團收到一個遞呈時,如果社團中有小組負責人,那么他將會選擇接受或拒絕,如
12、果社團中沒有小組負責人,這一步工作流將會被直接省略。第二步和第三步的遞呈也是按照此步驟處理。當調用工作流的第一步時,完成工作流步驟的任務把相關的遞呈放到“任務箱”中,如果組中的一個成員接受任務箱中的任務,則任務被從任務箱中移出。如果一個遞呈被拒絕,則系統會通過e?mail把原因發送給遞呈提交者,提交者可以修改后重新提交;如果一個遞呈被“接受”,將轉到工作流的下一步。(3) 搜索和瀏覽功能實現終端用戶可以使用多種方法發現內容,具體如下:使用關鍵字進行搜索;使用外部的ID號進行搜索;使用標題、數據進行瀏覽。在數字圖書館系統中,發現內容最基本的方法是搜索。基于OAI?PMH協議的互操作平臺的搜索和索
13、引模塊是一個簡單的API接口,它能夠在全部的社區、社團中完成新內容的索引、再生索引,系統的具體搜索界面如圖7所示。 4 結 論本文設計了MARC格式到DC格式的格式轉換器及基于OAI?PMH協議互操作平臺的命令動詞,解決了數字圖書館中數據提供者與服務提供者間的元數據交互問題。給出了不同格式間的對照關系及格式轉換器的實現方法,同時完成了命令動詞的實現代碼,對推動OAI?PMH協議在數字圖書館領域的應用具有重要意義。With the rapid development of network, computer and communication technology, digital librar
14、y has become the important sources of information for people to obtain information, but for many distributed digital resources on the network 1? 2, it is hoped that through to retrieve the required information. But since early in the establishment of digital library, without unified standard, its re
15、source metadata format is not the same, and thus to develop a set of can describe the unity of the different digital library resources metadata format is very difficult; Early at the same time, set up the digital library of data between the provider and the service provider is a point-to-point proto
16、col service, outside of the agreement between data providers and service providers to metadata interaction 3? 4.According to the above problem, design is widely used in the digital library metadata format format of MARC and DC converter, and redesigned based on OAI? PMH protocol interoperability pla
17、tform command verbs, between the digital library resources sharing is of great significance.1 OAI? PMH protocol overviewData providers and service providers, registered server constitutes the OAI in digital library field ? 5 7? PMH the framework of the agreement. Data providers are stored in the fra
18、mework of a large number of metadata repository, the service provider can extract meta data from multiple data provider and provide searching, browsing and other value-added services, the service provider is sent to the data provider to UDP request, which is based on OAI? PMH command verbs, a data p
19、rovider to the service provider's request is through HTTP protocol in the form of XML response, registered in the server can be the registration data providers and service providers. The specific framework is shown in figure 1.2, MARC format to DC converter design and implementationCurrently use
20、d in the digital library metadata format to MARC, and based on OAI? PMH protocol data providers can only support DC format metadata 8. So in order to realize based on OAI? PMH protocol metadata interoperability platform, you need to put the metadata of MARC format is converted to DC metadata format.
21、2.1 comparison relationship of MARC and DC format domainDC format contains element describes a total of 15, according to the scope and categories describe content can be divided into three groups, as shown in table 1.This system provides a number of input interface, so the design of MarcDc module sh
22、ould be performed before the item number of input processing. If the input is MARC format, after first MarcDc format converter to convert the data, then by a batch processing module; If the input is a DC format, the direct application of a batch processing module for processing.3 OAI? The realizatio
23、n of PMH interoperable platform3.1 based on OAI? PMH protocol analysis of the command verbsBased on the OAI? PMH agreement in the digital library, in order to realize multiple data interaction between metadata between providers and service providers, redesigned based on OAI? PMH protocol interoperab
24、ility platform command verbs, the main functions of the command verbs are as follows:GetRecord (1) : this verb can search metadata record from the warehouse. If the specified item from the warehouse and can't get appointed by metadataPrefix metadata format, the head of the state can return prope
25、rty information value for the "delete", the realization of the function of this depends on the storage by tracking the deletion of level;(2) Identify: this verb is used to retrieve information about the warehouse. Using the verb, storage can also return to the rest of the descriptive infor
26、mation, in the information returned part based on OAI? PMH protocol interoperability platform is needed;(3) ListIdentifiers: this verb does not return the record itself, and records of return only the head, by head based on the set members and timestamp available choice of parameters. Based on wareh
27、ouse to remove the support features, such as a request to delete a record with the specified parameters matching, it returns the record will contain values for the "delete" state property;(4) ListMetadataFormats: this verb can be retrieved from the storage to obtain the required metadata f
28、ormat, at the same time can through parameter limits the request to specify item metadata format for; (5) ListRecords: extracted from the verbs used in the warehouse record. Support feature based on warehouse to delete, if you remove the request and parameter matching of records, it returns the reco
29、rd will contain values for the state of the "delete" attribute, but returned to a state of "delete" record cannot be expressed as metadata format;(6) ListSets: the collection structure of verbs used to return to the warehouse, to get good selectivity.3.2 between data providers an
30、d service providers command verb usage analysisBetween data providers and data service request and response is done by the above six core verb. Use the standard Web server implementation approach, is a typical request by configuring the Web server, allows it to be able to handle OAI? PMH request sof
31、tware distribution of OAI? PMH requests, the specific interaction process as shown in figure 5.Interaction between data providers and data server steps specific as follows:(1) the service provider first find the metadata for the data provider, and then get its unique identity;(2) the service provide
32、r to the data provider ListSets requests and ListMetaFormats request. Data provider according to the service provider's request to return to its can provide satisfy certain conditions (such as some format, a theme, a certain period of time) metadata; After the service provider to obtain metadata
33、 to provide service to the users.3.3 OAI? PMH command request and response format analysis of verbs(1) request format3.4 the design of the OAI six verbs(1) the profile designWhen the data provider receives the OAI request, need to parse the request. First determines whether the request type is legal
34、, if not legal, then send a wrong message to the service provider; If legally, it concludes that the request which belong to six effective request type. As the parameter metadataPrifix for request types ListIdentifiers is mandatory, so if the data provider receives the request type is ListIdentifier
35、s, so the parser can directly check the request of the second parameter. But if there is no related parameters in the request, so want to confirm that the request is valid, will be expected to confirm the request must contain resumptionToken (recovery mark parameters), and the data provider know thi
36、s parameter.The assumption that data provider in unqualified DC mode can only send the metadata collection, so the only effective value can only be oai_dc metadataPrifix parameters. Under normal circumstances, can select the parameters in the request must be resolved, but can be simple, in the form
37、of an informal description. Then, according to the received request parameters, the data provider query using SQL statements in the warehouse, if the record is more than a passing identifier can pass the maximum, then the data provider will generate a new resumptionToken logo, and the query paramete
38、r with pointer information is stored in the concrete implementation process is shown in figure 6.(2) the detailed design(1) using Protocol data packet six verbs, according to the request, response, and the information of the Date class using Java. Util. The Date, year, month, day Calendar or String
39、method between the various formats, such as mutual conversion and output; The Set class by a named Set to create a new Set of formal description using XML.(2) to deal with the Client source code, the Client, can use a URL or a collection of properties to create HarvesterItinerary class used to descr
40、ibe the state of the harvester class, HarvesterItinerary class the harvester state of a class can be saved. OAIConnection class can describe a connection to the OAI, to accept a single request, the connection is based on using doReques or URL returned by the Response based on warehouse.(3) interoper
41、ability platform of the OAI source code package can provide the Server Server, and convert the source code to document service. OAI service Target programming interface on a Target class implements the interoperability platform through TargetAdapter class defines a don't do anything, GenericTarg
42、et class can implement by a very simple mechanism to create a small OAI collector, and are looking for and service, in the memory collection procedures and the given metadata prefixes are used to get the all records; JDBCServer class is to provide general server of OAI servlet OAIServerIfc such as O
43、AI provides a framework, core metadata prefix used in DC metadata records, processing GetRecord request, the Identify, ListRecords request, ListSets, ListMetadataFormats request with ListIdentifiers request. ResumableResultSet class describes a result, the result by combined with reuse marker, can b
44、e reused. As a result of the series of local customer3.5 the realization of the function of the data providerItem (1) of the order and submitThe user can use the e? Mail you need to order the items that we can share the resources of the digital library. User submitted item first describes the elemen
45、ts of metadata, then upload the source file, through the system after the validation of documents submitted to interoperable platform for everyone to browse and download.(2) the realization of the workflowThree team leader is responsible for implementing corporate workflow, each team leader to compl
46、ete different workflow steps. The order of the workflow is as follows: when the club received a presented, if a team leader in the community, so he will choose to accept or reject, if there is no team leader in the community, will be directly omit this step workflow. The second and third steps presented is also in accordance with this step.When calli
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯系上傳者。文件的所有權益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網頁內容里面會有圖紙預覽,若沒有圖紙預覽就沒有圖紙。
- 4. 未經權益所有人同意不得將文件中的內容挪作商業或盈利用途。
- 5. 人人文庫網僅提供信息存儲空間,僅對用戶上傳內容的表現方式做保護處理,對用戶上傳分享的文檔內容本身不做任何修改或編輯,并不能對任何下載內容負責。
- 6. 下載文件中如有侵權或不適當內容,請與我們聯系,我們立即糾正。
- 7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 南京林業大學《無機材料科學與技術》2023-2024學年第二學期期末試卷
- 財務報表培訓
- 石家莊鐵路職業技術學院《有害生物風險分析》2023-2024學年第二學期期末試卷
- 《紅樓夢的情節解析》課件
- 著色混凝土改造施工方案
- 院內急救操作規范
- 《新能源發展趨勢分析》課件
- 清華大學《織員工激勵》2023-2024學年第二學期期末試卷
- 邵陽學院《圖形創意與表現》2023-2024學年第二學期期末試卷
- 2025如何處理丟失的勞動合同以辦理退休手續
- XX光伏電站項目方案(自投)
- 機組黑啟動方案
- 《長方體正方體》課前小研究
- 附件:湖北省重點水利水電工程施工招標投標評分標準-鄂水
- MIL-STD-105E抽樣計劃表
- 充填灌漿試驗施工方案
- 中國聯通消防安全管理規定
- 一課一練21:關聯速度模型(答案含解析)—2021屆高中物理一輪基礎復習檢測
- GB∕T 10781.11-2021 白酒質量要求 第11部分:馥郁香型白酒
- T∕CCT 002-2019 煤化工 副產工業氯化鈉
- 學習通《商業計劃書的優化》章節測試附答案
評論
0/150
提交評論