• No results found

Peer-to-peer data storage software Specification and source code

N/A
N/A
Protected

Academic year: 2021

Share "Peer-to-peer data storage software Specification and source code"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

agreement n° 283783

Peer-to-peer data storage software

Specification and source code

CODE:

DEL-078

VERSION:

01

(2)

Authors: Michal Jakubec (AUAV) Rene Hudec (AUAV) Collaborators:

Revised by: Raquel Cedazo (UPM)

Approved by: Francisco Sanchez Moreno (UPM)

Distribution List:

(3)

Change Control

Issue

Date

Section Page

Change Description

Reference Documents

Document Name

Code

Version

R.1 Peer-to-peer live broadcasting software: spec and source code update DEL-9.17 1

(4)

Contents

1.

Purpose of the document

...

5

2.

Introduction

...

5

3.

Data flow between hosts and clients

...

6

4.

Client access

...

6

5.

Host replication

...

7

6.

Data content types

...

7

7.

Infrastructure metadata types

...

8

8.

Possible types of protocols

...

8

9.

(5)

1. Purpose of the document

This document describes the basic architecture for the future development of the GLORIA data storage solution and provides possible approaches to particular technical problems, such as implementation of an appropriate data transport protocol, exchange of content, its metadata and proper enforcement of data integrity and security.

2. Introduction

The GLORIA distributed data store will be providing reliable data storage services for astronomical content acquired and/or processed through the facilities made available by the members and participants of the GLORIA network. There are many participants being able to provide and consume multimedia content, especially in form of astronomical images and large data set acquired by private astronomical equipment owner by each member of the GLORIA network. Astronomical data has to be able to flow from each participant to the GLORIA data storage network, as well as in the opposite direction, from the GLORIA data storage, on-demand basis according to each participant’s unique needs.

According to the requirements specified above, there are two distinctive levels of the final peer-to-peer data storage solution:

• data store hosts - provided by institutional members of the GLORIA network

• data store clients - represented by regular participants of the GLORIA network A whole concept of the possible solution is illustrated in the following diagram:

Data store hosts are capable to provide plenty of storage capacity to be able to fully or, at least, partially replicate contents of the GLORIA network data store. The amount and selection of data to be replicated to the particular host can be driven by various parameters, e. g. free storage capacity, amount of client access requests, type of data to be replicated, etc.

(6)

3. Data flow between hosts and clients

According to the roles of clients and hosts described above, there are two distinctive types of connections: ● client access

● host replication

Each type of connection has to provide a distinctive set of features allowing an efficient way how to exchange the required astronomical data between each participating node. While a typical client access connection is based on a well-known client-server paradigm, already employed in many protocol applications for the internetwork environment, host replication is rather an instance of the mesh-like communication network, frequently utilized in the content delivery networks or peer-to-peer solutions.

Additionally, the GLORIA data store must be able to enforce an appropriate storage policy over its hosts, to govern various aspects of its services, such as the amount or content replicas being spread over the data store hosts to provide failover capability and to fulfil globally established data retention needs.

On the other hand, main requirements of data store client participation are to be able to upload currently acquired astronomical data to the GLORIA data store and, on a subscription basis, to be able to download the selected content back for purposes of subsequent data processing or usage.

4. Client access

There are several types of data being requested by a client, especially: ● astronomical data, in form of:

○ content payload ○ metadata attributes ● web publication content, such as:

○ web pages ○ images ○ scripts, styles ○ discussion posts, etc.

(7)

For client to be able to exchange data with the storage network, a first step is to locate an appropriate host that will server client’s requests. One of the possible solutions is to establish some well-known domain naming system (DNS) name, e. g. “track.data.gloria-project.eu”. Therefore, each client will be able to locate a host capable of supplying an up-to-date list of all hosts currently providing data storage services to clients.

It’s up to each client to select an appropriate host from the list, e.g. it could be a host with the lowest possible latency, highest available bandwidth, least workload or just being nearby the client.

After the client selects the appropriate serving host and successfully authenticates to that host with the valid user credentials, it is ready to start particular content data exchange between that client and data storage network through that host.

5. Host replication

Main principle of host-to-host replication process is depicted in the following diagram:

(8)

Each host that is planned to be participating in the storage network must have assigned a certificate uniquely proving its identity as a authorized member of the GLORIA data store solution.

Then, through the registration process, it needs to manifest its availability and various capability parameters to the other hosts. This step can be easily accomplished through a regular client-type connection towards appropriately selected host, e.g. host with the lowest possible latency, currently available bandwidth, least workload or just being nearby it.

Finally, it becomes a member of the network and the content replication process can be initiated.

6. Data content types

Basically, there are several distinctive content types being replicated between hosts and made accessible to clients, especially:

● web publication content, such as: ○ web pages

○ images ○ scripts, styles ○ discussion posts, etc. ● astronomical data, such as:

○ content payload ● metadata attributes

Although there is almost no limit on particular kind of data being stored in the GLORIA data store, the main purpose of the solution is to provide storage services for astronomical data acquired or authoritatively published by members of the GLORIA project.

7. Infrastructure metadata types

For the storage network being able to cope with diverse amounts and requirement of accessing clients and peer hosts, there is definitely needed to manage and distribute several kinds of infrastructure-related metadata:

● host availability characteristics, in form of: ○ types of service being advertised

(9)

○ preferred replication partners

○ maintenance downtime period schedules ● host performance characteristics, such as amount of:

○ connected clients ○ available bandwidth ○ free storage space ● other related metadata

Each participating host has to be responsible for publishing relevant, up-to-date metadata about itself and use those provided by other host members of the network to assure optimal performance of the GLORIA data store as a whole.

8. Possible types of protocols

For both types of interaction, client-to-host and host-to-host, there has to be an appropriate application programming interface designed and established, ideally based on standards or public recommendations so that clients and hosts running on various operating system platforms will be able to connect and participate in a GLORIA data storage ecosystem without excessive programming efforts.

From the perspective of software architecture, there are several possible protocol candidates, fulfilling the requirement described above and useful especially for the metadata exchange:

● XML (eXtensible Markup Language) Web Services protocol

○ based on HTTP/SOAP (Hyper-text Transfer Protocol, Simple Object Access Protocol) and WSDL (Web Service Description Language) specification established by W3C

● JSON-RPC protocol

○ based on HTTP protocol and JSON (JavaScript Object Notation) format for payload ● XML-RPC protocol

○ based on HTTP protocol and XML format for payload

For exchange of data store content bits, the HTTP protocol itself could be an optimal, initial choice, because of its wide implementation and availability in many operating system platforms and programming environments.

Moreover, based on practical experience with the first versions of solution, there is always a possibility to extend the set of available content transport protocols, such as BitTorrent, according to future needs of the GLORIA project and its members and participants.

9. Conclusion

The GLORIA data storage network is one of the most important parts of the project, because it allows a reception, reliable storage and proper distribution of astronomical data acquired or published by members and participants of the GLORIA project.

This document provided an overall view of the key principles and solutions to particular problems being solved during the future implementation in various programming environments and operating system platforms.

(10)

GLORIA Partners

UPM

Universidad Politécnica de Madrid SPAIN

AUAV

Astronomical Institute, Academy of Sciences of the Czech Republic CZECH REPUBLIC

CSIC

Consejo Superior de Investigaciones Científicas SPAIN

INAT & CVUT

Czech Technical University in Prague CZECH REPUBLIC

IP-ASCR

Institute of Physics of the Academy of Sciences of the Czech Republic CZECH REPUBLIC

IAC

Instituto de Astrofísica de Canarias SPAIN

INAF

Istituto Nazionale di Astrofisica ITALY

SAO

Special Astrophysical Observatory of Russian Academy of Sciences RUSSIA

UCDNUID

University College Dublin IRELAND UC University of Chile CHILE UMA University of Malaga SPAIN UOXF University of Oxford UNITED KINGDOM UNIWARSAW Uniwersytet Warszawski POLAND

References

Related documents

With the aim of re-situating modern Iranian drama within the context of international dramatic developments, the plays of Gholamhossein Sa‘edi, Akbar Radi, and

Whether or not a practitioner routinely recommended mouthguards for his/her athletically active patient was addi- tionally analyzed by the number of years one had been practicing..

The main contributions of the present study are: 1) a semiau- tomated semantic annotation of image patches based on a large collection of high-resolution land cover SAR images; 2)

While under the earlier concept, investment companies were companies which has acquired shares/securities of its own group/holding/ subsidiary companies only and such acquisition

KAL Lounge Award program allows SKYPASS members to redeem miles for access to KAL Prestige Class Lounges operated by Korean Air. · The ticket must have been issued with a Korean

Results from the stochastic production function estimates indicate that Bt corn has a strong statistically signi fi cant mean yield-increasing e ff ect, and that Bt technology signi

Abbildung 52 Select and Modify MS Project XML Export Template - Custom Field Mapping Resource Assignments.... This document gives an overview of import and export functionality