• No results found

Best Practices for Sharing Imagery using Amazon Web Services. Peter Becker

N/A
N/A
Protected

Academic year: 2021

Share "Best Practices for Sharing Imagery using Amazon Web Services. Peter Becker"

Copied!
31
0
0

Loading.... (view fulltext now)

Full text

(1)

Best Practices for Sharing Imagery

using Amazon Web Services

(2)

Making Imagery Accessible

Store massive volumes of imagery on inexpensive cloud storageUse elastic compute for image services

Focus on Imagery

- Large in Volume - Static

- Basis for visualization and Analytics

Note:

Implementation of these workflows currently require additional tools and a component that is not publically available. Details for access will be provided at the end

Objectives

(3)

What’s Changed

NAS Standalone Direct Access Enterprise Cloud NFS Http Simple/ Object Store Elasticity

(4)

Amazon Storage Options

Ephemeral (80GB)

$0 (inc. with EC2)

NFS

EBS (Elastic Block Storage) 1TB/Disk ($100/TB/Month)

Fileshare

(5)

Amazon Storage Options

Ephemeral (80GB)

$0 (inc. with EC2)

NFS

EBS (Elastic Block Storage) 1TB/Disk ($100/TB/Month)

S3

Simple Storage $30/TB/Month 99.999999999% Durability 99.99% Reliability/year ≈10-50MB/s Http

(6)

Azure Storage Options

Temporary Disk (80GB) NFS+REST Azure Files 5TB, max 60MBps/Share Azure Blobs 500TB/container max 60MBps/Blob REST

Blobs

(7)

Optimizations

Suitable data structure:

- Enable access in single request

- Metadata

- Index

- Data Tiles

Cache to ephemeral/temporary storage

Object

Store

Use Inexpensive Scalable Storage

- Minimize Number of requests - Minimize Size of requests - Minimize Repeat requests

Enable ElasticityEnable Caching

Http

(8)

File Format Considerations / Optimization

Metadata Metadata Index Metadata Bad Index Metadata Pyramid Pyramid

Enable separate caching of metadata and index Non Optimum

Access

Enables partial Access

Striped TIF Tiled TIF MRF

Index Metadata

Tiling of imagery – Enables partial access

Compression – Reduce storage and transfer – Weigh against additional computeData access complexity – Reduce subsequent requests

separ

Cache Locally

(9)

MRF (Meta Raster) Format

NASA designed format

Split raster into three (or more) files

- MRF – Includes links to Index and DataFile - Index – Simple index to tiles

- DataFile – Data file

- Auxiliary files – e.g. Aux.xml

Open Source Implemented in GDALMultiple compression options

Multiple modes: Static, Split, Cloned, Caching

MRF file is seen as a Raster Dataset in ArcGIS

MRF file and index is small and can easily be on a local file systemData can be on slower storage

Index Metadata

Pyramid

(10)

Storage Formats & Compression

Raw/Striped TIF Sequential access / No compressionTiled GeoTIF Tiled Access

- Compression Options

- None

- Lossy JPEG………...…..Fast Lossy compression 8 & 12bit - Lossless Deflate/LZW…Generic lossless compressionJPEG2000 Tiled Access

- Compression Options - Lossy……….Computationally expensive - Lossless………Computationally expensive MRF Tiled Access - Compression Options - As geoTIF – JPEG,Deflate - PNG – Generic lossless

- LERC – Controlled Lossy – Very Fast – Optimum for higher bit depth and catagorical

(11)

Converting Data

ArcGIS CopyRasters, GDAL_Translate/GDALaddo

- Convert rasters, but don’t copy associated metadata

OptimizeRasters

- Python Script (Calls ArcGIS Version of GDAL)

- From any GDAL supported format to MRF or TiledTIF - Parameters to define compression, tilesize etc

(12)

Getting Imagery to and from the Cloud

AWS command line

CloudBerry, FileZilla, ….OptimizeRasters

- Copy and conversion in parallel

- Option to creates Cloned MRF, Caching MRF - Check success and retries if necessary

Import/Export

- Ship hard drives for large projects

Note: File Names on S3 are case sensitive

(13)

Conversion and Uploading

Using OptimizeRasters

(14)

Accessing

(15)

ArcGIS Prerequisites

ArcGIS typically access files through a file systemCannot access files in S3 directly

Use MRF Driver to point to MRF file that points to data on S3MRF Driver is custom GDAL Driver for ArcGIS

Available in 10.3.1* (with issues)Best to download – See later

Note: Following can be used to Optimize Enterprise implementations as wellWorks with data on S3, but also from any local storage

(16)

ArcGIS MRF Driver

Special MRF driver for ArcGIS

Incorporated into ArcGIS 10.3.1* Desktop & Server4 Modes StaticMRF Index Metadata Datafile One Directory

(17)

ArcGIS MRF Driver

Special MRF driver for ArcGIS

Incorporated into ArcGIS 10.3.1* Desktop & Server4 Modes StaticMRF SplitMRF Index Metadata Datafile Index Metadata Fast Expensive Storage Slow Cheap Storage Datafile

(18)

ArcGIS MRF Driver

Special MRF driver for ArcGIS

Incorporated into ArcGIS 10.3.1* Desktop & Server4 Modes

StaticMRF SplitMRF ClonedMRF

Index Metadata Datafile Index Metadata Fast Expensive Storage e.g. Ephemeral Slow Cheap Storage e.g. S3 Index Metadata Datafile Cache Index Metadata VSI Curl / Http Datafile

(19)

ArcGIS MRF Driver

Special MRF driver for ArcGIS

Incorporated into ArcGIS 10.3.1* Desktop & Server4 Modes

StaticMRF SplitMRF ClonedMRF CachingMRF

Index Metadata Datafile Index Metadata Datafile Index Metadata Datafile Cache Cached File Any GDAL Readable file Index Metadata XXX File Cache VSI Curl / Http Index Metadata VSI Curl / Http

(20)

Creating Local MRF Files that Point to Source

Use OptimizeRasters

- Creates MRF files on local file system (Ephemeral or EBS) - MRF points to Index and Data Files

- Copies Auxiliary files for RasterType

- Note MRF files can have any extension (incl. .tif)

If source is MRF – Use ClonedMRF

- Adds caching optimization

If source is other format – Use CachingMRF

- Enables source to be any Image on S3 (or SAN etc)

Define Cache location as Ephemeral

Index Metadata Datafile Cache Index Metadata XXX File Cache Index Metadata Auxiliary Files

Auxiliary Files Auxiliary Files

OptimizeRastes

(21)

Creating and Using

CachingMRF

(22)

Creating and Using

CachingMRF

(23)

Creating Mosaic Dataset

Use Standard Workflow

- Create Mosaic Dataset - Add Rasters

- Define and Build Overviews - Set Properties

- Perform QC - Publish

More Advance Workflow

- Create Source Mosaic Dataset for each project

- As for Standard Workflow

- Create Derived Mosaic Dataset

For more details see: Image Management Workflows

- Resource Center landing page http://esriurl.com/6005

- Guidebook in Help System http://esriurl.com/6007

(24)

Handling Overviews

Create Overviews on Ephemeral or EBS Use OptimizeRasters to copy to S3

Delete Overviews

Use Optimize Rasters to copy back MRF filesRepair paths (if necessary)

(25)

Publishing Image Services

For initial testing use Standard Workflow

- Right click Publish as Image Service

(26)

Configuring ArcGIS Server for Imagery in AWS

Use AutoScaling Group

Use Silo configuration of ArcGIS Server (each server is its own site)

Load Balancer – User Regular Amazon Elastic Load Balancer (ELB)

- Turn On Sticky session on using Duration-Based Session Stickiness

- Listen on ports 80 (http) & 443 (https)

- Install SSL certification on port 443

- Forward Port 80 (ELB) -> 6080 (ArcGIS Server) - Forward Port 443 (EBL) -> 6443 (ArcGIS Server)

On ArcGIS server turn on http & httpsCreate Master Server

Create AMI of Master ServerConfig AMI to Autoscaling group

Scale out and scale in based on CPU Load (typically CPU Load >80% or <20% for 5min)

(27)

Publish Elastic Image Services

Prepare Base

- Delete all Cache files - ZIP MD and MRF files - Store on S3

For each machine

- Use Deployer Script / Cloud Formation

Template to Start up

- Pull Mosaic Datasets, MRF overviews,

MRF source rasters from S3 in a zip file.

Above can be automated with Cloud

Formation templates available from Professional Services

(28)

Advanced Techniques

S3 Security

- Use ACL to define Public Read

- Else use Bucket Policy to define which IPs etc. can read

Embed MRF files into Mosaic Dataset

- MRF files fully encapsulated in Mosaic Dataset

PostgreSQL

(29)

Getting Access

Currently the scripts are in Prototype state

To get access to OptimzeRasters, MRF Drivers, PPT and doc:

- Send email to -

[email protected]

- Provide GitHub ID

- Name & email

(30)

Thank you…

Please fill out the session survey in your mobile appSelect

[Best Practices for Sharing Imagery using Amazon Web Services]

in the Mobile App

- Use the Search Feature to quickly find this title

Click “Technical Workshop Survey”

Answer a few short questions and enter any comments

(31)

References

Related documents

To optimize performance, the imagery was cached following guidelines similar to those described for the Riverside Street Map map service—in other words, using the same scale

ArcGIS – Service Oriented Architecture ArcGIS for Mobile ArcGIS Explorer Rich Internet Clients ArcGIS for Desktop ArcGIS for Server Data Tier Network... Desktop Tablets

These artificial images of scores were studied by means of Analysis of Variance to check if significant differences along the storage periods appear and to compare the behaviour

ArcGIS cloud product spectrum ArcGIS Server On Private Cloud ArcGIS Server On Amazon Web Services ArcGIS Online Manage it Yourself High Low.. ArcGIS in the Cloud

1 ArcGIS Mobile •Map •Collect •Search •Sync 5 MPC 4 3 ArcGIS ArcGIS SERVER SERVER ArcGIS.com ArcGIS.com ArcGIS Desktop •Design •Author •Setup •Extract 2 Mobile GP

The results indicate this is a true interactive (moderating) relationship, wherein the interaction of gender and sexual identity leads to unique educational attainment outcomes:

decisionmaker, who is minimizing the overall social costs, to have higher emissions for the sources that contribute risk to these locations If the increase in emissions from

To assess the relationship of gender patterns of suicidal behaviour between immigrants and natives in their country of origin, we used the established rates of completed suicide for