• No results found

Visualization and Data Analysis

N/A
N/A
Protected

Academic year: 2021

Share "Visualization and Data Analysis"

Copied!
19
0
0

Loading.... (view fulltext now)

Full text

(1)

Defense Programs

Working Group Outbrief

g

Visualization and Data Analysis

James Ahrens, David Rogers, Becky Springmeyer

Eric Brugger, Cyrus Harrison, Laura Monroe,

Dino Pavlakos

Scott Klasky, Kwan-Liu Ma, Hank Childs

1

March 23-24, 2011

(2)

Defense Programs

Working Group Description

g

The scope of our working group is scientific visualization and data

analysis.

– Scientific visualization

• refers to the process of transforming scientific simulation and

experimental data into images to facilitate visual understanding

– Data analysis

• refers to the process of transforming data into an information-rich form

via mathematical or computational algorithms to promote better understanding

understanding

– Data management - shared with IONS

• refers to the process of tracking, organizing and enhancing the use of

scientific data

The purpose of our work is to enable scientific discovery and

understanding.

– Our scope includes an exascale software and hardware infrastructure

2

March 23-24, 2011

Ou scope c udes a e asca e so t a e a d a d a e ast uctu e that effectively supports visualization and data analysis.

(3)

Defense Programs

Identify current state-of-the-art

g

Production visualization tools scalably process large data

– Suite of data-parallel visualization, analysis and rendering algorithmsSuite of data parallel visualization, analysis and rendering algorithms

• Open-source tools – ParaView, Visit • Commercial tool – Ensight

ASC success story

– NNSA/ASC created the large-data scientific visualization tools in use

by other agencies (NSF, DOD) and around the worldy g ( , )

3

(4)

Defense Programs

Identify Exascale Visualization and Data Analysis

Needs

g

Visualization and Data Analysis (VDA)

Broad range of scope for VDA

– VDA as an application – VDA as a service

– VDA as a systems infrastructure

Note: like apps, VDA capabilities will require development to

exploit opportunities in evolving platforms

4

(5)

Defense Programs

1. Exascale Challenges – storing a full-range

of results for later analysis becomes

impossible due to technology trends

p

gy

g

The rate of performance improvement of rotating storage is not

keeping pace with compute.

Provisioning additional disks is a possible mitigation strategy,

however, power, cost and reliability issues will become a

significant issue.

A new in-situ exascale visualization and data analysis approach is

needed:

needed:

– Slow output to data hierarchy / Data movement = power/cost – Where will we process data?

• Different customer-driven approaches require integration at different HW

‘levels’

5

March 23-24, 2011 Workshop on R&D Challenges for HPC Simulation Environments

(6)

Defense Programs

2. Exascale Challenges - Exascale simulation

results must be distilled with quantifiable

data reduction techniques

q

g

Exascale as massive data

– Defacto data reduction techniqueDefacto data reduction technique

Visualization algorithms

Rendering massive numbers of polygons

– This puts lots of data into a single pixel, combined by the renderer

• This is a workable method but is it what the user wants?

– This approach provides the foundation for our current successes

• brute force approach that requires significant computing resources • difficult to quantify the bias of this approach

Approaches that quantifiably reduce data as it is generated need

to be explored

6

March 23-24, 2011 Workshop on R&D Challenges for HPC Simulation Environments

(7)

Defense Programs

3. Exascale Challenges - New exascale-enabled

physics approaches require corresponding new

visualization and data analysis approaches

g

Implication of exascale as massive compute

– Statistical physics approachesStatistical physics approaches

• Statistical modeling of a physical process

– Parametric studies

• record how a simulation responds in a parameter space of possibilities

– Multi-physics approaches

• simulate a linked model of different related phenomena such as a linked

physics and chemistry simulation

M lti l h

– Multi-scale approaches

• simulate phenomena at different spatial and temporal scales

Understanding and presenting both summarized and highlighted

results from multiple sources is an important technical challenge

that needs to be addressed.

7

March 23-24, 2011 Workshop on R&D Challenges for HPC Simulation Environments

(8)

Defense Programs

4. Exascale Challenges - Visualization and

data analysis approaches will need to run

efficiently on exascale platform architectures

y

p

g

– Need to take advantage of a very high degree of parallelism

– Technical challenges include achieving portability, efficiency and

integration flexibility with simulation codes

8

March 23-24, 2011 Workshop on R&D Challenges for HPC Simulation Environments

(9)

Defense Programs

1. Path Forward - New visualization and data

analysis software infrastructure

g

– When?: Run-time, Postprocessing

Required Partnership

– IONS, tools, systems – How?: Interactive, Batch

In-situ analysis within the simulation

code

– Apps for co-design

Metric

– Our success will be measured

– Run-time, (batch or interactive)

Post-processing --- advanced

query-based approach

Our success will be measured by our readiness for

applications as machine delivery milestones are met

Revolutionary approach

– Phase 1 P t t h i

Risks

• Prototype approaches in applications – Phase 2&3

• Develop and deploy

– How to do discovery science

in an in-situ world?

– Won’t find an effective

9

p p y

• Continue R&D analysis approach for

exascale applications

(10)

Defense Programs

2. Advanced quantifiable data reduction

algorithms

g

Data triage

– How do we significantly

Required Partnership

– Applied math How do we significantly reduce the data as it is generated?

• Statistical sampling

Applied math

– Apps for co-design

Metric

M f t f d t • Compression • Multi-resolution • Science-based feature extraction

– Measure of amount of data

reduced and quality of result, time extraction

Revolutionary approach

– Phase 1 P t t h

Risks

• Prototype approaches

independently and with applications

– Phase 2&3

– Won’t find an effective

analysis approach for exascale applications

10

• Develop and deploy • Continue R&D

(11)

Defense Programs

3. Visualization and data analysis techniques

to help understand advanced exascale

physics

g

p y

Visualization and Data Analysis for:

– Statistical physics approaches

Required Partnership

– Applications for co-design

– Parametric studies

– Multi-physics approaches – Multi-scale approaches

h lt f diff t t f

Applications for co design

Metric

– Ties to appropriate

application milestones

how results from different aspects of a simulation suite relate to each other

application milestones

Evolutionary/Revolutionary

Evolutionary/Revolutionary

approach

– Phase 1 P t t h

Risks

– Won’t understand output of

exascale applications

• Prototype approaches

independently and with applications

– Phase 2&3

exascale applications

11

• Develop and deploy • Continue R&D

(12)

Defense Programs

4. Implement core visualization and

data-analysis capability using a scalable parallel

infrastructure

g

– Our visualization and data

analysis solutions need to

Required Partnership

– Programming models, tools

work on the exascale

supercomputers on both swim

lanes.

Programming models, tools and applications groups

Metric

Our success will be measured

– Our success will be measured

by our readiness for

applications as machine delivery milestones are met

Evolutionary approach

– Phase 1

P t t h

Risks

– Not running on the machines

• Prototype approaches

independently and with applications

– Phase 2&3

12

• Develop and deploy • Continue R&D

(13)

Defense Programs

5. Exascale visualization and data analysis

hardware infrastructure

g

Data-intensive hardware

infrastructure for the exascale

f

Required Partnership

– HW, Systems, I/O

platform

– Memory buffers for staged

analysis and storage

HW, Systems, I/O

Metric

– Ties to appropriate machine

milestones

– Analysis-enabled storage – Large node memory portion

of the supercomputer milestones

Evolutionary/Revolutionary

approach

– Phase 1

Risks

– HW platform that makes data

analysis difficult

– Phase 1

• Prototype approaches

independently and with applications

analysis difficult

13

– Phase 2&3

• Develop and deploy • Continue R&D

(14)

Defense Programs

6. Tracking and using knowledge about the

scientific goals makes

visualization and data analysis more effective

y

g

Provenance

– Where the data comes from?

Required Partnership

– Tools, Systems, Applications – How was it generated?

– Uncertainty quantification – Workflow management and

b d

Tools, Systems, Applications for co-design

Metric

Time to solution cost beyond

Monitoring, debugging

– Supercomputer “situational

– Time to solution, cost,

improved quality p p awareness”

Revolutionary approach

– Phase 1

Risks

– Won’t know where data came

from

– Phase 1

• Prototype approaches

independently and with applications from – Inefficient use of supercomputer 14 – Phase 2&3

• Develop and deploy • Continue R&D

(15)

Defense Programs

Recommended Co-Design Strategy

g

Critical steps/activities

– Data Analysis and Movement are critical cross-cutting issuesData Analysis and Movement are critical cross cutting issues

– Develop well-defined HW needs for VDA

• In-situ and interactive data extraction could benefit from co-design – (e.g. resource management, time series analysis)

– Pilot co-design projects to help develop teams and define co-design

processes

• These can be subsets – VDA and IO working with a defined app

Working with vendors

– Well-defined communication with partners and vendors

– Visualization software vendors (e.g. Kitware, CEI), platform vendors( g , ), p

– Path Forward investments

15

(16)

Defense Programs

Recommended Co-Design Strategy

g

Role of skeleton/compact apps

– VDA mini-appsVDA mini apps

• Investigate coupling of in-situ capabilities with applications at scale

– Inform Architecture decisions by providing information on use

patterns

Concerns/suggestions

– Organization effects outcomes

• Partnering with Verification/Validation applied mathPartnering with Verification/Validation, applied math

• Overlap with other groups

– Influence we can have on HW vendors w/in timeframe

• Particularly with respect to datay p

– Co-design should have significant investment and sustained

commitment

16

(17)

Defense Programs

Big Picture Issues

g

Coordination

– VDA will be more tightly coupled with apps than in the pastVDA will be more tightly coupled with apps than in the past

– Data movement and provenance are shared responsibilities

Test beds

C t t b d i l t ti E l it

– Common test beds crucial to creating Exascale community – Specialized test beds may be necessary

Simulators

– Requirement: include adequate characterization of data movement

• Unclear if there are requirements for VDA-only simulation components

– Requirement: system-level simulation, to model various VDA use

cases

Remaining gaps

– Interactive data analysis (e.g. querying and extraction) for Exascale is

17

March 23-24, 2011

Interactive data analysis (e.g. querying and extraction) for Exascale is coupled with in-situ (what you can compute at runtime) and

computation on data (what you want to understand after the simulation)

(18)

Defense Programs

Conclusions

g

A d t

i

t d

ti

i

iti

l t

A data-oriented perspective is critical to

exascale success

18

(19)

Defense Programs

End

g

19

March 23-24, 2011 R&D Challenges for HPC Simulation Environments

References

Related documents

The motifs are reflecting the essential behaviour of the complex network and can capture the signifi- cant behaviour of the full Arabidopsis flowering model and can

Typically, this surgery does not last as long as your original device implant because the doctor will usually plug the new device into your already existing heart leads?. Will

In the Dual Antiplatelet Therapy (DAPT) Study, patients who were free from major ischemic or bleeding events at 1 year after coronary stenting (either drug-eluting [DES] or bare metal

Second service step-down The available J-Care services offerings for the product will be capped at Core and CorePlus support. The available JNSAC service offerings will be capped

In November 2010, Transcontinental announced that these acquired firms will be grouped under Transcontinental Interactive to offer customers marketing strategy and planning

Kod drugih ustavnih prava moguće je samo posredno djelovanje ustavnih prava, i to na način da je prava koja su dana samo u zakonsko uređenim odnosima među pojedincima

High AEES, Low overlap (False Negative FN): Clusters that have a high AEES and have a low (<50%) node or edge overlap indicates clusters that were not found in the

In 2008, the state procured the new Accela solution , which includes the Accela software (web portal and back office permit system), and the hosting service to run both systems.