• No results found

Scientific Computing at NCEAS

N/A
N/A
Protected

Academic year: 2021

Share "Scientific Computing at NCEAS"

Copied!
31
0
0

Loading.... (view fulltext now)

Full text

(1)

Scientific Computing at NCEAS

Jim Regetz & Rick Reeves

National Center for Ecological Analysis & Synthesis

Winter 2011

(2)

NCEAS Scientific Computing team

Who we are. . .

Rick Reeves: Geography, spatial data processing/analysis Jim Regetz: Ecology, statistics, numerical modeling

Where you can find us. . .

Office: Rm 301 (by the elevators)

Email: [email protected], [email protected] Web: http://www.nceas.ucsb.edu/scicomp

(3)

NCEAS technology support areas

Analysis

Collaboration

Data & Metadata

(4)

Synthesis is an uphill battle

Discovery

Integration

Analysis

Publication

The literature is the final resting place for much of our knowledge.

Data synthesis and re-analysis are hard in a publication-driven world!

(5)

Synthesis is an uphill battle

Discovery

Integration

Analysis

Publication

The literature is the final resting place for much of our

knowledge. Data synthesis

and re-analysis are hard in a publication-driven world!

(6)
(7)

Collaboration tools and services

Tech support available on site

Networking Printing Laptops Lab computers Video conferencing Smartboards Remote collaboration

Password-protected group website

Analytical server, source code repository, and more

NCEAS Help Wiki:

(8)

Collaboration tools and services

Tech support available on site Networking Printing Laptops Lab computers Video conferencing Smartboards Remote collaboration

Password-protected group website

Analytical server, source code repository, and more

NCEAS Help Wiki:

(9)

Analysis support

Data discovery & acquisition

Consult with us about “base layer” data (e.g., global climate, digital terrain models)

Data management & analysis

Recommend best practices and appropriate software Discuss statistical & analytical approaches

Lend a hand with programming, etc. Documentation & archiving

How/where can you capture your analytical procedures? How/where can you archive data products?

(10)

Analysis support

Data discovery & acquisition

Consult with us about “base layer” data (e.g., global climate, digital terrain models)

Data management & analysis

Recommend best practices and appropriate software Discuss statistical & analytical approaches

Lend a hand with programming, etc.

Documentation & archiving

How/where can you capture your analytical procedures? How/where can you archive data products?

(11)

Analysis support

Data discovery & acquisition

Consult with us about “base layer” data (e.g., global climate, digital terrain models) Data management & analysis

Recommend best practices and appropriate software Discuss statistical & analytical approaches

Lend a hand with programming, etc.

Documentation & archiving

How/where can you capture your analytical procedures? How/where can you archive data products?

(12)

Analytical software comparison

Flexible Scripted Reliable Cross-platfo rm Op en source Free! R x x x x x x Matlab x x x x SAS x x x x Excel GRASS x x x x x x PostGIS x x x x x x ArcGIS x x x

(13)

Analytical software comparison

Flexible Scripted Reliable Cross-platfo rm Op en source Free! R x x x x x x Matlab x x x x SAS x x x x Excel GRASS x x x x x x PostGIS x x x x x x ArcGIS x x x

(14)

Analytical software comparison

Flexible Scripted Reliable Cross-platfo rm Op en source Free! R x x x x x x Matlab x x x x SAS x x x x Excel GRASS x x x x x x PostGIS x x x x x x ArcGIS x x x

(15)

Analytical software comparison

Flexible Scripted Reliable Cross-platfo rm Op en source Free! R x x x x x x Matlab x x x x SAS x x x x Excel GRASS x x x x x x PostGIS x x x x x x ArcGIS x x x

(16)

Your responsibilities

1 Identify a Data Point of Contact for your group

2 Establish a timetable for documenting/archiving data 3 Create metadata entries in the NCEAS Data Repository

for your data sets -as soon as possible!

4 Archive completed data sets in the Data Repository NCEAS Data Repository

(17)

Your responsibilities

1 Identify a Data Point of Contact for your group

2 Establish a timetable for documenting/archiving data

3 Create metadata entries in the NCEAS Data Repository

for your data sets -as soon as possible!

4 Archive completed data sets in the Data Repository NCEAS Data Repository

(18)

Your responsibilities

1 Identify a Data Point of Contact for your group

2 Establish a timetable for documenting/archiving data

3 Create metadata entries in the NCEAS Data Repository

for your data sets -as soon as possible!

4 Archive completed data sets in the Data Repository NCEAS Data Repository

(19)

Your responsibilities

1 Identify a Data Point of Contact for your group

2 Establish a timetable for documenting/archiving data

3 Create metadata entries in the NCEAS Data Repository

for your data sets -as soon as possible!

4 Archive completed data sets in the Data Repository

NCEAS Data Repository

(20)

NCEAS Ecoinformatics program

Research

Better managing heterogeneous data Capturing reproducible scientific workflows

Interpreting data/analyses using semantic technologies

Tools & services

Metadata standard for scientific data (EML)

Online repository for ecological data/metadata (KNB) Advanced storage system for data/metadata (Metacat) Desktop software for managing metadata (Morpho) Distributed scientific workflow platform (Kepler)

(21)

NCEAS Ecoinformatics program

Research

Better managing heterogeneous data

Capturing reproduciblescientific workflows

Interpreting data/analyses using semantic technologies

Tools & services

Metadata standard for scientific data (EML)

Online repository for ecological data/metadata (KNB) Advanced storage system for data/metadata (Metacat) Desktop software for managing metadata (Morpho) Distributed scientific workflow platform (Kepler)

(22)

NCEAS Ecoinformatics program

Research

Better managing heterogeneous data Capturing reproducible scientific workflows

Interpreting data/analyses using semantic technologies Tools & services

Metadata standard for scientific data (EML)

Online repository for ecological data/metadata (KNB)

Advanced storage systemfor data/metadata (Metacat)

Desktop software for managing metadata (Morpho)

(23)

Real world data vary in format, structure, semantics

(24)

Let data vary, but formalize metadata

Metadata makes data useful

Needed to assess/interpret data Provides critical context

Owner, Abstract, Methods Variables, Sites, Units

Ecological Metadata Language (EML)

Metadata standard: rules about what and how to say things

Developed by scientists, for scientists Human-readable, yet machine-interpretable

(25)

Let data vary, but formalize metadata

Metadata makes data useful Needed to assess/interpret data Provides critical context

Owner, Abstract, Methods Variables, Sites, Units

Ecological Metadata Language (EML)

Metadata standard: rules about what and how to say things

Developed by scientists, for scientists Human-readable, yet machine-interpretable

(26)

Data documentation made easy

Basic option: Web upload

http://data.nceas.ucsb.edu

Simple online form General metadata only Upload data files

(27)

Data documentation made easy

Basic option: Web upload

http://data.nceas.ucsb.edu

Simple online form General metadata only Upload data files

(28)

Data documentation made easy

Basic option: Web upload http://data.nceas.ucsb.edu

Simple online form General metadata only Upload data files

Advanced option: Install Morpho desktop software (free!)

Create & save detailed metadata using wizards

1 Manage your data on your own computer

2 Share with specific colleagues: set access priveleges 3 Share metadata/data with entire community

(29)
(30)

Search online (http://knb.ecoinformatics.org)

Participants NCEAS LTER OBFS PISCO ESA UCNRS SANParks TERN . . . and more

(31)

Thanks for listening. . .

References

Related documents

Figure 2 — The correlation between serum triglyceride (TG) levels after CAPD treatment (Post-D TG, mg/dL) and serum insulin levels ( µ U/mL) at fasting (a), 1 hour (b), and 2 hours

Patients hospitalized for AMI and heart failure in regions with lower density of cardiologists experienced modestly higher 30-day and 1-year mortality risk,

The continuous, rotated solution is then mapped back to a discrete solution using the same algorithm for the minimum cost assignment problem employed in spectral quadrisection..

“[I]f you’ve got a racially aggravated charge, you have to ask yourself as a police officer and as a prosecutor, ‘Do I have enough evidence to prove the assault … and the racial

Full Day and Half Day time includes call time, set up time, show time and tear

1 of the German securities trading Act (wphG) in conjunction with sections 297 (2) sentence 3 and 315 (1) sentence 6 of the German Commercial Code (hGB) To the best of our

Whether the space under consideration has to be maintained at a temperature lower or higher than the surrounding environment, to pump out or in the heat, external power is

This group of actors in the policy debate largely fear the possible introduction of a two-tiered health care system, similar to the one in America, consisting of high standard