• No results found

Setting the Direction for Big Data Benchmark Standards Chaitan Baru, PhD San Diego Supercomputer Center UC San Diego

N/A
N/A
Protected

Academic year: 2021

Share "Setting the Direction for Big Data Benchmark Standards Chaitan Baru, PhD San Diego Supercomputer Center UC San Diego"

Copied!
20
0
0

Loading.... (view fulltext now)

Full text

(1)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Setting the Direction for Big Data

Benchmark Standards

Chaitan Baru, PhD

San Diego Supercomputer Center

UC San Diego

(2)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved. 2

  Industry’s first workshop on big data benchmarking

  Acknowledgements

  National Science Foundation (NSF)

  SDSC industry sponsors: Seagate, Greenplum,

NetApp, Mellanox, Brocade (host)

  WBDB2012 steering committee

 Chaitan Baru, SDSC/UC San Diego

 Milind Bhandarkar, Greenplum/EMC

 Raghunath Nambiar, Cisco

 Meikel Poess, Oracle

(3)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Invited Attendee Organizations

  Actian   AMD   BMMsoft   Brocade   CA Labs   Cisco   Cloudera   Convey Computer   CWI/Monet   Dell   EPFL   Facebook   Google   Greenplum 3

  San Diego Supercomputer Center

  SAS

  Scripps Research Institute

  Seagate   Shell   SNIA   Teradata Corporation   Twitter   UC Irvine   Univ. of Minnesota   Univ. of Toronto   Univ. of Washington   VMware   WhamCloud   Yahoo! •  Hewlett-Packard •  Hortonworks

•  Indiana Univ / Hathitrust

Research Foundation •  InfoSizing •  Intel •  LinkedIn •  MapR/Mahout •  Mellanox •  Microsoft •  NSF •  NetApp •  NetApp/OpenSFS •  Oracle •  Red Hat

Meeting structure: 6 x 15mts invited presentations; 35 x 5mts lightning talks. Afternoons: Structured discussion sessions

(4)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Topics of discussions

  Audience: Who is the audience for this benchmark?

  Application: What application should we model?

  Single benchmark spec: Is it possible to develop a single benchmark

to capture characteristics of multiple applications?

  Component vs. end-to-end benchmark. Is it possible to factor out a

set of benchmark “components”, which can be isolated and plugged into an end-to-end benchmark(s)?

  Paper and Pencil vs Implementation-based. Should the

implementation be specification-driven or implementation-driven?

  Reuse. Can we reuse existing benchmarks?

  Benchmark Data. Where do we get the data from?

  Innovation or competition? Should the benchmark be for innovation

(5)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Audience: Who is the primary

audience for a big data benchmark?

  Customers

 Workload should preferably be expressed in

  English

  Or, a declarative Language (unsophisticated user)

  But, not a procedural language (sophisticated user)

  Want to compare among different vendors

  Vendors

  Would like to sell machines/systems based on benchmarks

  Computer science/hardware research is also an audience

  Niche players and technologies will emerge out of academia

(6)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Applications: What application should

we model?

  Possibilities

  An application that somebody could donate

  An application based on empirical data

 Examples from scientific applications

  Multi-channel retailer-based application, like the

amended TPC-DS for Big Data?

 Mature schema, large scale data generator, execution rules,

audit process exists.

  “Abstraction” of an Internet-scale application, e.g.

data management at the Facebook site, with synthetic data

(7)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Single Benchmark vs Multiple

  Is it possible to develop a single benchmark to represent

multiple applications?

  Yes, but not desired if there is no synergy between the

benchmarks, e.g. say, at the data model level

  Synthetic Facebook application might provide context

for a single benchmark

  Click streams, data sorting/indexing, weblog

(8)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Component benchmark vs. end-to-end

benchmark

  Are there components that can be isolated and plugged

into an end-to-end benchmark?

  The benchmark should consist of individual components

that ultimately make up an end-to-end benchmark

  The benchmark should include a component that

extracts large data

  Many data science applications extract large data and

then visualize the output

  Opportunity for “pushing down” viz into the data

(9)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Paper and Pencil / Specification driven

versus Implementation driven

  Start with an implementation and develop specification at

the same time

  Some post-workshop activity has begun in this area

(10)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Where Do we Get the Data From?

  Downloading data is not an option

  Data needs to be generated (quickly)

  Examples of actual datasets from scientific applications

  Observational data (e.g. LSST), simulation outputs

  Using existing data generators (TPC-DS,

TPC-H)

  Data that is generic enough with good characteristics is

(11)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Should the benchmark be for

innovation or competition?

  Innovation and competition are not mutually exclusive

  Should be used for both

  The benchmark should be designed for competition,

such a benchmark will then also be used internally for innovation

  TPC-H is a prime example of a benchmark model that

could drive competition and innovation (if combined correctly)

(12)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved. !" ‡#$%&'("" - ')*+,+-.",/00-12"3).*45617"81-5"#16.,6*2+-."$1-*),,+.9" $)18-156.*)"%-/.*+:"" - 4220;<<===>20*>-19<20*?,<,0)*<20*?,@A>A>B>0?8" ‡C4D"3/+:?"-."2-0"-8"#$%&'(E" ‡F-:/5)";"" - G-"24)-1)2+*6:":+5+2" - #),2)?"/0"2-"ABB"#H"" ‡F):-*+2D";"1-::+.9"/0?62),"" ‡F61+)2D" - 5LFKUHODWLRQDOPRGHODQGGDWDWDEOHVIDFWWDEOHV«" - I6,D"2-"6??"24)"-24)1"2=-",-/1*)," " " " !"#$%&'&$!()*+,&-.$%&'&$/01(2$

Can we reuse existing benchmarks?

  Yes, we could but we need to discuss:

  How much augmentation is necessary?   Can the benchmark data be scaled

  If the benchmark uses SQL, we should not require it

  Examples: but none of the following could be used unmodified   Statistical Workload Injector for Map Reduce (SWIM)

  GridMix3 (lots of shortcomings)

 Open source

  TPC-DS

  YCSB++ (lots of shortcomings)

  Terasort – strong sentiment for using this, perhaps as part of an “end-to-end” scenario

(13)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Keep in mind principles for good

benchmark design

  Self-scaling, e.g. TPC-C

  Comparability between scale factors

  Results should be comparable at different scales

  Technology agnostic (if meaningful to the application)

  Simple to run

3 TPC

+Longevity: TPC-C has carried the load for 20 years

+Comparability

‡Audit requirements and strict detailed run rules mean one can compare results published by two different entities

+Scaling

‡Results just as meaningful at the high-end of the market as at the low-end; as relevant on clusters as on single servers

- Hard and expensive to run

- No kit

- DeWitt clauses

(14)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Extrapolating Results

  TPC benchmarks typically run on “over-specified”

systems

  i.e. Customer installations may have less hardware

than benchmark installation (SUT)

  Big Data Benchmarking may be opposite

  May need to run benchmark on systems that are

smaller than customer installations

 Can we extrapolate?

  Scaling may be “piecewise linear”. Need to find those

inflexion points

(15)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Interest in results from the Workshop

  June 9, 2012:

  Summary of Workshop on Big Data Benchmarking, C. Baru, Workshop on

Architectures and Systems for Big Data, Portland, OR.

  June 22-23, 2012:

  Towards Industry Standard Benchmarks for Big Data, M. Bhandarkar, The

Extremely Large Databases Conference at Asia, June 22-23, 2012.

  August 27-31, 2012:

  Setting the Direction for Big Data Benchmark Standards, Baru, Bhandarkar,

Nambiar, Poess, Rabl, 4th Int’l TPC Technology Conference (TPCTC 2012), with

VLDB2012, Istanbul, Turkey

  September 10-13, 2012:

  Introducing the Big Data Benchmarking Community, Lightning Talk and Poster at

the 6th Extremely Large Database Conference (XLDB), Stanford Univ, Palo Alto.

  September 20, SNIA ABDS2012

  September 21, Workshop on Managing Big Data Systems (MBDS), San

Jose, CA.

(16)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Big Data Benchmarking Community

(BDBC)

  Biweekly phone conferences

  Group communications and coordination is facilitated

via phone calls every two weeks .

  See http://clds.sdsc.edu/bdbc/community for call-in

information

  Contact [email protected] or [email protected]

to join BDBC mailing list

(17)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

Big Data Benchmarking Community

Participants

o  Actian o  AMD o  Argonne National o  Labs o  BMMsoft o  Brocade o  CA Labs o  Cisco o  Cloudera o  CMU o  Convey Computer o  CWI/Monet o  DataStax o  Dell o  EPFL o  Facebook o  GaTech o  Google o  Greenplum o  Hortonworks o  Hewlett-Packard o  IBM o  IndianaU/HTRF o  InfoSizing o  Intel o  Johns Hopkins U. o  LinkedIn o  MapR/Mahout o  Mellanox o  Microsoft o  SLAC o  SNIA o  Teradata o  Twitter o  Univ. of Minnesota o  UC Berkeley o  UC Irvine o  UC San Diego o  Univ of Passau o  Univ of Toronto o  Univ. of Washington o  VMware o  WhamCloud o  Yahoo ! o  … o  NetApp o  Netflix o  NIST o  NSF o  OpenSFS o  Oracle o  Ohio State U. o  Paypal o  PNNL o  Red Hat

o  San Diego Supercomputer

Center

o  SAS

o  Seagate

(18)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

2

nd

Workshop on Big Data

Benchmarking: India

  December 17-18, 2012, Pune, India

  Hosted by Persistent Systems, India   See http://clds.ucsd.edu/wbdb2012.in   Themes:

  Application Scenarios/Use Cases   Benchmark Process

  Benchmark Metrics   Data Generation

  Format: Invited speakers; Lightning talks; Demos   We welcome sponsorship

  Current Sponsors: NSF, Persistent Systems, Seagate, Greenplum, Brocade, Mellanox

(19)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

3

rd

Workshop on Big Data

Benchmarking: China

  June 2013, Xian, China

  Hosted by Shanxi Supercomputing Center

  See http://clds.ucsd.edu/wbdb2013.cn

  Current Sponsors: Seagate, Greenplum, Mellanox,

Brocade?

(20)

2012 SNIA Analytics and Big Data Summit. © Univ. of California San Diego. All Rights Reserved.

The Center for Large-scale Data

Systems Research: CLDS

  At the San Diego Supercomputer Center, UC San Diego

  R&D center and forum for discussions on big

data-related topics

  Benchmarking; Taxonomy of data growth; Information

value; Focus on industry segments, e.g healthcare

  Workshops; Program-level activities; Project-level

activities

  For information and sponsorship opportunities:

  Contact: Chaitan Baru, SDSC, [email protected]

  Thank you!

References

Related documents

In line with tumor metrics, gene expression analysis of EP and OT treated models showed some overlapping trends in resistance mechanisms supporting tumor response of incomplete

Jayanthakumaran and Frank (2004) test the hypothesis that trade reforms have had a positive impact on manufacturing exports, using both time series and cross-sectional data,

domestic finance to adequately fill the void left by the decline of London and the breakdown of the world financial system in the interwar period, when neither the Buenos Aires

I initially decided to write this thesis when I found an important similarity between Bourdieu and Heidegger: The concepts Dasein and world 3 have a similar structural role

In the home environment, target entities are those that are relevant for being monitored and controlled by applications such as energy management, security/safety

difficile spores could be transmitted from the farm environment to humans through a number of mechanisms including direct contact, airborne dispersal, avian, rodent or arthropod

The rationale for these cut scores is that if the lowest-scoring 15% of the national student population has consistently been found to be at severe risk in reading and math, and

These results of the empirical analyses discussed so far are all in line with the theoretical expectations: due to knowledge spillovers and innovation spillovers, clusters