• No results found

two

N/A
N/A
Protected

Academic year: 2020

Share "two"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

Summary of DA-PWC Performance

Data

Data for this analysis were taken from Region 5 (41870 points total) of the Fungi sequence classification at http://salsahpc.indiana.edu/millionseq/fungi2/fungi2_index.html

Dataset # Points # Clusters Description

A 12579 4 Region 5 sub cluster 2 (of 10) split into 4 (# clusters) B 20000 10 Region 5 first 20k points split into 10 clusters C 40000 10 Region 5 first 40k points split into 10 clusters

Parallel Libraries

Dataset MPI C# Tasks MPI Java Tasks

A MPI.NET

v1.0.0 Microsoft Task ParallelLibrary .NET v4.0 OpenMPI v1.7.5rc5 Habanero Java (HJ-lib)v0.1

B MPI.NET

v1.0.0 Microsoft Task ParallelLibrary .NET v4.0 OpenMPI v1.8.1 Habanero Java (HJ-lib)v0.1.1

C MPI.NET

v1.0.0 Microsoft Task ParallelLibrary .NET v4.0 OpenMPI v1.8.1 Habanero Java (HJ-lib)v0.1.1

Parallelisms

Total T P N Total T P N Total T P N

1 1 1 1

2

1 1 2

1 2 1

2 1 1

4

1 1 4

1 2 2

1 4 1

2 1 2

2 2 1

4 1 1

8

1 1 8

1 2 4

1 4 2

1 8 1

2 1 4

2 2 2

2 4 1

4 1 2

4 2 1

8 1 1

16

1 1 16

1 2 8

1 4 4

1 8 2

2 1 8

2 2 4

2 4 2

4 1 4

4 2 2

8 1 2

32

1 1 32

1 2 16

1 4 8

1 8 4

2 1 16

2 2 8

2 4 4

4 1 8

4 2 4

8 1 4

64

1 2 32

1 4 16

1 8 8

2 1 32

2 2 16

2 4 8

4 1 16

4 2 8

8 1 8

128

1 4 32

1 8 16

2 2 32

2 4 16

4 1 32

4 2 16

8 1 16

256

1 8 32

2 4 32

4 2 32

(2)

Where,

T - # of concurrent tasks per MPI process P - # of MPI processes per node

N - # of nodes

Dataset Parallelisms Tested

A 1,2,4,8,32,64,128,256

B 64,128,256

C 64,128,256

System Information

System Description

Tempest 32 nodes, each has 4 Intel Xeon E7450 CPUs at 2.40GHz with 6 cores, totaling 24 cores per node; 48 GB node memory and 20Gbps Infiniband (IB) network connection. It runs Windows Server 2008 R2 HPC Edition – version 6.1 (Build 7601: Service Pack 1).

FutureGrid

(3)

Performance Results

Dataset A – 12579 Points

TxP (top) Nodes (middle) Total Parallelism (bottom)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Time

(hr)

0 2 4 6 8 10 12

Dataset A - Performance with varying nodes for different TxP

combinations

ScaledMPINET

OMPI

(4)

The highlighted section of above graphs is presented below.

TxP (top) Nodes (middle) Total Parallelism (bottom)

8 1

1x8 16 21x8 32 41x8 64 81x8 12816 1x8

256 32 1x8

8 1

2x4 16 22x4 32 42x4 64 82x4 12816 2x4

256 32 2x4

8 1

4x2 16 24x2 32 44x2 64 84x2 12816 4x2

256 32 4x2

8 1

8x1 16 28x1 32 48x1 64 88x1 12816 8x1 256 32 8x1 Time (hr) 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

Dataset A - Performance with varying nodes for different TxP

combinations - Selected

ScaledMPINET

OMPI

The above graphs shows 4 different intra-node parallelism patterns – 1x8, 2x4, 4x2, and 8x1. Note how 8x1 performs poorly compared to the other three in OMPI. This happens due to the topology of FutureGrid machines where a socket contains only 4 cores. Therefore running 1 MPI with more than 4 tasks doesn’t yield good

performance.

(5)

A different representation of the above graph shows the effect of TxP patterns more clearly.

Parallelism (top) Nodes (bottom)

1 2 3 4 5 6

Time

(hr)

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

Region 5(10)_2(4) 12579 Points 4 Clusters

-Selected

(6)

Dataset B – 20k Points

(7)

Dataset C – 40k Points

TxP (top) Nodes (middle) Total Parallelism (bottom)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17

Time

(hr)

0 5 10 15 20 25

Dataset C - Performance with increasing nodes and parallelism

MPINET ScaledMPINET OMPI

References

Related documents

Guaranteed cash and prizes of $80,000 are up for grabs in the Bingo Bugle’s 24th Annual World Championship Bingo Tournament. Every player receives a 9-on package of Bingo paper

In the previous sections, we dis- cuss the expectation that a neural network exploiting the fractional convolution should perform slightly worse than a pure binary (1-bit weights

Colliers International makes no guarantees, representations or warranties of any kind, expressed or implied, regarding the information including, but not limited to, warranties

• Our goal is to make Pittsburgh Public Schools First Choice by offering a portfolio of quality school options that promote high student achievement in the most equitable and

The largest transactions in the third quarter of 2015 on the Polish M&A market were the acquisition of PKP Energetyka SA by the CVC Capital Partners and the acquisition of

INDOOR UNIT Error display OPERATION lamp TIMER lamp SWING lamp Error contents Indoor unit?. Model information abnormal

Make  changes  to  section  on  agreements  to   reflect  emphasis  of  agreements  in  new

Comments This can be a real eye-opener to learn what team members believe are requirements to succeed on your team. Teams often incorporate things into their “perfect team