• No results found

Running Native Lustre* Client inside Intel Xeon Phi coprocessor

N/A
N/A
Protected

Academic year: 2021

Share "Running Native Lustre* Client inside Intel Xeon Phi coprocessor"

Copied!
23
0
0

Loading.... (view fulltext now)

Full text

(1)

Running Native Lustre* Client

inside Intel® Xeon Phi™

coprocessor

Dmitry Eremin, Zhiqi Tao and Gabriele Paciucci 08 April 2014

(2)

What is the Intel® Xeon Phi™

coprocessor?

 Up to 61 Cores and 244 Threads per coprocessor  8 GB of memory

 Gen2x16 PCI Express: up to 8 GB/s  8 memory controllers supporting

up to 16 GDDR5 channels delivering

up to 5.5 GT/s (up to 352 GB/s bandwidth)

(3)

Typical Cluster Configuration

InfiniBand Switch Ethernet Switch

(4)

Typical Cluster Configuration with

Intel® Xeon Phi™ coprocessors

InfiniBand Switch Ethernet Switch

(5)

Intel® Xeon Phi™ coprocessor

Software Architecture

 Intel® Xeon Phi coprocessors, based on

Intel® Many Integrated Core (Intel® MIC) Architecture

 Intel® MIC Software Stack

(Intel® MPSS)

The Symmetric Communication Interface (SCIF) API is

the communication backbone between the host

processors and the Intel® Xeon Phi coprocessors in a heterogeneous computing environment

(6)
(7)

mic0

11.11.11.2 mic0

10.10.10.2

Intel® Xeon Phi™ coprocessors

Static Pair Topology

 Every Intel® Xeon Phi coprocessor card is assigned to a

separate subnet known only to the host

 Host is a router for any network communications

Host eth0

12.12.12.1 10.10.10.1mic0

mic1 11.11.11.1

(8)

Intel® Xeon Phi™ coprocessors

Internal Bridge Topology

 Bridging together multiple Intel® Xeon Phi coprocessor

card virtual network interfaces, on the same host

 Host is a router for external network communications

Host mic0 mic1 br0 10.10.10.1 mic0 10.10.10.3 mic0 10.10.10.2 eth0 12.12.12.1

(9)

mic0

12.12.12.3 mic0

12.12.12.2

Intel® Xeon Phi™ coprocessors

External Bridge Topology

 Bridge the Intel® Xeon Phi coprocessor virtual

connections to a physical Ethernet device

 Independent network communications  Maximum MTU is 9000 Host eth0 mic0 mic1 br0 12.12.12.1

(10)

Data transfer over virtual Ethernet

eth0 memory MIC0

MIC1 chip memory chip CPU chip memory

 All data passed through host memory

(11)

The NFS throughput

over virtual Ethernet (MTU is 1500)

0.00 5.00 10.00 15.00 20.00 25.00 1 2 4 6 8 16 32 64 128 160 M B/ sec Threads

(12)

The NFS throughput

over virtual Ethernet (MTU is 64512)

0.00 20.00 40.00 60.00 80.00 100.00 120.00 140.00 160.00 180.00 200.00 1 2 4 6 8 16 32 64 128 160 M B/ sec Threads

(13)
(14)

The virtual IB interface in

Intel® Xeon Phi™ coprocessor

 Become available in Intel® MPSS starting with v3.1  Required to have the OFED version 1.5.4.1 InfiniBand

drivers installed

 Supports Intel® True Scale and Mellanox Fabrics

Host eth0 mic0 mic1 MIC0 mic0 MIC1 ib0 mic0 ib0 ib0

(15)

Data transfer over virtual IB

eth0 memory MIC0

MIC1 chip memory chip CPU chip memory

 RDMA transfer passed directly to RDMA device

 RDMA device hardware is shared between Linux-based

host and Intel® Xeon Phi coprocessor applications

RDMA device

(16)

The Coprocessor Operating System

(coprocessor OS)

 Based on a standard Linux kernel source code  Coprocessor OS is a minimal:

 Busybox minimal shell environment

 Linux Standard Base (LSB) Core libraries

(17)

The Lustre* 2.4 Client for

Intel® Xeon Phi™ coprocessor

 Download the SOURCE package from

"MPSS 2.1 release for Linux" section

Unpack it and then unpack the

package-full_src-k1om.tar.bz2 file in any location

 Execute the following commands in this place:

# export PATH=/usr/linux-k1om-4.7/bin:$PATH # make defconfig-miclinux

# make -C card/kernel ARCH=k1om modules_prepare # sh autogen.sh

# ./configure --with-linux=<unpacked_path>/card/kernel \ --without-o2ib \

--host=x86_64-k1om-linux --build=x86_64-pc-linux # make rpms

(18)

The Lustre* 2.5 Client for

Intel® Xeon Phi™ coprocessor

 Download the SOURCE, mpss-3.x-k1om.tar and OS

specific files from "MPSS 3.x release for Linux" section

 Unpack from them “kernel-dev-*.rpm”,

“ofed-driver-*-devel-*.rpm” and “linux-*.tar.bz2” files

 Prepare Intel® MPSS sources for Lustre* build:

# rpm2cpio kernel-dev-*.rpm | cpio –idm

# rpm2cpio ofed-driver-*-devel-*.rpm | cpio –idm # tar xjvf linux-*.tar.bz2 && cd linux-*

# cp -f ../boot/config-* .config

# cp -f ../boot/Module.symvers-* Module.symvers

# . /opt/mpss/3.x/environment-setup-k1om-mpss-linux # make ARCH=k1om silentoldconfig modules_prepare

(19)

The Lustre* 2.5 Client for

Intel® Xeon Phi™ coprocessor (cont)

 Build Lustre* sources for Intel® Xeon Phi coprocessor

with Intel® MPSS sources with virtual IB support: # . /opt/mpss/3.x/environment-setup-k1om-mpss-linux # sh autogen.sh # ./configure --with-linux=<path>/linux-2.6.38+mpss3.x \ --with-o2ib=<path>/usr/src/ofed-driver-*.el6.x86_64 \ --host=k1om-mpss-linux --build=x86_64-pc-linux # make rpms

(20)

Install & Configure the Lustre* Client

Host configuration in

/etc/modprobe.d/lustre.conf

options lnet networks=“o2ib0(ib0)”

Mounting on Host:

# mount –t lustre 8.8.8.8@o2ib:/lustrefs /mnt/lustrefs

Xeon Phi configuration in /etc/modprobe.d/lustre.conf

options lnet networks=“o2ib0(ib0)”

Mounting on Xeon Phi:

# mount.lustre 8.8.8.8@o2ib:/lustrefs /mnt/lustrefs

 Only two Lustre* RPMs should be installed on host:

lustre-client-mic-<version>.x86_64.rpm

lustre-client-mic-modules-<version>.x86_64.rpm  ssh mic0 "echo 'options lnet networks=\"o2ib0(ib0)\"' >

(21)

The Lustre* throughput

over virtual IB

0.00 200.00 400.00 600.00 800.00 1000.00 1200.00 1 2 4 6 8 16 32 64 128 160 M B/ sec Threads 280,00

(22)

Server and storage infrastructure,

networking and hardware support was supplied by

Intel’s High Performance computing Labs in Swindon (UK).

Special thanks to Jamie Wilcox and Adam Roe.

(23)

References

Related documents

Table 3 summarizes classification of identified apps by functionality; there is a lot more apps that offer informational content, relaxation and meditation than those dedicated to

• The incorporation into the Investment Grade Agreement of certain provisions which initially appeared in the optional Market Conditions Provisions, notably the provisions

AN ANALYSIS OF NETWORK AND APPLICATION RESEARCH, TESTING AND OPTIMIZATION As part of Internet.org’s mission to bring affordable internet access to the two-thirds of the world not

Termination of agreement with the Village of Nissequogue Fire District for radio dispatching services, pursuant to the attached agreement, effective June 4, 2021. RESULT:

It is therefore almost natural that at the very moment of its foundation PEGASUS, aiming to become the portal of the European higher aerospace education in front of the

Digestive enzyme activities correlated closely with food availability in copepods from Billefjorden and also in individuals from the incubation experiments at

The handbook includes clinical policies and procedures, American Speech-Language-Hearing Association (ASHA) standards and MA state and Department of Education licensing

Local Drug Generic Name: INSULIN CONCENTRATE REG HUMAN 500 UNT/ML ORDER UNIT: VI?. DISPENSE UNITS/ORDER UNITS: 1 DISPENSE UNIT: VI PRICE PER DISPENSE