• No results found

An Introduction to High Performance Computing in the Department

N/A
N/A
Protected

Academic year: 2021

Share "An Introduction to High Performance Computing in the Department"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

An Introduction to High Performance Computing

in the Department

Ashley Ford & Chris Jewell

Department of Statistics University of Warwick

(2)

1 Some Background

2 How is Buster used?

3 Software

(3)

Outline

1 Some Background

2 How is Buster used?

3 Software

(4)

What is Buster for?

Raw computing power:

Large datasets → lots ofmemory

Complex algorithms →fast processing

Batch processing

Ability to set your algorithm running, and get on with other work.

Interactivesessions

Manipulating data in real-time.

Cost effectiveness

High-powered centralised computing facility shared among users

(5)

System architecture

Internet

Execution

Fileserver Frontend

(6)
(7)

Specifications

Frontend Node

2 x 2.33GHz Intel E5410 Quad Core processors 8GB fully buffered RAM

Storage Cluster

LustreTM high performance filesystem

17TB RAID storage

Execution Nodes(11 machines, 108 CPUs)

5 nodes: 3.16GHz Intel X5460 Quad Core x2, 16GB FBRAM 2 nodes: 2.93GHz Intel X5570 Quad Core x2, 16GB FBRAM 5 nodes: 2.80GHz Intel X5660 Six Core x2, 48GB FBRAM

(8)

Outline

1 Some Background

2 How is Buster used?

3 Software

(9)

What you get

Standard user accounts provide:

Username (same as ITS username) andpassword

1GB fault-tolerant home storage

Expandable if required

Backed up nightly to ITS central backup service

Default 150GB storagespace.

Fault tolerant NOTbacked up!

Up to 800GB scratchspace, shared with other users

Files deleted automatically after 14 days

(10)

Frontend node

Internet

Frontend

Fileserver

(11)

Logging in

Login provided over SSH secure shell connection

Hostname: buster.stats.warwick.ac.uk

Provides password-protected access fromanywhere on the

internet.

Graphical forwarding enabled (eg. R graphs, text editors) Clients:

Linux / MacOS X- nativesshclient

(12)

Accessing files

SSH

Encryptedfile transfer from anywhere on the internet Usescp(Linux/MacOS X) orWinSCP(Windows)

Windows fileshare

Unencryptedfrom within campus (still requires password, though)

Home: \\buster\<username>

(13)

The module system

Software packages available viamodulecommand

Allows versioningof software packages Checks for conflicts between packages

Available packages: module avail

Adding a module (default version): module add R

Adding a module (specific version): module add R/2.8.1

Displaying information: module display R

(14)

Submitting jobs

Internet

Execution

Fileserver Frontend

(15)

Submitting jobs

Grid Engine

Jobs managed by Grid Engine (Sun/Oracle/Open Grid

Scheduler)

Interactivejobs

Batchjobs

Submit a job from the Frontend node, and Grid Engine sends it to a free slot on an execution node

(16)

Interactive jobs

Requested via theqlogincommand on the frontend node

Shortjobs only

eg. running individual commands in R or interactive Python

Pros: interactivity, graphics, quick and simple to use.

Cons: youlose your jobif your connection to Buster is interrupted, lower processor scheduling priority.

(17)

Batch Jobs

Submit via a job submission script from the frontend node with qsub:

Ideal for longjobs run in batch mode

Pros:

Allows requests formultiple processors

Provides thetask array facility

Saving standard output and error buffers to disk

Allows you to log out and get on with something else while your job runs

High-priority processor scheduling

Cons:

You have to write a job submission script No interactivity/graphics

(18)

Batch Jobs

Example job script - /usr/sge/examples/jobs/r-example.sh

# !/bin/bash #$ -S /bin/bash #$ -o /storage/$USER/r-example.stdout #$ -e /storage/$USER/r-example.stderr #$ -l h vmem=500M,h rt=0 . /etc/profile module add R cd /storage/$USER

time R --vanilla << EOF

x<-runif(100) pdf("R-output.pdf") plot(x) dev.off()

(19)

Submitting Jobs

Simply:

$ qsub <path to script>

Queues veryshort 1 hour short 12 hours medium 24 hours long 48 hours unlimited ∞

(20)

Batch Jobs

Advanced options

Task arrays - instruct Grid Engine to run N instances of your algorithm

-t 1-N:5 (ie. 1 to N jobs, skip every 5)

Parallel environments

For runningparallelalgorithms only!

Shared memory (smp) or distributed memory (mpi) -pe <smp | mpi> n

(21)

Monitoring jobs

Monitoring jobs:

qstat monitors job status

To see how busy the queue is: qstat -u \*

Killing jobs:

qdel deletes jobs

Requires the job number (use qstat)

To kill all your jobs at once: qdel -u <username>

Jul Aug Sep Oct Nov

0

40

80

(22)

Outline

1 Some Background

2 How is Buster used?

3 Software

(23)

Available Software

Applications R Maple Ox Ggobi Scilab Octave Scripting languages Python (+numpy) Perl R J Libraries GSL ATLAS LAPACK Boost GNU Multiprecision SPRNG JAGS ACML Compilers

GNU Compiler Collection Sun Java 6 SE SDK

(24)

Outline

1 Some Background

2 How is Buster used?

3 Software

(25)

Buster Support

Using Buster:

1 Command man pages

2 Web documentation: Dept. Homepage →Intranet → Local IT Info → Cluster

3 Forum: New Forum

4 Sysops: Phil Harvey-Smith & Simon Parkes

Help on specific software:

1 Software package documentation

2 Web documentation (FAQ etc etc)

3 Mailing lists 4 Google!!!!!!!!!!!!!

(26)

Buster Forum

http://forums.warwick.ac.uk/wf → Departments → Stats → Buster

A new forum has been set up to :

1 provide hints and tips

2 if you spend a long time finding a solution to a problem,

others might benefit from your answer.

3 make requests for new or upgraded software

(27)

References

Related documents

Different channel models are used for different types of communication (for example, if information is sent to one person or to many people) and different types of errors.

If a clinician focuses on self-care outside of work, takes coffee and lunch breaks during work hours, gets enough sleep, seeks support and supervision from coworkers, engages in

In this paper, we’ll look at the first steps in measuring your AppSec program, starting with how to use metrics to understand what is working and where you need to improve,

For the natural building with no radon mitigation system, the interior surface is assumed to have a relative negative pressure of 2 pascals (relative to the exterior).. The

In Queensland, the Queensland Transcultural Mental Health Centre (QTMHC) is a state-wide service working in partnership with the national body and other mainstream mental

Workforce diversity had a positive influence on organizational performance in the selected firms in Nigeria, and that education as a tool in the aspect of managing workforce

Reflecting the slowdown in advertising investment and particular difficulties in the financial markets during the year, the 2001 results are less than those for 2000 (net profit

Ferroday Ltd specialise in the development and application of product data technology standards for the representation of materials and other engineering properties and