• No results found

Key Words: Dynamic Load Balancing, and Distributed System

N/A
N/A
Protected

Academic year: 2021

Share "Key Words: Dynamic Load Balancing, and Distributed System"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

D

D

Y

Y

N

N

A

A

M

M

I

I

C

C

R

R

O

O

T

T

A

A

T

T

I

I

N

N

G

G

L

L

O

O

A

A

D

D

B

B

A

A

L

L

A

A

N

N

C

C

I

I

N

N

G

G

A

A

L

L

G

G

O

O

R

R

I

I

T

T

H

H

M

M

I

I

N

N

D

D

I

I

S

S

T

T

R

R

I

I

B

B

U

U

T

T

E

E

D

D

S

S

Y

Y

S

S

T

T

E

E

M

M

S

S

ISSA OTOUM AL DAHOUD ALI ROSE SULEIMAN

Al-Zaytoonah University Al-Zaytoonah University Neelain University

com . yahoo @ k 2 issaotoum jo . edu . alzaytoonah @ aldahoud com . o yaho @ rosesuleiman

A

A

B

B

S

S

T

T

R

R

A

A

C

C

T

T

Load Balancing in a distributed system is an important process to reduce delays and improve response times in order to speed up applications and results. Different approaches to Load Balancing have different advantages and disadvantages. ‘Classical’ approaches to load balancing are quite good and mostly efficient, but in many circumstances, the overheads incurred from load balancing are too high and therefore become ineffective.

Dynamic Rotating Load Balancing Algorithm in Distributed Systems is proposed in this paper.

This new algorithm has much lower overheads and faster response times when compared to the classical approaches, as shown in the data obtained from the simulations done to test this approach. It is also scalable and efficient regardless of the size of the network used.

I

I

N

N

T

T

R

R

O

O

D

D

U

U

C

C

T

T

I

I

O

O

N

N

This paper is a study about dynamic load balancing in a distributed system.

It compares the various approaches to Load Balancing in distributed system, with focus on dynamic approaches in general; some processes are discussed in detail to clarify issues concerning factors contributing to load and the mechanisms used in any balancing process.

The paper explains the new algorithm and how it is applied, with discussion of the simulation process used to compare the classical approach to the new- proposed- approach.

Also it gives the results and conclusions drawn from the study and further proposals.

The tables and diagrams at the end of the study give comparisons between the various approaches and the results of the simulations done as part of the study, with references outlined after that.

Key Words: Dynamic Load Balancing, and Distributed System

1

1

-

-

D

D

Y

Y

N

N

A

A

M

M

I

I

C

C

L

L

O

O

A

A

D

D

D

D

I

I

S

S

T

T

R

R

I

I

B

B

U

U

T

T

I

I

O

O

N

N

1-1.Load distribution

Load distribution seeks to improve the performance of a distributed system, usually in terms of response time or resource availability, by allocating workload amongst a set of cooperating hosts.

This division of system load can take place statically or dynamically: 1-2. Dynamic load distribution

Dynamic load distribution is designed to overcome the problems of unknown or un-characterizable workloads, non-pervasive scheduling and runtime variation (any situation where the availability of hosts, the composition of the workload or the interaction of human beings can alter resource requirements or availability). Dynamic load distribution systems typically monitor the workload and hosts for any factors that may affect the choice of the most appropriate assignment and distribute jobs accordingly. This very difference between static and dynamic forms of load distribution is the source of the power and interest in dynamic load distribution.

The objectives of this thesis lie entirely within the domain of dynamic load balancing. For brevity, I will take the more general term of load distribution to stipulate only the dynamic form. [1, 2, 7, 11]

The Degree of Load Distribution

Load Sharing: This is the coarsest form of load distribution. Load may only be placed on idle hosts, and can be viewed as binary, where a host is either idle or busy.

(2)

Load Balancing: Where load sharing is the coarsest form of load distribution, load balancing is the finest. Load balancing attempts to ensure that the workload on each host is within a small degree (or balance criterion) of the workload present on every other host in the system.

Load Leveling: Load leveling occupies the ground between the two extremes of load sharing and load balancing. Rather than trying to obtain a strictly even distribution of load across all hosts, or simply utilizing idle hosts, load leveling seeks to avoid congestion on any one host.

Other schemes such as, MOSIX, which could be considered load balancing systems, are in fact load leveling, as the balancing phase occurs periodically.

1-3. Previous Load Distribution Taxonomies

There are numerous existing taxonomies available for the classification of load distribution, including Wang and Morris, Casavant and Kuhl and Jacqmot and Milgrom .

2

2

-

-

T

T

H

H

E

E

P

P

R

R

O

O

P

P

O

O

S

S

E

E

D

D

N

N

E

E

W

W

A

A

L

L

G

G

O

O

R

R

I

I

T

T

H

H

M

M

:

:

A network is made up of nodes connected together in a certain configuration; the configuration will not matter for our purposes here.

The nodes are arranged logically from 1 to n; where n is the total number of nodes.

We can view the network, regardless of its size, as consisting of adjacent pairs of nodes or triplets, the nodes that are nearest each other are considered as the pair. This breaks up the large network into a number of small networks, the Load Balancing can then be done within the small networks (consisting of only two or three nodes).

This grouping has the effect of both reducing the number of messages exchanged and also the physical distances between nodes as they are chosen to be adjacent. This has the effect of reducing the overhead when Load Balancing is done and thus makes for a more efficient process with faster response times and quicker task achievement.

To do the Load Balancing in a dynamic manner, we need to set the criteria for the start of Load Balancing Process, and also to have a mechanism for changing the configuration of the network each time load balancing is done.

2-1. Criteria for Load Balancing:

1) When a certain number of tasks (queue length) at the node is reached- i.e. a threshold. 2) Periodically.

2-2. Selection of nodes:

Any number of nodes can make up a group, but limiting the size to only 2 or 3 gives us the advantages of low overhead, reduction of job thrashing as well as other advantages; like scalability, robustness, stability and efficiency.

The selection of nodes, for 3 nodes per cluster, or for 2 nodes per cluster can be done using the code:

The code for the process, shown here in C++: Cycle =1

While (true)

For (i=0;i< groups ; i++)

For (j=0;j< clustersize ; j++) X=i*clustersize+j+cycle; If (x>nodes) C[i+1][j+1] = x % Nodes Else C[i+1][j+1] = x; The Load Balancing Code is placed in this area

If (++cycle>Nodes) cycle=1;

In this configuration, the load over the groups consisting of nodes I;j;k : NetLoad = int ((load I+ load j)/2) , for 2 node groups NetLoad = int ((load I+ load j+ load k)/3) , for 3 node groups

(3)

2-3.Load Balancing steps:

Step 1:

Calculation of local load:

We need to calculate local load ( i.e. load at each node): Factors of load:

Load (L) is directly proportional to :

Average queue length(Q(avg)).

Response time(tresp(avg))

Average waiting time(tw(avg))

Load (L) is inversely proportional to :

Number of nodes, n.

Mathematically:

L α (Q(avg)) * (tresp(avg))* (tw(avg)) / n

Multiplying by a constant (c) makes this an equation: L = (Q(a)) * (tresp(a))* (tw(a))*c / n

(tresp)= t release – t arrival

(tw)= t seize – t arrival

(tresp(avg))=the sum of (tresp)/ number of jobs= sum( t release – t arrival )/ nj

(tw(avg))=sum((tw)/number of jobs=sum(t seize – t arrival )/ nj

(Q(avg)) of node n= the sum of jobs at time (m)

The constants, obtained from experimental values as shown in previous studies (Zhou , Kara; 1994): * tasks arrive at nodes in a Poisson distribution manner.

* task size follows exponential distribution.

time to send message 0.00001 s (10 ms) time to receive message 0.00001 s (10 ms) time for job sending 0.00005 s (50 ms) time for job receiving 0.00005 s (50 ms) time to send result 0.00001 s (10 ms) time to receive result 0.00001 s (10 ms)

Step 2:

Calculation of total load:

1.For central load balancing approach:

Check load of all nodes and distribute load accordingly. 2.For Distributed load balancing approach:

Check tables of loads and distribute load accordingly. 3.For rotating algorithm:

Loads are calculated locally and the total load for each cluster is added and averaged; if threshold is exceeded then load is distributed by transfer of jobs to neighboring nodes in each cluster then the rotation of nodes is done. Step 3:

Trigger of load balancing: 2 mechanisms: 1) Threshold.

2) Periodically. Step 4: Response Times

Response times were calculated for different number of nodes by the simulation code for the various approaches of balancing

(4)

Considerations:

To Compare:

1) Centralized approach. 2) Distributed approach. 3) New algorithm.

We have to consider the following issues first:

The different approaches have different overheads depending on various aspects of the system. They each have advantages and disadvantages. To have a fair comparison with no bias would be quite difficult as the various aspects of the overhead and other delay factors will not be constant for each approach, but if we try to keep all the aspects of the simulations used to do the comparisons constant, apart from those that are inherently different, then the bias will be kept to a minimum.

Some variables may be applicable only to some approaches and not in others. Results of the simulation:

A simulation was constructed to do the following:

1) Perform as a network with multiple processors- i.e. consisting of n nodes. this n can be varied for the purpose of the study.

2) A set task was given equally to all approaches; the task was divided into smaller tasks in exponential distribution manner and then distributed to the nodes set up in the simulation.

3) The process of load balancing was done for each approach of simulation.

4) The results obtained were plotted as response times vs. number of nodes for all the simulation processes to compare the prospective response times.

Mathematical Considerations:

(load * Number of processors) Arrival rate =

(Required number of processors *Average execution time)

The Load is directly proportional to: Arrival rate

Mean service time ( = Total service time / number of tasks ) The Load is inversely proportional to number of processors So , mathematical expression to describe Load:

(5)

The tasks were split into jobs by the exponential distribution method.

The results obtained are given in the following graphs :

Graph1: Rotational-2nodes, Central, Distributed & No Balancing

(6)

A

A

N

N

A

A

L

L

Y

Y

S

S

I

I

S

S

O

O

F

F

R

R

E

E

S

S

U

U

L

L

T

T

S

S

:

:

The results show the following observations:

1) Plotting the results taken from the simulation for the average response times vs. the number of nodes for No Load Balancing, Central approach , Distributed approach , Rotational approach for 2 nodes per cluster we see that : The response times are better for Rotational 2 which is better than central approach which is better than distributed approach , all better than no Load Balancing.[Graph1]

2) Plotting the results taken from the simulation for the average response times vs. the number of nodes for No Load Balancing, Central approach , Distributed approach , Rotational approach for 3 nodes per cluster we see that : The response times are better for Rotational 3 which is better than central approach which is better than distributed

approach , all better than no Load Balancing.[Graph2]

Graph3: Rotational-2nodes, Rotational-3nodes & No Balancing

(7)

3) Plotting the results taken from the simulation for the average response times vs. the number of nodes for Rotational approach for 3 nodes per cluster & Rotational approach for 2 nodes per cluster we see that :

The response times are better for Rotational 3 than Rotational 2, all better than no Load Balancing.[Graph3] 4) To see the effect of increasing the number of nodes and how the behavior changes, Graph 7 shows that as the

number of nodes increase, the average response times start increasing after a certain point for the distributed approach and the central approach until eventually they reach a point where no balancing is better. While the rotational approach continues to give better response times regardless of the number of nodes used.[Graph4]

C

C

o

o

n

n

c

c

l

l

u

u

s

s

i

i

o

o

n

n

s

s

:

:

1) From the analysis of results we find that the simulations done show that the proposed new algorithm has much lower overheads and faster response times when compared to the classical approaches, as shown in the data obtained from the simulations done to test this approach.

2) It is also scalable and efficient regardless of the size of the network used: as we see that when number of nodes increases it still gives good response times, which is not found in other approaches.

3) Using this approach gives better results because there are virtually no overheads compared to classical approaches.

4) Using clusters of 3 nodes gives better performance than 2 nodes per cluster, this is related to the fact that 3 nodes are better at handling the local load than 2 nodes because of simple mathematical rules involved.

R

R

e

e

f

f

e

e

r

r

e

e

n

n

c

c

e

e

s

s

1) Al_Dahoud Ali, Giacomo Gioffi, . “Ramtha : The manager algorithm for Dynamic Load

Balancing in Distributed Systems.

2) Al_Dahoud, A. Dynamic Load Balancing in Distributed Systems, Ph.D . Thesis, National

Technical University of Ukraine, Kiev, Ukraine, 1996.

3) Andrews. Tanenbaum, “ Modern Operating Systems” , Prentice _ Hall International, Inc,

Newjersy, 1992.

4) Allan, R., Michael, L. and Miron, L. Condor Technical Summary, Computer Sciences

Department, University of Wisconsin-Madison, 1991.

5) Anna, H. Load Balancing in Distributed Systems: A Summary, Performance Evaluation

Review, Vol. 16 #2-4 February 1989, pp. 17-19.

6) Silberschatz A., Galvin. Peter B., “ Operating System Concepts” , Fifth Edition, John Wiley &

Sons , Inc., 1999 .

7) SANDEEP GUPTA , Dynamic Load Balancing , project website, internet site. Page opened,

April 2003.

8) Averill, L. and Kelton W. Simulation Modeling And Analysis, McGraw_HiII, New York,

USA, 1991.

9) Bruce, L., Hosseini S., Vairavan K. and Gregory, W. Performance Characteristics ora Load

Balancing Algorithm, Journal of Parallel and Distributed Computing, 31, 1995, pp. 159-165.

10) Chi-Chung, H. and Samuel, C. Allocating Task Interaction Graph to Processors in

Heterogeneous Networks, IEEE Transactions on Parallel and Distributed Systems, vol. 8,

No.9, September 1997, pp. 908-925.

11) Chi-Chung, H. And Samuel, C. Hydrodynamic Load Balancing, IEEE Transactions on Parallel

and Distributed Systems, Vol. 10, No.11 , November 1999, pp. 1118-1137.

12) Chi-Chung, H. And Samuel, C. Theoretical Analysis of the Heterogeneous Dynamic Load

Balancing Problem Using a Hydrodynamic Approach, Department of Computer Science, The

Hong Kong University of Science and Technology, Technical Report, March 5, 1996.

(8)

13) Chi-Chung, H. And Samuel, C. Theoretical Analysis of the Heterogeneous Dynamic Load

Balancing Problem Using a Hydrodynamic Approach, Journal of Parallel and Distributed

Computing, 43, 1997, pp. 139-146.

14) Dasgupta, P., Majumder A. and Rhattacharya, P. V_THR : An Adaptive Load Balancing

Algorithm, Journal of Parallel And Distributed Computing, 42, 1997, pp. 101-108.

15) Douglas, T., Jim, B. and Miron, L. Condor, Computer sciences Department, University of

Wisconsin, 2001.

16) Flavio, R. and Anurag, K. Adaptive Optimal Load Balancing in a Non-homogeneous

Multi-server System with a Central Job Scheduler, IEEE Transactions on Computers, vol. 39, No.10,

October 1990.

17) Herbert, K. and Andreas, W. Comparison of Dynamic Load Balancing Strategies, Lehrstuhl

fur Informatik IL 1990.

18) Jerrell, W. and Stephen, T. A Practical Approach to Dynamic Load Balancing, IEEE

Transactions on Parallel and Distributed Systems, Vol. 9, No.3, March 1998, pp. 235-248.

19) Jie, L. and Hisao, K. Optimal Static Load Balancing in Network Configurations With

Two-Way Traffic, Journal of Parallel And Distributed Computing, 23, 1994, pp. 364-375.

20) Kai, H. and DeGroot, D. Parallel Processing for Supercomputers and Artificial Intelligence,

McGraw-Hill Publishing Company, USA,1989.

21) Mohammed, Z., wei, L. and Srinivasan, P. Customized Dynamic Load Balancing for a

Network of Workstations, Journalof Parallel And Distributed Computing, 43, 1997, pp.

156-162.

22) Mourad, K. A Global Plan Policy for Coherent Cooperation in Distributed Dynamic Load

Balancing Algorithms, University of Leeds -School of Computer Studies, Research Report

Series, Report 94.21, July 1994.

23) Mourad, K. Using Dynamic Load Balancing in Distributed Information Systems, University

of Leeds -School of Computer Studies, Research Report Series, Report 94.18, May 1994.

24) Tanenbaurn, A. Distributed Operating Systems, Prentice-Hall International, New Jersey,

USA, 1995.

25) Xiaotie, D., Hai-Ning, L., Junsheng, L. And Ring, X. Competitive Analysis of Network Load

Balancing, Journal of Parallel And Distributed Computing, 40, 1997, pp. 162-172.

26) Chao-Ju, H. And Kang, S. Implementation of Decentralized Load Sharing in Networked

Workstations Using the Condor Package, Journal of Parallel And Distributed Computing,40,

1997, pp. 173-184.

References

Related documents

While mathematics majors saw a 1.5 percent increase to their median starting salary, now standing at $47,000, they also had three of their top-employing industries pay- ing

State 3.1 Phillip Lim 50s 60s 7 For All Mankind 70s A Peace Treaty ALV Collections AQ/AQ AQUA x Maddie &amp; Tae AS65 ASKA Collection ASOS ASTR Aldo Alexander McQueen Alice +

Although both low- and middle-inome ountries are globally labor abundant, the effets of trade poliy on eonomi growth and inome distribution between these ountries differ beause of

Speci cally, using state-level panel data from 1977-1994 from the U.S., we consistently nd (i) evidence of environmental regulation being endogenous when examining

The computer network system of SAFER includes a gas dispersion model, whose data processing follows a procedure established previously: First, the system determines the location of

pectrometer (whose data confirmed using numerous stability tests) was used. Table 1 summarizes the 10-minute averages of the measured values of the two systems.

Focusing then on the majority group of childless, non-pregnant women, 11 we estimate trends over calendar periods in the competing risks of entry into cohabitation and into

In contrast, the vancomycin/ b - lactam combination remains recommended for the first- line treatment of patients with severe sepsis or septic shock and of patients at high risk