UNIVERSITY OF ILLINOIS
... tik4*m*i**«»*
THIS IS TO CERTIFY THAT THE THESIS PREPARED UNDER MY SUPERVISION BY
KARL ROBERT KRAUSE
ENTITLED SIMULTANE0US s o l u t i o n o r s y s t e m s o f i n t e r l i n k e d m u l t i s t a g e d
SEPARATORS USING THE CRAY X-MP SUPERCOMPUTER
IS APPROVED BY ME AS FULFILLING THIS PART OF THE REQUIREMENTS FOR THE
DEGREE OF BACHEL0R 0 F SCIENCE IN CHEMICAL ENGINEERING
Instructor in Chtrgt
A m o v tn : ../JJrfSSfcSz
UVATS A f f n iT P A O T U P V T A V CHEMICAL ENGINEERING
Simultaneous Solution of Systems o f Interlinked Multistaged Separators Using the Cray X-M P Supercomputer
By
Karl Robert Krause
Thesis
for the
Degree o f Bachelor of Science in
Chemical Engineering
College of Liberal Arts and Sciences University o f Illinois
Urbana, Illinois
3 4 4
6
6 m1 T 8 10 10 10 14 14 15 15 15 17 17 18 1 8 2122
22
31 5 16Contents
1 I n t r o d u c t io n ...2 Sim ultaneous Solution o f Interlinked Distillation Colum ns . . . .
2.1 Equation Formulation...
2.2 Nonlinear Equation Solvers...
2.3 Linear Equation Solvers ...
3 C om parison o f Perform ance on the C Y B E R and the Cray . . . .
3.1 Differences Between CYBER and C ray F O R T R A N ...
3.2 Total Solution T im e ...
3.3 Nonlinear Equation Solvers...
3.4 Linear Equation Solvers ...
3.5 Jacobian and Function E va lu a tion s...
4 O pportunities for Im proved P e r f o r m a n c e ...
4.1 Breakdown o f CPU Time by S u b r o u tin e ... ... . . . .
4.2 Vectorization o f Linear Equation Solvers ...
4.3 Vectorization of Other S u b r o u tin e s...
4.4 Alternative Solution Strategies...
5 Acknowledgem ents ...
6 B ib lio g r a p h y ...
A A p p e n d ic e s ...
A .l Sample Program O u t p u t ...
A.2 Sample TIM ER/TALLY O u tp u t...
A .3 Subroutine Structure o f the Program s...
A.4 Total Solution Time of Each R u n ...
A .5 Performance o f Linear Equation Solvers on the CYBER ...
List of Figures
1 Generic Distillation Column Plate . . . .
2 Revision of Code to Improve Vectorization
List of Tables
1 Problem Specifications... . 7
2 Comparison of Average Solution Time ... 9
3 Comparison o f Nonlinear Equation Solvers... \[ 4 Comparison of Linear Equation Solvers ... 12
5 Comparison of Vectorizability of Linear Equation Solvers ... 12
6 Comparison of Time for Jacobian Evaluations and Function Evaluations . 13 T Breakdown of Execution Time by S u b ro u tin e ... 14
8 Total Solution Time for Each Hun for Problem I ... 23
9 Total Solution Time for Each Run for Problem 2 ... 24
10 Total Solution Time for Each Hun for Problem 3 ... 24
11 Total Solution Time for Each Run for Problem 4 ... 25
12 Total Solution Time for Each Run for Problem 5 ... 26
13 Total Solution Time for Each Run for Problem 6 ... 27
14 Total Solution Time for Each Run for Problem 7 ... 28
15 Total Solution Time for Each Run for Problem 8 ... 29
16 Total Solution Time for Each Run for Problem 9 ... 30
1
Introduction
Distillation is one o f the most widely used industrial chemical processes and the modelling o f simple distillation systems has received much attention. More complicated systems with interlinking sidestreams are frequently the most cost effective, however, hut are also more difficult to simulate. The traditional method for simple systems has been to simulate one column, use the product streams from this column as the feed streams to the second column, simulate the second column, and so on until all of the columns have been solved. This sequential modular approach works well for simple systems but is not very efficient for interlinked systems because a sidestream from one column may affect a previously solved column. As a result, a large number of iterations over all id' the columns is required to solve the system.
A more efficient solution method is to solve the entire system simultaneously. This technique works well with interlinked systems, but requires the solution o f a large number o f nonlinear equations. Each plate is modelled bv 2C ♦ I equations, where C is the number o f components. Therefore, over 1,000 nonlinear equations must be solved simultaneously to simulate a typical system o f 5 components and 100 plates. While advances in computer technology have allowed this technique to be used on computers such as the CYBER 175, the development of supercomputers is important because l) their increased memory allows larger problems to be solved and 2) their vector processing capabilities can be used to improve the efficiency o f existing programs.
The second aspect has been investigated in this report. The performance of existing programs1 on the CYBER and the Cray X-MP supercomputer has been compared. Both computers were located a* the University o f Illinois-Urhana campus. The total solution time on the Cray was less than on the CYBER because of the faster clock speed of the Cray. The programs were then run on the Cray with the vectorizAtion both on and off. Any difference in solution time was due only to the effect o f vectorization. Finally, the CPU time spent in each DO loop And subroutine was determined to identify the locations in the code where vectorization would he the most beneficial.
The efficient solution o f distillation problems is important as computer resources he come more valuable. Also, flowsheeting programs are finding increased use in industry. The distillation columns frequently require the most time to simulate accurately. Signif icant reductions in the solution time for the distillation system, therefore, will improve the overall performance o f the flowsheeting program.
2
Simultaneous Solution of Interlinked Distillation Columns
The equations that must he solved are nonlinear and successive linearization was used to solve the system. This involves approximating the nonlinear system as a linear system, solving the linear system, and using this new solution to relinearize the nonlinear system. The solution o f the linear system converges to the solution o f the nonlinear system. 2.1 Equation Formulation
The Naphtali*Samlholm equation formulation was used for most of the problems2.
In this formulation, each stage is represented bv I energy balance, (' mass balances, and
C equilibrium equations; 1 energy balance; En ( I * '^n )//n t ( I t S», )/l»» If n * \ fin t ft f n V mass balances: m ( l ) I'»i ,»r» * ( I * ‘^n)f*n,tr\ ^ n ► I M l , w In.in V equilibrium equations: Q n ,m ( ft E n ,w ^ n f*m , m ) > f* n I ( I 0 ) V n f l , »n h i / 1 n • 1 N number of plates, n I. 2. . . . N, m b 2. . . . ( ’
Figure l shows a generic plate ami delines the variables.
The equilibrium K-values ami the enthalpies are functions o f temperature. The
independent variables are NC vapor How rates, N(! liquid flow rates, an plate tem
peratures. The En, Mn>m, and Q n m are residuals which will equal zero wm;i the system
has been solved.
An alternative formula! ioi»4 involves summing the (' equilibrium equations;
dn l*n j En,inhn,tn
and using this to replace one of the equilibrium equations. This formulation was used with the BBTF linear equation solver and is indicated by BBTF*.
Figure I: Generic Distillation Column Plate including Nomenclature*
p l o t * n * I
n • place number a • component nuaber
Lq • liquid flow race from place n
■ vapor flow race from place • liquid aidescream from plate n Wn • vapor sidesereaa from place n Fn - feed rare to plate n
The term ‘solver* is actually a misnomer because the nonlinear equation solver is used to approximate the actual system with a linear system. This linear system can then he directly solved. Four different linear equation solvers were used.'*
The Newton-Raphson (\ H) method is the classic method. The nonlinear sytein is
expanded in a Taylor series about the current approximation to the solution. j\ and all
partial derivatives higher than first order are dropped. The resulting linear system can he
represented by i f ik i where .1 is the Jacobian matrix of partial derivatives,
p is the vector of variable corrections, ami / is the set of nonlinear equations. The
superscript (F) refers to the iteration number. I lie new estimate of the solution is
jr'kt n p{k’ t r u’\ The N R method converges faster than the other methods hut the Jacobian must he reevaluated at each iteration.
The Jacobian evaluation frequently requires more CPI time than solving the linear system. This is due to calls to time consuming thermodynamic property evaluation routines. The other methods, known as quasi-Newton methods, approximate or hold the Jacobian constant for several iterations. This reduces the time for each iteration but results in slower convergence and more iterations. The simplified Newton-Raphson (simp N-R) method simply holds the Jacobian constant for several iterations. The modified Broyden (M B) method updates the Jacobian without reevaluating the thermodynamic partial derivatives. Also, the linear system doesn't have to he resolved at each iteration. Schubert's (Schubert) method approximates the Jacobian in a way that preserves the sparsity o f the matrix, but the linear system must lie resolved at each iteration. The method o f Dennis and Marwil (MARWIL) method essentially updates the Jacobian in a way that makes the solution o f the linear system trivial. The drawback is that more information about the linear system is required. MARWIL is most efficient when used with the GENSOL linear equation solver.
2.3 Linear Equation Solvers
The methods of solving the linear system differ in the data structure used and in the matrix block structure they take advantage of.h The solvers used were continuous back substition (CBS), bordered-block-triangular form (BBTF). bordered-block-tridiagonal form (BBTDF1), LU factorisation (LUIP9), Harwell subroutine library method MA28 (M A28A), sparse Gaussian elimination with partial pivoting (NSPIV), and implicit back substitution (RANKI). B B TD Fl stores each block o f nonzero elements as a full matrix and is referred to as having a l*tau storage scheme, where tau is the number of nonzero elements. NSPIV, LU1P9, RANKI, and CBS have 2*tau data structures, where each nonzero element and its column number are stored along with a small array of row locator indices. A 3*tau data structure is used by MA28A. Each nonzero element is stored along with its row and column numbers.
3 Comparison of Performance on the CYBER and the Cray
The program* wet® tested on 9 problem* with 5 nonlinear equation solvers, 8 lin* ear equation solvers, and 3 thermodynamic property evaluation routines/The problem specifications are summarized in Table l and detailed in reference I.
Table I: Problem Specifications
p r o b le m r t -c o m p o n e n t * s ta g e s ♦i) n a t io n s c o lu m n s fe e d s s id e s t r e a m s . . „„„ . -■ ■ ’[...3 ... . 22 154 " 2 .... 1 3 i 2 4 45 4 0 5 2 3 l 3 4 90 8 1 0 2 3 4 10 37 m m mi 11 2 l 3 5 IS 25 77 5 2 1 l 6 6 95 1235 3 1 3 7 ! i 6 120 1560 2 3 l : 8 ! 3 3 0 0 2 1 0 0 2 2 4 !i 9 ! 3 33 0 2 3 1 0 3 1 2 :
3.1 Differences Between CYBER and Cray FORTRAN
Originally, the programs were written in FORTRAN 4 on the CYBER. The Cray uses CFT, an extended version of FORTRAN 77, which has some different syntaxes than FORTRAN 4.7 Minor changes in the program statement were necessary, Also, the syntax of the RETURNS statement was different. RETURNS allows a subroutine to return to different locations in the calling routine depending on results in the subroutine.
The CYBER does not distinguish between upper and lower case letters and the programs and data files were all upper case. The Cray uses both cases. The programs initially did not execute correctly on the Cray. The problem was found to be in the way character data was being read from the data file. The terms %LIQ’ and ‘ VAP* were used to designate liquid and vapor feed streams, respectively, but the Cray was reading *L!q’ and *VAp\ Since these are not equivalent on the ( ’ray, the programs were not calculating the stream properties correctly. The programs ran property after changing all letters to lower case in both the data and FORTRAN files. The reason that the last letter being read as lower case has not been determined.
Another surprise occured when comparing runs on the Cray with the vectorization on (Cray-on) and off (Cray-off). In most cases the solution was reached slightly faster, as expected, by Cray-on than by Cray-off. Several runs, however, took l or 2 more iterations
with the vertorization on resulting in ( ’ray on taking more CIM' time to converge. Kv«*n when both methods converged in the same number of iterations the final sums of squares were slightly (up to 10/7) different. While differences in the internal precision of the v'YBER and die Cray are expected to produce slightly different results after several hundred thousand operations, calculations on the ( ’rav should produce exactly the same result regardless of whether the vectori/ation is on or off. Slight tmmeric.il differences between matrix elements could result in different pivot selections, which could lead to a different number of iterations.
it.2 Total Solution Time
The average solution time for each problem is summarized in Table 2 for both mi merical and analytical derivatives. The total solution times for each run on the CYBER. Cray on, anti Cray-off are listed in fables 8 to Hi. The Cray offers two advantages that allow programs to execute faster. The first is a clock speed that is about 5 times faster than conventional computers. This means that each numerical operation can be performed five times faster m, the Cray. The second advantage is the ability to process large vector loops simultaneously. This is refered to as vectorizatinn. Instead of iterat ing through a DO loop to update each element o f an array or matrix, the Cray can update the entire matrix simultaneously, ( ’ode that is written to take advantage of vectoriza- tion can show a speedup of up to ten.1* Speedup is defined, in this case, as the ratio of solution times Cray-off/Crav-on. Highly vectorized programs on the Cray can show a speedup of up 50 when compared to the CYBER due to both the faster clock speed and vectorization.
The column labeled ‘ runs' in Table 2 indicates the number of combinations of equation solvers anti thermodynamic property derivatives used to solve a particular problem. Onlv runs where results were available from both the CYBER and the ( ’rav were included in the tables to make the comparison between machines valid.
Comparison between solution times for CYBER anti Crav off indicate an av rage speedup of only 2.5, varying between l.fi and 3.7 from problem to problem. This speedup is due to the faster clock speed only. One possible explanation for the less than expected increase is the memory structure of the Cray. The Cray memory is organized into 32 ‘ banks.’ Reading from a memory location in a given bank makes that bank unavailable for four clock cycles.1* A ‘ bank conflict* occurs when an unavailable memory location is accessed and the number o f bank conflicts depends on the dimensions o f the array variables. Severe bank conflicts can reduce the efficiency by up to a factor o f four. The variation in average solution times from problem to problem support the possibility of bank conflicts because each problem involves a different number o f components, stages, and variables. Therefore, the arrays will have different dimensions.
Table 2: Comparison of Average Solution Time
numerical thermodynamic property derivatives
average solution time ( s e e )
problem run CYBER ( ray off ( rav cm speedup H
1 7 G. 12 2.533 2.172 1.02 1; 2 7 23.9 8.649 7.675 1.13 | 3 7 70.4 18.907 18.788 l.oi : 1 \ HO.9 49.601 49.331 1.01 jj 5 4 105. 47.167 43.347 1.09 jj 6 2 53.6 23.016 22.494 1.02 m t 3 125. 47.410 46.084 1.03 j 8 1 51.9 24.954 24.769 1.01 9 2 68.7 32.922 31.104 1.06
average speedup from CYBER to ( ray: 2.4
analytical thermodynamic property derivatives
average solution time ( sec) ]
speedup
jjproblem run , CYBER Cray off Crayon
i 1 8 1.667 1.611 1.03 4 2 38.8 15.610 16.033 r 5 4 49.2 31.385 28.829 1.09 6 2 25.5 11.548 11.442 1.01 1 7 3 61.0 21.042 20.570 1.02 ! 8 l 21.3 7.976 7.922 i.oi i 1 L _ ® _ 2 | 37.7 14.185 13.987 1.01 !
average speedup from CYBER to Cray: 2.4
Very little speedup or cured when the vert orient ion was turned on. This was expected because the code was not written to take ad vantage o f this feature. Problem five showed the greatest improvement in solution time with the vectorization on and had the largest number of components. While the improvement was not very great, it could indicate that more loops involving the number of components were vectorized than loops that
iterate on the number of plates.
3.3 Nonlinear Equation Solvers
The nonlinear equation solvers are compared in fable 3. Again, verv little reduction
in CPE time v noticed with the vectorization on. Problems I and 9 were run using
each nonlinear equation solver with the same linear equation solver. Onlv deferences in the nonlinear equation solvers accounted for the differences in solution times. All of the runs executed faster when analytical derivatives where used but the Newton-Raphson method, which reevaluates the Jacobian at each iteration, improv'd the most. Significant improvements in the vectorization of the Jacobian and physical property evaluation subroutines could greatly improve the efficiency of this method. Schubert’s method and the Newton-Raphson method solve the problems faster than the other methods.
3.4 Linear Equation Solvers
The linear equation solvers are compared in Tables 4 and 5. The earlier results from the CYBER are not included for clarity but may be found in Table 17. The execution time for all of the linear equation solvers except CBS decreased by a factor of 4 or greater when run on the ( ray. This was significantly larger than the average decrease in total solution time. There were no significant improvements in solution time when the vectorization was turned on except for BBTDPl with a speedup o f 1.14 This routine also executed 10 to 15 times faster on the Cray without vectorization than on the CYBER. This could indicate that the data was structured to avoid bank conflicts. BBTDFl and LU1P9 showed the best overall performance.
3.5 Jacobian and Function Evaluations
Finally, neither the function nor the Jacobian evaluation routines were greatly af fected by vectorization. Table 6 compares the times for the Cray and the CYBER. The speedup was the same for both derivative methods indicating that neither method had a large degree o f vectorization. Numerical derivative calculations require about 5 times more CPU time than analytical calculations but may be easier to vectorize. These results further show that significant improvement in efficiency could be achieved by rewriting the programs to take advantage o f vectorization.
Table 3: Comparison of Nonlinear Kquation Solvers
Problem 1: 154 equations* 22 stages, 3 components CBS linear equation solver. ( 'hao-Seader correlation
nonlinear thertnodymanic total solutir, n time (sec i
equation solver
proper! v
derivatives CYBKR Cray off t ray-on speedup
N R numerical j 3. TO 1.854 1.841 1.01 M B numerical i 2.123 2.099 1.01 Simp. N R numerical i 2.275 2.245 1.01 Schubert numerical j 1.470 1.445 1.02 N-R analytical j 0.K52 0.044 1.01 M B analytical , 1.103 1.140 1.02 Simp. N-R analytical | 1.310 1.298 1.01 Schubert analytical ] 0.995 0.974 1.02
Problem 9: 2310 equations* 330 stages, 3 components LU1P9 linear equation solver,J^hao-Seader correlation
nonlinear thertnodymanic ] total solution time (sec)
equation property |
solver derivatives JJC'YBKR
’ | 05.7..
Cray-off Cray on speedup
N-R numerical 32.078 31.951 1.00 M B numerical ii 44.150 43.555 1.01 Simp. N-R numerical i _ 42.069 41.530 1.01 Schubert numerical | 71.0 33.765 30.250 1.12 N-R analytical | 25.9 9.860 9.776 1.01 M-B analytical i - 21.940 21.533 1.02 Simp. N-R analytical 19.789 19.513 1.01 Schubert analytical 1 49.4 18.509 18.197 1.02 11
Table I: Comparison of Linear Equation Solvers
! linear time to so Ive linear system for Crav-on (sec)
I equation problem number
! solver t 2 3 4 5 0 7 8 9 [ " T bs . 1:1 .52 2.9 3.1 3.0 1.9 LI'IPD .020 .048 .ioa .00 .71 .38 .32 .31 .25 RANKI ! .028 .19 .73 1.5 1.3 2.7 NSPIV : .021 .051 .74 .45 .37 .32 BBTDF1 ! .017 .057 .117 23 .33 MA28A .020 .9 2.8 BBTF* .028 .052 -GENSOL : .029
-Table 5: Comparison of Vectorizability of Linear Equation Solvers
ir linear number average CPU time to solve linear system (sec)
equation of
solver problems CYBER Cray-off Cray-on speedup
CBS | 7 2.63 1.644 1.648 LUIP9 9 1.26 0.315 0.311 1.01 RANKI 6 3.42 1.069 1.082 — NSPIV 2 0.19 0.036 0.036 1.00 BBTDF1 5 2.50 0.171 0.150 1.14 MA28A 3 4.18 1.274 1.240 1.03 BBTF* 2 3.12 0.040 0.040 1.00
Use to compare vectorizability of each linear equ.it ion solver-can’ t compare equation solvers because different
Table 6: Comparison of Time for Jacobian Evaluations and Function Evaluations
thermodynamic property derivatives
average CPU time for evaluation (sec)
CYBER Cray-off Cray-on speedup
Jacobian numerical 6.01 2.296 2.241 1.02 1 evaluation analytical 1.49 0.515 0.502 1.03 function evaluation both 0.30 0.120 0.117 1.03 1 13
4 Opportunities for Improved Performance
The results clearly indicate that the programs are not written for efficient execution o f the Cray. The execution time was broken done at the subroutine and the DO loop level by using the TIM ER/TALLY utility. The results are promising in that a large portion of the execution time is spent in a small area of the code. While it is impossible to vectorize every DO loop, vectorization of small portions o f the code could result in large speedups.
4.1 Breakdown of CTO Time by Subroutine
Problem 4 was run using the TIMER/TALLY utility and the results are summarized in Table 7. The highest percentage of time was spent in the linear equation solver subroutines regardless o f the nonlinear equation solver used. The percentage increased, up to 91% for CBS, when analytical thermodynamic derivatives were used because less tune was spent evaluating the Jacobian. Further analysis at the DO loop level indicated that over 80% o f the total solution time was being spent in one DO loop within the subroutine CBSLUl. If this loop can be vectorized to achieve a speedup of 10 within the loop, the overall program will show a speedup of up to 8.
Table 7: Breakdown of Execution Time by Subroutine
Problem 4 run with Cray-on
j nonlinear linear | equation equation | solver solver percentage o f t thermodynamic numerical ime in subroutine >roperty derivatives analytical | N-R NSPIV | Schubert LUIP9 1 NSPIV1 28% VLElO 4% LU1P2 64% SCHUB 2% NSPIV 1 69% JACOB2 6% LUIP2 74% SCHUB 3% ! | N-R CBS 1 CBSLUl 60% VLElO 2% CBSLUl 91% [ JACOB2 1% | Schubert RANKI 1 RANKI 81% VLElO 1% RANKI 90% ; SPK2 1% | M-B BBTDFl l 1 SOLVE2 20% INVERT 5% SOLVE2 23% ’ INVERT 9% .
less than 2% o f all operations were vectorized j|
number that were scalar. Less than 2% of the operations were vectorised. This is not surprising considering the above results.
1.2 Vectorization o f Linear Kquation Solvers
The possibilitv of vectorizing the CBSLCl subroutine was examined. The portion of code where the majority of the time was being spent is shown in the top of f igure 2, Only the innermost loop can be vectorized and a loop with an IF statement in it can not be vectorized. Also, a loop can not .alter an array value that would be used in a latter iteration if the loop was not vectorized. The innermost loop, |)() .V20 . . . . in the original code was not vectorized for these and other reasons.
The total solution time was reduced bv 2 seconds by rewriting the rode as three separate loops ami using a temporary array. The loop with label 527 in the bottom
portion of Figure 2 is vectorized, hut the loop with label f»>5 still is not. \ more
advanced understanding of the intricacies of vectorization would probably show that the whole loop can he vectorized, but this small modification confirms that the programs can he more efficient.
4.3 Vectorization o f Other Subroutines
The thermodynamic property subroutines are called many times during execution, particularly when the N-R nonlinear equation solver is used. These would be likely candidates for further study. A point of diminishing returns is reached quickly, however, because very little time is spent in most o f the subroutines. Completely vectorizing a
subroutine that only takes \% of the total time will only result in a speedup of about
l.l.
4.4 Alternative Solution Strategies
Perhaps the most promising approach would be an entirely different solution strategy. A more tfllcent algorithm written with vectorization in mind might give better results than patching up programs that were designed for scalar operations. Also, the virtually unlimited memory capacity o f the Cray could be utilized in a way that allows for an efficient vectorized algorithm.
Figure 2: Revision of Code to Improve Vectorization
er
Subroutine CBSLCl whs revised to improve the vectorization of one 1)0 loop. Ov
80% of the total solution time was spent in this subroutine when problem t
with the Newton-Raphson nonlinear and the CBS linear equation solvers a
derivatives.
* . r <1^)5 sef
Original Code: 1% of all operations were vectorized, solution time I '* -'''
IF (NSPK .EQ. 0) G OTO 540 DO 530 NS = 1. NSPK JJP CLi(JSPK(NS)) DO 520 JJ - JJF, JJL IF (A (JP + JJ) .EQ. 0.) GOTO 520 NOP NOP + 1
IF( A(JJP+JJ) .EQ. 0.) NFILL - NFILL r 1
A(JJP f JJ) A(JJP -J J ) • A(JJP + II)*A(JP -JJ )
520 CONTINUE
530 CONTINUE
n(*7
Modified Code: 4% o f all operations were vectorized, solution time -DO 523 JJ = JJF, JJL 523 ATEMP(JJ) = 0. IF (NSPK .EQ. 0) G OTO 540 DO 530 NS 1. NSPK JJP ^ CLI(JSPK(NS)) IF (A (J J P t ll) .EQ. 0.) GOTO 530 DO 525 JJ JJF, JJL IF (A (JP +JJ) .EQ. 0.) GOTO 525 NOP ^ NOP t- 1
IF (A (JJP+JJ) .EQ. 0.) NFILL = NFILL + l ATEMP(JJ) = A (J J P + Il)*A (J P + J J ) 525 CONTINUE DO 527 JJ = JJF, JJL A(JJP+JJ) -r A (J J P fJ J ) • ATEMP(JJ) 527 ATEMP(JJ) ^ 0. 530 CONTINUE
5
Acknowledgements
The guidance and patience of Or. M.A. Stadtherr was essential throughout this project. The members of Professor Stadtherr’* research group were also very helpful. Finally, the inspiration and motivational support provided bv Julie Staley was great I v appreciated.
6 Bibliography
The following references were used in this report:
1. Malachowski, Mike., ’ Simultaneous Solution of Multistaged Interlinked
Separators,' Ph.D. Thesis, I niv. of Illinois. Frbana. Illinois (1983)
2. Ibid, pp. 37-41. 3. Ibid, p. 28.
I. Ibid, p. 43. 5. Ibid, pp. 86-95. 6. Ibid, pp. 59-86.
7. ('ray Research Inc., ’ Cray 1 and Cray X-MP Computer Systems. | ()R- TRAN ((-F T ) Reference Manual,' revision J 02 (1984), p. I I.
8. Natio i il Center for Supercornputing Applications, T s e r’s Ciuide,' version 1.0, (January, 1986), p. 10-7.
9. Ibid, p. 10 9.
r«* C » t Ci l* — . w l tM ' £ R C O fl C 'C tt X* 4 t~,
A
Appendices
A .I Sample Program O utput - 1 •: J ■,
'■ ; f • J ■; i i j ; v k t ) i * * * * a m i? * rt to • » C ♦ 2 : ** E -d if h i G « •“ J i ** i 3 t j u &. So % ft Z 3 ft . ftZl 3 C C •to X C Jflp *f • ft > -■ _ c Z 3 D ^ * «0 * 2 •> 2 v 5og W k. * ft k* C j ; o 22 ~ 2 4 4-3 c - l * K - _ 0 ' . ' £ £ < «, £ * 4 * ’* T m ;c £ I k ’ e c o 1 t 4 4 J 0 0 3 m u i f o o t u n £ ** 2 £ C ft 2 S l f C . to , . I * ^ r : j i z C V v 2 #fc* 9 ( I C 0 0 0 ri i O V 0 J u c H 3 |£ 2 * 4 * 3 1 1 Ito l.y t/ •4 3 *+■ *■ c f *• J £ W 'to 0 0 © V-J ^ * * - 4 3 I 0 c * to ^ ■i» T I * -v ^ c 5 2 ■* »'■ L ll ft . • . - .1 * “ * * ■= !J « l c ft t * r C i : 3 2 i c < o C to c to J ft O . •. r— - _ 0 •• * m .* f* (i • 4 i c c c o ; c j O O 0 o c o : c c « c o c w ♦ ♦ * * * * UWUU'J'. • • o c e © e c ^ O $ 0 6 J 0 ft 5s J* '* l* y J to to ♦< < ■ ^ c c je o o o II » (. Ii 1 c o o o s ; ’ ; e « c s c c t* r ^ © *•■ n . . . c rococo© ,w u4£ c I I *■ w - *5 u %c •f % *; U v J ,c :t ■/ z H M► « 'r -** * sI I IM ,«* ft* c 3,4 C i j M c rf 7 ** M i r* : - *- ** - •• 4* wr ♦* r w1 . U J L t/ i; l v * ^ r L' 1' n U■4 - T f ** KZ1Z ** r- M u*4m£. ‘4 i l ? * u 23 i t * * to. I * *i» - t’ 1 J C £>■ W4. * ! «i * ff*.to • > c c 2i 1* *1 ij ♦ * j* ♦: t' C : l : r , **•fl0 tu.z r: C-, W s4 ^ 1 4. » 4 J ' j j j 0C 00 c V* r* * * *- * * » / ^ ** 4 , • w4 ft » • *; ;*'J w t j j •*k - t -■ - 4 , * t *t 0 rt0t it r I * *'*l >•. ^ u* 4*U u* Vw 4 *•*- •H12li ? n V rl t 3 411 ' ' * & * * - 2: *« - - . c U0 L* 0 V*5 u ft. * *, -** i i - ^ . s* 1 “wi £ -* V- ! - to' ‘ t to v> c to ■ O tol C* t to to to to •> to < I ( , to ft y ft g w o 0 o c o c - * 4 < U t ft « N fc» C «* 4 to m to O C 0 O 6 C fs #. K ► K » COOCS'. to*. * * I I I to bUldlilkiu - o u c o ; »• 0 * * r. o © •<!>»•<♦ « * < V *•t ft i ? M < < c ‘ o c i 9 C C •_'«***• t to to * n fti f> ft :& go o & o u * i i i i i 5 § t* rt 4 ft -*« # ft. < 5 *? . « r ** > ... 0 •; c c 0 0 0 C 0 0 * C o o r , , to, , ., y < r C 0 0 0 0 0 c :; *_ V > f C C ) 4 * * } ft W W to. * 1 1 0 0 6 0 0 0 . . , o < ». : r ? c to : < ^ < j, • • •« 4 0 . i ., £ to 0 0 0 0 ■j V * * ft) i i y » 1 { ; . » ♦ to > , j i j M V U ** » to* • u t i : ~ z • / - :
fit * 0 r r . ; 1 > S 2
%
c
» A c V ■ » 2 l/
*
z
c / • * s? ?
* w * w*
C
, T;:
jj
I
I
* V
M 3 *i
* »
? c .* ; cr * - u o f i t I * * .r f ? ** 98 , r r m r r : 0 : C * i p . ' i * , ;9
„I
t *• • • V > s J V > -r . - * r , A . « - -4 ' / - 1 c r ’ < .‘ C -» X X 1 3 * t ix ■ , i> i ” , T t i 3 * • H r * * -▼ * M * 7 n C 4 C 7 * ■ T * * > - - K r -2 b r, ( M 2 5 3 -a i n r -/ * / * I r i " , « « r * r M * \ ( y M * i H (* i i c c -m * r f c c C r » H * I ni o 1 * m 0 0 0 * * ? ■ c 0 0 S i -? 0 * 4 > p t - C : * $ * N e w ■ i a 2 n p 0 * z 7 9 * 3 ; r :f r n w S i 0 „3
a a t r * t H h no
r z s * 0 9 0 ■• 0 i n**
**
**
55
55
55
rt f\ * y {•< * ! » x u n ? ‘ i I t * 1 f*l I * ; r r t • * JII*
H
im
J8 5I JJ 3 2 i 5 5 i al se s: fi ri "i f: 3 : ; 5' ’ J * • t w ♦ . -SiJ SS iH55
55
55
rr fr rr 00 00 00 hH Vb t it t m *4 * i H *4 rr rr rr ti r v ll 4 - i i . -Mn *
w
* n
b il lpi*
**
**
n
u
n
a •»!
o r 5 «o d
■ a«
t Hs
r 53 g!5 00 00 9**
**
*
90 9 *3 H M H N f * t X X -4 55 55 ? Ji ll * ss s a H W H H * 4 i j 4 4 4 m M m nI '
i 2
S
|
m y * U r g o* Ti*
mm
mu
*
*o 35 35 M M H M 1 * X X X M * II t t J I 2 ? f -f h - N M « N | < -4 - - 4 -4 M H M H??
??
,?
* *0^
< fl o
- r -H W H * z z zz ss iimt
H H H M ?* *« rr rr * N H W C 7 i * ♦ 0 !? ^ M M t H a < 4 »• o ►* H -4 i§ 0 H « •, > ! fi K !j * 0 c z o 00 00 00 0 (, , ■ ' c v o c oo oo oo r h H H c c 0 0 0 o 0 o * 0 r c o 0 o o V ' 0 M oo oo oo * # V * ^ s ^ f* O CO JC O 0 0 ' * " c ^ o ^ 3 r t l 4 > • « t 5 p t V ‘ 0fT af iA G C f~ cm iN v e n se 7t C O TH-tfT F O R J N V E R '. I O N m M O S O L l J T l 'j N s O . H C 4 a « ,r Y T » K O R M O T R r < & Z C 4 J A L % F OR L t N i . » « R F O U A I I O N S O L V E R * & . 0 3 3 4 » * * * i* 4 t < » 3 * « . l j « ? i .Z * ? < !» »; '» X E t i RORH O E R J C M T H m IJO V E C T O R SC V ' . i; 12 < » T | « i i s r * 2 0 3 7 r O I 4 ? f F S S * 7C4 4 M O * S M Of S O L U T I O N V C '. 'T O R - 30 1 1 f T ! 7 1 5 0 1 5 7 3 8 W » 2 8 H I f E l E * H • v t r t * : t z t v . t c * ' o c -* O l W ’f > C A S O U t iR C V * O . f f i o ; a Z « 9 4 H 2 3 1 E ♦ O i S ’ K f * r > ; z t c . • »* ♦ * » i« T » c o . c ♦ S 'A l t t R C S -O . l Z ! C 6 0 0 7 F I a 1 O F 4 o ; r i ► O F S Tt. |» 1 7 1 1 i . . « . -I 6 N O . «. .. 1 T S E T C O M C ^ X I K R l -. . l o t 4 K f .f t E V i. H T I f if e • ' O H _» * • • O S * ! i * f* E V u c l I i . T l O W ■ < R » • »* C . . T T * ? X . * R '. ‘ tO t-U r t E f o n I N V E P M C A «T .C H 1 (1 1 IT N -O . H * H S E C O N D S • i T i .F - M . H . i . . I C J O r . O i S iL • -» C »S ■= • 0 . l £ ; 6 S 9 l 2 : ( i > « 4 7 F E 4 w l r t n e * O R ' : -T r r . i . *► '. t c c i c * t s * m » i* . t < » v Ct ECONtc. i • r. r , n c ' i N u n s f R .» T T "• € . ^ O * * >». ' O S I n N E V m I O u T j ( > N < > n U P C t t 'l E * S I C O NC 't T i n e F O R lM * y C R ' I O U * 4 1 4 0 '. O L u r i i . ( N 3 O . H E A S E C O N D S rt ” H S I T E = - O . l O O O i ' » O l S U M C IE S J U R U E O b 0 . 7 S t l B E 1 7 7 1 7 0 « » E ' » X l i n e c o p s ^ c r s m * C m u h l a ' i o n -o . „ i o v * ie :c o * «* r* * » I T E R m T I O N N l M B t 'N *♦ T I M E F O R J m < : O B I h N C V n O U f T l U N OR l l R D n T t » k -4 H ^ S f r C O N C t f c T X M E F O R I N V E R S I O N A N D ‘ .O L M I I O N c 0 . * » t 7 S E C O N D S S T r R S I Z E -O . t O O O C » C 1 SU M C W S O U m P .C S a O . T B R 'F A X E N A f t S i « E > 0 * T J f t C r O T . S T f R ‘. I Z r C tr t C i E t T i t C u -0 . 1 0 7 S E C O N D S L H H > < r I O N N II M B i' H r i “ tc f O f * j * « o r i » « N e v m l u u t i ' i n cm u r d m t e -* i . a h ? s e c o n d s t i n e f o r I N V E R S I O N n NO S O L U T I O N ^ O . H S 2 S E C O N D S S T F .R S I Z E * O . t O O O E E O l SU M O F S Q U A R E S * O . Z S « O g 4 9 1 £ H E 7 4 E • 1 H T I N E F O R S T C R S I Z E C m l C M L A I I O N « O . i O S S E C O N D S S O L U T I O N H n S S E E N A T T A I N E D s i « o r s c r m i c s m o . ^ s s o s * s t X '* r ? v e -i s x s l e s s t h a n c o n v e r o c n * : e c r i t e r i o n o f t o t a l N U N t C R o r I T E R M T I O N S s S N U « * E R O F U m C O S I a N E V A L U A T I O N S ? ** C N F C U T I O N T I N E * l * H O . S E C O N D S O . I 0* 7
O A A A ^ . A A A / V A A A v v v v v v v v v V V o c o o c c o o o o o o S J fr fr fr x i^ x jf r fM h s Is ! S 'Q O tf ^ t ^ O v j^ O K 9 * ^ * * W * 4 W 9 > * '4 > * > * ‘ 91 * * 0 W -F **■ O t d ** £ * y K ! i* i r , 1 t ; ^ C v 0 v ? V i :. * l ^ o o o o o c o o o o o o o c o o o c o o c o o o o o o o o c o o o o o o o o o o o o o o o o o o v O O O C * ■* O v o o o ? i! 4 -. 5 N t * O v S t !y 4 ? “ s 4 8« | H * 9f r-F *f cO -F N * “ X C V : »• * * * X * * f * V s « e 0 f» i i !K !( !e 2 < !' s s : c r -’ -i r z j ^ r ’v T| ^ * fi ^ 0 9 H ? rj 3 >q c H f ! M c 2 M 2 * i S 0 ■* •! 7 *“ 2 f * 2 9 -F J* f > «% 5 *-» " 5 2
o
o
o
o
o
o
o
o
o
o
o
o
o
o o o o o o o o o o o o o ?3 » * h - O • £ 9 > i U H .3 >4 K 9 9 * « F si W -F y f -> O f *> si & f cf O U ? y 9 S *2 >, 4 '> 4 0 0 0 ? v 4 * + > l-^ v» O F O O O > iO a (h O O O O O O O O O O O O O o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o u O o X O ^ O X H ? ! 0 ^ r ^ S 'J O s , o > 0 » * N > !f r O U‘ * M * 9 ? •> y 2* 9 > 4 0 f> < ^ i ? ft * < m < r < w r < r c n s r n ~ y . r e -< r rs Z * 4 C S 9 h j i? f ij pi n o m tr p £ * ; ? “ < ^ -• ?j ^ -■ 0 0 o C 0 A 0 o o A 0 o O o o o o O o o o o o O o o 0 o si > 4^ 4 V *4 >1 *> V *V •4 .*> *4 * *4 3^ ? i s» © -F si-F -F *4 H -ilk . © * s *o >4 * s! 9 * » I* •F » w *> >* S» ©* m si I fu > f* 0 * u > 0 O -4 A V «s 0 i-i 0 0 n n IO ii o O K ? A -✓ •v i 0 O 0 0 o o o O o o c 0 o o o 0 o 0 o o A V 0 o o o o aV A V o 0 o A A A V V V o A V 0 /% A >* V A V A A A o 0 o o o o o o O 0 O © 6 6 0 o O c c A o A A At *• A **A A 0 * ? -> 0 — A r* x «* A A. c o » f tf| o N 1* m -f U n * 0 «! F O * ■ ‘«L 4 i*. 0* V -F s X IR 14 1 < W < M !*! 5* T > ,»* f d x™* V' «l * : r* r-. 2* h ft "f? ■v C C A x 5 «* F £5 5~ ~ -' M -«*i : i-:-r* -« r T ~ 3* — T-: X A V H* w* u: X i n ty. f" if! U4 U w ~ w K ?-> V : K't i - V ^ X X .•fr i?*r i « • * 1 : - t :« rv -v -n V A — S '.v.^ *0 A V » f > » f 4 * * ? > » # » > > > • 4 » * • > % O 0 o c o o O 0 O O O 0 O O 0 A 1* M »* « ■*■ t * i* >* • * 0* -— k.l Kr V; r* w !H 2 f> w !3 i> %\ V .Kt : K" : f; V :?! ??? ! i} *f| i n fn v *V v? O 0 V ; Vt f i Kt K‘ r H : ?! T- J A K* w . :*r X if 4; x! IS C n ir, 9 v : =n ■ :■* j \ »* “ “ O * • H *» .V ,-•» f *4 v * 4 * ■>• \ W w w X’ X V v* f.l \ y 0 X if! If } *" * .» O O C 0 0 O *j f.*i si s r*. 2 II A V w 9* i ii^5 •n U ( a 14 ^ «-t i \r; K> f.) w U ■F Vi ?> s i4 -v A V »• 0 o -.— *■* }* : • !» ‘ *1 « F *F * 0--4 W *v c o y ; * ' < *4 C !\V X r* •4 i-. X v !r;if r.i i £>1 o f-t « J| >4 f /ic y e y $ ^ iii u o i * 4 4 0 ^ 9 > * ih * * 4i -0 *• ) m uk 4; X? o 9 w 4} :S I ** ? >n y 3 ^ O * r -> w -F h -A a* s ?■ « F ** ' S o o u ? tr vt ^C* £? '2 i. ; Si !3 * J I* i: n a Vi v > H -0 ^ n % i23 S S £ 4 ; 'C * *! u •4 si s 9 o * "4 i i-•v * * *; w 0 i.. •C !Js ,'x X X ■4 *• % : : in 1 tno >.i«fVS hi on 3“ i no ix co o * *! -f i H ? > O i z o d. tc -f r 4 4 I-i • *4 —i X r i 9 9 T | M 9 r * 5 ^ ^ 3 M 30 •j 5» t TJ * 5 * 9 2 1 0 0 9 2 c O d. i 4 M c i n n i i H "I T K tO 2 1 - 9 h n c 0 0 9 *4 -j iH c o H n r r c J u W i a :Jt ?< r o c r* r ; -: < c W i! 1 2 -4 o r z 4# 2 •t ! %a ZS * i w? ?5 5 i 2 i i > * ? -<? ? • v I I * t* . r -3 f O i 'i O N O ^ O C O ^ ? ■> I-i I-i ** jj O i O y ^ O o “ o o o ^ o c n r: -f IA.3 Subroutine Structure of the Programs
The program itself is SIM25 and is stored in the file »im290. The program performs some initialization and then calls the subroutine MAIN located in the file lcsim28i. Main then calls the following subroutines:
routine: tile: routine: file: routine: file:
DATA Ic28x VSCALK lc28x PRINTF Ic28x
SC'HUBO Ic28x Sf'HFB lc28x BH Ic28x
BHA lr28x SIMP lc28x MARWIL Ic28g5
JACOBS Ic28b JACOB3 Ic28b JACOBI Ir28b
PRINT1 le28x STORE3 Ic28 x ST0RE2 Ic28x
TALLY1 lcoriler TALLY3 Ic28x RSf'ALE Ic28x
FSC’ALE Ic28x JACOB lc28b BBTFO Ic28gfi
CiENSOL Ic28g5 RANK! tcrank CBSLU Iccbslu
BBTF Ic28g4 MA28AA Ic28x M28*BB lc28c
LUIP9 lclu 1 p9 NSPIVC lcnspivl SOLVEO Ic28g6
STORE4 Ic28x BBTFP Ic28x S0LVE3 Ic28g4
MA28CC Ic28x SOLVECi Ic28g5 STEP Ir28x
PRINTR Ic28x RESIDi lc28i RESIDO l<-28i
The thermodynamic function routines for C’hao-Seader correlation with numerical deriva tives are stored in Ic21d4 and with analytical derivatives in !c21d. The Uniquac routines are stored in lcuniq. Most o f the subroutines listed above call other subroutines located in the same hie. The modified version of the subroutine CBSLUl is stored in vcbs. Data files are stored as lcdXXX where XXX denotes the problem and linear equation solver used.
A.4 Total Solution Time o f Each Run
The following tables list the total solution time for each run on the Cray. Data from the CYBER is included where available.
Table 8: Total Solution Time for Each Run for Problem l
Chao-Sender correlation with numerical derivatives
I nonlinear linear f total OPT time (sec)
equat ion equation !
solver solver CYBER
4.00 Crayon ('ray-off J Schubert BHTDFl 3.686 N-R GENSOL 10.5 3.568 3.577 i Schuli*; ? MA28A 8.38 1.436 1.474 | N R RANK1 3.79 1.808 1.816 | N R CBS 3.70 1.841 1.854 j N R LCIP9 3.76 1.763 1.771 j N-R NSPIV — 1.769 1.783 ! MARWIL CiENSOL 8.70 3.202 3.220 M B RANKI — 2.069 2.096 M B LUIP9 — 2.013 2.036 M B CBS — 2.245 2.275 Simp N R CBS - 2.245 2.275 Schubert CBS 2.96 1.445 1.476
Chao-Sender correlation with analytical derivatives
nonlinear linear total CPU time (sec)
equation equation
solver solver CYBER Cray-on Cray-off
Schubert BBTDFl — 4.665 5.050 N-R GENSOL — 2.379 2.377 Schubert MA28A — 0.960 0.994 N-R RANKI — 0.614 0.618 N-R CBS — 0.644 0.652 N-R LUIP9 — 0.570 0.575 N-R NSPIV — 0.576 0.581 MARWIL GENSOL — 2.482 2.490 M B RANKI ...— 1.116 1.140 M-B LUIP9 — 1.039 1.073 M-B CBS — * 1.140 1.163 Simp N-R CBS — 1.298 1.316 Schubert CBS — 0.974 0.995
Table 9: Total Solution Time for Each Run for Problem 2
Uniquac correlation with numerical derivatives
! nonlinear linear total CPU time (sec)
| equation equation | i
■ solver solver ! CYBER Cray on Crav-off
| Schubert BBTDFl f 18.4 13.291 16.150 ! | Schubert MA28A 53.7 16.695 18.381 Simp N-R BBTF* 1 13.2 4.220 4.115 N R CBS 13.8 3.077 2.970 ! Schubert LIUP9 13.7 2.828 2.825 j N R NSP1V !1 2.716 2.605 MARWII. (JENSOL 34.9 8.971 8.859 | | Schubert RANK! ; 19.3 4.640 4.641 j
Table 10: Total Solution Time for Each Run for Problem 3
Uniquac correlation with numerical derivatives nonlinear equation solver linear equation solver
total CPU time (sec)
CYBER Crayon Cray-off
Schubert LU1P9 40.4 8.704 9.389 N R CBS 47.6 12.101 11.736 N-R RANKI 60.0 13.829 13.713 MARWIL GENSOL 197.2 50.866 51.030 Simp N-R BBTF* 48.4 16.867 16.412 Simp N-R BBTDFl 37.2 14.219 15.986 Sit° r N-R MA28A 61.7 14.927 14.984
Table 11: Total Solution lime for Each Run for Problem 4
Chao*Header correlation with numerical derivatives
nonlinear linear total CPC time (sec)
equation equation
solver solver ^ CYBER
63.5 Cray-on Cray-off ) Schubert L I I P 9 18.741 17.980 1 N-R CBS 55.1 23.877 24.408 ;i Schubert RANKI 103.4 31.433 29.473 jl Schubert BBTDFl 101.5 123.273 126.633 |
Chao-Sender correlation with analytical derivatives nonlinear equation solver linear equation solver
total CPU time (sec)
CYBER C rayon Cray-off
Schubert LU1P9 109.7 42.171 43.888
N-R CBS 29.3 15.995 15.865
Schubert RANKI — 23.737 24.992
Table 12: Total Solution Time for Each Run for Problem 5
Chao-Seader correlation with numerical derivatives
nonlinear linear total CPU time (sec)
equation solver
equation
solver i CYBER Cray-on
! i f'rav-off N-R CBS ' 1 0 9 . 7 12.171 43.888 N R NSPIV 14.290 15.381 N R LU1P9 89.3 22.832 2-1.525 J N-R RANKI 107.1 27.366 29.161 N R BBTDF1 115.2 81.019 91.092 j
Chao-Sender correlation with analytical derivatives nonlinear equation solver linear equation solver
total CPU time (see)
CYBER Cray on Cray-off |
N-R CBS 53.4 27.768 28.095 !
N-R NSPIV - 5.303 5.478 !
N-R LUIP9 33.2 8.338 8.619
N-R RANKI 50.9 13.019 13.580
Table 13: lota! Solution Time for Each Hun for Problem b
Chao-Sender correlation with numerical derivatives
nonlinear linear total CPC time (sec)
equation equation
solver solver CYBER Cray-on Cray-off
N R NSP!V 15.881 16.402
N R CBS 62.8 29.459 29.952
N R LU1P9 44.4 15.528 16.079
( hao-Seader correlation with analytical derivatives
nonlinear linear total CPU time i(sec)
equal ion solver
equation
solver CYBER ( ’ray on (>ay-of
N R NSPIV 4.829 5.003
N-R CBS 34.5 18.379 18.428
N-R LC1P9 16.4 4.508 4.670
Fablf 11: I'otal Solution Time for Each Hun for Problem
Chao Header correlation with numerical derivatives
nonlinear linear ! total CPC time ( sec)
equation equation
solver solver j CYBER ( ’ray-on Cray-off ^
N R NSPIV 1I 34.952 35,957 i
N R CBS |! 112.5 47.909 48.997
N R o 'i p ? ) ! 95.6 34.469 35.724
N-R RANK! 1 167.1 55.874 57.510 |
Chao-Header correlation with analytical derivatives
nonlinear linear total CPC time (sec)
equat ion equation
solver solver CYBER Crayon Cray-off j.
N R NSPIV - 9.327 9.642 “
N R C’BS 47.9 22.345 22.721 jj
N-R LC1P9 31.7 8.929 9.279 jj
Tablt* 15: Total Solution Time for Each Run for Problem 8
Chao-Header correlation with numerical derivatives
nonlinear linear total CPC time (sec)
equation equation
j solver solver CYBER Cray on Cray off i
f N R ' SPIV ' 24.836 24.97'! \
J N R M ' IPO 5L9 24.769 24.954 |
Chao-Seader correlation with analytical derivatives
nonlinear linear total CPC time (sec)
equation equation i
solver solver CYBER Cray-on Crav-off
N R NSPIV 7.1)82 8.070 1
Table 16: Total Solution Time for Bach Run for Problem 0
Chao-Header correlation with numerical derivatives
| nonlinear linear total CPU time (sec)
equation
1 solver
equation
solver CYBER Cray-on ( ’ray-off
f N R LU1P9 65.7 31.951 32.078
| Simp N- R LU1P9 - - 11.530 42.069
| M B MUP9 - 43.555 44.150
| Schubert LIMP*) 71.6 30.256 33.765
( ’ hao* Seader c or relation with analytical der i vat i ves
nonlinear linear total CPU time (sec)
equation equation
solver solver CYBER Crayon Cray-off
N R LU1P9 25.9 9.776 9.860
Simp N-R H U P9 — 19.513 19.789
M-B LU1P9 21.553 21.940
A.5 Performance of Linear Lt|uation Solvers on the i ' \ B L R
The results from reference I are aiven in the I able below.
Table 17: Solution Times for Linear Reflation Solvers on the t ’Y B K R
LINEAR SOLUTION LINEAR SOLUTION
EQUATION TIME EQUATION TIME
SOLVER (SECONDS) SOLVER (SECONDS)
PROBLEM 1 CBS .07 PROBLEM 2 LU1P9 .21
EQUATIONS LU1P9 .09 A05 EQUATIONS CBS .21
22 STAGES RANK I . 10 A5 STAGES NSPIV .27
3 COMPONENTS NSPIV .11 A COMPONENTS BBTDF2 .56
BBTDF1 .16 BBTDF1 .59 BBTDF2 . 17 RANKI .73 BBTF* .18 THOMAS .8* THOMAS .26 BBTF* .97 BBTF .36 BBTF 1.A7 MA28A .67 MA28A 3.08
PROBLEM 3 LU1P9 . A6 PROBLEM A LU1P9 2 . A A
810 EQUATIONS CBS .71 777 EQUATIONS BBTDF2 3.18
90 STAGES BBTDF2 1.13 37 STAGES BBTDF1 A.A2
A COMPONENTS BBT0F1 1.21 10 COMPONENTS RANKI A. A]
THOMAS 1.6A CBS A.73
RANKI 2.25 THOMAS 10.23
BBTF* 5.26
BBTF 6.07
MA28A 8.78
PROBLEM 5 LU1P9 2.81 PROBLEM 6 LU1P9 1.A8
775 EQUATIONS BBTDF2 3*62 1235 EQUATIONS BBTDF2 A.23
25 STAGES RANKI A.86 95 STAGES CBS A.55
15 COMPONENTS BBT0F1 6.10 6 COMPONENTS BBTBF1 5.01
CBS 6.11 THOMAS 6.12
RANKI 6.91
PROBLEM 7 LU1P9 1.3* PROBLEM 8 LU1P9 1.3*
1560 EQUATIONS CBS 2.0A 2100 EQUATIONS B8TDF1 2.19
120 STAGES BBTDF2 3.A5 300 STAGES BBTBF2 2.35
6 COMPONENTS BBTDF1 A.03 3 COMPONENTS
RANKI 8.1A PROBLEM 9 LU1P9 1.13 2310 EQUATIONS BBTDF1 2.A2 330 STAGES THOMAS 3.01 3 COMPONENTS
SI