Three Dimensional Simulation of Time-Dependent Scramjet Isolator/Combustor Flowfields Implemented on Parallel Architectures

(1)

McDaniel, Keith S. Three Dimensional Simulation of Time-Dependent Scramjet Isolator

/Combustor Flowelds Implemented on Parallel Architectures, ( Under the directions of

Dr. J. R. Edwards)

The development of a parallel Navier-Stokes solver for computing time-dependent,

three-dimensional reacting owelds within scramjet (supersonic combusting ramjet)

en-gines is presented in this work. The algorithm combines low-diusion upwinding

meth-ods, time accurate implicit integration techniques, and domain decomposition strategies

to yield an eective approach for large-scale simulations. The algorithm is mapped to a

distributedmemoryIBMSP-2architecture andasharedmemoryCompaqES-40

architec-ture using the MPI-1 message-passing standard. Two and three-dimensional simulations

oftime-dependenthydrogenfuelinjectionintoa modelscramjet isolator/combustor

con-guration at two equivalence ratios are performed. These simulations are used to gain

knowledge of engine operability, inlet performance, isolator performance, fuel air mixing,

ame holding, mode transition, and engine unstart. Results for an injection at a ratio

of 0.29 show qualitative agreement with experiment for the two-dimensional case, but

re-vealed a slow progression toward engine unstart for the three-dimensional case. Injection

at an equivalence ratio of 0.61 resulted in engine unstart for both two-dimensional and

three-dimensional cases. Engine unstart for the three-dimensional case occurs as a

re-sponse tothe formation and growth of large pockets of reversed owalong the combustor

side wall. These structures develop at an incipient pressure above 154 kPa and result in

signicant blockage of the core ow, additionalcompression, and chemicalreactionwithin

(2)

Architectures

by

Keith Scott McDaniel

A thesis submitted tothe Graduate Faculty of

North Carolina State University

in partialfulllmentof the

requirementsfor the Degree of

Master of Science

Department of Mechanical and Aerospace Engineering

Raleigh, North Carolina

December 19,2000

Approved By:

(3)

I dedicate this thesis to my grandmother, Emily C. Dotson, and my brother,

Don Mark McDaniel, who have stood behind me through many years of hard work.

Their support of my goalsand aspirations have kept me strong. Without them, my road

(4)

Biography

Keith Scott McDaniel was born March 18, 1975 in Winston-Salem, North Carolina. He

grewup nearLibertyAirportinWinston-Salemwherehisfascinationwithaircraftandthe

driving forces behind them sprang from. After graduation from Carver High School, he

began his collegiateactivities at North CarolinaState University. He received aBachelor

ofScience inAerospace EngineeringinMay of1998 and iscurrently attainingaMaster of

Science in Aerospace Engineering from the same university. Keith willbegin employment

(5)

Acknowledgements

First,I wanttogivelovingthankstomymother,DonnaFayMcDaniel, andfather, Danny

Jay McDaniel, for they have given me a life full of opportunities. I thank Dr. Jack

Edwards for being a teacher and an advisor during my studies at North Carolina State

University. I acknowledgethe AirForceOÆceof Scientic Research(AFOSR) forsupport

of the current research activities through AFOSR GRANTF499620-98-1-0238,monitored

by Dr. Len Sakell. I also want to thank all my friends and schoolmates for all the help

(6)

Contents

1 Introduction 1

2 Governing Equations 4

2.1 Reference Equations . . . 4

2.2 Averaging . . . 7

2.2.1 Reynolds Averaging. . . 8

2.2.2 FavreAveraging. . . 9

2.3 ChemicallyReacting Equation Set. . . 11

2.3.1 Chemical Kinetics. . . 13

3 Numerical Formulation 18 3.1 Turbulent Closure . . . 18

3.2 Upwinding . . . 20

3.3 Time Advancement . . . 23

4 Multiblock Parallel Implementation 25 4.1 Algorithm . . . 25

4.2 ParallelPerformance . . . 28

5 Application 32 5.1 FlowModel . . . 32

5.2 Boundary Conditions . . . 33

5.3 Fuel Injection . . . 34

(7)

6.2.1 EquivalenceRatio 0.29 . . . 39

6.2.2 EquivalenceRatio 0.61 . . . 42

7 Conclusions 45

8 Figures 47

References 73

(8)

List of Tables

1 Abridged JachimowskiChemistry Model . . . 16

2 Balakrishnan etal. ChemistryModel . . . 17

3 Execution Times onScramjet . . . 30

4 Execution Times onIBM SP . . . 31

(9)

List of Figures

1 Upwindingstencil ( -direction ) . . . 47

2 Block decomposition of domain . . . 48

3 Parallelalgorithmow chart . . . 48

4 NAL isolator / combustor conguration . . . 49

5 NAL fuel strut . . . 49

6 NAL isolator / combustor congurationgrid, (every secondpoint shown) . 50 7 Hydrogen mass ow rate normalized by equivalence ratio versus time . . . 50

8 Floweld contour plots , =0:0 . . . 51

9 Wallpressure atseveral characteristic times =0:29 . . . 52

10 Floweld contour plots , =0:29 . . . 53

11 Wallpressure atseveral characteristic times , =0:61 . . . 54

12 Mach number contour plot atseveral time levels, =0:61 . . . 55

13 Static pressure at several time levels, =0:61 . . . 56

14 OH contours atseveral time levels, =0:61 . . . 57

15 Centerline Machnumberversus X, =0:61 . . . 58

16 U-velocity contours ( m s )near fuel strut ( t= 13.25 ms, =0:29 ) . . . 59

17 U-velocity contours ( m s ),=0:29 . . . 60

18 Static pressure (Pa)contours, =0:29) . . . 61

19 Temperature contours (K), =0:29 . . . 62

20 Hydrogen mass fraction, =0:29) . . . 63

21 Hydroxyl mass fraction contours, =0:29) . . . 64

22 Wallpressure atseveral characteristic times , =0:29 . . . 65

23 Time averaged wallpressure ,=0:29 . . . 65

24 U-velocity contours ( m s ) near fuel strut ( t= 0.25 ms, =0:61 ) . . . 66

(10)

26 Temperature contours (K), =0:61 . . . 68

27 Static pressure (Pa)contours, =0:61 . . . 69

28 Hydrogen mass fraction, =0:61 . . . 70

29 Hydroxyl mass fraction contours, =0:61 . . . 71

30 Wallpressure atseveral time levels ,=0:61 . . . 72

(11)

List of Symbols

A Pre-exponential factor

~

A ~

G Combined inviscidand viscous uxJacobians

^

A System Jacobian matrix

a Speed of sound

b Reaction order

C

p

CoeÆcient of specic heats at constantpressure

~ c

j

Species Concentration

D Diusion coeÆcient

e

t

Total internal energy

E,F,G Flux vectors (x, y, z,directions respectively)

E Inviscidux component (x -direction)

Ep Parallel eÆciency based ontime

E

v

Viscous ux component (x -direction)

F Inviscidux component (y -direction)

F

T

Time averaged owvariable

F

v

Viscous ux component (y -direction)

f Instantaneous owvariable

G Inviscidux component (z-direction)

G

v

Viscous ux component (z-direction)

~

H Combined Jacobianmatrix at point ijk

h Stagnation enthalpy per unit mass

h o

f

m

Heat of formationfor species m

J Jacobian

J

mj

Diusion Flux Tensor

k

f ;k

b

Forward and backward reactionequilibriumconstants

k Turbulent kinetic energy(section 3)

k Specic reactionrate (section2.3.1)

l

k

Kolmogorov length scale

l

mfp

Mean freepath length scale

M Mach number(section 3)

M Collision partner(section 2.3.1)

M

j

Molecular weightfor species j

Pr Prandtl number

p Pressure

q

j

Heat uxvector

q

t

j

Reynolds heat ux vector u 0 0

i h

0 0

(12)

R Universal gas constant

R R

f ;R R

b

Forward and backward reactionrates

S Source vector

Sp Parallel speedup

Sc Schmidt number

t Time

T Temperature

T

a

Activation Temperature

T Integral time scale

TB Third body eÆciencies

T

1

Turbulentintegraltime scale

T

2

Mean property variationtime scale

W Primitivevariable vector

U Conservativevariablecolumn vector

u Velocity component instreamwise direction

u

j

Velocity vector with componentsu, v, w

v Velocity component inwallnormal direction

v 0

m;n

StoichiometriccoeÆcients for reactant m inreactionn

v 00

m;n

StoichiometriccoeÆcient for product m inreaction n

w Velocity component inspanwise direction

x Cartesian coordinate(generallystreamwise component)

Y

m

Species mass fraction

y Cartesian coordinate(generallynormal component)

z Cartesian coordinate(generallyspanwise component)

_ w

m

Species production rate

Q

j

Turbulentheat ux tensor

T

ij

Turbulentstress tensor

U;V;W Contravariant velocity components

V

j

Species diusion velocity vector

U m

;V m

;W m

Species contravariant diusion velocitycomponents

Y

mj

Turbulentspecies mass uxcomponent

Greek Letters:

Æ

ij

Kronecker Delta

Density

;; Computationalcoordinate

; Arbitrary variables

Thermal conductivity

t

Turbulentthermal conductivity

Molecular viscosity

(13)

! Specic dissipation rate

ij

Laminarstress tensor

kinematicviscosity

Subscripts:

1 Freestream quantity

o Stagnation (total)quantity

i;j;k Spatialcoordinates, cellindices

L;R Left and RightStates

m Chemical species

n Time level orchemicalreaction

t Totalor turbulentquantity

mix Mixture quantity

v Viscous quantity

1

2

Interface data

Superscripts:

0

Reynolds uctuating component

00

Favre uctuating component

c Convective component

p Pressure component

Accents:

Averagedvariable

~ Favre-averaged component

^ Linear system ux Jacobianin computationalspace

Abbreviations:

CFD ComputationalFluidDynamics

CS Control Surface

CV Control Volume

DNS DirectNumerical Simulation

HXLV Hyper-X LaunchVehicle

HXR V Hyper-X Research Vehicle

LES Large EddySimulation

R ANS Reynolds AveragedNavier StokesEquations

NACA National Aeronautics and Space Administration

NAL National Aerospace Laboratory

NASA National Aeronautics and Space Administration

NASP National AeroSpace Plane

NS Number of species

NR Number of reactions

PNS Parabolized Navier-Stokes

(14)

1 Introduction

The scramjet( supersonic combusting ramjet) engineis a key toairbreathing hypersonic

technology. ThistypeofpropulsiondevicehasbeenstudiedbyNACA/NASAfornearly60

years [1]. A review of the scramjet technology in the 1980's indicates a rm fundamental

basis for the performance potential of dual mode scramjets. Dual mode scramjets are

similar to conventional ramjets but do not contain a second minimal area. This allows

both supersonic (scramjet) and subsonic (ramjet) combustion modes. For example, a

tactical missile(an expendable, lowcost, lowweight, xed geometrydevice) must operate

as a ramjet at low ight Mach numbers and as a scramjet at high ight Mach numbers

(M > 6:5). If adequate combustor-inlet isolation is provided, the scramjet will operate

in a ramjet mode with slightly lower eÆciencies than that of a conventional ramjet [2].

Knowledge of engine operability,inlet performance, isolator performance, fuel air mixing,

ame holding, combustion eÆciency, mode transition, and engine unstart is crucial for

scramjet applications. A large amount of scramjet combustion experimental data was

collected in ground based test facilities during the National Aero Space Plane (NASP)

Program. After termination of NASP in 1995, the focus of NASA Langley shifted to the

Hyper-X program. The Hyper-X program will demonstrate the free-ight operation of

an airframe-integrated, hydrogen-fueled, scramjet owpath in atmospheric ight at Mach

7 and 10 by 2002 [3]. \One shot" test experiments, such as Hyper - X, and scramjet

experiments in ground-based facilitiesare limited in the amount of oweld information

which can be recorded.

When astage inadevelopment cycle is reached wherewind tunnelor ight testingis

required, computationaltechniques can beextremelyuseful asadiagnostic tooltoexplain

certainunexplainedowphenomenaobservedinmeasureddata. Thisisparticulartruefor

(15)

performance. ThepotentialofCFDtechniquesasadiagnostictoolisdirectlyrelatedtothe

fact that computational solutions provide complete ow eld data, whereas wind tunnel

tests generally measure global characteristics and surface data. In the past, less general

equation sets have been used for the simulation of scramjet owelds. An example is the

use of the Parabolized Navier-Stokes equations (PNS). For supersonic ow, the

Navier-Stokesequationscanbemadeparabolicinthestreamwisedirections byassumingthat the

streamwise viscous terms are negligible and by a special treatment of the pressure in the

subsonicnearwallregionoftheboundarylater. Thesteadyequationsarethensolvedusing

astreamwisemarchingnite-dierencescheme[4]. Theseschemesare eÆcientforasteady

state solution butare only accurate if asupersoniccore ow exitsand are invalidfor time

dependent simulations. The solution of the complete Navier-Stokesequations isnecessary

tocomputescramjetcombustionowswere largesubsonic orseparatedregionsofowmay

exist. Theseregionsofowmayoccurduringatransitionfromscramjettoramjet modeof

operationand duringunsteady modes ofoperation, suchastime dependent fuelinjection.

Also, these subsonic regions as well as shock / boundary layer interactions play a major

role indetermining the inlet ow structure and the resultinginlet performance.

An understanding of dynamic modes of scramjet operation requires, at a minimum,

time-dependent solutions of the Favre-Averaged Navier-Stokes equations, extended for

chemicallyreactingows. Thisequation system can capture complex owstructures such

as expansion fans, boundary layers, shear layers, reaction fronts, subsonic separated-ow

regions, and shock-boundary layer interactions. The use of Favre-Averaging for an

un-steady meanowimpliesthe averagingof theNavier-StokesequationsoveratimeTmuch

larger than the longest turbulent time scale but smaller than the time scale characteristic

of the bulk uid motion. This separates the turbulent uctuating time scales from the

(16)

of operation and to explain results obtained from ground-based experiments. One such

mode of operation is the time dependent injection of fuel into the combustor. Time

de-pendentfuelinjectionleadstoconcerns ofowpathstability,scramjet/ramjettransition,

ame stabilization, and engine unstart. This directly corresponds to the proposed Hyper

-X Launch Vehicle (HXLV) ight test sequence. The HXLV will separate from a B-52

and will be boosted to the predetermined stage separation point. The Hyper-X Research

Vehicle(HXRV)willthenseparatefromtheHXLV.Aftera5secondunpoweredcontrolled

ight, sequenced fuel injection will commence [5]. Simulation of these dynamic modes of

engineoperationrequirescompleteoweldcalculationsofthecombustorandinletto

cap-ture the complextimedependentowstructures discussedpreviously. Thisrequiresof the

order of 1:0x10 6

grid points for 3-D resolution of a model scramjet combustor, assuming

that all eects of turbulence on the unsteady mean ow are modeled. Noting a solution

of the reactingoweld using a 7 species 7 reaction mechanism and the k ! turbulence

modelwillrequirethesolutionof14simultaneousequationsateverygridpoint,asolution

of roughly 14 million simultaneous equations is required at every time level. Obviously,

a parallel algorithm is needed to solve a problem of this size. This work develops a

par-allel Navier-Stokes solver for conducting three-dimensionalsimulations of time-dependent

scramjet owelds.

Sections2-3present the governing Navier-Stokesequations,their closureusing

turbu-lencemodelingconcepts,theirdiscretizationusinglow-diusionupwindmethods,andtheir

integration using time-accurate implicit methods. An approach for parallelizingthe

algo-rithm using a domain-decomposition / message-passing paradigm is discussed in section

4. The application of the algorithmto a model scramjet isolator / combustor

congura-tion tested atthe National Aerospace Lab, Japan [6]is discussed inSection 5,and results

(17)

2 Governing Equations

2.1 Reference Equations

This section will state the governing uid dynamic equations for viscous, unsteady

com-pressibleows. Theseequationsare extended toaccountfor niteratechemicalreactions.

The extended equations are then Favre-Averaged to model the eects of all turbulent

uctuations on the mean ow.

The compressible Navier-Stokes equations in Cartesian coordinates without body

forces orexternal heat additioncan bewritten as[7]:

@ @t () + @ @x j (u j

)=0 (2.1)

@ @t (Y m ) + @ @x j (Y m u j )= @ @x j (J mj )+w_

m (2.2) @ @t (u i ) + @ @x j (u i u j )= @ @x j (pÆ ij )+ @ @x j ( ij ) (2.3) @ @t (e t ) + @ @x j (h t u j )= @ @x j [q j + ji u i ] (2.4)

The diusion ux tensor J

mj

, diusion velocities V

j

, laminar stress tensor

ij

, and the

laminar heat ux vector q

j

are given by:

J mj = m V (m) j (2.5) V (m) j = D Y m @(Y m ) @x j (2.6) ij = @u i @x j + @u j @x i 1 2 Æ ij @u k @x k q j = @T @x j NS X m=1 h m Y m V j (2.7)

whereisthelaminarviscosity,isthethermalconductivity,DisthediusioncoeÆcient.

and Y

m =

m

(18)

The total enthalpy and energy are dened as [8]: h t = h mix + u i u i 2 e t = h t p (2.8) whereh mix

isthe mixturestatic enthalpy

The general eects of gravitational forces and radiative heat transfer are neglected

in this formulation, and thermal equilibrium at a common temperature is assumed. The

thermodynamicstate ofthegasmixtureisspeciedbythe followingauxiliaryrelations[9]:

= NS X m=1 m (2.9) Y m = m (2.10)

p = R

mix T (2.11) R mix = R NS X m=1 Y m M m (2.12) h mix = NS X m=1 h m Y m (2.13) h m = h o fm + Z T T ref C p m

(T)dT (2.14)

a 2 = C p mix C pmix R mix p (2.15) D = 1 Sc (2.16) C p mix = NS X m=1 Y m C p m

(T) (2.17)

Mixture values for the specic gas constant R

mix

are dened in equation 2.12, where

R is the universal gas constant, M

m

is the molecular weight of species m, The specic

heat at constant pressure for species m is C

pm

. Equation 2.11 denes the state equation

for a mixture of ideal gases. The species enthalpy h o

m

is equal to the species enthalpy of

formationh o

plustheintegralofC

pm

fromT

ref

(19)

the mixtureenthalpy. Curve-t polynomialsfromMcBrideetal[10]are usedforobtaining

C

p

m

as a function of temperature. The molecular viscosity and thermal conductivity

are determined fromSutherland's viscosity lawwhile Wilke's law[11] isused todetermine

mixture values. The molecular diusion coeÆcient D is modeled using Fick's law. By

assuminga constantSchmidt number of0:5for allspecies, one diusioncoeÆcientcan be

dened as inequation 2.16.

The four governing equations represented (equations 2.1-2.4) in vector form are the

ConservationofMass,ConservationofMomentum(Newton's2ndLaw),andthe

Conserva-tion of Energy (1st Law of Thermodynamics)laws appliedtoan arbitrarycontrolvolume

(CV). They are referred to as the Continuity, Momentum, and Energy equations

respec-tively. Therst terminthe continuityequationisthe rateofincrease of themass perunit

volume in the CV, and the second term represents the rate of mass ux per unit volume

passing through thecontrolsurface (CS).The rst terminthe momentumequationis the

rate ofofincrease ofmomentumperunitvolumeintheCVand thesecondtermrepresents

the rateofmomentumlossperunitvolumethroughtheCSduetoconvectionandstresson

the uidinternaltothe CV.Therst termintheenergy equationistherate ofincrease of

total energy per unit volume inthe CV and the second termrepresents the rate of energy

lost toconvection ,conduction,and dissipation through the CS. The equationsformulated

above can bewritten inCartesian vector form as

@U

@t +

@(E)

@x +

@(F )

@y +

@(G)

@z

=S (2.18)

If equation 2.1 is replaced by the auxiliary equation 2.9, then solution of the species

(20)

written as

E=E E

v = 0 B B B B B B B B B @ 1 u . . . NS u u 2 +p vu wu h t u 1 C C C C C C C C C A 0 B B B B B B B B B @ 1 U (1) . . . NS U (NS) xx yx zx u xx +v xy +w xz +q x 1 C C C C C C C C C A (2.19) 2.2 Averaging

Turbulentreacting owelds atlarge Reynolds numbers are inherentlythree dimensional

and cover alarge spectrumof turbulent lengthand time scales [12]. The smallestof these

scales are theKolmogorov scales. One ofthe premisesofKolmogorov'sequilibriumtheory

is that the smaller eddies are in a state at which the rate of their receiving energy from

the larger eddies is nearly equal to the rate at which the smallest eddies dissipate energy

to heat. The smallest turbulent scales can be shown to be acontinuum phenomenon. For

example, the ratio of the Kolmogorov length scale tothe mean free path of airmoving65

MPH over a driver's windowat 68 o

Fis approximately 72.

l

mfp

1:810 4

inch

2:510 6

inch

72 (2.20)

Therefore the Kolmogorov length scale is much bigger than the mean free path of air,

which inturn, is typically10 times the moleculardiameter.

To resolve all the turbulent length scales of a high Reynolds number ow using the

discretized Navier-Stokes equationswould requireextremely small spatialgrid spacing. In

addition,large integration timesare needed toresolve the average oweld over the time

scales. There have been several approaches used for solving the turbulent Navier-Stokes

equations. One method, known as Direct Numerical Simulation (DNS), is to solve the

governing equations directly. DNS attempts to resolve all important scales down to the

(21)

Anothermethod isLargeEddy Simulation(LES),whichattempts toresolveonly the

largereddiesintheoweldwhilemodelingtheeectsofunresolved(subgrid)eddies. LES

is alsoquite expensive for high Reynolds number ows.

Another method is to model the eects of all turbulent uctuations on the mean

ow properties. In this the governing equations are averaged over a time T that is much

larger than all the turbulent uctuating time scales T

1

. Forunsteady ows, such as those

investigated inthis work, the averaged time T must be smaller than the time scale of the

slowmean property variationsT

2

. Therefore, the turbulent uctuationsare averagedover

time T, while maintaining the time dependent mean property variations in the ow eld.

This implies that a large separation of time scales between the mean property variations

and the turbulent uctuationsmust existin the ow eld forthis approach to be valid.

2.2.1 Reynolds Averaging

The following sectionspresent Reynolds and Favre-averagingtechniques as appliedto the

Navier-Stokes equations. The averaging concepts introduced by Reynolds [13] generally

assume a wide variety of forms. The type most often used in CFD is time averaging.

The following is a derivation of the Reynolds Averaged Navier-Stokes (RANS)equations.

Although the RANSequations are derived fromtime averaging,the formof the equations

generalizes totime,spatial,and ensembleaveraging. The timeaveragedowvariablescan

beexpressed as :

f(x

i

)= lim

T!1 1

T Z

t+T

t f

i (x

i

;t)dt (2.21)

where f is the instantaneous ow variable. This variable can be decomposed into mean

and uctuatingcomponents,

f =f +f 0

(2.22)

(22)

ofturbulentuctuationsbutsmallerthanthesmallestperiodofslowpropertyvariationsin

the meanoweld. Thistypeofdecompositionresultsinthetimeaverageofauctuating

componentbeing equaltozero. Italsoresultsinthe time averageof theproducts ofmean

componentsto be equaltothe productsthemselves.

= ; (2.23)

0

= 0

=0 (2.24)

Equations 2.23-2.24 are known as the Reynolds postulates. They do not imply however,

thatthe timeaverageoftheproductsofuctuatingcomponentsareequaltozero[14]. The

average of the productof two uctuatingproperties is:

= +

0

+ 0

= +

0

+ 0

+ 0 0

= +

0 0

(2.25)

2.2.2 Favre Averaging

Density and temperature uctuations must be taken into account for compressible ow

elds. A density weighting procedure (Favre Averaging)isused tosimplifythe derivation

of the time averagedequations [13]. The Favre(mass) averaged componentis dened as

~

f(x

i )=

1

lim

t!1 1

t Z

t+t

t

f(x

i

;t)dt= f

(2.26)

The instantaneous function f can now be decomposed intoFavre mean and Favre

uctu-atingcomponents as:

(23)

Favre averaging is employed by decomposing p, , q

i

in the normal Reynolds averaging

fashion and decomposing the remainingvariables using the Favre decomposition.

= +

0

u

j =u~

j +u 00 j u (m) j =u~

(m) j +u (m) 00 j

p = p+p 0 ij = ij + 0 ij T = ~

T +T 00 (2.28) q j = q j +q 0 j Y m = ~ Y m +Y 00 m _ w m =w_

m +w_

0

m

e

t

= ~e

t +e 0 0 t h t = ~ h t +h 0 0 t

The equations are then mass-averaged using the denition of the Favre averaged

compo-nent inequation2.26. Usingthis typeof decomposition,the compressibleFavre-Averaged

Navier-StokesEquations (RANS) extended for species mass transportcan be written as

@ @t () + @ @x j (~u j

)=0 (2.29)

@ @t ~ Y m + @ @x j ~ Y m ~ u j = @ @x j h m ~ V (m) j +Y mj i

+w_

m (2.30) @ @t (~u i ) + @ @x j (~u i ~ u j )= @ @x j (pÆ ij )+ @ @x j [ ij +T ij ] (2.31) @ @t (~e t ) + @ @x j ~ h t ~ u j = @ @x j q j +u~

i ji +Q j (2.32) Y mj = Y 0 0 m u 00 j (2.33) T ij = u 00 i u 00 j (2.34) Q j = u 00 j h 00 t ~ u i T ji =q t j u i T ji (2.35) ~e t = NS X m=1 ~ Y m ~ h m ~ T p+ 1 2 NS X j=1 ~ u j ~ u j (2.36) ~

(24)

The averagedlaminar stress tensor,heat ux, and diusion velocities are given by:

ij

=

@u~

i @x j + ~ @u j @x i ! 1 2 Æ ij

@u~

k @x k q j = @ ~ T @x j NS X m=1 ~ h m ~ Y m ~ V (m) j ~ V (m) j = 1 ~ Y m D @ ~ Y m @x j (2.38)

2.3 Chemically Reacting Equation Set

TheFavreaveragedequationsetdescribedaboveistransformedintocurvilinearcoordinates

by the steady transformation[12]:

= (x;y;z)

= (x;y;z) (2.39)

= (x;y;z)

to yieldthe following equationset:

@U @t + @ ^ E ^ E v @ + @ ^ F ^ F v @ + @ ^ G ^ G v @

=S (2.40)

The conservative variable vector U and the source term vector S are:

U = 1 J 0 B B B B B B B B B @ 1 . . . NS ~u ~v w~ ~e 1 C C C C C C C C C A

; S =

(25)

The inviscidux vectors are given by: ^ E = 1 J 0 B B B B B B B B B @ U . . . NS U

~uU +

x p

~vU +

y p

wU~ +

z p ~ h t U 1 C C C C C C C C C A ; ^ F = 1 J 0 B B B B B B B B B @ V . . . NS V

~uV+

x p

~vV+

y p

wV~ +

z p ~ h t V 1 C C C C C C C C C A ; ^ G= 1 J 0 B B B B B B B B B @ W . . . NS W

~uW+

x p

~vW+

y p

wW~ +

z p ~ h t W 1 C C C C C C C C C A (2.42)

The viscous uxvectors are given by:

^ E v = 1 J 0 B B B B B B B B B B B B B @ 1 U (1) . . . NS U (NS) x ( xx +T xx )+ y ( xy +T xy )+ z ( xz +T xz ) x ( yx +T yx )+ y ( yy +T yy )+ z ( yz +T yz ) x ( zx +T zx )+ y ( zy +T zy )+ z ( xz +T zz ) x (~u xx +~v

xy +w~

xz +q x +Q x ) + y ~ u yx +v~

yy +w~

yz +q y +Q y + z (~u zx +v~

zy +w~

zz +q z +Q z ) 1 C C C C C C C C C C C C C A (2.43) ^ F v = 1 J 0 B B B B B B B B B B B B B @ 1 V (1) . . . NS V (NS) x ( xx +T xx )+ y ( xy +T xy )+ z ( xz +T xz ) x ( yx +T yx )+ y ( yy +T yy )+ z ( yz +T yz ) x ( zx +T zx )+ y ( zy +T zy )+ z ( xz +T zz ) x (~u xx +v~

xy +w~

xz +q x +Q x ) + y ~ u yx +~v

yy +w~

yz +q y +Q y + z (~u zx +~v

zy +w~

zz +q z +Q z ) 1 C C C C C C C C C C C C C A (2.44) ^ Q v = 1 J 0 B B B B B B B B B B B B B @ 1 W (1) . . . NS W ( NS) x ( xx +T xx )+ y ( xy +T xy )+ z ( xz +T xz ) x ( yx +T yx )+ y ( yy +T yy )+ z ( yz +T yz ) x ( zx +T zx )+ y ( zy +T zy )+ z ( xz +T zz ) x (~u xx +v~

xy +w~

xz +q x +Q x ) + y ~ u yx +~v

yy +w~

yz +q y +Q y + z (~u zx +v~

zy +w~

zz

+q +Q

(26)

, where w_

m

is the averaged species production rate for species m, U;V;W are the

con-travariant velocity components and U m

;V m

;W m

are the contravariant diusion velocities

forspecies m dened by:

U x ~ u+ y ~ v+ z ~ w V x ~ u+ y ~ v+ z ~ w (2.46) W x ~ u+ y ~ v+ z ~ w U m x ~ V m 1 + y ~ V m 2 + z ~ V m 3 V m x ~ V m 1 + y ~ V m 2 + z ~ V m 3 (2.47) W m x ~ V m 1 + y ~ V m 2 + z ~ V m 3

and J isthe Jacobianof the coordinate transformation.

2.3.1 Chemical Kinetics

The chemical mass productionrate perunit volume w_

m

for species m is given by the law

ofmass action:

_ w m = NR X n=1 h v 00 mn v 0 mn i (R R fn R R bn ) = NR X n=1 h v 00 mn v 0 mn i k f n NS Y j=1 (~c j ) v 0 j k b n NS Y j=1 ( ~c

j ) v 0 0 j !! (2.48) and v 0 j

isthe stoichiometriccoeÆcient of the reactants,v 00

j

is the stoichiometriccoeÆcient

of the products, NR is the total number of reactions involved, NS isthe total number of

speciesinvolved, R R

fn

andR R

bn

are theforward andbackward reactionratesrespectively

forreactionn, where ~c

j = j M j

isthe molardensity for species j. The specic reactionrate

k is writtenbelowinArrhenius form:

(27)

where A is the pre-exponential constant, and T

a

is the activation temperature [15]. The

variables A;b; andT

a

are found in Tables 1 and 2 for the two reaction mechanisms

con-sidered in this study. The reaction mechanisms are composed of two types of reactions,

exchangeand thirdbody. Theexchangereactiondescribestheinterdependentforwardand

backward reactions between products and reactants. Forthe followingexchange reaction:

A+B *

)C+D (2.50)

The forward and backward reactionrates are given by the lawof mass actionas:

R R

f

= k

f (T)c~

v 0 A A ~ c v 0 B B R R b = k b (T)c~

v 0 0 C C ~ c v 0 0 D D (2.51)

Third body reactions occur when two species react in the presence of a collision partner

M. The law of mass action can be rewritten for a third body reaction by considering

thatathird bodyreactionisactuallyNS reactionswhere the reactionrate depend onthe

collisionpartnerM.

_ w m = NR X n h v 00 mn v 0 mn i NS X i=1 (R R f ni R R b ni ) ! (2.52)

Forthe followingthird body reaction

AB +M *

)A+B +M (2.53)

the third body reactionrate R

n

for reactionn is

R n = NS X i=1 (R R fni R R bni ) (2.54)

and the forward and backward reactionrate in equation 2.54 are

(28)

If one of the specic reaction rate coeÆcients is chosen as a reference value, denoted by

k 0

f

n

then the third body eÆciency can be written as the ratio of the particular species

coeÆcientA

f

ni

tothe reference coeÆcient A 0 f n (TB) i = A f ni A 0 f n (2.57)

Usingthis relation,the forward and backward specic reactionrates can bewritten as

k f ni = k 0 fn (TB) i k b ni = k 0 bn (TB) i (2.58)

Using equation 2.58 in equation 2.55 and 2.56, then the third body reaction rate can be

writteninterms of the third body eÆciencies as

R n = k 0 fn ~ c v 0 AB AB k 0 bn ~ c v 0 A A ~ c v 0 B B NS X i=1 (TB) i ~ c v 0 M i M i (2.59)

The 7 species 7 hydrogen-oxidation reaction mechanism by Eklund [16] is used for

the chemistry modeling in the 3-D scramjet combustor simulations. This involves six

reactingspecies, H

2 , O

2 , H

2

O,OH,H,O,plus inertN

2

,thatinteractthrough aseven-step

reactionmechanism. This reactionmechanism waschosen forthe 3-Dscramjetsimulation

to maintain a minimal problem size. Also as indicated by the 2-D results, the ame is

stabilizedatthe fuel injector exit and therefore the inclusionof ignitionchemistry should

provide littlevariationin the amedownstream of the nozzle exit. This should have little

eect onthe thermal choking of the combustor.

The 9species 21reactionmechanismproposed by Balakrishnan, etal. [17] isused to

model the hydrogen oxidation for the 2-D scramjet combustor simulations. This involves

the same reaction species as in the reaction mechanism by Eklund plus H

2 O 2 and HO 2 .

These radicals have been shown to important during ignition of the hydrogen-oxygen

(29)

Table 1: AbridgedJachimowski ChemistryModel

# Reaction A b T

a

1: H + OH+ M *

)H

2

O +M .221E + 23 -2.0 0.0

2: H +H + M *

)H

2

+ M .730E + 18 -1.0 0.0

3: H

2 +O

2 *

)

OH +OH .170E + 14 0.0 24157.0

4: H +O

2 *

)OH +O .120E + 18 -0.91 8310.5

5: OH+ H

2 *

)H

2

O+ H .220E + 14 0.0 2591.8

6: O+ H

2 *

)OH + H .506E + 05 2.67 3165.6

7: OH +OH *

) H

2

O+ O .630E + 13 0.0 548.6

TBEs: H

2

O 16.0, H

2

2.5, and 1.0 forall other species

A, b,and T

a

are given inunits of moles ,cm,s,K, and cal/mole,

k=AT b

e (

Ta

T )

Modelingproceduresthat accountfortheeects ofturbulentuctuationsonchemical

reactionratesarediÆculttodevelopduetothelargenumberofunclosedcorrelationswhich

arise. Two alternativemethodsdiscussed by Roy[12],the probability density formulation

(PDF)andEddyDissipationConcept(EDC),wereshowntohavedecienciesincalculating

supersonic combustion ames. The more attractive (PDF) method also had negligible

eects for calculating attached supersonic ames. Therefore, the above formulation for

the species production rates is evaluated using mean quantities. The eects of turbulent

uctuationson rate coeÆcientsand onmolar density are neglected, thus

k

f

(T) ! k

f T

(2.60)

c

j (

j

) ! c

j (

j

) (2.61)

(30)

high-Table 2: Balakrishnan et al. ChemistryModel

# Reaction A b E

1 a

H + OH +M *

)H

2

O+ M 2.2e22 -2.0 0.0

2 b

H +H +M *

)H

2

+ M 1.e18 -1.0 0.0

3 a

H + O+ M *

)

OH +M 6.2e16 -0.6 0.0

4 a

O +O +M *

) O

2

+ M 6.17e15 -0.5 0.0

5 a

H + O

2 + M

*

)HO

2

+ M 6.76e19 -1.42 0.0

6 c H 2 O 2 +M *

) OH+ OH +M 1.2e17 0.0 45500.0

7 O+ OH +M *

) HO

2

+ M 1.0e16 0.0 0.0

8 H +O

2 *

)OH + O 3.52e16 -0.7 17070

9 OH +H

2 *

) H

2

O+ H 1.17e9 1.3 3626

10 O +H

2 *

)

OH+ H 5.06e4 2.67 6290

11 OH + OH

*

) H

2

O+ O .70e14 0.0 56767

12 HO 2 +H * ) H 2 +O 2

4.28e13 0.0 1411.0

13 HO

2 + H

*

)OH + OH 1.70e14 0.0 874.0

14 HO 2 + O * ) O 2

+ OH 2.00e13 0.0 0.0

15 HO 2 + OH * ) H 2

O+ O

2

2.89e13 0.0 -497.0

16 HO 2 + HO 2 * ) H 2 O 2 +O 2

3.02e12 0.0 1390.0

17 H +H

2 O 2 * ) H 2 +HO 2

4.79e13 0.0 7950.0

18 H +H

2 O

2 *

)OH +H

2

O 1.00e13 0.0 3590.0

19 H 2 O 2 + H * )H 2

O+ OH 7.08e12 0.0 1430.0

20 H +HO

2 *

)O +H

2

O 3.10e13 0.0 1720.0

21 H

2 +O

2 *

)OH + OH 1.70e13 0.0 44780.0

a

TBEs: H

2

O12.0, H

2 2.5

b

TBEs: H

2

O 6.5, O

2 0.5, N 2 0.4 c TBEs: H 2

O 15.0, H

2 2.5

A, b, and E are given in unitsof moles ,cm,s,K, and cal/mole,

b

( E

(31)

3 Numerical Formulation

3.1 Turbulent Closure

The process of averaging the Navier-Stokes equations results inextra terms that must be

modeled. These termsarise because of the nonlinearityof the equations. By investigation

of equations 2.33 -2.35, three unknown terms appear due to the averaging. These include

the Reynolds shear stress T

ij

, Reynolds heat ux q

t

j

, and the turbulent species diusion

Y

mj

. Inordertoobtainturbulentclosure,modelsforeachofthe termsmust bedeveloped.

Mostturbulencemodelsinusetodayemploythe Boussinesqeddy-viscosity approximation

fortheReynoldsstresstensor,whichassumesthattheReynoldsstressbehavesinamanner

similarto the laminarstress tensor [18]:

T

ij =

T

@u~

i

@x

j +

@u~

j @x i 2 3 Æ ij @u~

k

@x

k

(3.3.1)

The Reynolds heat ux is modeled using a gradient-diusion approximation, yielding an

expression similar toFourier's law:

q j t = t @ ~ T @x j NS X m=1 ~ h m ~ Y m ~ V (m) tj (3.3.2) t = T C p Pr t (3.3.3) ~ V (m) t j = 1 ~ Y m D t @ ~ Y m @x j (3.3.4) D t = t Sc t (3.3.5)

The turbulent species diusion is modeled by a gradientdiusion modelas

Y mj =Y 00 m u 00 j = T Sc t @ ~ Y m @x j (3.3.6)

The closure problem is reduced to specication of the eddy viscosity. If we assume that

the eddy viscosity can be computed in a similar fashion as the molecular viscosity, then

the eddy viscosity is alsodependent ona velocity scale and lengthscale.

(32)

whereu

mix andl

mix

denotevelocityandlengthscales,respectively,thatareassociatedwith

the mixing of turbulent eddies. These scales must be provided by the turbulence model.

Theturbulencemodelusedinthisworkisthezonaltwoequationk !modeldevelopedby

Menter[19]. ThismodelisacombinationoftheWilcoxk !andthestandardk models.

The k ! model is used in the laminar sublayer of the boundary layer and logarithmic

regionofthe boundarylayer. Thismodelhas beenshown tobesuperior tothek model

inadverse pressure gradientows and incompressibleows. The zonalmodelapproaches

a k model in the outer parts of the boundary layer and in free -shear regions. This

reduces the sensitivity of the calculation to free-stream turbulence quantities and allows

reasonablepredictions of free-shearows. The transitionfromk ! tok is facilitated

byablendingfunction F

1

,whichiszero inthe wake regionandfree shearlayerand onein

the wallregion. This modelis writtenbelowas:

t = a 1 k max(a 1 !;F 2 ) (3.3.8) @ @t (k)+ @ @x j (ku j ) = ij @u i @x j !k+ @ @x j (+ k t ) @k @x j (3.3.9) @ @t (!)+ @ @x j (!u j ) = t ij @u i @x j ! 2 + @ @x j (+ ! t ) @! @x j

+ 2(1 F

1 ) 2 1 ! @k @x j @! @x j (3.3.10) Let 1

represent anyconstantinthe originalk ! and

2

anyconstantinthe transformed

k model. Then is dened by the blendingfunction:

=F

1

+(1 F

1 ) 2 (3.3.11) The 1

constants of the (Wilcox) k ! modelare :

k1 = 0:85; !1 =0:5; 1

= :0750; a

1

=0:31 (3.3.12)

= 0:09; =0:41;

(33)

The constants

2

of the standard (k )are:

k2

= 1:0;

!2

=0;856;

2

=0:0828; a

1

=0:31 (3.3.14)

= 0:09; =0:41;

2 = 2 !2 2 p (3.3.15)

The blendingfunctions are dened by Menter as:

F

1

= tanh arg 4

1

; where (3.3.16)

arg 1 = min " max p k 0:09!y ; 500 y 2 ! ! ; 4 !2 k CD k! y 2 #

; and (3.3.17)

F

2

= tanh arg 2

2

; where (3.3.18)

arg

2

= max 2 p k 0:09!y ; 500 y 2 ! ! (3.3.19) where, CD kw = max 2 ! 2 1 ! @k @x j @! @x j (3.3.20)

Equation3.3.8satisesBradshaw's assumption[19]thatthe shearstressisproportionalto

theturbulentkineticenergyintheboundarylayer,whilepreservingtheoriginalformulation

that

t

=k=! for the rest of the ow.

3.2 Upwinding

The invisciduxes inthe Navier-Stokesequations are discretized using the LowDiusion

Flux Splitting Scheme (LDFSS) of Edwards [20]. This scheme attempts to combine the

accuracy of ux dierence splitting methods with the numerical eÆciency of ux vector

splitting methods. The LDFSS algorithm provides accurate resolution of stationary and

moving contact discontinuities while preserving solution monotonicity and entropy

satis-faction. The LDFSS algorithmrequires O(N) operationsfor N equations compared with

O(N 2

) operations of ux dierence schemes. [21] For clarity, the algorithm is applied

to the three dimensional instantaneous equations in the direction. Similar results are

(34)

advection and pressure components, as ^ E = ^ E c + ^ E p = jrj J UE c +pE p

; where U = 1 jrj ( x u+ y v+ z w) (3.3.21)

is the normalized contravariantvelocity in the direction,

E c = 2 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 4 Y 1 Y 2 . . . Y NS u v w h t k ! 3 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 5 ; ~ E p = 2 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 4 0 0 . . . 0 x y z 0 0 0 3 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 5 (3.3.22) x = x jrj ; y = y jrj z = z jrj (3.3.23)

The upwinding is performed at the cell interfaces, denoted by 1

2

in Figure 1, using the

propertiesatthetheleft(L)andtheright(R)states. Thediscretizationstoresinformation

atthenodecellvertex,withthemeshcellsdenotedbydashedlines. Themetricderivatives

areevaluatedatthecellinterfaces. TheLDFSSschemeappliedtothecellinterfacesresults

intheinvisciduxbeingsplitintoconvectiveandpressurecomponents,withtheconvective

uxwritten as

E c 1+ 1 2 = jrj J L a1 2 C + ~ E c L + R a1 2 C ~ E c R (3.3.24)

and the pressure ux dened by

^ E p 1 2 = jrj J ~ E p 1 2 D + L p L +D R p R (3.3.25)

where a1

2

is the interface sound speed, written as

(35)

and C

and D

are functions of the Mach number, dened as

D + = + L (1+ L ) L P + L (3.3.27) D = R (1+ R ) R P R (3.3.28) C + = + L (1+ L )M L L M + L L R ~ M + (3.3.29) C = + R (1+ R )M R R M R + L R ~ M (3.3.30) where + L = 1 2

[1+sign(1:0; M

L )] (3.3.31) + R = 1 2

[1 sign(1:0; M

R

)] (3.3.32)

L

= max[0:0; 1:0 int(jM

L

j)] (3.3.33)

R

= max[0:0; 1:0 int(jM

R j)] (3.3.34) M + L = 1 4 (M L +1) 2 (3.3.35) M R = 1 4 (M R 1) 2 (3.3.36) P + L = 1 4 (M L +1) 2 (2 M L ) (3.3.37) P R = 1 4 (M R 1) 2 (2 M R ) (3.3.38) ~ M + = 1 p L p R p L +p R +2 jp L p R j p L M 1 2 (3.3.39) ~ M = 1+ p L p R p L +p R 2 jp L p R j p L M 1 2 (3.3.40) M1 2 = 1 4 r 1 2 (M 2 L +M 2 L ) 1:0 ! 2 (3.3.41)

and the left and right state Mach numbers are dened according to the interface sound

(36)

In the denitions of ~ M + and ~

M (Equations 3.3.39-3.3.40), the rst term involving the

pressuresprovidesmonotonecapturing ofdiscontinuities[12], whilethe secondterm

coun-teractsthe \carbuncle"-typeodd-even decoupling discussed in Reference [22]

3.3 Time Advancement

The3-D code uses athree-pointbackward Euler timeadvancementprocedurethat results

ina large sparselinear system, formulatedfor aparticular grid pointas:

" 3 2 M t + ~ A @ ~ S @ ~ W # n ijk ~ W ijk + ~ B n ijk ~ W i;j 1;k + ~ C n ijk ~ W i;j+1;k + ~ D n ijk ~ W i;j;k 1 + ~ E n ijk ~ W i;j;k+1 + ~ F n ijk ~ W i 1;j;k + ~ G n ijk ~ W i+1;j;k = ~ R n ijk + 1 2 ~ U n ijk ~ U n 1 ijk t (3.3.43) where ~ A ~

GarefunctionoftheinviscidandviscousuxJacobians,asoutlinedinReference

[23]. M is the transformation matrix @ ~ U @ ~ W , @ ~ S @ ~ W

is the Jacobian of the source term vector,

and t isthe physical time step.

The basicimplicit operatoris formed by orderingequation 3.3.43overthe gridnodes

and incorporatingimplicitrepresentations for the boundaryconditions [12]. The resulting

system Jacobian matrix is denoted as ^

A. This system is approximately solved using a

symmetric planar Gauss-Seidel (SPGS) algorithm. Denoting h 3 2 M t + ~ A @ ~ S @ ~ W i as ~ H, a

planarGauss-Seidelpartitioning[12,20,24,25]isdenedbyrstgroupingmatrixelements

correspondingto ani= constantplane as D

D= h ~ H+ ~ B + ~ C+ ~ D+ ~ E i (3.3.44)

and groupingo-planeelementsinto L and U respectively, where

L = ~

F (3.3.45)

(37)

Inthis formulationallmatrixelementscomposingofD,L,andU arestoredoverthe entire

mesh. The partitioning is then dened by

(D+L) n D n 1

(D+U) n ~ W n+1 = ~ R n + 1 2 ~ U n ~ U n 1 t (3.3.47)

This Gauss-Seidel partitioning is an approximation to the matrix ^

A, with factorization

error LD 1

UÆW. The factorization is nearly exact for streamwise supersonic ows, but

thefactorization errorincreases asthethe numberof nonzeroelementsinUincreases [25].

This meansthat asthe amount of streamwise subsonic ow increases, the performance of

the algorithm will degrade. The SPGS algorithm is implemented in two steps,a forward

and backward step in the direction:

forward sweep: ~ W n+ 1 2 i =(D n i ) 1 h ~ R n i L n i 1 ~ W n+ 1 2 i 1 i (3.3.48) backward sweep: ~ W n+1 i = ~ W n+ 1 2 i (D n i ) 1 U n i+1 ~ W n+1 i+1 (3.3.49)

The approximate inversion of the block pentadiagonal matrix D

i

at each i index is

ac-complished using an Alternating Direction-Implicit algorithm developed in earlier works

(38)

4 Multiblock Parallel Implementation

The calculation of ows within complex domains using structured grids usually requires

some type of domain-decomposition strategy. Domain decomposition involves the

parti-tioningof acomplexdomainintosimplersubdomains,whichmaybesolvedinsequence or

inparallel. The following subsections discuss the domaindecomposition strategy used for

time-dependent, 3-D scramjet calculations. The parallel implementation of the solution

algorithmis alsodiscussed. A combineddecomposition isused overthe three dimensional

domain. The combined decomposition contains a primary decomposition in the and

directions and a secondary decomposition in the direction. The Multiplicative Schwarz

method[26] is used to solve the primary decompositions. MultiplicativeSchwarz methods

solve each subdomain problem in sequence by using the most recent articial boundary

data from the adjacent blocks. The Additive Schwarz method is used to solve the

sec-ondary decompositions. Additive Schwarz methods solve each subdomain problem using

the lastarticial boundary data from the adjacent blocks. Parallelismcan be achieved in

the secondary direction because of the use of the Additive Schwarz algorithm, where no

dependence onupdateddatafromadjacentdomainsexists. Therefore, theprimary

decom-position is calculated in a serial fashion, while the secondary decomposition is calculated

inparallel. The 2-Dcalculations use onlythe primarydecomposition andwere run onthe

Cray T-90,avectorarchitecture. The3-Dcalculationswere runinparallelonthe IBM-SP

attheNorth CarolinaSuperComputingCenter(NCSC)andonaCompaqES-40atNorth

CarolinaState University (NCSU).

4.1 Algorithm

The development of this algorithmwas governed by the ow modelbeing used. The time

advancementschemediscussed earliermaybewrittenforthe entire domainasthelinear

system ~

A n

W n+1

= R

n

, where ^

A is Jacobian matrix of the nonlinear system, ~

(39)

thecorrection vector,and R isthe residual vector. The system can bedecomposed overa

set of subdomainsas follows.

^ A = 0 B B @ ^ A 1 ^ A n1 ::: ::: ::: ::: ^ A n mm ^ A mm 1 C C A (4.4.1) where ^ A k

denotes elements of the Jacobian corresponding strictly to subdomain k and

^

A

n

k

denotes elements that connect the solution within block k to that within other

blocks. Byremovingthe explicitdependence onthe ^

A

n

k

elements,asuccessive iteration

procedure for solving the system can be used, with the boundary data updated through

thecalculationof theresidualvectorR on

k

. Consideringatwoblock decompositionthe

MultiplicativeSchwarz algorithmcan be written asa two step process.

W n+ 1 2 W n + ^ A 1 1 0 0 0 R n (4.4.2) W n+1 W n+ 1 2 + 0 0 0 ^ A 1 2 R n+ 1 2 (4.4.3)

The n + 1

2

notation denotes the updating of information from adjacent to subdomain

interfaces. This updated information is incorporated into the calculation of the residual

operator as soon as it becomes available. The matrix inverse indicated in 4.4.2 and 4.4.3

actually represents the application of the planar Gauss-Seidel procedure 3.3.47 on each

subdomain. If an Additive Schwarz type decomposition is also applied in the block then

theresultisafourblockdecompositionwithahybridMultiplicativeandAdditiveSchwarz

scheme. Equation 4.4.2-4.4.3 becomes:

W n+ 1 2 W n + 0 B B @ 0 B B @ ^ A 1 1;1

0 0 0

0 0 0 0

1 C C A + 0 B B @

0 0 0 0

0 ^ A 1 1;2 0 0

0 0 0 0

1 C C A 1 C C A R n (4.4.4) W n+1 W n+ 1 2 + 0 B B @ 0 B B @

0 0 0 0

0 0 ^ A 1 2;1 0

0 0 0 0

1 C C A + 0 B B @

0 0 0 0

(40)

This is a two step Multiplicative Schwarz method with Additive Schwarz sub-steps. This

requires the update of the residual vector after each Multiplicative step for the primary

decomposition while only updating the residual vector for the secondary decomposition

afteralltheMultiplicativeSchwarzstepsarecomplete. ThisallowseachMultiplicativestep

tobeen solved in parallel. Fornominally two dimensional problems with a uniform mesh

inthe thirddimension, itisconvenient toapply AdditiveSchwarz inthe uniformdirection

and to apply Multiplicative Schwarz decomposition in the non-uniform directions. The

abovemethod cangeneralized forthe actualScramjet combustorgeometrydecomposition

as shown in Figure 2. The primary decomposition is in the and directions while the

secondarydecompositionisinthedirection. Figure3illustratesthesolutionprocedurefor

the combined Multiplicativeand Additive Schwarz methods for two processors. Consider

the,, decompositionofblocks1through 5inFigure2,withreferencetotheowchart

inFigure3. Eachprocessorreceivesandecompositionforblocks1-5,wheretheseblocks

aredecomposedintheand directions. Theneachinnerlooponthe owchartbeginson

block 1 and sequences through to block 5i.e. (m=1...mb). First,each processor allocates

theprimitivevector W

m

overthesubdomainforthe currentblock, includingthe articial

boundaries. Then the residual vector R

m

is formed for the subdomain. The system of

linear equations ^

A

m W

n+1

m

= R

m

is then approximately solved for the update vector

W n+1

m

. After the interior points are updated, a serial transfer of the articial boundary

informationto adjacent blocks in the and directions satisfying [W

m \W

m ]

; takes

place. After all the blocks are sequenced through, a parallel data transfer updates the

solutionvariableson the articialboundaries inthe direction, satisfying

W

m \W

m

takesplace. Therefore,theparallelalgorithmisindependentofthetimeintegrationmethod

usedtosolvethesystem. Theequationsetin4.4.2-4.4.3isdependentontheproperupdate

(41)

vector R

The parallel transfers were carried out by using the Message Passing Interface[27 ]

(MPI-1)standard. A sampleprogramof the MPI block transferis given in theAppendix.

A buer is used to store the data being transfered for the secondary decomposition. This

buer is sent using blocking, combined send-receive commands in the direction in the

appendix. Two waves of send-receive are performed to transfer data for the two articial

facesateachinterface. Thistypeofmessagepassingonlycompleteswhenalltheprocessors

involved in the transfer have posted their individual send-receive calls. This ensures that

data is transferred at the right time level of the integration and also prevents system

buering that could lead to slow performance as the system could be overloaded with

buered data. This type of message passing also prevents deadlock as waves of data are

transferedfor eachface.

Thearticialboundaryinformationoneach planeismappedintoanonedimensional

array before the parallel transfer to adjacent blocks. This allows an explicit packing and

unpacking of the buer, instead of using implicit derived data types generated by MPI.

Derived data types may be dependent onthe system being used, as dierent systems use

dierent memory layouts and sizes. The explicit buering is system-independent, but

requires ((

max

+6)(

max

+6)6) operations to pack and unpack the buer. The time to

perform these operations is negligible compared with the time required to perform one

iterationof the ow solver.

4.2 Parallel Performance

(42)

S(1;p) = t

1

t

p

(4.4.6)

E(1;p) = t

1

pt

p

(4.4.7)

(4.4.8)

The speedup S in equation 4.4.6 will not exceed the number of processors (p) used in a

particularcalculation, asindicatedby the SpeedupFolklore Theorem[28]. Ifthe speedup

S is greater than the number of processes, then a faster serial algorithm can be made

be simulating the parallel one, provided that it can be simulated. In the case where

cache performance gets better with smaller subdomains via domain decomposition, the

serial program can not simulate the parallel program and therefore super-linear speedup

is observed. The eÆciency E in equation 4.4.7 is simply a measure of the time cost

optimization,described belowas:

If E(1;p)<1 then the parallel algorithmisnot time cost optimal

If E(1;p)=1 then the parallel algorithmistime cost optimal,

provided that the sequential algorithmistime optimal

If E(1;p)>1 then afaster sequential algorithmcan be obtained

by simulatingthe parallel one

(4.4.9)

Tables3through4providethetimedatafortheparallelalgorithmappliedtoaLargeEddy

Simulation(LES) calculationona4-processorCompaqES-40 sharedmemoryarchitecture

and an IBM-SP distributed memory architecture. The LES calculation on 33x65x65 and

65x65x65 grids is that of a plane Poiseuille channel ow. This was used to validate the

parallelinterface boundary transfers and to gain knowledge of parallel performance. The

timesT1, T2, T3, are real,cpu, and system timesinunits of seconds. The 33x65x65node

casesolvedfortwentyiterationsisshowninTable3. Theexpectedtrendthatincreasingthe

(43)

isincreased,asthecompileronlyoptimizestheserialportionofthealgorithm. Thetimeto

perform twenty iterationsusingfour processorstook343 seconds withno optimization. A

71percent decrease intime was accomplishedby using alevelfour compileroptimization,

butthisonlyresultedina25percentdecreaseintheeÆciencyofthealgorithmascompared

tothe eÆciency of the case withoutoptimization. Using the Speedup Folklore theorem of

4.4.9,the most optimalcondition for the current algorithmis with fourprocesses and full

optimization, with a speedup of 3.04. This algorithm did not have the highest speedup

forthe fourprocessor cases, but had the highest speedup forthe most time optimalserial

program.

Mostoftheperformancelossisdirectresultoftheparallelalgorithmrequiringanorder

of O (

max

max (

max

+p 1)) 2

operations. As the domain is broken into up smaller

segments,thentheoveralldomainsizeincreasesbyp 1planesofdata. Withanoverlap

of 6 planes of data between domains, each iterationrequires 6p planes of data transfer

and anadditionalcomputationof p 1 planes of data.

T1 T2 T3 node % n options speedup eÆciency

1250.0 1249.75 .37 99 1 -O0 1.0 1.00

623.0 622.41 1.31 199 2 -O0 2.01 1.01

343.0 341.92 2.25 399 4 -O0 3.64 0.91

750.0 749.61 .77 99 1 -O1 1.00 1.00

400.0 396.74 6.58 199 2 -O1 1.88 0.94

215.0 214.01 4.31 399 4 -O1 3.49 0.87

342.0 342.16 .39 99 1 -O2 1.00 1.00

171.0 171.42 .80 199 2 -O2 2.00 1.00

107.0 106.62 2.64 398 4 -O2 3.20 0.80

309.0 309.01 0.40 99 1 -O3 1.00 1.00

193.0 189.72 6.68 199 2 -O3 1.60 0.80

101.0 100.64 1.29 399 4 -O3 3.06 0.77

295.0 294.75 0.40 99 1 -O4 -fast 1.00 1.00

97.0 96.62 2.78 399 4 -O4 -fast 3.04 0.77

(44)

IBMSPattheNorth CarolinaSupercomputingcenter for 1000iterationruns. These runs

were calculated using level 3 optimizationon IBM SP, which is comparable tothe level 4

optimizationon the Compaq ES40. The speedup and eÆciency results here demonstrate

thatincreases in thenumberof subdomainsand the attendant communication and

execu-tioncosts degradethe programperformance forprocessornumbers largerthan eight. This

can be explained by the fact that the ratio of the number of computational planes to

communication planes is 5

6

, and the problem size has increased by 21 percent. The extra

iterationwork canbe avoidedby usingacell-centered nitevolumediscretization,but the

communication costs donot disappear.

T1 T2 T3 n speedup eÆciency comment

21258.58 21222.52 24.30 1 1.0 1.0 LES(274625 nodes)

13153.38 13129.44 15.43 2 1.616 .808 LES(274625 nodes)

5690.53 5542.18 11.40 4 3.7357 .934 LES(274625 nodes)

3091.07 2989.94 5.15 8 6.8774 .860 LES(274625 nodes)

1898.43 1792.48 3.44 16 11.198 .700 LES(274625 nodes)

(45)

5 Application

5.1 Flow Model

The National Aerospace Lab (NAL) nominally two-dimensional scramjet isolator /

com-bustorconguration issimulatedin this work [29]. This congurationas shown in Figure

4 consists of a wedge shaped fuel strut on the centerline of the combustor. The strut

wedge angle is5 o

and the isolator length is0.140 m. The total lengthof the combustor is

0.540mwith the strut placed0.043 mbehindthe inowplane. Pressure taps were placed

10mm apart on the centerline of the top wall for the rst 0.340 m and 0.020 m apart

for the rest of the combustor. The fuel strut shown in Figure 5 is 94.3 mm wide in the

spanwise direction. The injector throat was designed to maintain supersonic injection of

the fuel stream. The 5 degree wedge angle was determined by Masuya[30 ] to not induce

boundarylayerseparation onthe ductwallduetotheimpingingshockfromthefuelstrut.

Thefuel strutis precededbythe inlet sectionthat receivesvitiated airfromthe upstream

combustor. The inow conditions are shown in Table 5. The scramjet combustor inlet

FlowProperty Experiment Simulation

T

o

2214 2214

h

o

2.79 2.79

p

o

(kPa) 1012 1012

M

1

2.44 2.43

p

1

59.0 59.117

T

1

1258 1258

v

1

1753 1752.14

m

O2

0.333 0.333

1

1.28 1.2845

R

1

321.0 322.8

Y

O

2

0.2615 0.2615

Y

N2

0.5288 0.5288

Y

H

2 O

0.2097 0.2097

(46)

conditions were obtained from from the exhaust gas of a Vitiated Air Generator (VAG).

Thistypeoftestgas containsalargewater massfractionof0.2097asseeninTable 5. The

nominal Mach number and stagnation temperature are 2.45 and 2214 respectively,

lead-ingto astatic temperature that is well above the auto-ignition temperature for hydrogen

combustion. Thethree-dimensionalblockdecompositionisshowninFigure2-6. The

nom-inallytwo-dimensional geometry of the NAL combustor is modeled precisely in the and

directions and is extended in the direction. Block 1 contains 17x41x33 points, block

2 contains 17x9x33 points, block 3 contains 49x41x33 points, block 4 contains 61x33x33

points,and block 5 contains 133x73x33points,for a total of 481,173 points. Due to

com-putermemoryrestrictions, the resolution inthe and directions isa fourthof that used

in the 2-D simulations of Reference [6]. The two-dimensional grid is shown in Figure 6

withevery othergrid pointshown. Mesh clustering to allsolid surfaces isused tocapture

viscous layergradients. Also the grid wasstretched inthe directiontoresolvethe shock

structurearoundthe wedgeleadingedge,the Prandtl-Meyerexpansionnear thecenterline

strut,the recirculation regionbehind the fuelstrut, and the free shearlayer that develops

aftof the fuelstrut.

5.2 Boundary Conditions

Both two and three dimensional turbulent channel ow calculations were performed to

obtain the inow data for the simulations. All ow variables are extrapolated from the

interiorofthe combustorsectiontothe outowplaneof thecombustorusing secondorder

extrapolation. This methodofextrapolationisquestionablewherethermalchokingresults

and the combustor is subsonic, but without a knowledge of the back pressure, no better

method can be found. No-slip and adiabatic wall conditions were applied to all solid

(47)

decomposition discussed in section 4.3 introduces interface boundaries between adjacent

blocks. The ow variables along interface boundaries in the direction are not updated

during the Multiplicative Schwarz solution steps, while the interface boundaries in the

and directions are updated from data contained on adjacent blocks. The interface

boundaries are updated using adjacent interface data during the parallel transfer.

5.3 Fuel Injection

Allcaseswere initializedwithaconvergedsolutionofthe baselinecaseinwhichthe

equiv-alence ratio () is set 0.0. This corresponds to no hydrogen fuel injection. The fuel ow

rate was ramped over time to match the equivalence ratio for each case, then maintained

forthedurationof the simulation. Thefuelowrate through the geometricthroatA

t was

calculatedfrom:

_ m

H2

A

t =

m_

O2

A

t M

H2

M

O

2

=51:388 (5.5.1)

The fuelow rate was ramped using the following procedure:

_ m

H2

A

t

=min

1:0; t

:5t

c

_ m

H2

A

t

(5.5.2)

where t

c

is the ow-through time based on the free-stream velocity. The fuel is ramped

over onehalfof aow-through time(approximately0.00015seconds). Afterthis time, the

fuel ow rate isheld constant. The conditionsat the subsonic inowboundary of the fuel

(48)

6 Results and Discussion

6.1 Two Dimensional Results

Threefuelequivalenceratiocasesof0.0(nofuelinjection),0.29,and0.61weresimulatedin

two dimensions. The =0:0and =0:29 cases are nominally steady, whilethe =0:61

case resulted in a unsteady unstart. All reactive ow simulations were initialized from a

converged =0:0 solution.

The Machnumber, temperature, and pressure contours are shown inFigure8 for the

=0:0case. This case isuseful inasareference pointfor describingchangesdue tomass

injectionandcombustionoffuelforthehigherequivalenceratiocases. Theboundarylayer

thickness isapproximately5.5mmthickandincreasesthroughthe ductduetofrictionand

shock boundary layer interactions to about 12 mm at the end of the combustor section.

A small separation region results from the interaction of the bow shock formed ahead

of the fuel strut with the top wall boundary layer, and a second separation zone results

from the recompression shock that forms behind the expansion fan aft of the fuel strut.

A large recirculation region exists behind the fuel strut. In this symmetric case, upper

and lower supersonic cores result in the combustor section, with a subsonic mixinglayer

between them. Temperature increases can be observed near the adiabatic walls, behind

the recompression shocks, and in the separation zones. A maximum static temperature

of 2250 K is observed in the stagnation region at the bow of the fuel strut. The pressure

contours illustrate the complex shock system in the isolator and combustor regions. The

obliqueshocksystem/expansionfaninteractionscontinue throughoutthe combustorwith

a total of six shock reections in the combustor. The total pressure ratio Pe

P1

along the

centerline is .4635. This ratio is slightly less than the .47911 total pressure ratio that

would result from a normal shock in the duct at the free-stream Mach number of 2.44.

(49)

rise across the recompression shock promoteame stabilization forthe higherequivalence

ratios. Thisowsolutionandits3-Danalogueisusedastheinitialconditionforthehigher

equivalence ratio cases.

Figure 9 displays surface pressure distribution for the = 0:0 and = 0:29 cases

at several characteristic time levels. A steady scramjet mode of combustion is obtained

after 22.66t

c

characteristic times (t

c

= :314 ms). For the = 0:0 cases, reasonably good

agreement with experimental data is found throughout the combustor except near the

impingementpointofthebowshockontheupperboundarylayer. Thepeakpressureriseis

signicantlyunderpredicted,andthesubsequentwaveinteractionsarelesspronouncedthan

shown in the experimental data. These discrepancies may result from several numerical

factors, such as insuÆcient grid resolution in the impingement region, the discretization

scheme's relatively low order of accuracy near discontinuities, and the possible tendency

of the turbulence modelto \over-separate" the boundary layer inresponse to the adverse

pressuregradient. Theexperimentaldata shows thatthe actualcombustionprocess inthe

combustor inuences the ow inthe isolatorsection to alarge extent, indicatingthat the

shocktrainestablishedtocontainthecombustion-inducedpressurerisepropagatesintothe

isolatorregionand isstabilizedthere. This isnotseen hereinthe twodimensionalresults,

but is observed for the three dimensional case as discussed later. The two dimensional

resultsalsoshowatransitiontosubsonicowthroughasuccessiveweakeningoftheoblique

shock structure at after x = 200mm, rather than a continuing oblique shock pattern

throughout the nozzle as observed in the experiment. The hydroxyl mass fraction (OH),

Machnumber, temperature, andpressure contours areshown inFigure10forthe =0:29

case. Contour levels for Mach number and temperature are restricted to jMj < 1 and

T >2000 respectively to better distinguish regionsof subsonic owand regions of strong

(50)

thatis stabilizedinthe recirculation regiondiscussed earlier. The gradualpressure rise in

thecombustorisdue tothe outward deectionof thereactiveshear layer, which decreases

the eective ow area occupied by the core uid. Small regions of high-temperature,

low-momentum uid occur on the upper boundary layer due to shock / boundary layer

interactions. The temperature in the mixingregion reaches a maximum value of around

2800Kelvin. Acomparisonofthehydroxyl contoursandtheMachnumbercontoursshows

that chemical reactions occur in the supersonic region of the ow. The majority of the

combustion occurs between the fuel strut and x = 200mm meters plane, as indicated by

the largelevelsof hydroxyl inthat region. This region isalso the expansionregion, which

willbecomea more signicantpart of the ow for the higherequivalence ratio cases. The

ame structure shown is typical of diusion ames, where the maximum concentrations

of hydroxyl radical delineate the ame front and correspond to the regions of highest

temperature.

Figures 11 - 15 illustrate the response of the isolator / combustor oweld to

time-dependentinjectionofhydrogenat=0:61. InFigure11wallpressure distributionsalong

the top wall of the scramjet combustor at several time levels are compared with

experi-mental pressure data time averaged over 40 ms. Three stages of the unsteady response

are observed for this case. The initial stage occurs before t < 4:63t

c

, where the injection

process raises the static pressure of the uid within the injector from around 50 kPa to

nearly 300 kPa, resulting in choked ow at the nozzle throat. The expansion behind the

fuel strut is nearly eliminated while the recompression shock is strengthened due to the

displacement eect of the hydrogen fuel mass injection (t = 1:54t

c

). OH contours shown

inthe close up viewof the injector region(Figure 14) reveal that the initialstages of fuel

-airmixingoccurthrough the growth of the vorticalstructures generated through

Kelvin-Helmholtzmechanisms. Thisresultsinthe intermittentpulsingofthe fuelowratebefore