• No results found

Scalable algorithms for adaptive mesh refinement with arbitrary element types

N/A
N/A
Protected

Academic year: 2021

Share "Scalable algorithms for adaptive mesh refinement with arbitrary element types"

Copied!
110
0
0

Loading.... (view fulltext now)

Full text

(1)

with arbitrary element types

Johannes Holke (German Aerospace Center DLR, Cologne)

(2)

www.DLR.de •Folie 2>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Who am i and why am i here?

– 2014 Master of Science, Mathematics, University of Bonn

– 2014-2018 PhD under Carsten Burstedde, Institute for Numerical Simulation, Bonn – Since 2018 at DLR, Simulation and Software TechnologyŠHigh-performance

Computing

(3)

www.DLR.de •Folie 3>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(4)

www.DLR.de •Folie 3>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Who am i and why am i here?

(5)

Outline

Adaptive Meshes

Trees and space-filling curves The TM-curve

Arbitrary element types Recursive search Numerical results

(6)

www.DLR.de •Folie 5>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Adaptive Meshes

(7)
(8)

www.DLR.de •Folie 7>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Adaptive Meshes

Adaptive Refinement

Only refine the mesh where needed.

The same computational error with less elements • Mesh management becomes more complicated

(9)

Dynamical AMR

(10)

www.DLR.de •Folie 9>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

AMR Algorithms

• Input: Coarse mesh • New • Adapt • Balance • Partition • Ghost Coarse Mesh

(11)

AMR Algorithms

• Input: Coarse mesh • New • Adapt • Balance • Partition • Ghost New

(12)

www.DLR.de •Folie 9>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

AMR Algorithms

Input: Coarse mesh • New • AdaptBalance • Partition • Ghost Adapt + Balance

(13)

AMR Algorithms

Input: Coarse mesh • New • AdaptBalance • Partition • Ghost Partition

(14)

www.DLR.de •Folie 9>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

AMR Algorithms

Input: Coarse mesh • New • AdaptBalance • Partition • Ghost Repeat

(15)

Trees and space-filling curves

(16)

www.DLR.de •Folie 10>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Trees and space-filling curves

(17)

www.DLR.de •Folie 11>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Unstructured meshes

Arbitrary connectivity. All element neighbors and grid coordinates need to be stored explicitly.

• Works with all element shapes−→hybrid meshes

• Large memory footprint

Long runtimes of AMR algorithms

(Re-)Partition

Often with graph based methods (parMETIS/SCOTCH). 1.000 - 10.000 elements/s per processa

(18)

www.DLR.de •Folie 11>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Unstructured meshes

Arbitrary connectivity. All element neighbors and grid coordinates need to be stored explicitly.

Full geometric flexibility

• Works with all element shapes−→hybrid meshes

• Large memory footprint

Long runtimes of AMR algorithms

(Re-)Partition

Often with graph based methods (parMETIS/SCOTCH). 1.000 - 10.000 elements/s per processa

(19)

www.DLR.de •Folie 11>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Unstructured meshes

Arbitrary connectivity. All element neighbors and grid coordinates need to be stored explicitly.

Full geometric flexibility

• Works with all element shapes−→hybrid meshes

• Large memory footprint

Long runtimes of AMR algorithms

Often with graph based methods (parMETIS/SCOTCH). 1.000 - 10.000 elements/s per processa

(20)

www.DLR.de •Folie 11>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Unstructured meshes

Arbitrary connectivity. All element neighbors and grid coordinates need to be stored explicitly.

Full geometric flexibility

• Works with all element shapes−→hybrid meshes

• Large memory footprint

Long runtimes of AMR algorithms

(Re-)Partition

Often with graph based methods (parMETIS/SCOTCH). 1.000 - 10.000 elements/s per processa

(21)

Tree based AMR

(22)

www.DLR.de •Folie 12>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Tree based AMR

(23)

Tree based AMR

Idea: The geometrical resolution needed is often much coarser than the numerical resolution.

Left geometry: 11k spheres, 383 million tetrahedra Right adaptive refinement: 167 billion tetrahedra

(24)

www.DLR.de •Folie 12>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Tree based AMR

Idea: The geometrical resolution needed is often much coarser than the numerical resolution.

(25)

www.DLR.de •Folie 13>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Tree based AMR

(26)

www.DLR.de •Folie 13>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Tree based AMR

Start with (Unstructured) input mesh modelling the geometry

(27)

Tree based AMR

Start with (Unstructured) input mesh modelling the geometry

Use (structured) refinemnet rule within every coarse mesh cell.

(28)

www.DLR.de •Folie 14>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Space-filling curves (SFC)

Linear order of the leaves of each tree.

k 0 k 1

p 0 p1 p 1 p2

k0

k1

k0 k1

(29)

Space-filling curves (SFC)

Linear order of the leaves of each tree.

k 0 k 1

p 0 p1 p 1 p2

k0

k1

k0 k1

(30)

www.DLR.de •Folie 15>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Two meshes: Coarse mesh that describes the geometry. Fine mesh for the computation.

Each coarse mesh cell is a refinement tree.

(31)

Two meshes: Coarse mesh that describes the geometry. Fine mesh for the computation. Each coarse mesh cell is a refinement tree.

(32)

www.DLR.de •Folie 16>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Trees and space-filling curves

Properties

Geometric flexibility

Works with all shapes that have an SFC−→Hybrid meshes are possible

• Low memory footprint(Coordinates and neighbors per tree) • Shorter runtimes and better scalability

(Re-)Partition

Unstructureda 1.000 - 10.000 Elements/s per prozess

Tree basedb 700.000 - 5,000.000 • Cache efficient

Recursive search

(33)

www.DLR.de •Folie 16>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Trees and space-filling curves

Properties

Geometric flexibility

Works with all shapes that have an SFC−→Hybrid meshes are possible

• Low memory footprint(Coordinates and neighbors per tree) • Shorter runtimes and better scalability

(Re-)Partition

Unstructureda 1.000 - 10.000 Elements/s per prozess

• Cache efficient

Recursive search

• Possibly unconnected partitions

(34)

www.DLR.de •Folie 16>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Trees and space-filling curves

Properties

Geometric flexibility

Works with all shapes that have an SFC−→Hybrid meshes are possible

• Low memory footprint(Coordinates and neighbors per tree) • Shorter runtimes and better scalability

(Re-)Partition

Unstructureda 1.000 - 10.000 Elements/s per prozess

Tree basedb 700.000 - 5,000.000

• Cache efficient

Recursive search

• Possibly unconnected partitions

aSmith, Rasquin, Shephard et. Al.Application specific mesh partition improvement.2015 bBurstedde, Holke,Coarse mesh partitioning for tree-based AMR, 2017

(35)

Trees and space-filling curves

Properties

Geometric flexibility

Works with all shapes that have an SFC−→Hybrid meshes are possible

• Low memory footprint(Coordinates and neighbors per tree) • Shorter runtimes and better scalability

(Re-)Partition

Unstructureda 1.000 - 10.000 Elements/s per prozess

Tree basedb 700.000 - 5,000.000 • Cache efficient

Recursive search

Possibly unconnected partitions

aSmith, Rasquin, Shephard et. Al.Application specific mesh partition improvement.2015 bBurstedde, Holke,Coarse mesh partitioning for tree-based AMR, 2017

(36)

www.DLR.de •Folie 17>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(37)

Interlude

Hanging nodes?

To resolve hanging nodes one could either 1. Interpolate in the solver routines.

(38)

www.DLR.de •Folie 18>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Interlude

Hanging nodes?

To resolve hanging nodes one could either 1. Interpolate in the solver routines.

(39)

Interlude

Hanging nodes?

To resolve hanging nodes one could either 1. Interpolate in the solver routines.

(40)

www.DLR.de •Folie 19>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Existing frameworks

Tree based AMR libraries

Quad/Hex, Peano curve -Peano

• Quad/Hex, Morton curve -p4est(Burstedde et. al) • Triangles, Sierpinski curve -sam(oa)2

(41)

Existing frameworks

Tree based AMR libraries

Quad/Hex, Peano curve -Peano

• Quad/Hex, Morton curve -p4est(Burstedde et. al) • Triangles, Sierpinski curve -sam(oa)2

(42)

www.DLR.de •Folie 19>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Existing frameworks

Tree based AMR libraries

Quad/Hex, Peano curve -Peano

• Quad/Hex, Morton curve -p4est(Burstedde et. al) • Triangles, Sierpinski curve -sam(oa)2

(43)

Existing frameworks

Tree based AMR libraries

Quad/Hex, Peano curve -Peano

• Quad/Hex, Morton curve -p4est(Burstedde et. al) • Triangles, Sierpinski curve -sam(oa)2

Tetrahedra/triangles -t8code • Hybrid - ?

• Prisms - ? • Pyramids - ?

(44)

www.DLR.de •Folie 19>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Existing frameworks

Tree based AMR libraries

Quad/Hex, Peano curve -Peano

• Quad/Hex, Morton curve -p4est(Burstedde et. al) • Triangles, Sierpinski curve -sam(oa)2

Tetrahedra/triangles -t8code • Hybrid - ? • Prisms - ? • Pyramids - ?                          t8codea github.com/holke/t8code

(45)

www.DLR.de •Folie 20>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

The Morton curve

Morton curve

The Z-curve for quadrilateral/hexahedral meshes is constructed via the Morton code (Morton 1966). It is the bitwise interleaving of an elements anchor coordinates.

0 6 24

8

Fix a maximum refinement levelL, i.e.L= 4.

Morton code ofQ: x(Q) = 6 = (0110)2 y(Q) = 8 = (1000)2 `(Q) = 3 ⇒m(Q) = (10010100)2 = (2 1 1 0)4 = 148

(46)

www.DLR.de •Folie 20>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

The Morton curve

Morton curve

The Z-curve for quadrilateral/hexahedral meshes is constructed via the Morton code (Morton 1966). It is the bitwise interleaving of an elements anchor coordinates.

0 6 24

8 24

Fix a maximum refinement levelL, i.e.L= 4.

Morton code ofQ: x(Q) = 6 = (0110)2 y(Q) = 8 = (1000)2 `(Q) = 3 ⇒m(Q) = (10010100)2 = (2 1 1 0)4 = 148

(47)

The Morton curve

Morton curve

The Z-curve for quadrilateral/hexahedral meshes is constructed via the Morton code (Morton 1966). It is the bitwise interleaving of an elements anchor coordinates.

0 6 24

8 24

Fix a maximum refinement levelL, i.e.L= 4.

Morton code ofQ: x(Q) = 6 = (0110)2 y(Q) = 8 = (1000)2 `(Q) = 3 ⇒m(Q) = (10010100)2 = (2 1 1 0)4 = 148

(48)

www.DLR.de •Folie 21>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

The Morton curve

Using the Morton index, many low-level algorithms can be computed efficiently. Example:

Computing face-neighbors (within tree)

Add or subtracth= 2L−`from the appropriate coordinate.

(x,y) (x+h,y)

(49)

The Morton curve

Using the Morton index, many low-level algorithms can be computed efficiently. Example:

Computing face-neighbors (within tree)

Add or subtracth= 2L−`from the appropriate coordinate.

(x,y) (x+h,y)

(50)

www.DLR.de •Folie 22>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

The Morton curve

Advantageous features of the Morton code

• Computable in a fast manner via bitwise interleaving. • Easy to implement.

• Memory efficient

• storage per element: coordinates of one node, level (4d+ 1 Bytes). • Children, parent, face-neighbor, etc.

computed in constant time. • 2D and 3D follow the same logic.

(51)

www.DLR.de •Folie 23>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018 The TM-curve Y 0 2L 2L 0 S0 S1 X

We embed this triangle in the square [0,2L]2,

which is divided along its diagonal; obtaining a second triangleS1.

(52)

www.DLR.de •Folie 23>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018 The TM-curve Y 0 2L 2L 0 X Bey’s observation:

When refiningS0, each occurring triangleT is

equivalent toS0orS1.

Definition

(53)

www.DLR.de •Folie 23>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018 The TM-curve Y 0 2L 2L 0 X Bey’s observation:

When refiningS0, each occurring triangleT is

equivalent toS0orS1.

(54)

www.DLR.de •Folie 23>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018 The TM-curve Y 0 2L 2L 0 X Bey’s observation:

When refiningS0, each occurring triangleT is

equivalent toS0orS1.

Definition

(55)

www.DLR.de •Folie 23>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018 The TM-curve Y 0 2L 2L 0 X

Furthermore, there is also an underlying quadrilateral mesh.

(56)

www.DLR.de •Folie 23>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018 The TM-curve Y 0 2L 2L 0 X

A triangleTin a refinement is uniquely identified by the coordinates of one node plus its levelplus its type.

Definition

(57)

The TM-curve

The same constructions applies in 3d. Here we have six different types.

X Y Z S0 S1 S2 S3 S4 S5

Each tetrahedronTin a refinement ofS0is

(58)

www.DLR.de •Folie 24>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

The TM-curve

The same constructions applies in 3d. Here we have six different types.

X

Y Z

Each tetrahedronTin a refinement ofS0is

(59)

The TM-curve

The same constructions applies in 3d. Here we have six different types.

X

Y Z

Each tetrahedronTin a refinement ofS0is

equivalent to one ofS0, . . . ,S5.

It is uniquely identified by the coordinates of one node plus its levelplus its type.

(60)

www.DLR.de •Folie 25>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

The TM-curve

Definition

The tetrahedral Morton index of a simplexTis contructed by interleaving (z,)y,xcoordinates of the anchor node ofTwithB.

(yL−1, . . . ,y0)2 (xL−1, . . . ,x0)2 (bL−1, . . . ,b0)4

(61)
(62)

www.DLR.de •Folie 27>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Computing face-neighbors, an example

Z

Y X

(63)

Computing face-neighbors, an example Z Y X N0 N2 N3 N1

Computation of N0

Addhtoz-coordinate, change type to 1.

(64)

www.DLR.de •Folie 27>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Computing face-neighbors, an example

Z Y X N0 N2 N3 N1

Computation of N1

Change type to 0.

(65)

Computing face-neighbors, an example Z Y X N0 N2 N3 N1

Computation of N2

Change type to 4.

(66)

www.DLR.de •Folie 27>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Computing face-neighbors, an example

Z Y X N0 N2 N3 N1

Computation of N3

Subtracthfromy-coordinate, change type to 3.

(67)

www.DLR.de •Folie 28>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Computing face-neighbors

We do this once for each type and obtain a look-up table.

(68)

www.DLR.de •Folie 28>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Computing face-neighbors

We do this once for each type and obtain a look-up table.

Computing face-neighbors is as easy as looking up the values in the table.

(69)

The TM-curve

Advantageous features of the tetrahedral Morton code

• Computable in a fast manner via bitwise interleaving. • Easy to implement.

• Memory efficient

• storage per element: coordinates of one node, level (4d+ 1 Bytes). • Children, parent, face-neighbor, etc.

computed in constant time. • 2D and 3D follow the same logic.

(70)

www.DLR.de •Folie 30>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Tree based AMR libraries

Quad/Hex, Peano curve -Peano

• Quad/Hex, Morton curve -p4est,t8code

Triangles, Sierpinski curve -sam(oa)2 • Tetrahedra/triangles -t8code • Hybrid - ? • Prismen - ? • Pyramiden - ?                          t8code

(71)

Arbitrary element types

Core algorithms (high-level)

New • Adapt • Partition • Ghost • Balance

(72)

www.DLR.de •Folie 31>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Arbitrary element types

Core algorithms (high-level)

NewAdapt • Partition • Ghost • Balance

low-level

• element refine • element parent • element id

• element face neighbor .

. .

(73)

Arbitrary element types

Core algorithms (high-level)

NewAdapt • Partition • Ghost • Balance

low-level

• element refine • element parent • element id

• element face neighbor .

. . • Decouple high-level and low-level algorithms

Define API for element-local (low-level) functions

(74)

www.DLR.de •Folie 32>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Example: Face-neighbors across tree boundaries

K G G0 K0 E E0 x y z x y z

Avoid couplings

Hex↔Prism Prism↔Tet Quad↔Tri . . .

(75)

Example: Face-neighbors across tree boundaries K G G0 K0 E E0 x y z x y z

Avoid couplings

Hex↔Prism Prism↔Tet Quad↔Tri . . .

(76)

www.DLR.de •Folie 33>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Example: Face-neighbors across tree boundaries

K G G0 K0 E E0 x y z x y z

(77)

Example: Face-neighbors across tree boundaries K G G0 K0 E E0 x y z x y z K G F x y z

(78)

www.DLR.de •Folie 33>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Example: Face-neighbors across tree boundaries

K G G0 K0 E E0 x y z x y z K G F x y z

1: Build 2D boundary element

G G0 F F0 x y x y 2: Transform coordinates

(79)

Example: Face-neighbors across tree boundaries K G G0 K0 E E0 x y z x y z K G F x y z

1: Build 2D boundary element

G G0 F F0 x y x y G0 K0 x y z F0

(80)

www.DLR.de •Folie 34>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Arbitrary element types

Do whatever you want

The decoupling of high- and low-level functions allows the user to implement its own refinement pattern and SFC.

Sources: M. Bader: Space-Filling Curves,

(81)

www.DLR.de •Folie 35>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

search

search

Example:N particles in the domain. For each, find the containing element and execute a callback function.

With recursive top-down search in a tree, we can handle all conditions (i.e. points) and elements in one run. And, we can exclude complete subtrees from the search.

(82)

www.DLR.de •Folie 35>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

search

search

Example:N particles in the domain. For each, find the containing element and execute a callback function.

More general: Find elements that match a certain condition, and excute a callback.

With recursive top-down search in a tree, we can handle all conditions (i.e. points) and elements in one run. And, we can exclude complete subtrees from the search.

(83)

search

search

Example:N particles in the domain. For each, find the containing element and execute a callback function.

More general: Find elements that match a certain condition, and excute a callback.

With recursive top-down search in a tree, we can handle all conditions (i.e. points) and elements in one run. And, we can exclude complete subtrees from the search.

(84)

www.DLR.de •Folie 36>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(85)
(86)

www.DLR.de •Folie 36>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(87)
(88)

www.DLR.de •Folie 36>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(89)
(90)

www.DLR.de •Folie 36>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(91)
(92)

www.DLR.de •Folie 36>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(93)
(94)

www.DLR.de •Folie 36>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(95)
(96)

www.DLR.de •Folie 36>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(97)
(98)

www.DLR.de •Folie 37>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

search

Example:

Ghost

Find all elements at our domain boundary. Communicate them with the neighboring process. tetrahedra

uniform adaptive

Level 9 8 4 8–10 7–9 3–5

elements/proc 786,432 98,304 24 1,015,808 126,976 31

ghosts/proc 32,704 8,160 30 31,604 8,137 56

Ghost[s] 129.6 16.19 5.93e-3 167.94 20.88 8.10e-3

Ghostwithsearch[s] 7.41 1.75 5.01e-3 7.08 1.69 8.12e-3

(99)
(100)

www.DLR.de •Folie 39>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Strong scaling, JUQUEEN

0.01 0.1 1 10 8192 16384 32768 64536 131072 Run time [s]

Number of MPI ranks

Ghost

C·G/P

Partition

2 billion tetrahedra

(101)

Weak scaling, JUQUEEN 1 3 6 10 8192 65536 458752 Run time [s]

Number of MPI ranks

Ghost (tet) Ghost (hex) ideal scaling 50 80 90 100 8192 65536 458752 Efficiency [%]

Number of MPI ranks

Ghost eff. (tet) Ghost eff. (hex)

Ghostwith∼233k Elements/process (Tet),∼310k elements/process (Hex). Largest mesh: 162 billion Hexaedra, 108 billion tetrahedra

(102)

www.DLR.de •Folie 41>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Large scale mesh partition

Coarse mesh: 325 million Tets Fine mesh: 167 billion Tets

(103)

Large scale mesh partition

Coarse mesh partition on 458,752 MPI ranks

t mesh size trees (ghosts) sent shared trees run time [s]

1 324,766,336 704 (2444) 280,339 0.207

2 324,766,336 708 (2456) 281,694 0.204

3 324,766,336 707 (2458) 281,900 0.204

Fine mesh partition on 458,752 MPI ranks t mesh size elements sent run time [s]

1 167,625,595,829 362,863 0.522

2 167,709,936,554 364,778 0.578

3 167,841,392,949 365,322 0.567

Tabelle:Run times for coarse mesh and fine mesh partition for the brick with holes on 458,752 MPI ranks. Forest level 3 to 4 (365k elements/proc).

(104)

www.DLR.de •Folie 43>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(105)
(106)

www.DLR.de •Folie 44>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

(107)
(108)

www.DLR.de •Folie 46>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Strong scaling, JUQUEEN

7.5 15 30 45 75 131072 262144 458752 Run time [s]

Number of MPI ranks

AMR (w/o Ripple-Balance) FV Solver Total (w/o Ripple-Balance) Ideal scaling

(109)

www.DLR.de •Folie 47>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Conclusion

• Constructed and investigated new SFC for tets and triangles • Develop and evaluate new massively scaling algorithms (t8code) • New method of coarse mesh partitioning

• Easily extentable with new element types/SFCs (prisms in a Bachelor’s thesis) • First application of tree based AMR with tetrahedra and hybrid meshes. • Seegithub.com/holke/t8code, GPL v2

Wish list

Currently limited to face-connectivity • Balanceis a bottleneck

• Pyramids • ...

(110)

www.DLR.de •Folie 47>30.11.2018>Johannes Holke•Scalable AMR>30.11.2018

Conclusion

• Constructed and investigated new SFC for tets and triangles • Develop and evaluate new massively scaling algorithms (t8code) • New method of coarse mesh partitioning

• Easily extentable with new element types/SFCs (prisms in a Bachelor’s thesis) • First application of tree based AMR with tetrahedra and hybrid meshes. • Seegithub.com/holke/t8code, GPL v2

Wish list

Currently limited to face-connectivity • Balanceis a bottleneck

• Pyramids • ...

References

Related documents

 The specific activities that will be provided on this track are two-fold and include direct service at the University School on campus with grades pre-k to 12 and service in

A Helicopter Emergency Medical Service (HEMS) or air ambulance is essentially a flying emergency department with on-board medical personnel who have both the

When regimes are not known with certainty, the European-style option value is a weighted average of the two conditional option values at the seed node, where initial regime

Higher cash value and guaranteed death benefit premiums than Lincoln LifeGuarantee UL Guaranteed Issue availability Premium relief feature. 1035 exchange

Huang Di asked Qi Bo, saying, "The way of acupuncture must penetrate the twelve major channels and that which is at their ends and beginnings, the collateral channels and how

The goal of this design and development study was to develop and validate a model to guide instructors in the implementation of LA tools to support academically at- risk students

Figure 3 Sagittal (a) T1-weighted, (b) T1-weighted after gadolinium-diethylenetriamine pentacetic acid (Gd-DTPA) injection, (c) T2-weighted, (d) short T1 inversion recovery, and

For example, if most individuals have strong preferences for organ donation and families observe an uninformative indicator of donation preferences, a presumed consent law may create