Title: Ocean Circulation: Sink or Swim
 1COMPUTATIONAL SCIENCES BROWN BAG SEMINAR
Ocean Circulation Sink or Swim
Danny Thorne
UK Department of Computer Science
Tuesday, April 17, 2001
Supported in part by Sandia National Laboratory, 
Lawrence Livermore National Laboratory, and the 
National Science Foundation. Nuke em if they 
cant take a joke. 
 2Craig C. Douglas University of Kentucky and Yale 
University 
Gundolf Haase Johannes KeplerUniversity-Linz
- Mohamed Iskandarani 
 - University of Miami
 
Danny Thorne University of Kentucky and Sandia 
National Laboratory
Michael Sheetz University of Kentucky 
 3Outline
This talk chronicles the
- Past 
 - Background 
 - Single Layer 
 - Multiple Layer 
 - Present 
 - Subgrid Structuring Techniques 
 - Simultaneous Iteration 
 - Lumped Schur Complement 
 - Future 
 - C 
 - 3D 
 - Non-Hydrostatic
 
of the Spectral Element Ocean Model (SEOM). 
 4(No Transcript) 
 5(No Transcript) 
 6Difficulties of Ocean Modeling
- Stratification 
 - Rotation 
 - Complicated basins 
 - Small aspect ratio (depth ltlt L) 
 - Turbulence (jets, eddies, fronts) 
 - Multiple time and length scales 
 - Poorly known I.C. and B.C. 
 
  7Advantages of Spectral Elements
- h-p finite element method 
 - Dual convergence paths 
 - Geometric Flexibility 
 - Unstructured grids 
 - Accuracy 
 - Low numerical dispersion and dissipation errors 
 - Scalability 
 - Dense Computational Kernels, O(KN3) 
 - Sparse neighbor-neighbor communications
 
  8(No Transcript) 
 9Unstructured Grids
- Complicated ocean basins 
 - No masking 
 - Local (elemental) mapping only 
 - Multiscale Simulations 
 - Increase resolution where needed 
 - Regional-Global simulations possible in single 
setting  - Avoid open boundary conditions
 
  10North East Pacific Grid 
 11Grid Spacing (km) 
 12Elements  Collocation points 
 13Disadvantages(TANSTAAFL!)
- Stricter CFL stability than FD 
 - Gibbs oscillations near fronts 
 - Global conservation only 
 - More CPU intensive than FD/grid point
 
  14SEOM Versions and Applications
- Single Layer 
 - No stratification effects 
 - Solution is the depth-mean flow 
 - 1.5 layer (wind circulation/abyssal flow) 
 - Global long period tides 
 - Estuarine modeling 
 - Multiple Layers 
 - Vertical coordinate is density (isopycnal) 
 - Vertical stack of single layers 
 - Wind driven circulation, 2-5 layers (soon up 
to16)  
  15Motivation for Layered SEOM
- Mathematically simpler than SEOM-3D 
 - Computationally simpler and faster 
 - No cross isopycnal diffusion 
 - No pressure gradient errors 
 - Baroclinic processes possible with 2 layers 
 - Eddy resolving simulations can be produced 
 -  relatively easily and cheaply
 
  16Current Work
/ TODO Outline / 
 17Filtering
- Each layer has to solve 
 -  , filtered vorticity 
 -  , filtered divergence field 
 - The filtering is done by series expansion and the 
Boyd-Vandeven filter in each spectral element.  - Solve on each of the 5 layers 
 and  
  18So, What Is the Problem?
-  Symmetric, positive definite matrix 
 -  Not a M-matrix 
 -  Really, really huge
 
Many parallel solvers available
but
Memory requirements vs. solution time 
 19Spectral Element
- Gauss-Lobatto discretization 
 - Element is the support of inner node f.e. basis 
functions 
- Inner nodes 
 - Boundary nodes 
 -  consisting of Edge nodes 
 -  Vertex nodes
 
  20System of Equations
- Spectral element discretization 
 - Solve 10 times the system of equations 
 -  Block structure 
 - where 
 - and the are 
symmetric. 
  21Schur Complement-cg
- Solve Laplacian equations by Schur Complement 
Conjugate Gradient (cg)  - Preconditioner 
 - Take advantage of the structure of the spectral 
elements 
  22Factor matrix
- Factorization of 
 - Schur complement 
 - Matrices are stored.
 
  23Schur Complement and Basis Transformation
- Defining the exact harmonic basis (EHB) 
transformation  - the Schur complement can be reinterpreted as 
 - i.e., Galerkin approach. 
 
  24Schur Complement cg
  25Schur Complement Preconditioner I
- Again, we can factor 
 such that  - BUT (with j 
counter of elements/edges/)  
  26Schur Complement Preconditioner II
- Replace with 
 -   linear interpolation from vertices 
onto an edge j  
  27Schur Complement Preconditioner III
- Calculate element-wise 
 - Approximate by 
 -  is on edge j 
Dryja  - Derive directly 
by symbolic methods  - Bramble/Pasciak/Schatz 
 
  28Schur Complement-pcg
  29Vertex Node System
-  is equivalent to a 
 -  (non-constant) 9-point 
stencil  - Solve directly (gather on one processor) 
 - Combine with parallel AMG (PEBBLES) 
 - Special cache optimized and parallel AMG/MG for 
9-point stencil () 
  30Memory Requirements
- Laplacian in 2D 
 - Small example 99 elements, 5146 nodes 
 - M  O(nelem) 
 - M(Schur-cg)  2.35 MB 
 - M(Schur-cg,pc)  2.36 MB
 
  31Memory requirements
99 elements, 5146 nodes 
 32Simultaneous Iterations
Filtering requires in each time step
5 layers
Solve 10 Laplacian equations at once! Less 
memory accesses, better cache usage 
 33Improvements by Sim. Solving
2.05 faster on R10k 1.75 faster on R14k
Filtering only 15 WCPU 
 3470 of Computing Time
is spent in solving
Note 
 35System of Equations
Diagonal matrix 
 36Schur Complement System (Pressure)
Solve via precond. cg... 
 37Solving of Pressure System
Solve via pcg
-  No matrices stored ! 
 -  gt Only matrix-vector 
operation available. 
  38Preconditioning of Pressure System
-  New preconditioner (Lumped Schur Complement)
 
  39Further Improvement by Lumped Schur Complement
Speedup 1.28 on R10k 1.22 onR14k 
 40Future of SEOM
-  SEOM 
 -  SEOM is currently a big, bad Fortran 90 code. 
 -  Fortran is a dying language. 
 -  Fortran 90 should be avoided at all costs. 
 -  BYON
 
-  Distributed Preprocessing Stage 
 -  Vanishing Layer Thickness 
 -  Non-Hydrostatic Primitive Equations 
 -  3D Continuous Stratification 
 -  Integration of PEBBLES, a parallel algebraic 
 -  multigrid solver from Linz.
 
  41SEOM Versions and Applications, Contd
- 3D Continuous Stratification 
 - density is a dependent variable 
 - low order SE discretization in vertical 
 - terrain following 
 - gravitational adjustment 
 - overflow 
 - flow rectification in a laboratory canyon 
 - test problems available on marine.rutgers.edu/po
 
  42Thank You