Views

CrossRef citations to date

Altmetric

Original Articles

Hierarchical hybrid grids: achieving TERAFLOP performance on large scale finite element simulations

B. Bergen CCS-2 Continuum Dynamics, Los Alamos National Laboratory, Los Alamos, NM, USACorrespondence[email protected]

G. Wellein Regionales Rechenzentrum Erlangen, Universität Erlangen, Erlangen, GermanyView further author information

F. Hülsemann EDF R&D, Departement SINETICS, Clamart Cedex, FranceView further author information

U. Rüde Friedrich-Alexander-Universität, Erlangen, GermanyView further author information

Abstract

The design of the hierarchical hybrid grids (HHG) framework is motivated by the desire to achieve high performance on large-scale, parallel, finite element simulations on super computers. In order to realize this goal, careful analysis of the low-level, computationally intensive algorithms used in implementing the library is necessary. This analysis is primarily concerned with identifying and removing bottlenecks that limit the serial performance of multigrid component algorithms such as smoothing and residual error calculation. To aid in this investigation, two metrics have been developed: the balance metric (BM), and the loads per miss metric (LPMM). Each of these metrics makes assumptions about the interaction of various data structures and algorithms with the underlying memory subsystems and processors of the architectures on which they are implemented. Applying these metrics generates performance predictions that can then be compared to measured results to determine the actual characteristics of an algorithm/data structure on a given platform. This information can then be used to increase performance.

In this paper, we first present an overview of the HHG framework. Next, we introduce the details of the two performance metrics. These metrics are then applied to three different data structures used to implement a Gauß–Seidel smoothing algorithm. Performance results and an interpretation of the underlying interactions of the data structures with several relevant supercomputing architectures are given. Finally, we present a brief discussion of some performance results of the HHG framework, followed by some concluding remarks.

Keywords:

Notes

^∥ [email protected]

^# [email protected]

^** [email protected]

^†† As we will see, it is important to consider not only what algorithm is being analyzed, but also how that algorithm is implemented, since this affects the way in which data are accessed.

^‡‡ The current HHG implementation is designed to accommodate three-dimensional grids. The two-dimensional example is included because it is easier to visualize.

^¶¶ The example given here is only one possibility for counting this metric. If we were to make different assumptions about the cache behavior of the algorithm, we would get a different count.

^§§ http://www.sgi.com/products/software/histx

^∥∥ Recall that all three algorithms have the same spatial locality by design.

Additional information

Notes on contributors

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related Research Data

A semi-analytic accuracy benchmark for Stokes flow in 3-D spherical mantle convection codes

Source: Springer Science and Business Media LLC

Linking provided by

Hierarchical hybrid grids: achieving TERAFLOP performance on large scale finite element simulations

Notes on contributors

G. Wellein

F. Hülsemann

U. Rüde

Related Research Data

Information for

Open access

Opportunities

Help and information

Hierarchical hybrid grids: achieving TERAFLOP performance on large scale finite element simulations

Abstract

Notes

Additional information

Notes on contributors

G. Wellein

F. Hülsemann

U. Rüde

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature