55
Views
14
CrossRef citations to date
0
Altmetric
Original Articles

Hierarchical hybrid grids: achieving TERAFLOP performance on large scale finite element simulations

, , &
Pages 311-329 | Received 08 Nov 2006, Accepted 20 Feb 2007, Published online: 06 Apr 2009

References

  • Barrett , R. , Berry , M. , Chan , T.F. , Demmel , J. , Donato , J. , Dongarra , J. , Eijkhout , V. , Pozo , R. and Romine , C. 1994 . Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 2nd ed. , Philadelphia, PA : SIAM .
  • Bergen , B. and Hülsemann , F. 2004 . Hierarchical hybrid grids: data structures and core algorithms for multigrid . Numerical Linear Algebra with Applications , 11 ( 2–3 ) : 279 – 291 . March–April
  • Bergen , B. , Hülsemann , F. and Rüde , U. 2005 . “ Is 1.7 × 1010 unknowns the largest finite element system that can be solved today? ” . In Proceedings of the ACM/IEEE Supercomputing'05 Conference (SC05) Seattle, Washington, November
  • Callahan , D. , Cocke , J. and Kennedy , K. 1988 . Estimating interlock and improving balance for pipelined architectures . Journal of Parallel and Distributed Computing , 5 : 334 – 358 .
  • Davis , K. , Hoisie , A. , Johnson , G. , Kerbyson , D.J. , Lang , M. , Pakin , S. and Petrini , F. 2004 . “ A performance and scalability analysis of the BlueGene/L architecture ” . In ACM/IEEE SC2004 Pittsburgh, PA, November 10–16, Available from http://hpc.pnl.gov/people/fabrizio/papers/sc04.pdf
  • Douglas , C.C. , Hu , J. , Kowarschik , M. , Rüde , U. and Weiß , C. 2000 . Cache optimization for structured and unstructured grid multigrid . Electronic Transactions on Numerical Analysis (ETNA) , 10 : 21 – 40 . February
  • Hülsemann , F. , Kowarschik , M. , Mohr , M. and Rüde , U. 2005 . “ Parallel geometric multigrid ” . In Numerical Solution of Partial Differential Equations on Parallel Computers , Edited by: Bruaset , A.M. and Tveito , A. Berlin : Springer-Verlag . volume 51 of Lecture Notes in Computer Science and Engineering, chapter 5, pages 165–208
  • Kowarschik , K. and Weiß , C. 2003 . “ An overview of cache optimization techniques and cache-aware numerical algorithms ” . In Algorithms for Memory Hierarchies—Advanced Lectures , Edited by: Meyer , U. , Sanders , P. and Sibeyn , J. Berlin : Springer-Verlag . volume 2625 of Lecture Notes in Computer Science, pages 213–232
  • Tamaki , Y. , Sukegawa , N. , Ito , M. , Tanaka , Y. , Fukagawa , M. , Sumimoto , T. and Ioki , N. 1999 . “ Node architecture and performance evaluation of the Hitachi super technical server sr8000 ” . In 12th International Conference on Parallel and Distributed Computing Systems 487 – 493 .
  • Trottenberg , U. , Oosterlee , C. and Schüller , A. 2001 . Multigrid , New York : Academic Press .
  • Weiß , C. , Karl , W. , Kowarschik , M. and Rüde , U. 1999 . “ Memory characteristics of iterative methods ” . In Proceedings of the ACM/IEEE Supercomputing'99 Conference (SC99) Portland, Oregon, November
  • Wellein , G. , Hager , G. , Basermann , A. Fehske , H. 2003 . “ Fast sparse matrix-vector multiplication for tflops computers ” . In High Performance Computing for Computational Science—VECPAR2002, LNCS 2565 , Edited by: Palma , J.M.L.M. 287 – 301 . Berlin, Heidelberg : Springer-Verlag .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.