References
- Barrett , R. , Berry , M. , Chan , T.F. , Demmel , J. , Donato , J. , Dongarra , J. , Eijkhout , V. , Pozo , R. and Romine , C. 1994 . Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 2nd ed. , Philadelphia, PA : SIAM .
- Bergen , B. and Hülsemann , F. 2004 . Hierarchical hybrid grids: data structures and core algorithms for multigrid . Numerical Linear Algebra with Applications , 11 ( 2–3 ) : 279 – 291 . March–April
- Bergen , B. , Hülsemann , F. and Rüde , U. 2005 . “ Is 1.7 × 1010 unknowns the largest finite element system that can be solved today? ” . In Proceedings of the ACM/IEEE Supercomputing'05 Conference (SC05) Seattle, Washington, November
- Callahan , D. , Cocke , J. and Kennedy , K. 1988 . Estimating interlock and improving balance for pipelined architectures . Journal of Parallel and Distributed Computing , 5 : 334 – 358 .
- Davis , K. , Hoisie , A. , Johnson , G. , Kerbyson , D.J. , Lang , M. , Pakin , S. and Petrini , F. 2004 . “ A performance and scalability analysis of the BlueGene/L architecture ” . In ACM/IEEE SC2004 Pittsburgh, PA, November 10–16, Available from http://hpc.pnl.gov/people/fabrizio/papers/sc04.pdf
- Douglas , C.C. , Hu , J. , Kowarschik , M. , Rüde , U. and Weiß , C. 2000 . Cache optimization for structured and unstructured grid multigrid . Electronic Transactions on Numerical Analysis (ETNA) , 10 : 21 – 40 . February
- Hülsemann , F. , Kowarschik , M. , Mohr , M. and Rüde , U. 2005 . “ Parallel geometric multigrid ” . In Numerical Solution of Partial Differential Equations on Parallel Computers , Edited by: Bruaset , A.M. and Tveito , A. Berlin : Springer-Verlag . volume 51 of Lecture Notes in Computer Science and Engineering, chapter 5, pages 165–208
- Kowarschik , K. and Weiß , C. 2003 . “ An overview of cache optimization techniques and cache-aware numerical algorithms ” . In Algorithms for Memory Hierarchies—Advanced Lectures , Edited by: Meyer , U. , Sanders , P. and Sibeyn , J. Berlin : Springer-Verlag . volume 2625 of Lecture Notes in Computer Science, pages 213–232
- Tamaki , Y. , Sukegawa , N. , Ito , M. , Tanaka , Y. , Fukagawa , M. , Sumimoto , T. and Ioki , N. 1999 . “ Node architecture and performance evaluation of the Hitachi super technical server sr8000 ” . In 12th International Conference on Parallel and Distributed Computing Systems 487 – 493 .
- Trottenberg , U. , Oosterlee , C. and Schüller , A. 2001 . Multigrid , New York : Academic Press .
- Weiß , C. , Karl , W. , Kowarschik , M. and Rüde , U. 1999 . “ Memory characteristics of iterative methods ” . In Proceedings of the ACM/IEEE Supercomputing'99 Conference (SC99) Portland, Oregon, November
- Wellein , G. , Hager , G. , Basermann , A. Fehske , H. 2003 . “ Fast sparse matrix-vector multiplication for tflops computers ” . In High Performance Computing for Computational Science—VECPAR2002, LNCS 2565 , Edited by: Palma , J.M.L.M. 287 – 301 . Berlin, Heidelberg : Springer-Verlag .