References
- S. Balay, W. Gropp, L.C. McInnes, and B. Smith, PETSc 2.0 users manual, Tech. Rep. ANL-95/11, Argonne National Laboratory, 1996
- Barrachina , S. , Castillo , M. , Igual , F.D. , Mayo , R. and Quintana-Ortí , E.S. 2008 . “ Evaluation and tuning of the level 3 CUBLAS for graphics processors ” . In 9th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing – PDSEC'08
- S. Barrachina, M. Castillo, F.D. Igual, R. Mayo, and E.S. Quintana-Ortí, FLAG@lab: an M-script API for linear algebra operations on graphics processors. FLAME Working Note #30, Tech. Rep. ICC 01-02-2008, Depto. de Ingenieria y Ciencia de Computadores, Universidad Jaume I, Spain, 2008
- Barrachina , S. , Castillo , M. , Igual , F.D. , Mayo , R. and Quintana-Ortí , E.S. 2008 . “ Solving dense linear systems on graphics processors ” . In Euro-Par'08 , Lecture Notes in Computer Science, 5168 739 – 748 . Berlin : Springer .
- Bientinesi , P. , Gunnels , J.A. , Myers , M.E. , Quintana-Ortí , E.S. and van de Geijn , R.A. 2005 . The science of deriving dense linear algebra algorithms . ACM Trans. Math. Soft. , 31 ( 1 ) : 1 – 26 .
- Bientinesi , P. , Quintana-Ortí , E.S. and van de Geijn , R.A. 2005 . Representing linear algebra algorithms in code: The FLAME application programming interfaces . ACM Trans. Math. Softw. , 31 ( 1 ) : 27 – 59 .
- Cwik , T. , van de Geijn , R. and Patterson , J. 1994 . The application of parallel computation to integral equation models of electromagnetic scattering . J. Opt. Soc. Am. A , 11 ( 4 ) : 1538 – 1545 .
- E.F. D'Azevedo and J.J. Dongarra, The design and implementation of the parallel out-of-core scalapack LU, QR, and Cholesky factorization routines, LAPACK Working Note 118 CS-97-247, University of Tennessee, Knoxville, 1997
- Demkowicz , L. , Karafiat , A. and Oden , J.T. 1992 . Solution of elastic scattering problems in linear acoustics using h-p boundary element method . Comp. Meths. Appl. Mech. Engrg , 101 : 251 – 282 .
- C. Edwards, P. Geng, A. Patra, and R. van de Geijn, Parallel matrix distributions: have we been doing it all wrong?, Tech. Rep. TR-95-40, Department of Computer Sciences, The University of Texas at Austin, Austin, 1995
- Gunter , B.C. , Reiley , W.C. and van de Geijn , R.A. 2001 . “ Parallel out-of-core Cholesky and QR factorizations with POOCLAPACK ” . In Proceedings of the 15th International Parallel and Distributed Processing Symposium (IPDPS) , Washington, DC : IEEE Computer Society .
- Joffrain , T. , Quintana-Ortí , E.S. and van de Geijn , R.A. 2005 . “ Rapid development of high-performance out-of-core solvers ” . In Proceedings of PARA 2004 , number 3732 in LNCS 413 – 422 . Berlin : Springer-Verlag .
- Quintana-Ortí , E.S. and van de Geijn , R. 2009 . Updating an lu factorization with pivoting . ACM Trans. Math. Soft. , to appear
- W.C. Reiley and R.A. van de Geijn, POOCLAPACK: Parallel Out-of-Core Linear Algebra Package, Tech. Rep. CS-TR-99-33, Department of Computer Sciences, The University of Texas at Austin, Austin, 1999
- Toledo , S. and Gustavson , F.G. 1996 . “ The design and implementation of SOLAR, a portable library for scalable out-of-core linear algebra computation ” . In Proceedings of IOPADS'96
- van de Geijn , R.A. 1997 . Using PLAPACK: Parallel Linear Algebra Package , Boston, MA : The MIT Press .