Search in:

International Journal of Parallel, Emergent and Distributed Systems Volume 24, 2009 - Issue 3

Submit an article Journal homepage

352

Views

CrossRef citations to date

Altmetric

Original Articles

Concurrent number cruncher: a GPU implementation of a general sparse linear solver

Luc Buatois ENSG/CRPG, Gocad Research Group, Nancy University, Rue du Doyen Roubault, BP40 54501, Vandoeuvre-les-Nancy, France; INRIA Lorraine, ALICE, BP 239 – 54506, Vandoeuvre-les-Nancy Cedex, FranceCorrespondence[email protected]

Guillaume Caumon ENSG/CRPG, Gocad Research Group, Nancy University, Rue du Doyen Roubault, BP40 54501, Vandoeuvre-les-Nancy, France

Bruno Lévy INRIA Lorraine, ALICE, BP 239 – 54506, Vandoeuvre-les-Nancy Cedex, France

Pages 205-223 | Received 10 Sep 2007, Accepted 27 Jun 2008, Published online: 02 Jun 2009

Cite this article
https://doi.org/10.1080/17445760802337010

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

AMD, AMD Core Math Library (ACML), http://www.developer.amd.com/acml.jsp
Google Scholar
Barrett , R. 1994 . Templates for the Solution of Linear Systems: Building Blocks For Iterative Methods , 2nd ed. , Philadelphia : SIAM .
Google Scholar
Bolz , J. 2003 . Sparse matrix solvers on the GPU: conjugate gradients and multigrid . ACM Trans. Graph. (TOG) , 22 : 917 – 924 .
Web of Science ®Google Scholar
M. Botsch, D. Bommes, and L. Kobbelt, Efficient linear system solvers for mesh processing, IMA Conference on Mathematics of Surfaces XI, Lecture Notes in Computer Science (LNCS) 3604 (2005), pp. 62–83
Google Scholar
L. Buatois, G. Caumon, and B. Lévy, Concurrent Number Cruncher: An efficient sparse linear solver on the GPU, High Performance Computation Conference (HPCC'07) Lecture Notes in Computer Science (LNCS), 2007
Google Scholar
I. Buck, K. Fatahalian, and P. Hanrahan, GPUBench: Evaluating GPU performance for numerical and scientific applications, in Proceedings of the ACM Workshop on General-purpose Computing on Graphics Processors, 2004
Google Scholar
Buck , I. 2004 . Brook for GPUs: stream computing on graphics hardware . ACM Trans. Graph. (TOG) , 23 : 777 – 786 .
Web of Science ®Google Scholar
E. Cuthill and J. McKee, Reducing the bandwidth of sparse symmetric matrices, in Proceedings of the 24th National Conference (1969), pp. 157–172
Google Scholar
K. Fatahalian, J. Sugerman, and P. Hanrahan, Understanding the efficiency of GPU algorithms for matrix–matrix multiplication, HWWS '04 In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware (2004), pp. 133–137
Google Scholar
Fernando , R. and Kilgard , M. 2003 . The Cg Tutorial: The Definitive Guide to Programmable Real-time Graphics , Boston : Addison-Wesley Longman Publishing Co., Inc .
Google Scholar
Floater , M.S. and Hormann , K. 2005 . “ Surface parameterization: a tutorial and survey ” . In Multiresolution in Geometric Modelling , Edited by: Dodgson , N.A. , Floater , M.S. and Sabin , M.A. 157 – 186 . Heidelberg : Springer-Verlag .
Google Scholar
Galoppo , N. 2005 . LU-GPU: Efficient algorithms for solving dense linear systems on graphics hardware . : 3 In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (SC)
Google Scholar
N. Gibbs, W. Poole, and P. Stockmeyer, An algorithm for reducing the bandwidth and profile of a sparse matrix, Technical Report, College of William and Mary Williamsbourg, VA, Department of Mathematics, 1974
Google Scholar
D. Göddeke, R. Strzodka, and S. Turek, Accelerating double precision FEM simulations with GPUs, Proceedings of the ASIM 2005 – 18th Symposium on Simulation Technique, 2005
Google Scholar
Turek , S. 2007 . Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations . Int. J. Parallel, Emergent Distrib. Syst. , 22 : 221 – 256 .
Google Scholar
GPGPU, General-Purpose computation on GPUs, www.gpgpu.org (http://www.gpgpu.org)
Google Scholar
Hestenes , M. and Stiefel , E. 1952 . Methods of conjugate gradients for solving linear systems . J. Res. Nat. Bur. Stand. , 49 : 409 – 436 .
Google Scholar
INTEL Math Kernel Library (MKL), www.intel.com/software/products/mkl (http://www.intel.com/software/products/mkl)
Google Scholar
INTEL, Math Kernel Library (MKL) – LINPACK SMP benchmark package, www.intel.com/cd/software/products/asmo-na/eng/266857.htm (http://www.intel.com/cd/software/products/asmo-na/eng/266857.htm)
Google Scholar
J. Jung and D. O'Leary, Cholesky decomposition and linear programming on a GPU, Workshop on Edge Computing Using New Commodity Architectures (EDGE), 2006
Google Scholar
Krüger , J. and Westermann , R. 2003 . Linear algebra operators for GPU implementation of numerical algorithms . ACM Trans. Graph. (TOG) , 22 : 908 – 916 .
Web of Science ®Google Scholar
B. Lévy, Numerical methods for digital geometry processing, Israel Korea Bi-National Conference, 2005
Google Scholar
LévyB., et al., Least squares conformal maps for automatic texture atlas generation, ACM SIGGRAPH'02, San-Antonio, Texas, USA, 2002
Google Scholar
Mallet , J. 1992 . Discrete smooth interpolation (DSI) . Comput. Aided Des. , 24 : 263 – 270 .
Web of Science ®Google Scholar
McCool , M. and DuToit , S. 2004 . Metaprogramming GPUs with Sh , Wellesley : AK Peters .
Google Scholar
Microsoft, Direct3d reference, http://www.msdn.microsoft.com
Google Scholar
A. Nealen et al., Laplacian mesh optimization, in Proc. ACM GRAPHITE 2006, pp. 381–389
Google Scholar
NVIDIA CUDA (Compute Unified Device Architecture), (2006), http://www.developer.nvidia.com/object/cuda.html
Google Scholar
Peercy , M. , Segal , M. and Gerstmann , D. 2006 . A performance-oriented data-parallel virtual machine for GPUs . ACM SIGGRAPH'06 ,
Google Scholar
Rost , R. 2004 . OpenGL Shading Language , Reading : Addison-Wesley Professional .
Google Scholar
M. Segal and K. Akeley, The OpenGL graphics system: A specification, version 2.0 (2004), www.opengl.org (http://www.opengl.org)
Google Scholar
J. Shewchuk, An introduction to the conjugate gradient method without the agonizing pain, Technical Report, CMU School of Computer Science, (1994), ftp://www.warp.cs.cmu.edu/quake-papers/painless-conjugate-gradient.ps (ftp://ftp://www.warp.cs.cmu.edu/quake-papers/painless-conjugate-gradient.ps)
Google Scholar
O. Sorkine and D. Cohen-Or, Least-squares meshes, Proc. Shape Model. Int. (2004), pp. 191–199
Google Scholar
R. Strzodka and D. Göddeke, Pipelined mixed precision algorithms on FPGAs for fast and accurate PDE solvers from low precision components, Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'06) 2006, pp. 259–270
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Concurrent number cruncher: a GPU implementation of a general sparse linear solver

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Concurrent number cruncher: a GPU implementation of a general sparse linear solver

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date