428
Views
8
CrossRef citations to date
0
Altmetric
Original Articles

Acceleration of iterative Navier-Stokes solvers on graphics processing units

, , , &
Pages 201-209 | Received 09 Jan 2013, Accepted 01 May 2013, Published online: 12 Jul 2013

References

  • Barrett , R. , Berry , M. , Chan , T. F. , Demmel , J. , Donato , J. , Dongarra , J. , Eijkhout , V. , Pozo , R. , Romine , C. and van der Vorst , H. 1994 . Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods. , 2nd ed , Philadelphia , PA : SIAM .
  • Bell , N. , Dalton , S. and Olson , L. 2011 . Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods , Santa Clara , CA : NVIDIA Corporation . NVIDIA Technical Report NVR-2011-002
  • Bell , N. and Garland , M. 2009 . “ Implementing Sparse Matrix-Vector Multiplication on Throughput-Oriented Processors ” . In SC’09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , 1 – 11 . Portland , OR : ACM .
  • Cohen , J. M. and Molemaker , M. J. 2010 . “ A Fast Double Precision CFD Code Using CUDA ” . In Parallel Computational Fluid Dynamics. Recent Advances and Future Directions , 414 – 429 . Lancaster , PA : DEStech .
  • Corrigan , A. , Camelli , F. , Löhner , R. and Mut , F. 2012 . Semi-Automatic Porting of a Large-Scale Fortran CFD Code to GPUs . International Journal for Numerical Methods in Fluids , 69 ( 2 ) : 314 – 331 . doi: 10.1002/fld.2560
  • Corrigan , A. and Löhner , R. 2011 . Semi-Automatic Porting of a Large-Scale CFD Code to Multi-Graphics Processing Unit Clusters . International Journal for Numerical Methods in Fluids , 69 ( 11 ) : 1 – 11 .
  • Elsen , E. , LeGresley , P. and Darve , E. 2008 . Large Calculation of the Flow Over a Hypersonic Vehicle Using a GPU . Journal of Computational Physics , 227 ( 24 ) : 10148 – 10161 . doi: 10.1016/j.jcp.2008.08.023
  • Farber , R. 2011 . CUDA Application Design and Development, 1 , Waltham , MA : Morgan Kaufmann .
  • Ferziger , J. H. and Perić , M. 2002 . Computational Methods for Fluid Dynamics , 3rd. ed , Berlin : Springer .
  • Garland , M. and Kirk , D. B. 2010 . Understanding Throughput-Oriented Architectures . Communications of the ACM , 53 ( 11 ) : 58 – 66 . doi: 10.1145/1839676.1839694
  • Geveler , M. , Ribbrock , D. , Göddeke , D. , Zajac , P. and Turek , S. 2012 . “ Towards a Complete FEM-Based Simulation Toolkit on GPUs: Unstructured Grid Finite Element Geometric Multigrid Solvers With Strong Smoothers Based on Sparse Approximate Inverses ” . In Computers & Fluids http://dx.doi.org/10.1016/j.compfluid.2012.01.025
  • Helfenstein , R. and Koko , J. 2012 . Parallel Preconditioned Conjugate Gradient Algorithm on GPU . Journal of Computational and Applied Mathematics , 236 ( 15 ) : 3584 – 3590 . doi: 10.1016/j.cam.2011.04.025
  • Jasak , H. 1996 . “ Error Analysis and Estimation for the Finite Volume Method with Applications to Fluid Flows ” . In PhD diss. , London : Imperial College .
  • Knoll , D. and Keyes , D. 2004 . Jacobian-Free Newton-Krylov Methods: A Survey of Approaches and Applications . Journal of Computational Physics , 193 ( 2 ) : 357 – 397 . doi: 10.1016/j.jcp.2003.08.010
  • Lee , V. W. , Kim , C. , Chhugani , J. , Deisher , M. , Kim , D. , Nguyen , A. D. Satish , N. 2010 . Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU . SIGARCH Computer Architecture News , 38 ( 3 ) : 451 – 460 . doi: 10.1145/1816038.1816021
  • Lefebvre , M. , Guillen , P. , Le , M. and Basdevant , C. 2012 . Optimizing 2D and 3D Structured Euler CFD Solvers on Graphical Processing Units . Computers & Fluids , 70 : 136 – 147 . doi: 10.1016/j.compfluid.2012.09.013
  • Malecha , Z. , Mirosław , Ł. , Tomczak , T. , Koza , Z. , Matyka , M. , Tarnawski , W. and Szczerba , D. 2011 . GPU-Based Simulation of 3D Blood Flow in Abdominal Aorta Using OpenFOAM . Archives of Mechanics , 63 ( 2 ) : 137 – 161 .
  • Naumov , M. 2011 . “ Incomplete-LU and Cholesky Preconditioned Iterative Methods Using CUSPARSE and CUBLAS ” . In Technical report , Santa Clara , CA : Nvidia .
  • Phillips , E. H. , Davis , R. L. and Owens , J. D. 2010 . “ Unsteady Turbulent Simulations on a Cluster of Graphics Processors ” . In Paper presented at the proceedings of the 40th AIAA Fluid Dynamics Conference, AIAA 2010-5036 Chicago , IL June 28–July 1
  • Phillips , E. , Zhang , Y. , Davis , R. L. and Owens , J. D. 2009 . “ Rapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units ” . In Paper presented at the 47th AIAA Aerospace Sciences Meeting, Paper No. AIAA 2009–565 Orlando , FL January 5–8
  • Rusche , H. 2002 . “ Computational Fluid Dynamics of Dispersed Two-Phase Flows at High Phase Fractions ” . In PhD diss. , London : Imperial College .
  • Salvadore , F. , Bernardini , M. and Botti , M. 2013 . GPU Accelerated Flow Solver for Direct Numerical Simulation of Turbulent Flows . Journal of Computational Physics , 235 : 129 – 142 . doi: 10.1016/j.jcp.2012.10.012
  • Shankar , P. N. and Deshpande , M. D. 2000 . Fluid Mechanics in the Driven Cavity . Annual Review of Fluid Mechanics , 32 ( 1 ) : 93 – 136 . doi: 10.1146/annurev.fluid.32.1.93
  • Tölke , J. and Krafczyk , M. 2008 . TeraFLOP Computing on a Desktop PC with GPUs for 3D CFD . International Journal of Computational Fluid Dynamics , 22 ( 7 ) : 443 – 456 . doi: 10.1080/10618560802238275
  • Waltz , J. 2013 . Performance of a Three-Dimensional Unstructured Mesh Compressible Flow Solver on NVIDIA Fermi-Class Graphics Processing Unit Hardware . International Journal for Numerical Methods in Fluids , 72 ( 2 ) : 259 – 268 . doi: 10.1002/fld.3744
  • Weller , H. G. , Tabor , G. , Jasak , H. and Fureby , C. 1998 . A Tensorial Approach to Computational Continuum Mechanics Using Object-Oriented Techniques . Computers in Physics , 12 ( 6 ) : 620 – 631 . doi: 10.1063/1.168744
  • Womersley , J. R. 1955 . Method for the Calculation of Velocity, Rate of Flow and Viscous Drag in Arteries When the Pressure Gradient is Known . Journal of Physiology , 127 ( 3 ) : 553 – 563 .
  • Zaspel , P. and Griebel , M. 2012 . “ Solving Incompressible Two-Phase Flows on Multi-GPU Clusters ” . In Computers & Fluids http://dx.doi.org/10.1016/j.compfluid.2012.01.021

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.