195
Views
20
CrossRef citations to date
0
Altmetric
Section A

Towards a parallel component in a GPU–CUDA environment: a case study with the L-BFGS Harwell routine

, , , &
Pages 59-76 | Received 26 Mar 2013, Accepted 24 Feb 2014, Published online: 22 May 2014

References

  • L. Antonelli, L. Carracciuolo, M. Ceccarelli, L. D'Amore, A. Murli, Total variation regularization for edge preserving 3D SPECT imaging in high performance computing environments, in Computational Science – ICCS 2002, International Conference Amsterdam, The Netherlands, April 21–24, 2002 Proceedings, Part II, Lecture Notes in Computer Science, Vol. 2330, Springer, Berlin, Heidelberg, 2002, pp. 171–180.
  • ANSI/IEEE 754-1985, American National Standard – IEEE Standard for Binary Floating-Point Arithmetic, American National Standards Institute, Inc., New York, 1985.
  • V. Boccia, L. D'Amore, M.R. Guarracino, and G. Laccetti, A grid enabled PSE for medical imaging: Experiences on MedIGrid, Proceedings – IEEE Symposium on Computer-Based Medical Systems, 2005, pp. 529–536.
  • L. Carracciuolo, L. D'Amore, and A. Murli, Towards a parallel component for imaging in PETSc programming environment: A case study in 3-D echocardiography. Parallel Comput. 32(1) (2006), pp. 67–83. doi: 10.1016/j.parco.2005.09.001
  • P. Caruso, G. Laccetti, and M. Lapegna, A performance contract system in a grid enabling, component based programming environment, Lecture Notes in Comput. Sci. 3470 (2005), pp. 9820–9992.
  • S. Cook, CUDA Programming: A Developer's Guide to Parallel Computing with GPUs, Morgan Kaufmann Publishers, Waltham, MA, 2012.
  • L. D'Amore, R. Arcucci, L. Marcellino, and A. Murli, HPC computation issues of the incremental 3D variational data assimilation scheme in OceanVar software, J. Numer. Anal. Ind. Appl. Math. 7(3–4) (2012), pp. 91–105.
  • L. D'Amore, V. Mele, L. Marcellino, and D. Romano, Deconvolution of 3D fluorescence microscopy images using graphics processing units, in Parallel Processing and Applied Mathematics, 9th International Conference, PPAM 2011, Torun, Poland, September 11–14, 2011. Revised Selected Papers, Part I, Lecture Notes in Computer Science, Vol. 7203, Springer, Berlin, Heidelberg, 2012, pp. 690–699.
  • L. D'Amore, D. Casaburi, A. Galletti, L. Marcellino, and A. Murli, Integration of emerging computer technologies for an efficient image sequences analysis, Integr. Comput.-Aided Eng. 18(4) (2011), pp. 365–378.
  • J. Delgado, J. Gazolla, E. Clua, and S.M. Sadjadi, A case study on porting scientific applications to GPU/CUDA, J. Comput. Interdiscip. Sci. 2 (2011), pp. 3–11.
  • J.E. Dennis and J.J. More, Quasi-Newton methods, motivation and theory, SIAM Rev. 19 (1977), pp. 46–89. doi: 10.1137/1019005
  • G. Giunta, R. Montella, G. Laccetti, F. Isaila, and F.J. Garcia Blas, A GPU accelerated high performance cloud computing infrastructure for grid computing based virtual environmental laboratory, in Advances in Grid Computing, Dr. Zoran Constantinescu, ed., In Tech Europe, Janeza Trdine, Rijeka, Croatia, 2011, pp. 121–146.
  • F. Gregoretti, G. Laccetti, A. Murli, G. Oliva, and U. Scafuri, MGF: A grid-enabled MPI library, Future Gener. Comput. Syst. 24(2) (2008), pp. 158–165. doi: 10.1016/j.future.2007.03.009
  • Harwell Subroutine Library, Release 10 (1990). Advanced Computing Department, AEA Industrial Technology, Harwell Laboratory, Oxfordshire, 1990.
  • D.B. Kirk and W.W. Hwu, Programming Massively Parallel Processors, Morgan Kaufmann, Burlington, MA, 2010.
  • D.C. Liu and J. Nocedal, On the limited memory BFGS method for large scale optimization, Math. Program. 45 (1989), pp. 503–528. doi: 10.1007/BF01589116
  • L. Maddalena, A. Petrosino, and G. Laccetti, A fusion-based approach to digital movie restoration, Pattern Recognit. 42(7) (2009), pp. 1485–1495. doi: 10.1016/j.patcog.2008.10.026
  • R. Montella, G. Coviello, G. Giunta, G. Laccetti, F. Isaila, and F.J. Garcia Blas, A general-purpose virtualization service for HPC on cloud computing: An application to GPUs, in Parallel Processing and Applied Mathematics, 9th International Conference, PPAM 2011, Torun, Poland, September 11–14, 2011. Revised Selected Papers, Part I, Lecture Notes in Computer Science, Vol. 7203, Springer Berlin, Heidelberg, 2012, pp. 740–749.
  • A. Murli, Lezioni di calcolo parallelo, Liguori Editore Srl, Naples, Italy, 2006.
  • A. Murli, V. Boccia, L. Carracciuolo, L. D'Amore, G. Laccetti, and M. Lapegna, Monitoring and migration of a PETSc-based parallel application for medical imaging in a grid computing PSE, IFIP Int. Fed. Inf. Process. 239 (2007), pp. 421–432.
  • NVIDIA, NVIDIA CUDA Compute Unified Device Architecture – Programming Guide Version 1.1, NVIDIA Corporation, Santa Clara, CA, 2007.
  • NVIDIA, CUBLAS, CUFFT, CUSPARSE, CURAND Libraries, NVIDIA Corporation, 2009. Available at http://www.nvidia.com/.
  • NVIDIA, CUDA Occupancy Calculator, 2008. Available at http://developer.down-load.nvidia.com/compute/cuda/CUDA_Occupancy_calculator.xls.
  • NVIDIA, CUDA Visual Profiler, 2009. Available at http://developer.download.nvidia.com/compute/cuda/2_2/toolkit/docs/cudaprof_1.2_readme.html.
  • Scalable Parallel Programming with CUDA, in ACM Queue, Volume 6, 2008. Available at http://mags.acm.org/queue/20080304/?u1=texterity.
  • C. Zhu, R.H. Byrd, P. Lu, and J. Nocedal, Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization, ACM Trans. Math. Softw. 23 (1997), pp. 550–560. doi: 10.1145/279232.279236

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.