72
Views
0
CrossRef citations to date
0
Altmetric
Articles

Accelerating Dense Matrix Computations with Effective Workload Partitioning on Heterogeneous Architectures

ORCID Icon, &
Pages 613-626 | Published online: 11 Jun 2018
 

ABSTRACT

The emergence of High Performance Computing (HPC) has enabled the researchers to perform large scientific computations efficiently and quickly. But as the heterogeneity of the processing units of the HPC systems increased, the utilization of all the resources became an issue. Fully harnessing the power of these systems requires efficient division of work across all the processing units. This solves the issue of under-utilization of resources and improves performance of the application. In this research work, we present a dynamic approach to workload partitioning that obtains the optimal workload partition and schedules them to processing units for parallel processing. Our workload partitioning technique is able to respond automatically to performance variation to provide good performance, it requires very negligible training and is implemented as a library. Performance results show that our dynamic approach is better than static and linear approach. By running the Dense Matrix-Matrix Multiplication kernel library by our proposed method on both CPU and Graphics Processing Unit (GPU) in parallel, we obtain average speedups from 2.69× to 3.14× over CPU and 1.12× to 1.38× over GPU. We used our method on multi-GPUs for which we obtain average speedups of 5.31× over CPU and 1.91× over single GPU.

ACKNOWLEDGEMENTS

The authors would like to thank the CUDA Center of Excellence at IIT Bombay, India for providing access to their HPC system and the anonymous reviewers for their helpful comments.

DISCLOSURE STATEMENT

No potential conflict of interest was reported by the authors.

Additional information

Notes on contributors

Mohsin Khan

Mohsin Khan received B.E. and M.Tech Degrees from Visvesvaraya Technological University, India in 2011 and 2013, respectively. He is currently pursuing Ph.D. from Visvesvaraya Technological University, with the research area as High Performance Computing. He has published six papers in international journals and conferences. His research interests include Parallel Computing, Heterogeneous Computing, Source to Source code Auto Parallelization.

Waseem Ahmed

Waseem Ahmed is a professor at the HKBK College of Engineering, Visvesvaraya Technological University, India. He received his B.E. degree (1995) from R.V College of Engineering, Bangalore University, India, MSc degree (1999) from University of Houston, United States of America, and Ph.D. degree (2008) from Curtin University of Technology, Australia. He has authored/co-authored over 28 papers in international journals, transactions and conferences. He has also authored a book in the field of Embedded Systems. He is a reviewer of seven international peer reviewed journals, transactions and conferences. He has been the active organizing member of various FDPs, workshops, and conferences. His research interests include Embedded Systems, High Performance Computing, and Data Mining. Email: [email protected]

Touseef M. Golandaz

Touseef M. Golandaz has completed B.E. (2014) from SECAB College of Engineering, India and M.Tech (2016) from HKBK College of Engineering, India. His area of interest includes ethical hacking, compiler design, GPGPU Computing, and Artificial Intelligence. His goal is to become a professor and an ethical hacker for which he is looking for options for Ph.D. research in the related fields. Email: [email protected]

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 100.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.