745
Views
35
CrossRef citations to date
0
Altmetric
Original Articles

Programming models and systems for Big Data analysis

, ORCID Icon &
Pages 632-652 | Received 06 Oct 2017, Accepted 26 Dec 2017, Published online: 05 Jan 2018

References

  • Marx V . Biology: the big challenges of big data. Nature. 2013;498:255–260.
  • Belcastro L , Marozzo F , Talia D , et al. Using scalable data mining for predicting flight delays. ACM Trans Intell Syst Technol. 2016;8:5:1–5:20.
  • Murdoch TB , Detsky AS . The inevitable application of big data to health care. JAMA. 2013;309:1351–1352.
  • John Walker S . Big data: a revolution that will transform how we live, work, and think. Inter J Ads. 2014;33(1):181–183. DOI:10.2501/IJA-33-1-181-183
  • Belcastro L , Marozzo F , Talia D , et al. Big data analysis on clouds. In: Zomaya A , Sakr S , editors. Handbook of big data technologies. Springer; 2017. p. 101–142.
  • Talia D , Trunfio P , Marozzo F . Data analysis in the cloud. Elsevier; 2015. ISBN 978-0-12-802881-0.
  • Skillicorn DB , Talia D . Models and languages for parallel computation. ACM Comput Surv. 1998;30:123–169.
  • Wadkar S , Siddalingaiah M , Venner J . Pro Apache Hadoop. Apress; 2014.
  • Flynn MJ . Some computer organizations and their effectiveness. IEEE Trans Comput. 1972;100:948–960.
  • Bux M , Leser U . Parallelization in scientific workflow management systems. CoRR. 2013;abs/1303.7195. Available from: http://arxiv.org/abs/1303.7195
  • Dean J , Ghemawat S . Mapreduce: simplified data processing on large clusters. Commun ACM. 2008;51:107–113.
  • Marozzo F , Talia D , Trunfio P . P2p-mapreduce: parallel data processing in dynamic cloud environments. J Comput Syst Sci. 2012;78:1382–1402.
  • Xin RS , Rosen J , Zaharia M , et al. Shark: SQL and rich analytics at scale. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD ’13, New York, NY, USA; New York, NY, USA: ACM; 2013. p. 13–24.
  • LaSalle D , Karypis G . Mpi for big data: New tricks for an old dog. Parallel Comput. 2014;40:754–767.
  • Reyes-Ortiz JL , Oneto L , Anguita D . Big data analytics in the cloud: Spark on hadoop vs mpi/openmp on beowulf. Proc Comput Sci. 2015;53:121–130.
  • Liang F , Lu X . Accelerating iterative big data computing through mpi. J Comput Sci Technol. 2015;30:283–294.
  • Gropp W , Lusk E , Skjellum A . Using MPI: portable parallel programming with the message-passing interface. Vol. 1. MIT press; 1999.
  • Geist A , Gropp W , Huss-Lederman S , et al. MPI-2: extending the message-passing interface. In: Bougé L , Fraigniaud P , Mignotte A , et al. , editors. Euro-Par’96 Parallel Processing. Berlin: Springer; 1996. p. 128–135.
  • Valiant LG . A bridging model for parallel computation. Commun ACM. 1990;33:103–111.
  • Malewicz G , Austern MH , Bik AJ , et al. Pregel: a system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. Indianapolis (IN): ACM; 2010. p. 135–146.
  • Ching A , Edunov S , Kabiljo M , et al . One trillion edges: graph processing at facebook-scale. Proc VLDB Endowment. 2015;8:1804–1815.
  • Wang Z , Bao Y , Gu Y , et al . A BSP-based parallel iterative processing system with multiple partition strategies for big graphs. In: IEEE International Congress on in Big Data (BigData Congress). Santa Clara (CA): IEEE; 2013. p. 173–180.
  • Siddique K , Akhtar Z , Yoon EJ , et al . Apache hama: an emerging bulk synchronous parallel computing framework for big data applications. IEEE Access. 2016;4:8879–8887.
  • Talia D , Trunfio P . Service-oriented distributed knowledge discovery. Chapman and Hall/CRC; 2012.
  • Van Der Aalst WMP , Ter Hofstede AHM , Kiepuszewski B , et al . Workflow patterns. Distrib Parallel Databases. 2003;14:5–51. DOI:10.1023/A:1022883727209
  • Wilde M , Hategan M , Wozniak JM , et al . Swift: a language for distributed parallel scripting. Parallel Comput. 2011;37:633–652.
  • Wozniak JM , Wilde M , Foster IT . Language features for scalable distributed-memory dataflow computing. In: Fourth Workshop on Data-flow Execution Models for Extreme Scale Computing (DFM). Edmonton: IEEE; 2014. p. 50–53.
  • Giardine B , Riemer C , Hardison RC , et al . Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 2005;15:1451–1455.
  • Lordan F , Tejedor E , Ejarque J , et al. Servicess: an interoperable programming framework for the cloud. J Grid Comput. 2014;12:67–91.
  • Tejedor E , Becerra Y , Alomar G , et al . Pycompss: parallel computational workflows in python. Int J High Perform Comput Appl. 2017;31:66–82.
  • Marozzo F , Talia D , Trunfio P . A workflow management system for scalable data mining on clouds. IEEE Trans Serv Comput. 2016. DOI:10.1109/TSC.2016.2589243
  • Marozzo F , Talia D , Trunfio P . Js4cloud: script-based workflow programming for scalable data analysis on cloud platforms. Concurrency Comput: Pract Exp. 2015;27:5214–5237.
  • Agapito G , Cannataro M , Guzzi PH , et al . Cloud4SNP: distributed analysis of SNP microarray data on the cloud. Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics 2013 (ACM BCB 2013); September; Washington, DC, USA: ACM Press; 2013. p. 468–475. ISBN 978-1-4503-2434-2.
  • Altomare A , Cesario E , Comito C , et al. Trajectory pattern mining for urban computing in the cloud. Trans Parallel Distrib Syst (IEEE TPDS). 2017;28:586–599. ISSN:1045–9219.
  • Abramova V , Bernardino J , Furtado P . Which nosql database? A performance overview. Open J Databases. 2014;1:17–24.
  • Cattell R . Scalable sql and nosql data stores. ACM SIGMOD Record. 2011;39:12–27.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.