55
Views
5
CrossRef citations to date
0
Altmetric
Articles

Parameter tuning of big data platforms for performance optimization

&

References

  • Gantz, John, and David Reinsel , The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the far east, IDC iView: IDC Analyze the future 2007(2012),pp 1-16.
  • IBM Big Data & Analytics Hub, The Four V’s of Big Data.
  • Apache Hadoop, https://hadoop.apache.org/
  • Apache Spark, https://spark.apache.org/
  • Apache Storm, http://storm.apache.org/
  • Wang, Guolu, Jungang Xu, and Ben He, A novel method for tuning configuration parameters of spark based on machine learning IEEE 18th International Conference on High Performance Computing and Communications, pp. 586-593. IEEE, 2016.
  • Trotter, Michael, Guyue Liu, and Timothy Wood, Into the Storm: Descrying Optimal Configurations Using Genetic Algorithms and Bayesian Optimization, 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS* W), pp. 175-180.
  • Krushnaraj Kamtekar, Raj Jain, (1991), Performance Modeling of Big-Data, The Art of Computer Systems Performance Analysis: Techniquesfor Experimental Design, Measurement, Simulation, and Modeling, WileyInterscience, New York,ISBN:0471503363
  • H.V. Jagadish, Alexandros Labrinidis (2012), Challenges and Opportunities with Big Data, ACM,Vol.5,No.12,pp.2022-23.
  • Chen Xiang, Yi Liang, Guang-Rui Li, Cheng Chen, and Si-Yu Liu. Optimizing Performance of Hadoop with Parameter Tuning. In ITM Web of Conferences, vol. 12, p. 03040. EDP Sciences, 2017.
  • Hua, Xingcheng, Michael C. Huang, and Peng Liu., Hadoop Configuration Tuning with Ensemble Modeling and Metaheuristic Optimization. IEEE Access 6 (2018): 44161-44174. doi: 10.1109/ACCESS.2018.2857852
  • Khaleel, Ali, and Hamed Al-Raweshidy. Optimization of Computing and Networking Resources of a Hadoop Cluster Based on Software Defined Network. IEEE Access 6 (2018): 61351-61365 doi: 10.1109/ACCESS.2018.2876385
  • Balaji Palanisamy, Aameek Singh, Ling Liu, (2015), Cost-Effective Resource Provisioning for MapReduce in a Cloud, IEEE transactions On Parallel and Distributed Systems,Vol.26,No.5,pp.1265-1279 doi: 10.1109/TPDS.2014.2320498
  • Vanita Jain, Rishabh Kapoor, Shashwat Gulyani & Arun Kumar Dubey (2019). Categorization of spam images and identification of controversial images on mobile phones using machine learning and predictive learning, Journal of Discrete Mathematical Sciences and Cryptography, 22:2, 293-307 doi: 10.1080/09720529.2019.1582863
  • Jamshidi, Pooyan, and Giuliano Casale, An uncertainty-aware approach to optimal configuration of stream processing systems, In 2016 IEEE 24th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), pp. 39-48. IEEE, 2016.
  • Hadoop Documentation, https://hadoop.apache.org/docs/r2.7.3/
  • SparkConfiguration, https://spark.apache.org/docs/2.4.0/configuration.html
  • Storm Default Configuration, https://github.com/apache/storm/blob/1.1.x-branch/conf/defaults.yaml
  • NCSS: Statistical Software, https://www.ncss.com/
  • Weka 3: Data Mining Software in Jahttps://www.cs.waikato.ac.nz/∼ml/weka/
  • RapidMiner, https://rapidminer.com/
  • Joshi, Shrinivas B., Apache hadoop performance-tuning methodolo­gies and best practices, In Proceedings of the 3rd acm/spec international conference on performance engineering, pp. 241-242. ACM, 2012.
  • Heger, Dominique, Hadoop performance tuning-a pragmatic & iterative approach, CMG Journal 4 (2013): 97-113.
  • Igiri, C. P., Singh, Y., & Poonia, R. C. (2019). An improved chaotic-based African buffalo optimisation algorithm. International Journal of Innovative Computing and Applications, 10(3/4), 147. doi: 10.1504/IJICA.2019.103375
  • Samadi, Yassir, Mostapha Zbakh, and Claude Tadonki, Comparative study between Hadoop and Spark based on Hibench benchmarks, In 2016 2nd International Conference on Cloud Computing Technol­ogies and Application (CloudTech), pp. 267-275. IEEE, 2016.
  • Zookeeper, https://zookeeper.apache.org
  • Tanuja Pattanshetti, Vahida Attar, Performance Features of Big Data Platforms and Service Level Agreements based on Software Quality Metrics using Feature Selection Algorithms, Journal of Advanced Research in Dynamical and Control Systems, Volume No.11, Issue no. 11, Pages: 1199-1205.
  • Tanuja Pattanshetti, Vahida Attar, Performance Optimization of Big Data Applications using Parameter Tuning of Data Platform Features through Feature Selection Techniques (Accepted - In press).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.