217
Views
4
CrossRef citations to date
0
Altmetric
Articles

A New Adaptive Fault Tolerant Framework in the Cloud

ORCID Icon, , ORCID Icon, ORCID Icon & ORCID Icon

References

  • M. Armbrust et al., “Above the clouds: A berkeley view of cloud computing,” EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2009-28, Feb. 2009.
  • S. S. Gill and R. Buyya, “Failure management for reliable cloud computing: A taxonomy, model, and future directions,” Comput. Sci. Eng., Vol. 22, pp. 52–63, Apr. 2020.
  • B. Nicolae and F. Cappello, “BlobCR: Virtual disk based checkpoint-restart for HPC applications on IaaS clouds,” J. Parallel Distrib. Comput., Vol. 73, pp. 698–711, May 2013.
  • Y. Tian, J. Tian and N. Li, “Cloud reliability and efficiency improvement via failure risk based proactive actions,” J. Syst. Softw., Vol. 163, p. 110524, May 2020.
  • T. Chalermarrewong, T. Achalakul and S. C. W. See, “The design of a fault management framework for cloud,” in 2012 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, Phetchaburi, Thailand, 2012, pp. 1–4.
  • I. P. Egwutuoha, S. Chen, D. Levy, B. Selic and R. Calvo, “Cost-oriented proactive fault tolerance approach to high performance computing (hpc) in the cloud,” Int. J. Parallel, Emergent Distrib. Syst., Vol. 29, pp. 363–378, Jan. 2014.
  • A. Tikotekar, G. Vallee, T. Naughton, S. L. Scott and C. Leangsuksun, “Evaluation of fault-tolerant policies using simulation,” in IEEE International Conference on Cluster Computing, ICCC, Austin, TX, USA, 2007, pp. 303–311.
  • A. Ganesh, M. Sandhya and S. Shankar, “A study on fault tolerance methods in cloud computing,” in IEEE International Advance Computing Conference (IACC), Gurgaon, India, 2014, pp. 844–849.
  • F. Machida, M. Kawato and Y. Maeno, “Redundant virtual machine placement for fault-tolerant consolidated server clusters,” in Proceedings of the 2010 IEEE/IFIP Network Operations and Management Symposium, NOMS 2010, Osaka, Japan, 2010, pp. 32–39.
  • A. Beloglazov, J. Abawajy and R. Buyya, “Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing,” Future Gener. Comput. Syst., Vol. 28, pp. 755–768, May 2012.
  • A. Rawat, R. Sushil and A. Agarwal, “Review of fault tolerance frameworks in the cloud,” Int. J. Inf. Syst. Mod. Des. (IJISMD), Vol. 11, pp. 79–99, Sep. 2020.
  • E. N. M. Elnozahy, L. Alvisi, Y. M. Wang and D. B. Johnson, “A survey of rollback-recovery protocols in message-passing systems,” ACM Comput. Surv., Vol. 34, pp. 375–408, Sep. 2002.
  • Í. Goiri, F. Julià, J. Guitart and J. Torres, “Checkpoint-based fault-tolerant infrastructure for virtualized service providers,” in IEEE Network Operations and Management Symposium -- NOMS 2010, Osaka, Japan, 2010, pp. 455–462.
  • S. Yi, D. Kondo and A. Andrzejak, “Reducing costs of spot instances via checkpointing in the Amazon Elastic Compute Cloud,” in IEEE 3rd International Conference on Cloud Computing, CLOUD 2010, Miami, FL, USA, 2010, pp. 236–243.
  • M. Zhang, H. Jin, X. Shi and S. Wu, “Virtcft: A transparent vm-level fault-tolerant system for virtual clusters,” in IEEE 16th International Conference on Parallel and Distributed Systems, Shanghai, China, 2010, pp. 147–154.
  • N. Limrungsi, J. Zhao, Y. Xiang, T. Lan, H. H. Huang and S. Subramaniam, “Providing reliability as an elastic service in cloud computing,” in IEEE International Conference on Communications, Ottawa, ON, Canada, 2012, pp. 2912–2917.
  • J. Guitart, M. Macias, K. Djemame, T. Kirkham, M. Jiang and D. Armstrong, “Risk-driven proactive fault-tolerant operation of IaaS providers,” in Proceedings of the International Conference on Cloud Computing Technology and Science, CloudCom, Vol. 1, Bristol, United Kingdom, 2013, pp. 427–432.
  • A. Zhou, S. Wang, Z. Zheng, C. Hsu, M. R. Lyu and F. Yang, “On cloud service reliability enhancement with optimal resource usage,” IEEE Trans. Cloud Comput., Vol. 4, pp. 452–466, Dec. 2016.
  • J. P. A. Neto, D. M. Pianto and C. G. Ralha, “MULTS: A multi-cloud fault-tolerant architecture to manage transient servers in cloud computing,” J. Syst. Archit., Vol. 101, pp. 2–44 2019.
  • B. Ray, A. Saha, S. Khatua and S. Roy, “Proactive fault-tolerance technique to enhance reliability of cloud service in cloud federation environment,” IEEE Trans. Cloud Comput., pp. 1–8, Jan. 2020.
  • R. Jhawar, V. Piuri and M. Santambrogio, “Fault tolerance management in cloud computing: A system-level perspective,” IEEE Syst. J., Vol. 7, pp. 288–297, Jun. 2013.
  • M. Vardhan, N. Jain, S. Mishra and D. S. Kushwaha, “A demand based fault tolerant file replication model for clouds,” in Proceedings of the CUBE International Information Technology Conference on – CUBE '12, Pune, India, 2012, pp. 561–566.
  • J. Liu, S. Wang, A. Zhou, S. A. P. Kumar, F. Yang and R. Buyya, “Using proactive fault-tolerance approach to enhance cloud service reliability,” IEEE Trans. Cloud Comput., Vol. 6, pp. 1191–1202, Oct. 2018.
  • S. Dawei, C. Guiran, M. Changsheng and W. Xingwei, “Analyzing, modeling and evaluating dynamic adaptive fault tolerance strategies in cloud computing environments,” J. Supercomput., Vol. 66, pp. 193–228, Oct. 2013.
  • A. Mohammed, “A framework for providing a hybrid fault tolerance in cloud computing,” in Proceding of Science and Information Conference, London, UK, 2015, pp. 844–849.
  • P. Padmakumari, A. Umamakeswari and M. Akshaya, “Hybrid fault tolerant scheme to manage VM failure in the cloud,” Indian J. Sci. Technol., Vol. 9, pp. 1–5, Sep. 2016.
  • X. Chen and J. Jian-Hui, “A method of virtual machine placement for fault-tolerant cloud applications,” Intell. Autom. Soft Comput., Vol. 8587, pp. 1–11, Mar. 2016.
  • A. Rawat, R. Sushil, A. Agarwal and A. Sikander, “A new approach for vm failure prediction using stochastic model in cloud,” IETE J. Res., pp. 1–8, Oct. 2018.
  • R. Buyya, R. Ranjan and R. N. Calheiros, “Modeling and simulation of scalable cloud computing environments and the cloudsim toolkit: Challenges and opportunities, ” in 2009 International Conference on High Performance Computing Simulation, Leipzig, Germany, 2009, pp. 1–11.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.