2,032
Views
4
CrossRef citations to date
0
Altmetric
Research Article

Anomaly Detection Model for Predicting Hard Disk Drive Failures

ORCID Icon, ORCID Icon & ORCID Icon
Pages 549-566 | Received 17 Feb 2019, Accepted 19 Mar 2021, Published online: 11 May 2021

References

  • Agarwal, V., C. Bhattacharyya, T. Niranjan, and S. Susarla. 2009. Discovering rules from disk events for predicting hard drive failures, in Proceedings of the International Conference on Machine Learning and Applications, Miami Beach, Florida, USA, DOI: 10.1109/ICMLA.2009.62
  • Appuswamy, R., R. Borovica, G. Graefe, and A. Ailamaki. 2017. The five minute rule thirty years later and its impact on the storage hierarchy, in Proceedings of the 7th International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, Munich, Germany, 1–8
  • Box, G., and D. Cox. 1964. An analysis of transformations. Journal of the Royal Statistical Society 26 (2):211–52. 1964.
  • Cheng, S., and M. Pecht. 2012. Using cross-validation for model parameter selection of sequential probability ratio test, expert systems with applications. An International Journal Archive 39 (9):8467–73. doi:10.1016/j.eswa.2012.01.172.
  • Chou, Y. M., A. Polansky, and R. Mason. 1998. Transforming non-normal data to normality in statistical process control. Journal of Quality Technology 30 (2):133–41. doi:10.1080/00224065.1998.11979832.
  • Goldszmidt, M. 2012. Finding soon-to-fail disks in a haystack, in Proceedings of the 4th USENIX conference on Hot Topics in Storage and File Systems, Boston, MA, USA,
  • Guyon, I., J. Weston, S. Barnhill, and V. Vapnik. 2002. Gene selection for cancer classification using support vector machines. Machine Learning 46(1/3):389–422. 1–3. doi:10.1023/A:1012487302797.
  • Hamerly, G., and C. Elkan. 2001. Bayesian approaches to failure prediction for disk drives, in Proceedings of the Eighteenth International Conference on Machine Learning, San Francisco, CA, USA, 202–09,
  • Hughes, G., J. Murray, K. Kreutz-Delgado, and C. Elkan. 2002. Improved disk-drive failure warnings. IEEE Transactions on Reliability 51 (3):350–57. doi:10.1109/TR.2002.802886.
  • Kohavi, R. 1995. A Study of cross-validation and bootstrapping for accuracy estimation and model selection, in Proceedings of the 14th international joint conference on Artificial intelligence - 2, Montreal, Quebec, Canada,1137–43
  • Mahdisoltani, F., I. Stefanovici, and B. Schroeder. 2017. Improving storage system reliability with proactive error prediction, 2017 USENIX Annual Technical Conference (USENIX ATC ’17). July 12–14, 2017,Santa Clara, CA, USA, ISBN 978-1-931971-38-6, pp. 391-402
  • Murray, J., G. Hughes, and K. Kreutz-Delgado. 2003. Hard drive failure prediction using non-parametric statistical methods, in Proceedings of ICANN/ICONIP 2003, Istanbul, Turkey
  • Murray, J., G. Hughes, and K. Kreutz-Delgado. 2005. Machine learning methods for predicting failures in hard drives: a multiple-instance application. The Journal of Machine Learning Research 6:783–816.
  • Pitakrat, T. A., V. Hoorn, and L. Grunske. 2013. A comparison of machine learning algorithms for proactive hard disk drive failure detection in Proceedings of the 4th international ACM Sigsoft symposium on Architecting critical systems, Vancouver, British Columbia, Canada, 1–10, ISBN: 978-1-4503-2123-5, DOI: 10.1145/2465470.2465473
  • Qian, J., S. Skelton, J. Moore, and H. Jiang. 2015. P3: Priority based proactive prediction for soon-to-fail disks, in Proceedings of the International Conference on Networking, Architecture and Storage (NAS), Boston, MA, USA, DOI: 10.1109/NAS.2015.7255224
  • Queiroz, L., F. Rodrigues, J. P. P. Pordeus Gomes, F. T. Brito, I. Chaves, M. Paula, M. Salvador, and J. Machado. 2017. A fault detection method for hard disk drives based on mixture of gaussians and nonparametric statistics. IEEE Transactions on Industrial Informatics 13 (2):542–50. doi:10.1109/TII.2016.2619180.
  • Schroeder, B., and G. Gibson. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you?, in Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST 2007), San Jose, USA, 1–16
  • Tan, Y., and X. Gu. 2010. On predictability of system anomalies in real world, in Proceedings of the International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), Miami Beach, Florida, USA, DOI: 10.1109/MASCOTS.2010.22
  • Wang, Y., E. Ma, T. Chow, and K. L. Tsui. 2014. A two-step parametric method for failure prediction in hard disk drives. IEEE Transactions on Industrial Informatics 10 (1):419–30. doi:10.1109/TII.2013.2264060.
  • Wang, Y., Q. Miao, E. Ma, K. L. Tsui, and M. Pecht. 2013. Online anomaly detection for hard disk drives based on mahalanobis distance. IEEE Transactions on Reliability 62 (1):136–45. doi:10.1109/TR.2013.2241204.
  • Wang, Y., Q. Miao, and M. Pecht. 2011. Health monitoring of hard disk drive based on Mahalanobis distance, in Proceedings of the Prognostics and System Health Management Conference, Shenzhen, China, DOI: 10.1109/PHM.2011.5939558
  • Xu, Y., K. Sui, E. Yao, H. Zhang, Q. Lin, Y. Dang, P. Li, K. Jiang, W. Zhang, J. G. Lou, et al. 2018. Improving service availability of cloud systems by predicting disk error, Proceedings of the 2018 USENIX Conference on Usenix Annual Technical Conference, Boston, MA, USA. 481–493.
  • Zhao, Y., X. Liu, S. Gan, and W. Zheng. 2010. Predicting disk failures with HMM- and HSMM-based approaches, in Proceedings of 10th Industrial Conference on Data Mining, ICDM 2010: Advances in Data Mining. Applications and Theoretical Aspects, Berlin, Germany, 390–404
  • Zhu, B., G. Wang, X. Liu, D. Hu, S. Lin, and J. Ma. 2013. Proactive drive failure prediction for large scale storage systems, in Proceedings of the 29th Symposium on Mass Storage Systems and Technologies (MSST), Long Beach, CA, USA, DOI 10.1109/MSST.2013.6558427

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.