450
Views
12
CrossRef citations to date
0
Altmetric
Articles

An Improved Fuzzy K-Nearest Neighbor Algorithm for Imbalanced Data using Adaptive Approach

&

REFERENCES

  • J. Han , J. Pei , and M. Kamber , Data Mining: Concepts and Techniques . San Francisco, CA: Elsevier, 2011.
  • H. Patel and D. Rajput , “Data mining applications in present scenario: A review,” Int. J. Soft Comput. , Vol. 6, pp. 136–42, Apr. 2011.
  • J. Friedman , T. Hastie , and R. Tibshirani , The Elements of Statistical Learning Vol. 1: Springer Series in Statistics . Berlin : Springer, 2001.
  • X. Wu , V. Kumar , J. R. Quinlan , J. Ghosh , Q. Yang , H. Motoda , et al. , “Top 10 algorithms in data mining,” Knowl. Inform. Syst. , Vol. 14, pp. 1–37, Jan. 2008.
  • T. Cover and P. Hart , “Nearest neighbor pattern classification,” IEEE Trans. Inform. Theory , Vol. 13, pp. 21–7, Jan. 1967.
  • G. Loizou and S. J. Maybank , “The nearest neighbor and the bayes error rates,” IEEE Trans. Pattern Anal. Mach. Intel. , Vol. 9, pp. 254–62, Feb. 1987.
  • T. Yang , L. Cao , and C. Zhang , “A novel prototype reduction method for the K-nearest neighbor algorithm with K≥ 1,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining , Springer, Berlin, Heidelberg, 2010, pp. 89–100.
  • K. Q. Weinberger and L. K. Saul , “Distance metric learning for large margin nearest neighbor classification,” J. Mach. Learn. Res. , Vol. 10, pp. 207–44, Feb. 2009.
  • R. Min , D. A. Stanley , Z. Yuan , A. Bonner , and Z. Zhang , “A deep non-linear feature mapping for large-margin knn classification,” in 2009 Ninth IEEE International Conference on Data Mining , 2009, Miami, pp. 357–66.
  • R. J. Samworth , “Optimal weighted nearest neighbour classifiers,” Ann. Stat. , Vol. 40, pp. 2733–63, 2012.
  • L. A. Zadeh , “Fuzzy sets,” Inform. Control , Vol. 8, pp. 338–53, Jun. 1965.
  • Y. Sun , A. K. Wong , and M. S. Kamel , “Classification of imbalanced data: A review,” Int. J. Pattern Recog. Artif. Intel. , Vol. 23, pp. 687–719, Jun. 2009.
  • H. He and E. A. Garcia , “Learning from imbalanced data,” IEEE Trans. Knowl. Data Eng. , Vol. 21, pp. 1263–84, Sept. 2009.
  • Q. Yang and X. Wu , “10 challenging problems in data mining research,” Int. J. Inform. Technol. Decis. Making , Vol. 5, pp. 597–604, Dec. 2006.
  • J. M. Benítez , N. García-Pedrajas , and F. Herrera , “Special issue on “new trends in data mining” NTDM,” Knowl.-Based Syst. , Vol. 25, pp. 1–2, Feb. 2012.
  • T. Raeder , G. Forman , and N. V. Chawla , “Learning from imbalanced data: Evaluation matters,” in Data Mining: Foundations and Intelligent Paradigms , Springer, Berlin, Heidelberg, 2012, pp. 315–31.
  • P. K. Chan and S. J. Stolfo , “Toward scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection,” in KDD , New York, 1998, pp. 164–8.
  • R. Pavón , R. Laza , M. Reboiro-Jato , and F. Fdez-Riverola , “Assessing the impact of class-imbalanced data for classifying relevant/irrelevant medline documents,” in 5th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2011) , Spain, 2011, pp. 345–53.
  • R. B. Rao , S. Krishnan , and R. S. Niculescu , “Data mining for improved cardiac care,” ACM SIGKDD Explor. Newsl. , Vol. 8, pp. 3–10, Jun. 2006.
  • X.-C. Li , W.-J. Mao , D. Zeng , P. Su , and F.-Y. Wang , “Performance evaluation of machine learning methods in cultural modeling,” J. Comput. Sci. Technol. , Vol. 24, pp. 1010–17, Nov. 2009.
  • G. E. Batista , R. C. Prati , and M. C. Monard , “A study of the behavior of several methods for balancing machine learning training data,” ACM Sigkdd Explor. Newsl. , Vol. 6, pp. 20–9, Jun. 2004.
  • B. Zadrozny and C. Elkan , “Learning and making decisions when costs and probabilities are both unknown,” in Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , San Francisco, CA, 2001, pp. 204–13.
  • R. C. Prati , G. E. Batista , and D. F. Silva , “Class imbalance revisited: A new experimental setup to assess the performance of treatment methods,” Knowl. Inform. Syst. , Vol. 45, pp. 247–70, Oct. 2015.
  • W. Liu and S. Chawla , “Class confidence weighted knn algorithms for imbalanced data sets,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining , Shenzhen, China, 2011, pp. 345–56.
  • H. Dubey and V. Pudi , “Class based weighted k-nearest neighbor over imbalance dataset,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining , Gold Coast, Australia, 2013, pp. 305–16.
  • S. Ando , “Classifying imbalanced data in distance-based feature space,” Knowl. Inform. Syst. , Vol. 46, pp. 707–30, Mar. 2016.
  • E. Kriminger , J. C. Principe , and C. Lakshminarayan , “Nearest neighbor distributions for imbalanced classification,” in The 2012 International Joint Conference on Neural Networks (IJCNN) , Brisbane, Australia, 2012, pp. 1–5.
  • N. Chen , A. Chen , and B. Ribeiro , “Influence of class distribution on cost-sensitive learning: A case study of bankruptcy analysis,” Intell. Data Anal. , Vol. 17, pp. 423–37, May 2013.
  • N. Tomašev and D. Mladenić , “Class imbalance and the curse of minority hubs,” Knowl.-Based Syst. , Vol. 53, pp. 157–72, Nov. 2013.
  • D. Ryu , J.-I. Jang , and J. Baik , “A hybrid instance selection using nearest-neighbor for cross-project defect prediction,” J. Comput. Sci. Technol. , Vol. 30, pp. 969–80, Sep. 2015.
  • A. Fernández , M. J. del Jesus , and F. Herrera , “On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data-sets,” Expert Syst. Appl. , Vol. 36, pp. 9805–12, Aug. 2009.
  • H. Han and B. Mao , “Fuzzy-rough k-nearest neighbor algorithm for imbalanced data sets learning,” in 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) , Yantai, China, 2010, pp. 1286–90.
  • C. Liu , L. Cao , and S. Y. Philip , “Coupled fuzzy k-nearest neighbors classification of imbalanced non-IID categorical data,” in 2014 International Joint Conference on Neural Networks (IJCNN) , Beijing, China, 2014, pp. 1122–9.
  • E. Ramentol , S. Vluymans , N. Verbiest , Y. Caballero , R. Bello , C. Cornelis , et al. , “IFROWANN: Imbalanced fuzzy-rough ordered weighted average nearest neighbor classification,” IEEE Trans. Fuzzy Syst. , Vol. 23, pp. 1622–37, Oct. 2015.
  • J. Alcalá , A. Fernández , J. Luengo , J. Derrac , S. García , L. Sánchez , et al. , “Keel data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework,” J. Mult.-Valued Logic Soft Comput. , Vol. 17, pp. 255–87, 2010.
  • J. M. Keller , M. R. Gray , and J. A. Givens , “A fuzzy k-nearest neighbor algorithm,” IEEE Trans. Syst., Man, Cybern. , Vol. 15, pp. 580–5, Jul.–Aug. 1985.
  • L. Baoli , L. Qin , and Y. Shiwen , “An adaptive k-nearest neighbor text categorization strategy,” ACM Trans. Asian Lang. Inform. Proces. (TALIP) , Vol. 3, pp. 215–26, Dec. 2004.
  • E. Fix and J. L. Hodges Jr , Discriminatory Analysis-Nonparametric Discrimination: Consistency Properties , Technical Report 4, Randolph Field, TX: US Air Force, School of Aviation Medicine, 1951.
  • E. Fix and J. L. Hodges , “Discriminatory analysis. Nonparametric discrimination: Consistency properties,” Int. Stat. Rev./Rev. Internationale de Statistique , Vol. 57, pp. 238–47, Dec. 1989.
  • D. Dua and E. Karra Taniskidou , UCI Machine Learning Repository . Irvine, CA: University of California, School of Information and Computer Science, 2017. Available: http://archive.ics.uci.edu/ml
  • S. Tan , “Neighbor-weighted k-nearest neighbor for unbalanced text corpus,” Expert Syst. Appl. , Vol. 28, pp. 667–71, May 2005.
  • H. Patel and G. Thakur , “A hybrid weighted nearest neighbor approach to mine imbalanced data,” in Proceedings of the International Conference on Data Mining (DMIN) , Las Vegas, NV, 2016, p. 106.
  • H. Patel and G. Thakur , “Classification of imbalanced data using a modified fuzzy-neighbor weighted approach,” Int. J. Intell. Eng. Syst. , Vol. 10, pp. 56–64, 2017.
  • J. Derrac , S. García , D. Molina , and F. Herrera , “A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms,” Swarm Evol. Comput. , Vol. 1, pp. 3–18, Mar. 2011.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.