28
Views
6
CrossRef citations to date
0
Altmetric
Original Articles

Equalizing imbalanced imprecise datasets for genetic fuzzy classifiers

Pages 276-296 | Received 14 Nov 2010, Accepted 01 Jun 2011, Published online: 23 Apr 2012

References

  • Alcalá-Fdez , J. , Sánchez , L. , Garćia , S. , Jesus , M.J. , Ventura , S. , Garrell , J.M. , Otero , J. , Romero , C. , Bacardit , J. , Rivas , J.M. , Fernández , J.C. and Herrera , F. 2009 . KEEL: A Software Tool to Assess Evolutionary Algorithms to Data Mining Problems . Soft Computing , 13 ( 3 ) : 307 – 318 .
  • Alcalá-Fdez , J. , Fernandez , A. , Luengo , J. , Derrac , J. , Garćia , S. , Sánchez , L. and Herrera , F. 2011 . KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework . Journal of Multiple-Valued Logic and Soft Computing , 17 ( 2-3 ) : 255 – 287 .
  • Batista , G. , Prati , R. and Monard , M. 2004 . A study of the behaviour of several methods for balancing machine learning training data . SIGKDD Explorations , 6 ( 1 ) : 20 – 29 .
  • Berger J. Statistical decision theory and Bayesian Analysis . Springer-Verlag .
  • Bradley , A.P. 1997 . The use of the area under the ROC curve in the evaluation ofmachine learning algorithms . Pattern Recognition , 30 ( 7 ) : 1145 – 1159 .
  • Casillas , J. and Carse , B. 2009 . Genetic Fuzzy Systems: Recent Developments and Future Directions . Soft Computing , 13 ( 3 ) : 417 – 418 .
  • Chawla , N.V. , Bowyer , K.W. , Hall , L.O. and Kegelmeyer , W.P. 2002 . SMOTE: Synthetic minority over-sampling technique . Journal of Artificial Intelligent Research , 16 : 321 – 357 .
  • Chawla , N.V. , Japkowicz , N. and Kolcz , A. 2004 . Editorial: Special issue on learning from imbalanced data sets . SIGKDD Explorations , 6 ( 1 ) : 1 – 6 .
  • Chen , S.H. 1985 . Operations on fuzzy numbers with function principal . Tamkang Journal of Management Sciences , 6 ( 1 ) : 13 – 25 .
  • Cheng , C.H. 1998 . A new approach for ranking fuzzy numbers by distance method . Fuzzy Sets and Systems , 95 : 307 – 317 .
  • Chen , S. J. and Chen , S. M. 2003 . A new method for handling multicriteria fuzzy decision making problems using FN-IOWA operators . Cybernetics and Systems , 34 : 109 – 137 .
  • Chen , S. J. and Chen , S. M. 2007 . Fuzzy risk analysis based on the ranking of generalized trapezoidal fuzzy numbers . Applied Intelligence , 26 ( 1 ) : 1 – 11 .
  • Chen , S. H. 1999 . “ Ranking generalized fuzzy number with graded mean integration ” . In Proceedings of the eighth international fuzzy systems association world congress 2 899 – 902 .
  • Crockett K. , Bandar Z. , O'Shea J. On producing balanced fuzzy decision tree classifiers . IEEE Internat. Conf. on Fuzzy Systems , 1756 – 1762 2006 .
  • Chu , T. C. and Tsao , C. T. 2002 . Ranking fuzzy numbers with an area between the centroid point and original point . Computers and Mathematics with Applications , 43 : 111 – 117 .
  • Dmochowski , J.P. , Sajda , P. and Parra , L.C. 2010 . Maximum Likelihood in Cost-Sensitive Learning: Model Specification, Approximators, and Upper Bounds . Journal of Machine Learning Research , 11 : 3313 – 3332 .
  • Elkan C. The Foundations of Cost-Sensitive Learning . Proceedings of the IJCAI-01 , 973 – 978 2001 .
  • Fawcett , T. and Provost , F.J. 1997 . Adaptive fraud detection . Data Mining Knowledge Discovery , 1 ( 3 ) : 291 – 316 .
  • Fernández , A. , Garcia , S. , del Jesús , M.J. and Herrera , F. 2008 . A study of the behaviour of linguistic fuzzy rule based classification system in the framework of imbalanced data-sets . Fuzzy Sets and Systems , 159 : 2378 – 2398 .
  • Fernández , A. , del Jesús , M.J. and Herrera , F. 2009 . On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data-sets . Expert Systems with Applications , 36 : 9805 – 9812 .
  • Fernández , A. , del Jesús , M.J. and Herrera , F. 2009 . Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets . International Journal of Approximate Reasoning , 50 : 561 – 577 .
  • Fernández , A. , del Jesús , M.J. and Herrera , F. 2010 . On the 2-tuples based genetic tuning performance for fuzzy rule based classification systems in imbalanced data-sets . Information Sciences , 180 ( 8 ) : 1268 – 1291 .
  • Hart , P. 1968 . The condensed nearest neighbor rule . IEEE Trans. Inform. Theory , 14 : 515 – 516 .
  • He , H. and Garcia , E.A. 2009 . Learning from Imbalanced Data . IEEE Transactions on Knowledge and Data Engineering , 21 ( 9 ) : 1263 – 1284 .
  • Japkowicz , N. and Stephen , S. 2002 . The class imbalance problem: a systematic study . Intelligent Data Anal. , 6 ( 5 ) : 429 – 450 .
  • Jain R. Decision-making in the presence of fuzzy variables , IEEE Trans. Systems Man and Cybernet. SMC-6 , 698 – 703 1976 .
  • Jain , R. 1978 . A procedure for multi-aspect decision making using fuzzy sets . Internat. J. Systems Sci. , 8 : 1 – 7 .
  • Kilic , K. , Uncu , O. and Türksen , I.B. 2007 . Comparison of different strategies of utilizing fuzzy clustering in structure identification . Information Sciences , 177 ( 23 ) : 5153 – 5162 .
  • Kubat M. , Holte R. , Matwin S. Learning when Negative Examples Abound . European Conference on Machine Learning 1997 .
  • Kubat M. , Matwin S. Addressing the curse of imbalanced training sets: one-sided selection . Internat. Conf. Machine Learning , 170 – 186 1997 .
  • Laurikkala J. Improving identification of difficult small classes by balancing class distribution . T.R. A-2001-2 , University of Tampere 2001 .
  • Liang C. , Wu J. , Zhang J. Ranking indices and rules for fuzzy numbers based on gravity center point . World Congress on Intelligent Control and Automation , Dalian , , China 2006 .
  • Mazurowski , M. , Habas , P. , Zurada , J. , Lo , J. , Baker , J. and Tourassi , G. 2008 . Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance . Neural Networks , 21 ( 2-3 ) : 427 – 436 .
  • Murakami S. , Maeda S. , Imamura S. Fuzzy decision analysis on the development of centralized regional energy control system , IFAC Symp. on Fuzzy Inform. Knowledge Representation and Decision Anal ., 363 – 368 1983 .
  • Nojima , Y. , Alcalá , R. , Ishibuchi , H. , Herrera , F. Special issue on evolutionary fuzzy systems . Soft Computing . doi: 10.1007/s00500-010-0663-2
  • Palacios , A. , Couso , I. and Sánchez , L. 2011 . Future performance modeling in athletism with low quality data-based GFSs . Journal of Multivalued Logic and Soft Computing , 17 ( 2-3 ) : 207 – 228 .
  • Palacios A. , Sánchez L. , Couso I. Diagnosis of dyslexia from vague data with Genetic Fuzzy Systems 51 , 993 – 1009 2010 .
  • Peng , X. and King , I. 2008 . Robust BMPM training based on second-order cone programming and its application in medical diagnosis . Neural Networks , 21 ( 2-3 ) : 450 – 457 .
  • Phua , C. , Alahakoon , D. and Lee , V. 2004 . Minority report in fraud detection: classification of skewed data . SIGKDD Explorations Newsletter , 6 ( 1 ) : 50 – 59 .
  • Provost , F. and Fawcett , T. 1998 . “ Robust classification systems for imprecise enviroments ” . In Proc. AAAI 706 – 713 .
  • Ramli , N. and Mohamad , D. 2009 . A comparative analysis of centroid methods in ranking fuzzy numbers . European Journal of Scientific Research , 28 ( 3 ) : 492 – 501 .
  • Sánchez , L. , Otero , J. and Couso , I. 2009 . Obtaining linguistic fuzzy rule-based regression models from imprecise data with multiobjective genetic algorithms . Soft Computing , 13 ( 5 ) : 467 – 479 .
  • Shieh , B.S. 2007 . An approach to centroids of fuzzy numbers . International Journal of Fuzzy Systems , 9 ( 1 ) : 51 – 54 .
  • Soler V. , Cerquides J. , Sabria J. , Roig J. , Prim M. Imbalanced datasets classification by fuzzy rule extraction and genetic algorithms . IEEE Internat. Conf. Data Mining –Workshops , 330 – 336 2006 .
  • Sun , Y. , Wong , A.K.C. and Kemel , M. 2009 . Classification of imbalanced data: A review . International Journal of Pattern Recognition and Artificial Intelligence , 23 ( 4 ) : 687 – 719 .
  • Tomek , I. 1976 . Two modifications of cnn . IEEE Trans. Systems Man Comm. , 6 : 769 – 772 .
  • Visa S. , Ralescu A. Learning imbalanced and overlapping classes using fuzzy sets . Internat. Conf. Machine Learning –Workshop on Learning from Imbalanced Datasets II 2003 .
  • Visa S. , Ralescu A. The effect of imbalanced data class distribution on fuzzy classifiers–experimental study . IEEE Internat. Conf. on Fuzzy Systems , 749 – 754 2005 .
  • Wang , Y.J. and Lee , H.S. 2008 . The revised method of ranking fuzzy numbers with an area between the centroid and original points . Computers and Mathematics with Applications , 55 : 2033 – 2042 .
  • Weiss , G. 2004 . Mining with rarity: a unifying framework. SIGKDD Explorations , 6 ( 1 ) : 7 – 19 .
  • Wilson , D.R. 1972 . Asymptotic properties of nearest neighbour rules using edited data . IEEE Trans. Systems Man Comm. , 2 ( 3 ) : 408 – 421 .
  • Xu , L. , Chow , M. and Taylor , L. 2007 . Power distribution fault cause identification with imbalanced data using the data mining-based fuzzy classification e-algorithm . IEEE Trans. Power Systems , 22 ( 1 ) : 164 – 171 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.