Search in:

Advanced search

IETE Journal of Research Volume 65, 2019 - Issue 6

Submit an article Journal homepage

450

Views

CrossRef citations to date

Altmetric

Articles

An Improved Fuzzy K-Nearest Neighbor Algorithm for Imbalanced Data using Adaptive Approach

Harshita Patel Department of Computer Applications, Maulana Azad National Institute of Technology , Bhopal, IndiaView further author information

G. S. Thakur Department of Computer Applications, Maulana Azad National Institute of Technology , Bhopal, IndiaView further author information

Pages 780-789 | Published online: 06 May 2018

Cite this article
https://doi.org/10.1080/03772063.2018.1462109
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

REFERENCES

J. Han , J. Pei , and M. Kamber , Data Mining: Concepts and Techniques . San Francisco, CA: Elsevier, 2011.
Google Scholar
H. Patel and D. Rajput , “Data mining applications in present scenario: A review,” Int. J. Soft Comput. , Vol. 6, pp. 136–42, Apr. 2011.
Google Scholar
J. Friedman , T. Hastie , and R. Tibshirani , The Elements of Statistical Learning Vol. 1: Springer Series in Statistics . Berlin : Springer, 2001.
Google Scholar
X. Wu , V. Kumar , J. R. Quinlan , J. Ghosh , Q. Yang , H. Motoda , et al. , “Top 10 algorithms in data mining,” Knowl. Inform. Syst. , Vol. 14, pp. 1–37, Jan. 2008.
Web of Science ®Google Scholar
T. Cover and P. Hart , “Nearest neighbor pattern classification,” IEEE Trans. Inform. Theory , Vol. 13, pp. 21–7, Jan. 1967.
Web of Science ®Google Scholar
G. Loizou and S. J. Maybank , “The nearest neighbor and the bayes error rates,” IEEE Trans. Pattern Anal. Mach. Intel. , Vol. 9, pp. 254–62, Feb. 1987.
PubMed Web of Science ®Google Scholar
T. Yang , L. Cao , and C. Zhang , “A novel prototype reduction method for the K-nearest neighbor algorithm with K≥ 1,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining , Springer, Berlin, Heidelberg, 2010, pp. 89–100.
Google Scholar
K. Q. Weinberger and L. K. Saul , “Distance metric learning for large margin nearest neighbor classification,” J. Mach. Learn. Res. , Vol. 10, pp. 207–44, Feb. 2009.
Web of Science ®Google Scholar
R. Min , D. A. Stanley , Z. Yuan , A. Bonner , and Z. Zhang , “A deep non-linear feature mapping for large-margin knn classification,” in 2009 Ninth IEEE International Conference on Data Mining , 2009, Miami, pp. 357–66.
Google Scholar
R. J. Samworth , “Optimal weighted nearest neighbour classifiers,” Ann. Stat. , Vol. 40, pp. 2733–63, 2012.
Web of Science ®Google Scholar
L. A. Zadeh , “Fuzzy sets,” Inform. Control , Vol. 8, pp. 338–53, Jun. 1965.
Web of Science ®Google Scholar
Y. Sun , A. K. Wong , and M. S. Kamel , “Classification of imbalanced data: A review,” Int. J. Pattern Recog. Artif. Intel. , Vol. 23, pp. 687–719, Jun. 2009.
Web of Science ®Google Scholar
H. He and E. A. Garcia , “Learning from imbalanced data,” IEEE Trans. Knowl. Data Eng. , Vol. 21, pp. 1263–84, Sept. 2009.
Web of Science ®Google Scholar
Q. Yang and X. Wu , “10 challenging problems in data mining research,” Int. J. Inform. Technol. Decis. Making , Vol. 5, pp. 597–604, Dec. 2006.
Web of Science ®Google Scholar
J. M. Benítez , N. García-Pedrajas , and F. Herrera , “Special issue on “new trends in data mining” NTDM,” Knowl.-Based Syst. , Vol. 25, pp. 1–2, Feb. 2012.
Web of Science ®Google Scholar
T. Raeder , G. Forman , and N. V. Chawla , “Learning from imbalanced data: Evaluation matters,” in Data Mining: Foundations and Intelligent Paradigms , Springer, Berlin, Heidelberg, 2012, pp. 315–31.
Google Scholar
P. K. Chan and S. J. Stolfo , “Toward scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection,” in KDD , New York, 1998, pp. 164–8.
Google Scholar
R. Pavón , R. Laza , M. Reboiro-Jato , and F. Fdez-Riverola , “Assessing the impact of class-imbalanced data for classifying relevant/irrelevant medline documents,” in 5th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2011) , Spain, 2011, pp. 345–53.
Google Scholar
R. B. Rao , S. Krishnan , and R. S. Niculescu , “Data mining for improved cardiac care,” ACM SIGKDD Explor. Newsl. , Vol. 8, pp. 3–10, Jun. 2006.
Google Scholar
X.-C. Li , W.-J. Mao , D. Zeng , P. Su , and F.-Y. Wang , “Performance evaluation of machine learning methods in cultural modeling,” J. Comput. Sci. Technol. , Vol. 24, pp. 1010–17, Nov. 2009.
Web of Science ®Google Scholar
G. E. Batista , R. C. Prati , and M. C. Monard , “A study of the behavior of several methods for balancing machine learning training data,” ACM Sigkdd Explor. Newsl. , Vol. 6, pp. 20–9, Jun. 2004.
Google Scholar
B. Zadrozny and C. Elkan , “Learning and making decisions when costs and probabilities are both unknown,” in Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , San Francisco, CA, 2001, pp. 204–13.
Google Scholar
R. C. Prati , G. E. Batista , and D. F. Silva , “Class imbalance revisited: A new experimental setup to assess the performance of treatment methods,” Knowl. Inform. Syst. , Vol. 45, pp. 247–70, Oct. 2015.
Web of Science ®Google Scholar
W. Liu and S. Chawla , “Class confidence weighted knn algorithms for imbalanced data sets,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining , Shenzhen, China, 2011, pp. 345–56.
Google Scholar
H. Dubey and V. Pudi , “Class based weighted k-nearest neighbor over imbalance dataset,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining , Gold Coast, Australia, 2013, pp. 305–16.
Google Scholar
S. Ando , “Classifying imbalanced data in distance-based feature space,” Knowl. Inform. Syst. , Vol. 46, pp. 707–30, Mar. 2016.
Web of Science ®Google Scholar
E. Kriminger , J. C. Principe , and C. Lakshminarayan , “Nearest neighbor distributions for imbalanced classification,” in The 2012 International Joint Conference on Neural Networks (IJCNN) , Brisbane, Australia, 2012, pp. 1–5.
Google Scholar
N. Chen , A. Chen , and B. Ribeiro , “Influence of class distribution on cost-sensitive learning: A case study of bankruptcy analysis,” Intell. Data Anal. , Vol. 17, pp. 423–37, May 2013.
Web of Science ®Google Scholar
N. Tomašev and D. Mladenić , “Class imbalance and the curse of minority hubs,” Knowl.-Based Syst. , Vol. 53, pp. 157–72, Nov. 2013.
Web of Science ®Google Scholar
D. Ryu , J.-I. Jang , and J. Baik , “A hybrid instance selection using nearest-neighbor for cross-project defect prediction,” J. Comput. Sci. Technol. , Vol. 30, pp. 969–80, Sep. 2015.
Web of Science ®Google Scholar
A. Fernández , M. J. del Jesus , and F. Herrera , “On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data-sets,” Expert Syst. Appl. , Vol. 36, pp. 9805–12, Aug. 2009.
Web of Science ®Google Scholar
H. Han and B. Mao , “Fuzzy-rough k-nearest neighbor algorithm for imbalanced data sets learning,” in 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) , Yantai, China, 2010, pp. 1286–90.
Google Scholar
C. Liu , L. Cao , and S. Y. Philip , “Coupled fuzzy k-nearest neighbors classification of imbalanced non-IID categorical data,” in 2014 International Joint Conference on Neural Networks (IJCNN) , Beijing, China, 2014, pp. 1122–9.
Google Scholar
E. Ramentol , S. Vluymans , N. Verbiest , Y. Caballero , R. Bello , C. Cornelis , et al. , “IFROWANN: Imbalanced fuzzy-rough ordered weighted average nearest neighbor classification,” IEEE Trans. Fuzzy Syst. , Vol. 23, pp. 1622–37, Oct. 2015.
Web of Science ®Google Scholar
J. Alcalá , A. Fernández , J. Luengo , J. Derrac , S. García , L. Sánchez , et al. , “Keel data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework,” J. Mult.-Valued Logic Soft Comput. , Vol. 17, pp. 255–87, 2010.
Web of Science ®Google Scholar
J. M. Keller , M. R. Gray , and J. A. Givens , “A fuzzy k-nearest neighbor algorithm,” IEEE Trans. Syst., Man, Cybern. , Vol. 15, pp. 580–5, Jul.–Aug. 1985.
Web of Science ®Google Scholar
L. Baoli , L. Qin , and Y. Shiwen , “An adaptive k-nearest neighbor text categorization strategy,” ACM Trans. Asian Lang. Inform. Proces. (TALIP) , Vol. 3, pp. 215–26, Dec. 2004.
Google Scholar
E. Fix and J. L. Hodges Jr , Discriminatory Analysis-Nonparametric Discrimination: Consistency Properties , Technical Report 4, Randolph Field, TX: US Air Force, School of Aviation Medicine, 1951.
Google Scholar
E. Fix and J. L. Hodges , “Discriminatory analysis. Nonparametric discrimination: Consistency properties,” Int. Stat. Rev./Rev. Internationale de Statistique , Vol. 57, pp. 238–47, Dec. 1989.
Web of Science ®Google Scholar
D. Dua and E. Karra Taniskidou , UCI Machine Learning Repository . Irvine, CA: University of California, School of Information and Computer Science, 2017. Available: http://archive.ics.uci.edu/ml
Google Scholar
S. Tan , “Neighbor-weighted k-nearest neighbor for unbalanced text corpus,” Expert Syst. Appl. , Vol. 28, pp. 667–71, May 2005.
Web of Science ®Google Scholar
H. Patel and G. Thakur , “A hybrid weighted nearest neighbor approach to mine imbalanced data,” in Proceedings of the International Conference on Data Mining (DMIN) , Las Vegas, NV, 2016, p. 106.
Google Scholar
H. Patel and G. Thakur , “Classification of imbalanced data using a modified fuzzy-neighbor weighted approach,” Int. J. Intell. Eng. Syst. , Vol. 10, pp. 56–64, 2017.
Google Scholar
J. Derrac , S. García , D. Molina , and F. Herrera , “A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms,” Swarm Evol. Comput. , Vol. 1, pp. 3–18, Mar. 2011.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

An Improved Fuzzy K-Nearest Neighbor Algorithm for Imbalanced Data using Adaptive Approach

REFERENCES

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

An Improved Fuzzy K-Nearest Neighbor Algorithm for Imbalanced Data using Adaptive Approach

REFERENCES

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date