• Overview of Chinese core journals
  • Chinese Science Citation Database(CSCD)
  • Chinese Scientific and Technological Paper and Citation Database (CSTPCD)
  • China National Knowledge Infrastructure(CNKI)
  • Chinese Science Abstracts Database(CSAD)
  • JST China
  • SCOPUS
DU Hong-le, ZAHGN Yan. A Classification Algorithm for Imbalanced Dataset of Sample Density[J]. Journal of Xihua University(Natural Science Edition), 2015, 34(5): 16-23, 74. DOI: 10.3969/j.issn.1673-159X.2015.05.003
Citation: DU Hong-le, ZAHGN Yan. A Classification Algorithm for Imbalanced Dataset of Sample Density[J]. Journal of Xihua University(Natural Science Edition), 2015, 34(5): 16-23, 74. DOI: 10.3969/j.issn.1673-159X.2015.05.003

A Classification Algorithm for Imbalanced Dataset of Sample Density

  • In order to resolve the classifiers' over fitting phenomenon to enhance classification performance, a new algorithm based on sample density is proposed for imbalanced data classification. Firstly, it computes the density of samples and the density of every class. Then it works out the number of class with cluster algorithm according to the relation of sample density of every class. Then it clusters the samples of majority class using K- means algorithm with above class number. The cluster centers are treated as the new samples and then a new training dataset is constructed with the new samples and minority dataset. According to the new training dataset, we can get the decision function. The method may resolve the problem of imbalanced dataset and improve the classification performance of SVM. Results of experiments with artificial dataset and six groups of UCI dataset show that the algorithm is effective for imbalanced dataset, especially for the minority class samples.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return