Issues of Class Imbalance in Classification of Binary Data: A Review

dc.contributor.authorAderoju, S.A.
dc.contributor.authorJolayemi, E.T.
dc.date.accessioned2023-07-27T08:38:46Z
dc.date.available2023-07-27T08:38:46Z
dc.date.issued2019
dc.description.abstractHandling classification issues of class imbalance data has gained attentions of researchers in the last few years. Class imbalance problem evolves when one of two classes has more sample than the other class. The class with more sample is called major class while the other one is referred to as minor class. The most classification or predicting models are more focusing on classifying or predicting the major class correctly, ignoring the minor class. In this paper, various data pre-processing approaches to improve accuracy of the models were reviewed with application to terminated pregnancy data. The data were extracted from the 2013 Nigeria Demographic and Health Survey (NDHS). The response variable is “terminated pregnancy” (asking women of reproductive age whether they have ever experienced terminated pregnancy or not), which has two possible classes (“YES” or “NO”) that exhibited class imbalanced. The major class (“NO”) is 86.82% (of the sample) representing Nigerian women of age 15 – 49 years who had never experience terminated pregnancy while the other category (minor class) is 13.18%. Hence, different resampling techniques were exploited to handle the problem and to improve the model performance. Synthetic Minority Oversampling Technique (SMOTE) improved the model best among the resampling techniques considered. The following socio-demographic factors: age, age at first birth, residential area, region, education level of women were significantly associated with having terminated pregnancy in Nigeria.
dc.identifier.citationAderoju, S.A. and Jolayemi, E.T. (2019). Issues of Class Imbalance in Classification of Binary Data: A Review, International Journal of Data Science and Analysis. Vol. 5, No. 6, 2019, pp. 123-127.
dc.identifier.doi10.11648/j.ijdsa.20190506.13
dc.identifier.issn2575-1883
dc.identifier.urihttps://kwasuspace.kwasu.edu.ng/handle/123456789/742
dc.language.isoen
dc.publisherInternational Journal of Data Science and Analysis
dc.relation.ispartofInternational Journal of Data Science and Analysis
dc.titleIssues of Class Imbalance in Classification of Binary Data: A Review
dc.typejournal-article
oaire.citation.issue6
oaire.citation.volume5
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2 2019_Issues of Class Imbalance in Classification of Binary Data.pdf
Size:
411.54 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: