A novel approach to outliers removal in a noisy numeric dataset for efficient Mining.

dc.contributor.authorAjiboye, A. R, Adewole, K. S., Babatunde, R. S. and Oladipo, I. D. (2016):
dc.date.accessioned2024-07-24T09:31:34Z
dc.date.available2024-07-24T09:31:34Z
dc.date.issued2016-02-16
dc.description.abstractData pre-processing is a key task in the data mining process. The task generally consumes the largest portion of the total data engineering effort while unveiling useful patterns from datasets. Basically, data mining is about fitting descriptive or predictive models from data. However, the presence of outlier sometimes reduces the reliability of the models created. It is, therefore, essential to have raw data properly pre-processed before exploring them for mining. In this paper, an algorithm that detects and removes outliers in a numeric dataset is proposed. In order to establish the effectiveness of the proposed algorithm, the clean data obtained through the implementation of the proposed approach is used to create a prediction model. Similarly, the clean data obtained through the use of one of the existing techniques is also used to create a prediction model. Each of the models created is simulated using a set of untrained data and the error associated with each model is measured. The resulting outputs from the two approaches reveal that, the prediction model created using the output from the proposed algorithm has an error of 0.38, while the prediction model created using the cleaned data from the clustering method gives an error of 0.61. Comparison of the errors associated with the models created using the two approaches shows that, the proposed algorithm is suitable for cleaning numeric dataset. The results of the experiment also unveils that, the proposed approach is efficient and can be used as an alternative technique to other existing cleaning methods.
dc.identifier.citationAjiboye, A. R, Adewole, K. S., Babatunde, R. S. and Oladipo, I. D. (2016):
dc.identifier.urihttps://www.iljcsit.com.ng/index.php/ILJCSIT/article/view/11
dc.identifier.urihttps://kwasuspace.kwasu.edu.ng/handle/123456789/1770
dc.language.isoen
dc.publisherPublished by Faculty of Computing and Information Systems, University of Ilorin.
dc.titleA novel approach to outliers removal in a noisy numeric dataset for efficient Mining.
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
17_DrAji_Adewole_and_I_Published.pdf
Size:
358.31 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: