A Hybridized Feature Extraction Model for Offline Yorùbá Document Recognition

dc.contributor.authorJumoke F. Ajao a*, Rafiu M. Isiaka a and Ronke S. Babatunde
dc.date.accessioned2024-10-22T09:42:11Z
dc.date.available2024-10-22T09:42:11Z
dc.date.issued2023-03-22
dc.description.abstractDocument recognition is required to convert handwritten and text documents into digital equivalents, making them more easily accessible and convenient to store. This study combined feature extraction techniques for recognizing Yorùbá documents in an effort to preserve the cultural values and heritages of the Yorùbá people. Ten Yorùbá documents were acquired from Kwara State University’s Library, and ten indigenous literate writers wrote the handwritten version of the documents. These were digitized using HP Scanjet300 and pre-processed. The pre-processed image served as input to the Local Binary Pattern, Speeded-Up-Robust-Features and Histogram of Gradient. The combined extracted feature vectors were input into the Genetic Algorithm. The reduced feature vector was fed into Support Vector Machine. A 10-folds cross-validation was used to train the model: LBP-GA, SURF-GA, HOG-GA, LBP-SURF-GA, HOG-SURF-GA, LBP-HOG-GA and LBP-HOG-SURF-GA. LBP-HOG-SURF-GA for Yorùbá printed text gave 90.0% precision, 90.3% accuracy and 15.5% FPR. LBP-HOG-SURF-GA for Handwritten Yorùbá document showed 80.9% precision, 82.6% accuracy and 20.4% (FPR) LBP-HOG-SURF-GA for CEDAR gave 98.0% precision, 98.4% accuracy and 2.6% FPR. LBP-HOG-SURF-GA for MNIST gave 99% precision, 99.5% accuracy, 99.0% and 1.1% FPR. The results of the hybridized feature extractions (LBP-HOG-SURF) demonstrated that the proposed work improves significantly on the various classification metrics.
dc.identifier.citationJumoke F. Ajao, Rafiu M. Isiaka and Ronke S. Babatunde
dc.identifier.issn2581-8260
dc.identifier.urihttps://kwasuspace.kwasu.edu.ng/handle/123456789/2493
dc.language.isoen
dc.publisherAsian Journal of Research in Computer Science
dc.titleA Hybridized Feature Extraction Model for Offline Yorùbá Document Recognition
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
27_YorubaDocumentRec_Ajao1542023AJRCOS99254.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: