Browsing by Author "Garba M. K."
Now showing 1 - 4 of 4
Results Per Page
Sort Options
- ItemA Test Procedure for Ordered Hypothesis of Population Proportions Against a Control(Turkiye Klinikleri Journal of Biostatistics, 2016) Yahya W. B.; Olaniran O. R.; Garba M. K.; Oloyede I.; Banjoko A. W.; Dauda K. A.; Olorede K. O.Objective: This paper aims to present a novel procedure for testing a set of population proportions against an ordered alternative with a control. Material and Methods: The distribution of the test statistic for the proposed test was determined theoretically and through Monte-Carlo experiments. The efficiency of the proposed test method was compared with the classical Chi-square test of homogeneity of population proportions using their empirical Type I error rates and powers at various sample sizes. Results: The new test statistic that was developed for testing a set of population proportions against an ordered alternative with a control was found to have a Chi-square distribution with non-integer values degrees of freedom v that depend on the number of population groups k being compared. Table of values of v for comparing up to 26 population groups was constructed while an expression was developed to determine v for cases where k > 26. Further results showed that the new test method is capable of detecting the superiority of a treatment, for instance a new drug type, over some of the existing ones in situations where only the qualitative data on users' preferences of all the available treatments (drug types) are available. The new test method was found to be relatively more powerful and consistent at estimating the nominal Type I error rates (α), especially at smaller sample sizes than the classical Chi-square test of homogeneity of population proportions. Conclusion: Conclusion: The new test method proposed here could find applications in pharmacology where a newly developed drug might be expected to be more preferred by users than some of the existing ones. This kind of test problem can equally exist in medicine, engineering and humanities in situations where only the qualitative data on users' preferences of a set of treatments or systems are available.
- ItemEfficient Support Vector Machine Classification of Diffused Large B-Cell Lymphoma and Follicular Lymphoma mRNA Tissue Samples(Annals Computer Science Series, 2015) 11. Banjoko A. W.; Yahya W. B.; Garba M. K.; Olaniran O. R.; Dauda K. A.; Olorede K. O.In this study, an efficient Support Vector Machine (SVM) algorithm that incorporates feature selection procedure for efficient identification and selection of gene biomarkers that are predictive of Diffuse Large B–Cell Lymphoma (DLBCL) and Follicular Lymphoma (FL) cancer tumor samples is presented. The data employed were published real life microarray cancer data that contained 7,129 gene expression profiles measured on 77 biological samples that comprised 58 DLBCL and 19 FL tissue samples. The dimension reduction approach of the Welch statistic was employed at the feature selection phase of the SVM algorithm. The cost and kernel parameters of the SVM model were tuned over a 10–fold cross-validation to improve the efficiency of the SVM classifier. The entire sample was randomly partitioned into 95% training and 5% test samples. The SVM classifier was trained using Monte Carlo Cross validation approach with 1000 replications. The performance of this classifier was assessed on the test samples using misclassification error rate (MER) and other performance measures. The results showed that the SVM classifier is quite efficient by yielding very high prediction accuracy of the tumor samples with fewer differentially expressed genes. The selected gene biomarkers in this work can be subjected to further clinical screening for proper determination of their biological relationship with DLBCL and FL tumour sub groups. However, more studies with large samples might be needed in future to validate the results from this work.
- ItemMulticlass Feature Selection and Classification with Support Vector Machine in Genomic Study(Professional Statisticians Society of Nigeria (PSSN), 2017) Banjoko A. W.; Yahya W. B.; Garba M. K.; Olaniran O. R.; Amusa L. B.; Gatta N. F.; Dauda K. A.; Olorede K. O.This study proposes an efficient Support Vector Machine (SVM) algorithm for feature selection and classification of multiclass response group in high dimensional (microarray) data. The Feature selection stage of the algorithm employed the F-statistic of the ANOVA–like testing scheme at some chosen family-wise-error-rate (FWER) to control for the detection of some false positive features. In a 10-fold cross validation, the hyper-parameters of the SVM were tuned to determine the appropriate kernel using one-versus-all approach. The entire simulated dataset was randomly partitioned into 95% training and 5% test sets with the SVM classifier built on the training sets while its prediction accuracy on the response class was assessed on the test sets over 1000 Monte-Carlo cross-validation (MCCV) runs. The classification results of the proposed classifier were assessed using the Misclassification Error Rates (MERs) and other performance indices. Results from the Monte-Carlo study showed that the proposed SVM classifier was quite efficient by yielding high prediction accuracy of the response groups with fewer differentially expressed features than when all the features were employed for classification. The performance of this new method on some published cancer data sets shall be examined vis-à-vis other state-of-the-earth machine learning methods in future works.
- ItemOn the Approximation of Pareto Distribution to Exponential Distribution Using the Gini Coefficient of Inequality(Professional Statisticians Society of Nigeria (PSSN), 2017) Yahya W.B.; Garba M. K.; Amidu L; Olorede, K. O.; Gatta, N. F.; Amusa L. B.Pareto proposed that income and wealth distribution obeys a universal power law valid for all times and countries, but subsequent studies have often disputed this position. Some even argued there is indeed no Pareto Law and that it should be entirely discarded in studies on distribution of wealth or resources. Many other probability distributions have been proposed such as log normal, exponential, gamma and two other forms by Pareto himself. Using data on imported goods from the National Bureau of Statistics as a case of distribution of wealth in Nigeria, we demonstrated that the distribution of money spent on importation in Nigeria also follow exponential distribution using the Gini coefficient which is a measure of inequality (degree of concentration) of a variable in the distribution of resources. Simulation studies were carried out at different sizes of items (or households) and varying values of the shape parameter and we compare how close the Gini coefficients of the exponential distribution approximate those obtained from the Pareto data as a credible alternative to Pareto distribution.