Stroke Treatment Prediction Using Features Selection Methods and Machine Learning Classifiers - 07/12/22

Doi : 10.1016/j.irbm.2022.02.002

I. Chourib ^a,^b,^{^⁎} , G. Guillard ^c, I.R. Farah ^a, B. Solaiman ^b
^a STICODE Departement, RIADI laboratory of National School of Computer Sciences, Manouba, Tunisia
^b MATHSTIC Departement, ITI laboratory of IMT Atlantique, Brest, France
^c Intradys, Brest, France

^⁎Corresponding author.

Graphical abstract

Il testo completo di questo articolo è disponibile in PDF.

Highlights

•	Find the most suitable couple: Feature Selection method-Machine Learning method.
•	Select the most relevant primitives from a large set of attributes.
•	Predict the appropriate treatment for a patient suffering of stroke disease.

Il testo completo di questo articolo è disponibile in PDF.

Abstract

Objectives

Feature selection in data sets is an important task allowing to alleviate various machine learning and data mining issues. The main objectives of a feature selection method consist on building simpler and more understandable classifier models in order to improve the data mining and processing performances. Therefore, a comparative evaluation of the Chi-square method, recursive feature elimination method, and tree-based method (using Random Forest) used on the three common machine learning methods (K-Nearest Neighbor, naïve Bayesian classifier and decision tree classifier) are performed to select the most relevant primitives from a large set of attributes. Furthermore, determining the most suitable couple (i.e., feature selection method-machine learning method) that provides the best performance is performed.

Materials and methods

In this paper, an overview of the most common feature selection techniques is first provided: the Chi-Square method, the Recursive Feature Elimination method (RFE) and the tree-based method (using Random Forest). A comparative evaluation of the improvement (brought by such feature selection methods) to the three common machine learning methods (K- Nearest Neighbor, naïve Bayesian classifier and decision tree classifier) are performed. For evaluation purposes, the following measures: micro-F1, accuracy and root mean square error are used on the stroke disease data set.

Results

The obtained results show that the proposed approach (i.e., Tree Based Method using Random Forest, TBM-RF, decision tree classifier, DTC) provides accuracy higher than 85%, F1-score higher than 88%, thus, better than the KNN and NB using the Chi-Square, RFE and TBM-RF methods.

Conclusion

This study shows that the couple - Tree Based Method using Random Forest (TBM-RF) decision tree classifier successfully and efficiently contributes to find the most relevant features and to predict and classify patient suffering of stroke disease.”

Il testo completo di questo articolo è disponibile in PDF.

Keywords : Stroke disease, Feature selection, Data mining, Decision tree classifier, Naive Bayes, K-nearest neighbor, Recursive feature elimination, Tree-based model, Chi-square

Mappa

Introduction

Materials and methods

Feature selection models

Feature selection techniques

Machine learning algorithms

Human and animal rights

Funding

Author contributions

CRediT authorship contribution statement

Esportazione

Vol 43 - N° 6

P. 678-686 - dicembre 2022 Ritorno al numero

Articolo precedente

Effect of Botulinum Toxin on Spasticity Level Assessed by Tonic Stretch Reflex Threshold: A Feasibility Pilot Study
A.R. Rezende, I.A. Marques, C.M. Alves, J.S.M. Shinosaki, E.L. Martins Naves

| Articolo seguente

Implication of the Nature and Texturation of Silicone Surfaces on the Grafting of PolyNaSS, a Bioactive Polymer
M. Lam, C. Falentin-Daudré

Benvenuto su EM|consulte, il riferimento dei professionisti della salute.
L'accesso al testo integrale di questo articolo richiede un abbonamento.

Già abbonato a @@106933@@ rivista ?

connettersi o creare un account

Stroke Treatment Prediction Using Features Selection Methods and Machine Learning Classifiers - 07/12/22

Graphical abstract

Highlights

Abstract

Objectives

Materials and methods

Results

Conclusion

Mappa

Citazioni Export

File

Contenuto

Il mio account

Aide & support

Dichiarazione CNIL