Linkage Detection of Features that Cause Stroke using Feyn Qlattice Machine Learning Model

Purwono Purwono; Alfian Ma'arif; Iis Setiawan Mangku Negara; Wahyu Rahmaniar; Jihad Rahmawan

doi:10.26555/jiteki.v7i3.22237

Authors

Purwono Purwono Universitas Harapan Bangsa http://orcid.org/0000-0002-7357-0405
Alfian Ma'arif Universitas Ahmad Dahlan http://orcid.org/0000-0002-3482-971X
Iis Setiawan Mangku Negara Universitas Harapan Bangsa
Wahyu Rahmaniar National Taipei University of Technology, Taiwan http://orcid.org/0000-0002-6902-5455
Jihad Rahmawan Iwate Prefectural University

DOI:

https://doi.org/10.26555/jiteki.v7i3.22237

Keywords:

Stroke, Machine Learning, Qlattice, Predictor, Ehr

Abstract

Stroke is a disease caused by brain tissue damage because of blockage in the cerebrovascular system that disrupts body sensory and motoric systems Stroke disease is one of the highest death cause in the world. Data collection from Electronic Health Records (EHR) is increasing and has been included in the health service big data. It can be processed and analyzed using machine learning to determine the risk group of stroke disease. Machine learning can be used as a predictor of stroke causes, while the predictor clarifies the influence of each cause factor of the disease. Our contribution in this research is to evaluate Feyn Qlattice machine learning models to detect the influence of stroke disease's main cause features. We attempt to obtain a correlation between features of the stroke disease, especially on the gender as a feature, whether any other features can influence the gender feature. This research utilizes 4908 data of the disease predictor using the Feyn Qlattice model. The result implies that gender highly impacts age and hypertension on stroke disease causes. Autorun in Feyn Qlattice model was run with ten epochs, resulting in 17596 test models at 57s. Query string parameter that was focused on age and hypertension features resulted in 1245 models at 4s. An increase of accuracy was found in training metrics from 0.723 to 0.732 and in testing metrics from 0.695 to 0.708. Evaluation results showed that the model is reasonably good as a predictor of stroke disease, indicated with blue lines of AUC in training and testing metrics close to ROC's left side peak curve.

References

J. Liu, Y. Sun, J. Ma, J. Tu, Y. Deng, P. He, R. Li, F. Hu, H. Huang, X. Zhou, and S. Xu, â€œAnalysis of main risk factors causing stroke in Shanxi Province based on machine learning models,â€ Informatics Med. Unlocked, vol. 26, p. 100712, 2021. https://doi.org/10.1016/j.imu.2021.100712

W. C. Chen, M. Y. Hsiao, and T. G. Wang, â€œPrognostic factors of functional outcome in post-acute stroke in the rehabilitation unit,â€ Journal of Formosan Medical Association, 2021. https://doi.org/10.1016/j.jfma.2021.07.009

J. D. Perkins, S. S. Wilkins, S. Kamran, and A. Shuaib, â€œPost-traumatic stress disorder and its association with stroke and stroke risk factors: A literature review,â€ Neurobiol. Stress, vol. 14, no. 100332, pp. 1â€“14, 2021. https://doi.org/10.1016/j.ynstr.2021.100332

A. Tjan, I. G. R. Widiana, E. D. Martadiani, I. M. D. P. Ayusta, M. W. Asih, and F. P. Sitanggang, â€œCarotid artery stiffness measured by strain elastography ultrasound is a stroke risk factor,â€ Clin. Epidemiol. Glob. Heal., vol. 12, no. May, pp. 1â€“5, 2021. https://doi.org/10.1016/j.cegh.2021.100850

O. Ookeditse, T. R. Motswakadikgwa, K. K. Ookeditse, G. Masilo, Y. Bogatsu, B. C. Lekobe, M. Mosepele, H. Schirmer, and S. H. Johnsen â€œHealthcare professionalsâ€™ knowledge of modifiable stroke risk factors: A cross-sectional questionnaire survey in greater Gaborone, Botswana,â€ eNeurologicalSci, vol. 25, no. 100365, pp. 1â€“6, 2021. https://doi.org/10.1016/j.ensci.2021.100365

F. Khennou, Y. I. Khamlichi, and N. E. H. Chaoui, â€œImproving the use of big data analytics within electronic health records: A case study based OpenEHR,â€ in Procedia Computer Science, 2018, vol. 127, pp. 60â€“68. https://doi.org/10.1016/j.procs.2018.01.098

M. Tavana, â€œTransforming healthcare one byte at a time in the world of big data,â€ Healthc. Anal., vol. 1, p. 100003, 2021. https://doi.org/10.1016/j.health.2021.100003

Y. Yang, X. Zheng, W. Guo, X. Liu, and V. Chang, â€œPrivacy-preserving fusion of IoT and big data for e-health,â€ Futur. Gener. Comput. Syst., vol. 86, pp. 1437â€“1455, 2018. https://doi.org/10.1016/j.future.2018.01.003

Beata Butryn, I. Chomiak-Orsa, K. Hauke, M. Pondel, Agnieszka, and Siennicka, â€œApplication of Machine Learning in medical data analysis illustrated with an example of association rules,â€ in Procedia Computer Science, 2021, vol. 192, pp. 3134â€“3143. https://doi.org/10.1016/j.procs.2021.09.086

J. Waring, C. Lindvall, and R. Umeton, â€œAutomated machine learning: Review of the state-of-the-art and opportunities for healthcare,â€ Artif. Intell. Med., vol. 104, no. October, p. 101822, 2020. https://doi.org/10.1016/j.artmed.2020.101822

K. Kosteva, T. Wu, Y. Wang, K. Chaudhuri, and C. Tanislav, â€œPredicting the risk of stroke in patients with late-onset epilepsy: A machine learning approach,â€ Epilepsy Behav., vol. 122, p. 108211, 2021. https://doi.org/10.1016/j.yebeh.2021.108211

L. Velagapudi, N. Mouchtouris, M. P. Baldassari, D. Nauheim, O. Khanna, F. A. Saiegh, N. Herial, M. R. Gooch, S. Tjoumakaris, R. H. Rosenwasser, and P. Jabbour, â€œDiscrepancies in Stroke Distribution and Dataset Origin in Machine Learning for Stroke,â€ J. Stroke Cerebrovasc. Dis., vol. 30, no. 7, p. 105832, 2021. https://doi.org/10.1016/j.jstrokecerebrovasdis.2021.105832

H. Zhu, L. Jiang, H. Zhang, L. Luo, Y. Chen, and Y. Chen, â€œAn automatic machine learning approach for ischemic stroke onset time identification based on DWI and FLAIR imaging,â€ NeuroImage Clin., vol. 31, p. 102744, 2021. https://doi.org/10.1016/j.nicl.2021.102744

A. Jamthikar, D. Gupta, N. N. Khanna, L. Saba, J. R. Laird, and J. S. Suri, â€œCardiovascular/stroke risk prevention: A new machine learning framework integrating carotid ultrasound image-based phenotypes and its harmonics with conventional risk factors,â€ Indian Heart J., vol. 72, no. 4, pp. 258â€“264, 2020. https://doi.org/10.1016/j.ihj.2020.06.004

Abzu, â€œThe QLattice is a radical new machine learning model,â€ 2020. https://www.abzu.ai/qlattice (accessed Oct. 06, 2021).

V. A. Bharadi, â€œQLattice Environment and Feyn QGraph Modelsâ€”A New Perspective Toward Deep Learning,â€ in Emerging Technologies for Healthcare, pp. 69â€“92 2021. https://doi.org/10.1002/9781119792345.ch3

Fedesoriano, â€œStroke Dataset,â€ 2020. https://www.kaggle.com/fedesoriano/stroke-prediction-dataset (accessed Oct. 06, 2021).

G. Y. Wong, F. H.F.Leung, and Sai-HoLing, â€œA hybrid evolutionary preprocessing method for imbalanced datasets,â€ Information Sciences, vol. 454â€“455, pp. 161â€“177, 2018. https://doi.org/10.1016/j.ins.2018.04.068

K. StÃ¶ger, D. Schneeberger, P. Kieseberg, and A. Holzinger, â€œLegal aspects of data cleansing in medical AI,â€ Comput. Law Secur. Rev., vol. 42, pp. 1â€“13, 2021. https://doi.org/10.1016/j.clsr.2021.105587

S. Sachan, F. Almaghrabi, J.-B. Yang, and D.-L. Xu, â€œEvidential reasoning for preprocessing uncertain categorical data for trustworthy decisions: An application on healthcare and finance,â€ Expert Syst. Appl., vol. 185, 2021. https://doi.org/10.1016/j.eswa.2021.115597

O. A. Olabanjo, B. S. Aribisala, M. Mazzara, and A. S. Wusu, â€œAn ensemble machine learning model for the prediction of danger zones: Towards a global counter-terrorism,â€ Soft Comput. Lett., vol. 3, p. 100020, 2021. https://doi.org/10.1016/j.socl.2021.100020

S. Gnat, â€œImpact of Categorical Variables Encoding on Property Mass Valuation,â€ in Procedia Computer Science, 2021, vol. 192, pp. 3542â€“3550. https://doi.org/10.1016/j.procs.2021.09.127

K. Pawluszek-Filipiak and A. Borkowski, â€œOn the importance of train-test split ratio of datasets in automatic landslide detection by supervised classification,â€ Remote Sens., vol. 12, no. 18, 2020. https://doi.org/10.3390/rs12183054

A. RÃ¡cz, D. Bajusz, and K. HÃ©berger, â€œEffect of dataset size and train/test split ratios in qsar/qspr multiclass classification,â€ Molecules, vol. 26, no. 4, pp. 1â€“16, 2021. https://doi.org/10.3390/molecules26041111

G. Sambasivam and G. D. Opiyo, â€œA predictive machine learning application in agriculture: Cassava disease detection and classification with imbalanced dataset using convolutional neural networks,â€ Egypt. Informatics J., vol. 22, no. 1, pp. 27â€“34, 2021. https://doi.org/10.1016/j.eij.2020.02.007

H. Seo, S. Back, S. Lee, D. Park, T. Kim, and K. Lee, â€œIntra- and inter-epoch temporal context network (IITNet) using sub-epoch features for automatic sleep scoring on raw single-channel EEG,â€ Biomed. Signal Process. Control, vol. 61, p. 102037, 2020. https://doi.org/10.1016/j.bspc.2020.102037

A. Luque, A. Carrasco, A. MartÃn, and A. de las Heras, â€œThe impact of class imbalance in classification performance metrics based on the binary confusion matrix,â€ Pattern Recognit., vol. 91, pp. 216â€“231, 2019. https://doi.org/10.1016/j.patcog.2019.02.023

K. R. Singh, K. P. Neethu, K. Madhurekaa, A. Harita, and P. Mohan, â€œParallel SVM model for forest fire prediction,â€ Soft Comput. Lett., vol. 3, p. 100014, 2021. https://doi.org/10.1016/j.socl.2021.100014

W. Rahmaniar, W.-J. Wang, C.-W. Chiu, and N.L. Hakim â€œReal-Time Bi-Directional People Counting Using an RGB-D Cameraâ€, Sensors Review, vol. 41, no. 4, pp. 341-349, 2021.

K. Gajowniczek and T. ZÄ…bkowski, â€œImbTreeAUC: An R package for building classification trees using the area under the ROC curve (AUC) on imbalanced datasets,â€ SoftwareX, vol. 15, p. 100755, 2021. https://doi.org/10.1016/j.softx.2021.100755

S. Yang and G. Berdine, â€œThe receiver operating characteristic (ROC) curve,â€ Southwest Respir. Crit. Care Chronicles, vol. 5, no. 19, p. 34, 2017. https://doi.org/10.12746/swrccc.v5i19.391

T. C. F. Polo and H. A. Miot, â€œUse of roc curves in clinical and experimental studies,â€ J. Vasc. Bras., vol. 19, pp. 1â€“4, 2020. https://doi.org/10.1590/1677-5449.200186

About the Journal	Journal Policies	Author	Information
Focus and Scope Editorial Board Reviewer Open Access Policy Sponsorships Contact Us Google Scholar Most Cited Paper	Publication Ethics Peer Review Process Review Guideline Archiving Advertising	Author Guidelines Online Submission Publication Charge / Fee Plagiarism Policy Article Withdrawal	For Readers For Authors Journal History For Editor For Reviewer

Linkage Detection of Features that Cause Stroke using Feyn Qlattice Machine Learning Model

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

Most read articles by the same author(s)

special_links

journal_metrics

current_indexing

journal_template_2

Make a Submission

sinta_certificate

visitor_country

visitors

Information