iGWO-RF: an Improved Grey Wolfed Optimization for Random Forest Hyperparameter Optimization to Identification Breast Cancer

Authors

  • Elvaro Islami Muryadi Khon Kaen University
  • Irianna Futri Khon Kaen University
  • Dimas Chaerul Ekty Saputra Khon Kaen University

DOI:

https://doi.org/10.26555/jiteki.v10i4.29300

Keywords:

Breast Cancer Prediction, Random Forest, Hyperparameter Optimization, Grey Wolf Optimization, Improved Grey Wolf Optimization, Classification

Abstract

The study focuses on improving the accuracy of breast cancer diagnosis by enhancing the predictive capabilities of a Random Forest model. This is achieved by utilizing an improved Grey Wolf Optimization algorithm for hyperparameter optimization. The main objectives are to enhance early detection, increase diagnostic precision, and reduce computational demands in clinical workflows. The work utilizes the Improved Grey Wolf Optimization (iGWO) algorithm to tune the hyperparameters of a Random Forest (RF) model, thereby improving its accuracy in diagnosing breast cancer. The methodology encompasses several steps, including data preparation, model training using iGWO-enhanced RF, performance evaluation compared to traditional methods, and validation using clinical datasets to confirm the reliability and effectiveness of the approach. The iGWO-RF model demonstrated exceptional performance in diagnosing breast cancer, achieving an accuracy of 96.4%, precision of 96.4%, recall of 98.0%, F1-score of 97.2%, and ROC-AUC of 0.988. The findings of iGWO-RF outperform those of SVM, original RF, Naive Bayes, and KNN models, indicating that iGWO-RF is effective in optimizing hyperparameters to improve prediction accuracy. The iGWO-RF model greatly enhances the accuracy and efficiency of breast cancer diagnosis, surpassing conventional models. Integrating iGWO-RF into clinical workflows is advised to improve early identification and patient outcomes. Additional investigation should focus on the utilization of this technology in various medical datasets and circumstances, highlighting its potential in a wide range of healthcare environments.

Author Biographies

Elvaro Islami Muryadi, Khon Kaen University

Department of Community, Occupational, and Family Medicine

Irianna Futri, Khon Kaen University

Department of International Technology and Innovation Management

Dimas Chaerul Ekty Saputra, Khon Kaen University

Department of Computer Science and Information Technology

References

[1] M. Arnold et al., “Current and future burden of breast cancer: Global statistics for 2020 and 2040,” Breast, vol. 66, pp. 15–23, Sep. 2022, https://doi.org/10.1016/j.breast.2022.08.010.

[2] M. Zubair, S. Wang, and N. Ali, “Advanced Approaches to Breast Cancer Classification and Diagnosis,” Front. Pharmacol., vol. 11, p. 632079, Feb. 2021, https://doi.org/10.3389/fphar.2020.632079.

[3] R. Sharma, “Global, regional, national burden of breast cancer in 185 countries: evidence from GLOBOCAN 2018,” Breast Cancer Res Treat, vol. 187, no. 2, pp. 557–567, Jun. 2021, https://doi.org/10.1007/s10549-020-06083-6.

[4] M. Tariq, S. Iqbal, H. Ayesha, I. Abbas, K. T. Ahmad, and M. F. K. Niazi, “Medical image based breast cancer diagnosis: State of the art and future directions,” Expert Systems with Applications, vol. 167, p. 114095, Apr. 2021, https://doi.org/10.1016/j.eswa.2020.114095.

[5] M. Amgad et al., “NuCLS: A scalable crowdsourcing approach and dataset for nucleus classification and segmentation in breast cancer,” GigaScience, vol. 11, p. giac037, May 2022, https://doi.org/10.1093/gigascience/giac037.

[6] D. C. E. Saputra, Y. Maulana, E. Faristasari, A. Ma”arif, and I. Suwarno, “Machine Learning Performance Analysis for Classification of Medical Specialties,” in Proceeding of the 3rd International Conference on Electronics, Biomedical Engineering, and Health Informatics, vol. 1008, T. pp. 513–528,2023, https://doi.org/10.1007/978-981-99-0248-4_34.

[7] D. A. Ragab, O. Attallah, M. Sharkas, J. Ren, and S. Marshall, “A framework for breast cancer classification using Multi-DCNNs,” Computers in Biology and Medicine, vol. 131, p. 104245, Apr. 2021, https://doi.org/10.1016/j.compbiomed.2021.104245.

[8] B. Smolarz, A. Z. Nowak, and H. Romanowicz, “Breast Cancer—Epidemiology, Classification, Pathogenesis and Treatment (Review of Literature),” Cancers, vol. 14, no. 10, p. 2569, May 2022, https://doi.org/10.3390/cancers14102569.

[9] C. Chakraborty, A. Kishor, and J. J. P. C. Rodrigues, “Novel Enhanced-Grey Wolf Optimization hybrid machine learning technique for biomedical data computation,” Computers and Electrical Engineering, vol. 99, p. 107778, Apr. 2022, https://doi.org/10.1016/j.compeleceng.2022.107778.

[10] S. Sundaramurthy and P. Jayavel, “A hybrid Grey Wolf Optimization and Particle Swarm Optimization with C4.5 approach for prediction of Rheumatoid Arthritis,” Applied Soft Computing, vol. 94, p. 106500, Sep. 2020, https://doi.org/10.1016/j.asoc.2020.106500.

[11] A. Bilal, G. Sun, S. Mazhar, and A. Imran, “Improved Grey Wolf Optimization-Based Feature Selection and Classification Using CNN for Diabetic Retinopathy Detection,” in Evolutionary Computing and Mobile Sustainable Networks, vol. 116, pp. 1–14, 2022, https://doi.org/10.1007/978-981-16-9605-3_1.

[12] X. Wang, Z. Li, H. Kang, Y. Huang, and D. Gai, “Medical Image Segmentation using PCNN based on Multi-feature Grey Wolf Optimizer Bionic Algorithm,” J Bionic Eng, vol. 18, no. 3, pp. 711–720, May 2021, https://doi.org/10.1007/s42235-021-0049-4.

[13] R. Ahmadi, G. Ekbatanifard, and P. Bayat, “A Modified Grey Wolf Optimizer Based Data Clustering Algorithm,” Applied Artificial Intelligence, vol. 35, no. 1, pp. 63–79, Jan. 2021, https://doi.org/10.1080/08839514.2020.1842109.

[14] D. M. Belete and M. D. Huchaiah, “Grid search in hyperparameter optimization of machine learning models for prediction of HIV/AIDS test results,” International Journal of Computers and Applications, vol. 44, no. 9, pp. 875–886, Sep. 2022, https://doi.org/10.1080/1206212X.2021.1974663.

[15] A. G. Gad, “Particle Swarm Optimization Algorithm and Its Applications: A Systematic Review,” Arch Computat Methods Eng, vol. 29, no. 5, pp. 2531–2561, Aug. 2022, https://doi.org/10.1007/s11831-021-09694-4.

[16] “What Is Cancer? - NCI”. Accessed: Jun. 01, 2024. [Online]. Available: https://www.cancer.gov/about-cancer/understanding/what-is-cancer.

[17] Halodoc, “Kanker Payudara - Gejala, Penyebab, dan Pengobatan,” halodoc. Accessed: Jun. 01, 2024. [Online]. Available: https://www.halodoc.com/kesehatan/kanker-payudara.

[18] “Sekretariat Nasional ASEAN – Indonesia,” Accessed: Jun. 01, 2024. [Online]. Available: https://setnasasean.id/news/read/jumlah-kasus-dan-kematian-akibat-kanker-payudara-di-indonesia-tertinggi-di-asean.

[19] “Globocan 2018 Latest global cancer data,” Accessed: Jun. 01, 2024. [Online]. Available: https://www.iarc.who.int/infographics/globocan-2018-latest-global-cancer-data.

[20] “Tips Mengurangi Risiko Penyakit Kanker Payudara,” Accessed: Jun. 02, 2024. [Online]. Available: https://dinkes.jakarta.go.id/berita/read/tips-mengurangi-risiko-penyakit-kanker-payudara.

[21] D. U. H. Putro, A. R. I. Darmayanti, R. Tandiola, and K. Aulawi, “Pengendalian Infeksi pada Pasien Kanker: Literature Review,” Jurnal Kesehatan Vokasional, vol. 8, no. 1, Art. no. 1, Apr. 2023, https://doi.org/10.22146/jkesvo.67677.

[22] W. Gautama, “Breast Cancer in Indonesia in 2022: 30 Years of Marching in Place,” Indonesian Journal of Cancer, vol. 16, p. 1, Apr. 2022, https://doi.org/10.33371/ijoc.v16i1.920

[23] M. P. Ningrum and R. S. R. Rahayu, “Determinan Kejadian Kanker Payudara pada Wanita Usia Subur (15-49 Tahun),” Indonesian Journal of Public Health and Nutrition, vol. 1, no. 3, Art. no. 3, Nov. 2021, https://doi.org/10.15294/ijphn.v1i3.46094.

[24] “Pembuatan Aplikasi Deteksi Dini Kanker Payudara Berbasis Android,” DRPM. Accessed: Jun. 02, 2024. [Online]. Available: https://drpm.unpad.ac.id/pembuatan-aplikasi-deteksi-dini-kanker-payudara-berbasis-android/.

[25] S. Mirjalili, S. M. Mirjalili, and A. Lewis, “Grey Wolf Optimizer,” Advances in Engineering Software, vol. 69, pp. 46–61, Mar. 2014, https://doi.org/10.1016/j.advengsoft.2013.12.007.

[26] S. N. Makhadmeh et al., “Recent Advances in Grey Wolf Optimizer, its Versions and Applications: Review,” IEEE Access, vol. 12, pp. 22991–23028, 2024, https://doi.org/10.1109/ACCESS.2023.3304889.

[27] H. Faris, I. Aljarah, M. A. Al-Betar, and S. Mirjalili, “Grey wolf optimizer: a review of recent variants and applications,” Neural Comput & Applic, vol. 30, no. 2, pp. 413–435, Jul. 2018, https://doi.org/10.1007/s00521-017-3272-5.

[28] S. Al Afghani Edsa and K. Sunat, “Hybridization of Modified Grey Wolf Optimizer and Dragonfly for Feature Selection,” in Data Science and Artificial Intelligence, vol. 1942, pp. 35–42, 2023, https://doi.org/10.1007/978-981-99-7969-1_3.

[29] D. C. E. Saputra, K. Sunat, and T. Ratnaningsih, “A New Artificial Intelligence Approach Using Extreme Learning Machine as the Potentially Effective Model to Predict and Analyze the Diagnosis of Anemia,” Healthcare, vol. 11, no. 5, p. 697, Feb. 2023, https://doi.org/10.3390/healthcare11050697.

[30] O. Köksoy, “Multiresponse robust design: Mean square error (MSE) criterion,” Applied Mathematics and Computation, vol. 175, no. 2, pp. 1716–1729, Apr. 2006, https://doi.org/10.1016/j.amc.2005.09.016.

[31] M. Ćalasan, S. H. E. Abdel Aleem, and A. F. Zobaa, “On the root mean square error (RMSE) calculation for parameter estimation of photovoltaic models: A novel exact analytical solution based on Lambert W function,” Energy Conversion and Management, vol. 210, p. 112716, Apr. 2020, https://doi.org/10.1016/j.enconman.2020.112716.

[32] J. Muschelli, “ROC and AUC with a Binary Predictor: a Potentially Misleading Metric,” J Classif, vol. 37, no. 3, pp. 696–708, Oct. 2020, https://doi.org/10.1007/s00357-019-09345-1.

[33] D. Chicco and G. Jurman, “The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification,” BioData Mining, vol. 16, no. 1, p. 4, Feb. 2023, https://doi.org/10.1186/s13040-023-00322-4.

[34] A. J. Bowers and X. Zhou, “Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for Evaluating the Accuracy of Predictors of Education Outcomes,” Journal of Education for Students Placed at Risk (JESPAR), vol. 24, no. 1, pp. 20–46, Jan. 2019, https://doi.org/10.1080/10824669.2018.1523734.

[35] “Deteksi Kanker Bisa lewat Aplikasi – Library”. Accessed: Jun. 02, 2024. [Online]. Available: https://www.ciputra.ac.id/library/deteksi-kanker-bisa-lewat-aplikasi/.

[36] M. Saldaña-Téllez, S. Meneses-Navarro, L. Cano-Garduño, and K. Unger-Saldaña, “Barriers and facilitators for breast cancer early diagnosis in an indigenous community in Mexico: voices of otomí women,” BMC Womens Health, vol. 24, p. 33, Jan. 2024, https://doi.org/10.1186/s12905-023-02875-2.

[37] M. T. R. Hamid, N. A. Mumin, S. A. Hamid, and K. Rahmat, “Application of Artificial Intelligence (AI) System in Opportunistic Screening and Diagnostic Population in a Middle-income Nation,” Current Medical Imaging, vol. 20, no. 1, pp. 1–10, 2024, https://doi.org/10.2174/0115734056280191231207052903.

[38] L.-A. Dang et al., “Impact of artificial intelligence in breast cancer screening with mammography,” Breast Cancer, vol. 29, no. 6, pp. 967–977, 2022, https://doi.org/10.1007/s12282-022-01375-9.

[39] J. Melnikow et al., Supplemental Screening for Breast Cancer in Women With Dense Breasts: A Systematic Review for the U.S. Preventive Service Task Force. in U.S. Preventive Services Task Force Evidence Syntheses, formerly Systematic Evidence Reviews. Rockville (MD): Agency for Healthcare Research and Quality (US), 2016. Accessed: Jun. 02, 2024. [Online]. Available: http://www.ncbi.nlm.nih.gov/books/NBK343793/.

[40] E. Movik, T. K. Dalsbø, B. C. Fagelund, E. G. Friberg, L. L. Håheim, and Å. Skår, Digital Breast Tomosynthesis with Hologic 3D Mammography Selenia Dimensions System for Use in Breast Cancer Screening: A Single Technology Assessment, 2017, https://fhi.brage.unit.no/fhi-xmlui/bitstream/handle/11250/2457872/Movik_2017_Dig.pdf?sequence=1.

[41] M. A. Secretariat, “Cancer Screening with Digital Mammography for Women at Average Risk for Breast Cancer, Magnetic Resonance Imaging (MRI) for Women at High Risk,” Ont Health Technol Assess Ser, vol. 10, no. 3, pp. 1–55, Mar. 2010, https://pmc.ncbi.nlm.nih.gov/articles/PMC3377503/.

Downloads

Published

2024-11-24

How to Cite

[1]
E. I. Muryadi, I. Futri, and D. C. E. Saputra, “iGWO-RF: an Improved Grey Wolfed Optimization for Random Forest Hyperparameter Optimization to Identification Breast Cancer”, J. Ilm. Tek. Elektro Komput. Dan Inform, vol. 10, no. 4, pp. 665–680, Nov. 2024.

Issue

Section

Articles

Similar Articles

1 2 > >> 

You may also start an advanced similarity search for this article.