Comparing Machine Learning and Human Judge in SATU Indonesia Awarding Processes

Onno W. Purbo

doi:10.26555/jiteki.v7i3.22201

Authors

Onno W. Purbo Institut Teknologi Tangerang Selatan (ITTS)

DOI:

https://doi.org/10.26555/jiteki.v7i3.22201

Keywords:

Machine Learning, Random Forest, Orange Data Mining

Abstract

For more than ten years, SATU Indonesia Awards, with PT. Astra International Tbk's support is given to inspiring young Indonesians. Every year, more than 10,000 nominations must be short-listed to 90 nominations within one week with five (5) assessment parameters. The research contributions are (1) creating a machine learning mechanism for the awarding process from ten years of the SATU Indonesia Awards nomination archive, (2) creating two (2) models of training data for the five (5) assessed parameters, namely motivation, obstacle, outcome, outreach, and sustainability, and (3) compare machine learning prediction with 2021 judge's assessment. TEMPO Data and Analysis Center (PDAT) extracts the corpus training data from ten years' SATU Indonesia Awards data in six months. The corpus training data contains nomination texts with Judges' scores on motivation, obstacle, outcome, outreach, and sustainability. Two (2) corpus training data and two models were generated with, namely, (1) the average Judges' parameter value per instance and (2) the Judges' smallest value and stored in two (2) corpus of 1220 instances each. The classification model was generated by Random Forest, which has the slightest error among the classification algorithms tested. The first model aims to predict the nomination assessment parameters. The second model is to detect the outlier in the incoming nominees for extraordinary nominees. The machine learning predictions were compared and found to be similar to the 2021 judge's assessment in the awarding processes at SATU Indonesia Awards. The average Judges' pre-final 2021 nominees' scores are compared to the Random Forest's predictions and found to be reasonably similar, with a small RMSE error around 1.1 to 1.6 for all assessment parameters. The smallest RMSE was obtained in the Sustainability parameter. The Obstacle parameter was found to have the largest RMSE.

Author Biography

Onno W. Purbo, Institut Teknologi Tangerang Selatan (ITTS)

Vice Rector

References

M. Fathony, A. Khaq, and E. Endri, â€œThe effect of corporate social responsibility and financial performance on stock returns,â€ Int. J. Innov. Creat. Chang., vol. 13, no. 1, 2020. https://www.ijicc.net/images/vol_13/13120_Fathony_2020_E_R.pdf

N. D. Hidayati, â€œPattern of corporate social responsibility programs: A case study,â€ Soc. Responsib. J., vol. 7, no. 1, 2011. https://doi.org/10.1108/17471111111114576

Y. Liu, R. Huang, and J. Yu, â€œTowards award prediction based on big data co-author network,â€ 2019. https://doi.org/10.1109/ICCCBDA.2019.8725612

J. Wu et al., â€œProduct Design Award Prediction Modeling: Design Visual Aesthetic Quality Assessment via DCNNs,â€ IEEE Access, vol. 8, 2020. https://doi.org/10.1109/ACCESS.2020.3039715

M. KokoÃ§, G. AkÃ§apÄ±nar, and M. N. Hasnine, â€œUnfolding Studentsâ€™ Online Assignment Submission Behavioral Patterns Using Temporal Learning Analytics,â€ Educ. Technol. Soc., vol. 24, no. 1, 2021. https://eric.ed.gov/?id=EJ1292999

D. Tempelaar, B. Rienties, and Q. Nguyen, â€œThe Contribution of Dispositional Learning Analytics to Precision Education,â€ Educ. Technol. Soc., vol. 24, no. 1, 2021. http://oro.open.ac.uk/74065/8/74065VOR.pdf

H. Luan and C. C. Tsai, â€œA Review of Using Machine Learning Approaches for Precision Education,â€ Educ. Technol. Soc., vol. 24, no. 1, pp. 250â€“266, 2021. https://eric.ed.gov/?id=EJ1292868

C. C. Y. Yang, I. Y. L. Chen, and H. Ogata, â€œToward Precision Education: Educational Data Mining and Learning Analytics for Identifying Studentsâ€™ Learning Patterns with Ebook Systems,â€ Educ. Technol. Soc., vol. 24, no. 1, 2021. https://eric.ed.gov/?id=EJ1292957

X. Chen, H. Xie, D. Zou, and G.-J. Hwang, â€œApplication and theory gaps during the rise of Artificial Intelligence in Education,â€ Comput. Educ. Artif. Intell., vol. 1, p. 100002, 2020. https://doi.org/10.1016/j.caeai.2020.100002

J. Y. Wu, C. C. Y. Yang, C. H. Liao, and M. W. Nian, â€œAnalytics 2.0 for Precision Education: An Integrative Theoretical Framework of the Human and Machine Symbiotic Learning,â€ Educ. Technol. Soc., vol. 24, no. 1, 2021. https://eric.ed.gov/?id=EJ1292867

Y. Lan, Y. Hao, K. Xia, B. Qian, and C. Li, â€œStacked Residual Recurrent Neural Networks with Cross-Layer Attention for Text Classification,â€ IEEE Access, vol. 8, 2020. https://doi.org/10.1109/ACCESS.2020.2987101

J. Du, C. M. Vong, and C. L. Philip Chen, â€œNovel Efficient RNN and LSTM-Like Architectures: Recurrent and Gated Broad Learning Systems and Their Applications for Text Classification,â€ IEEE Trans. Cybern., vol. 51, no. 3, 2021. https://doi.org/10.1109/TCYB.2020.2969705

M. U. Salur and I. Aydin, â€œA Novel Hybrid Deep Learning Model for Sentiment Classification,â€ IEEE Access, vol. 8, 2020. https://doi.org/10.1109/ACCESS.2020.2982538

J. Zheng and L. Zheng, â€œA Hybrid Bidirectional Recurrent Convolutional Neural Network Attention-Based Model for Text Classification,â€ IEEE Access, vol. 7, 2019. https://doi.org/10.1109/ACCESS.2019.2932619

Y. S. Mehanna and M. Bin Mahmuddin, â€œA Semantic Conceptualization Using Tagged Bag-of-Concepts for Sentiment Analysis,â€ IEEE Access, vol. 9, 2021. https://doi.org/10.1109/ACCESS.2021.3107237

K. Liu and L. Chen, â€œMedical Social Media Text Classification Integrating Consumer Health Terminology,â€ IEEE Access, vol. 7, 2019. https://doi.org/10.1109/ACCESS.2019.2921938

H. Tang, Y. Mi, F. Xue, and Y. Cao, â€œAn Integration Model Based on Graph Convolutional Network for Text Classification,â€ IEEE Access, vol. 8, 2020. https://doi.org/10.1109/ACCESS.2020.3015770

K. Fiok et al., â€œText Guide: Improving the Quality of Long Text Classification by a Text Selection Method Based on Feature Importance,â€ IEEE Access, vol. 9, 2021. https://doi.org/10.1109/ACCESS.2021.3099758

C. N. Tulu, O. Ozkaya, and U. Orhan, â€œAutomatic Short Answer Grading with SemSpace Sense Vectors and MaLSTM,â€ IEEE Access, vol. 9, 2021. https://doi.org/10.1109/ACCESS.2021.3054346

O. J. Ying, M. M. A. Zabidi, N. Ramli, and U. U. Sheikh, â€œSentiment analysis of informal malay tweets with deep learning,â€ IAES Int. J. Artif. Intell., vol. 9, no. 2, 2020. https://doi.org/10.11591/ijai.v9.i2.pp212-220

A. Amalia, O. S. Sitompul, E. B. Nababan, M. S. Lydia, and N. Rahmatunnisa, â€œBahasa Indonesia text corpus generation using web corpora approaches,â€ J. Theor. Appl. Inf. Technol., vol. 97, no. 24, 2019. http://www.jatit.org/volumes/Vol97No24/14Vol97No24.pdf

B. B. Kadaru, M. Umamaheswararao, and C. Science, â€œAn Overview of General Data Mining Tools,â€ Int. Res. J. Eng. Technol., vol. 4, no. 9, 2017. https://www.irjet.net/archives/V4/i9/IRJET-V4I9165.pdf

A. Naik and L. Samant, â€œCorrelation Review of Classification Algorithm Using Data Mining Tool: WEKA, Rapidminer, Tanagra, Orange and Knime,â€ in Procedia Computer Science, 2016, vol. 85. https://doi.org/10.1016/j.procs.2016.05.251

J. DemÅ¡ar et al., â€œOrange: Data mining toolbox in python,â€ J. Mach. Learn. Res., vol. 14, 2013. https://jmlr.org/papers/volume14/demsar13a/demsar13a.pdf

D. N. Gujarati, Linear Regression: A Mathematical Introduction. 2020. https://doi.org/10.4135/9781071802571

A. Pant, â€œIntroduction to Linear Regression and Polynomial Regression,â€ Towards Data Science, 2019.

M. Pal and P. Bharati, â€œIntroduction to Correlation and Linear Regression Analysis,â€ in Applications of Regression Techniques, 2019. https://doi.org/10.1007/978-981-13-9314-3

J. Fox and S. Weisberg, An R Companion to Applied Regression, Third edition, Sage publications, 2019.

M. Kwak and S. B. Kim, â€œUnsupervised Abnormal Sensor Signal Detection with Channelwise Reconstruction Errors,â€ IEEE Access, vol. 9, 2021. https://doi.org/10.1109/ACCESS.2021.3064563

T. S. Buda, M. Khwaja, and A. Matic, â€œOutliers in Smartphone Sensor Data Reveal Outliers in Daily Happiness,â€ Proc. ACM Interactive, Mobile, Wearable Ubiquitous Technol., vol. 5, no. 1, 2021. https://doi.org/10.1145/3448095

H. O. Marques, R. J. G. B. Campello, J. Sander, and A. Zimek, â€œInternal Evaluation of Unsupervised Outlier Detection,â€ ACM Trans. Knowl. Discov. Data, vol. 14, no. 4, 2020. https://doi.org/10.1145/3394053

N. Reunanen, T. RÃ¤ty, and T. Lintonen, â€œAutomatic optimization of outlier detection ensembles using a limited number of outlier examples,â€ Int. J. Data Sci. Anal., vol. 10, no. 4, 2020. https://doi.org/10.1007/s41060-020-00222-4

H. Wang, M. J. Bah, and M. Hammad, â€œProgress in Outlier Detection Techniques: A Survey,â€ IEEE Access, vol. 7, 2019. https://doi.org/10.1109/ACCESS.2019.2932769

M. Kang and E. Choi, Machine Learning. WORLD SCIENTIFIC, 2021. https://doi.org/10.1142/12037

M. Kubat, An Introduction to Machine Learning. Cham: Springer International Publishing, 2021. https://doi.org/10.1007/978-3-030-81935-4

M. Nabipour, P. Nayyeri, H. Jabani, S. Shahab, and A. Mosavi, â€œPredicting Stock Market Trends Using Machine Learning and Deep Learning Algorithms Via Continuous and Binary Data; A Comparative Analysis,â€ IEEE Access, vol. 8, 2020. https://doi.org/10.1109/ACCESS.2020.3015966

H. Zhang, Z. Fu, and K. I. Shu, â€œRecognizing Ping-Pong Motions Using Inertial Data Based on Machine Learning Classification Algorithms,â€ IEEE Access, vol. 7, 2019. https://doi.org/10.1109/ACCESS.2019.2953772

Y. Nieto, V. Gacia-Diaz, C. Montenegro, C. C. Gonzalez, and R. Gonzalez Crespo, â€œUsage of Machine Learning for Strategic Decision Making at Higher Educational Institutions,â€ IEEE Access, vol. 7, 2019. https://doi.org/10.1109/ACCESS.2019.2919343

I. Kaur and A. Kaur, â€œA Novel Four-Way Approach Designed with Ensemble Feature Selection for Code Smell Detection,â€ IEEE Access, vol. 9, 2021. https://doi.org/10.1109/ACCESS.2021.3049823

C. Yin, Y. Zhu, J. Fei, and X. He, â€œA Deep Learning Approach for Intrusion Detection Using Recurrent Neural Networks,â€ IEEE Access, vol. 5, 2017. https://doi.org/10.1109/ACCESS.2017.2762418

J. Hartmann, J. Huppertz, C. Schamp, and M. Heitmann, â€œComparing automated text classification methods,â€ Int. J. Res. Mark., vol. 36, no. 1, 2019. https://doi.org/10.1016/j.ijresmar.2018.09.009

A. Mamgain, â€œGuidance to Data Mining in Python,â€ Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol., vol. 3, no. 6, 2018. https://ijsrcseit.com/CSEIT1836128

D. Lilja, Linear Regression Using R: An Introduction to Data Modeling. 2016. https://doi.org/10.24926/8668/1301

About the Journal	Journal Policies	Author	Information
Focus and Scope Editorial Board Reviewer Open Access Policy Sponsorships Contact Us Google Scholar Most Cited Paper	Publication Ethics Peer Review Process Review Guideline Archiving Advertising	Author Guidelines Online Submission Publication Charge / Fee Plagiarism Policy Article Withdrawal	For Readers For Authors Journal History For Editor For Reviewer

Comparing Machine Learning and Human Judge in SATU Indonesia Awarding Processes

Authors

DOI:

Keywords:

Abstract

Author Biography

Onno W. Purbo, Institut Teknologi Tangerang Selatan (ITTS)

References

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

special_links

journal_metrics

current_indexing

journal_template_2

Make a Submission

sinta_certificate

visitor_country

visitors

Information