Speech classification using combination virtual center of gravity and k-means clustering based on audio feature extraction

Diah Kumalasari; Arief Bramanto Wicaksono Putra; Achmad Fanany Onnilita Gaffar

Authors

Diah Kumalasari Politeknik Negeri Samarinda
Arief Bramanto Wicaksono Putra Politeknik Negeri Samarinda
Achmad Fanany Onnilita Gaffar Politeknik Negeri Samarinda

Keywords:

Classification, Feature Extraction, K-Mean, Virtual Center of Gravity

Abstract

Voice recognition can be done in a variety of ways. Sound patterns can be recognized by performing sound feature extraction. The trainer sound data is built from the best sound data selection using a correlation coefficient based on the level of similarity between sound data for optimal sound features. Extraction of voting features on this research using the Virtual Center of Gravity method. This method calculates the distance between the sound data against the center point of gravity with visualizations in the 3-dimensional form of white, black, and grey pattern spaces. The preprocessing process generates a complex number of data consisting of real numbers and imaginary numbers. The number will be calculated the distance to the Virtual Center of Gravity's pattern space using Euclidean Distance. The sound feature testing is done using K-Means Clustering by means of a speech classification data based sound. The results showed an accuracy of 92.5%.

Author Biographies

Diah Kumalasari, Politeknik Negeri Samarinda

Information Technology

Arief Bramanto Wicaksono Putra, Politeknik Negeri Samarinda

Information Technology Department

Achmad Fanany Onnilita Gaffar, Politeknik Negeri Samarinda

Information Technology

References

B. Dave and P. D. S. Pipalia, â€œSpeech Recognition: a Review,â€ Int. J. Adv. Eng. Res. Dev., vol. 1, no. 12, pp. 230â€“236, 2014, doi: 10.21090/ijaerd.011244.

K. R. Ghule and R. R. Deshmukh, â€œFeature-Extraction-Techniques-for-Speech-Recognition-A-Review.docx,â€ Int. J. Sci. Eng. Res., vol. 6, no. 5, pp. 143â€“147, 2015.

M. Ference and A. M. Weinberg, â€œCenter of Gravity and Center of Mass,â€ Am. J. Phys., vol. 6, no. 2, pp. 106â€“106, 1938, doi: 10.1119/1.1991277.

Y. A. Ibrahim, J. C. Odiketa, and T. S. Ibiyemi, â€œPreprocessing technique in automatic speech recogntion for human computer interaction: an overview,â€ Ann. Comput. Sci. Ser., vol. XV, no. 1, pp. 186â€“191, 2017.

A. G. Jondya and B. H. Iswanto, â€œIndonesianâ€™s Traditional Music Clustering Based on Audio Features,â€ Procedia Comput. Sci., vol. 116, pp. 174â€“181, 2017, doi: 10.1016/j.procs.2017.10.019.

O. Of and E. For, â€œPCA- Based P Almprint R Ecognition 1 Introduction 2 The Structure of palmprint verification systems 3 Feature normalization techniques,â€ Electr. Eng., no. i, pp. 2â€“5, 2009.

P. Schober and L. A. Schwarte, â€œCorrelation coefficients: Appropriate use and interpretation,â€ Anesth. Analg., vol. 126, no. 5, pp. 1763â€“1768, 2018, doi: 10.1213/ANE.0000000000002864.

O. K. Hamid, â€œFrame Blocking and Windowing Speech Signal,â€ J. Information, Commun. Intell. Syst., vol. 4, no. 5, 2019.

H. Triwiyanto, O Wahyunggoro, H A Nugroho, â€œPerformance Analysis of the Windowing Technique on Elbow Joint Angle Estimation Using Electromyography,â€ J. Phys., 2018.

H. Hauser, E. GrÃ¶ller, and T. TheuÃŸl, â€œMastering Windows: Improving Reconstruction,â€ 2000 IEEE Symp. Vol. Vis. VV 2000, pp. 101â€“109, 2000, doi: 10.1109/VV.2000.10002.

R. Hibare and A. Vibhute, â€œFeature Extraction Techniques in Speech Processing: A Survey,â€ Int. J. Comput. Appl., vol. 107, no. 5, pp. 1â€“8, 2014, doi: 10.5120/18744-9997.

A. K. . F. Haque, â€œFFT and Wavelet-Based Feature Extraction for Acoustic Audio Classification,â€ Int. J. Adv. Innov. Thoughts Ideas, pp. 1â€“7, 2012.

A.B.W. Putra, S. Pramono, and A. Naba, â€œRancang Bangun Prototype Ciri Citra Kulit Luar Kayu Tanaman Karet Menggunakan Metode Virtual Center of Gravity,â€ J. EECCIS, vol. 8, no. 1, p. pp.19-26, 2014.

A. V. D. Sano and H. Nindito, â€œApplication OF K-means algorithm for cluster analysis on poverty of provinces in indonesia,â€ ComTech, no. 6, pp. 141â€“150, 2011.

O. Oyelade, Oladipupo, â€œApplication of k-Means Clustering algorithm for prediction of Students â€™ Academic Performance,â€ Int. J. Comput. Sci. Inf. Secur., vol. 7, pp. 292â€“295, 2010.

S. Saito, Y. Tomioka, and H. Kitazawa, â€œA Theoretical Framework for Estimating False Acceptance Rate of PRNU-Based Camera Identification,â€ IEEE Trans. Inf. Forensics Secur., vol. 12, no. 9, pp. 2026â€“2035, 2017, doi: 10.1109/TIFS.2017.2692683.

Speech classification using combination virtual center of gravity and k-means clustering based on audio feature extraction

Authors

Keywords:

Abstract

Author Biographies

Diah Kumalasari, Politeknik Negeri Samarinda

Arief Bramanto Wicaksono Putra, Politeknik Negeri Samarinda

Achmad Fanany Onnilita Gaffar, Politeknik Negeri Samarinda

References

Downloads

Published

Issue

Section

License

quicklinks

Information

Current Issue

template

tools

crossref

Developed By