Comparative Analysis of Support Vector Machine, Decision Tree, and Naive Bayes in Evaluating Machine Learning Effectiveness
Main Article Content
Abstract
This study aims to analyze and compare the performance of three widely used machine learning algorithms for data classification: Support Vector Machine (SVM), Decision Tree, and Naïve Bayes. These algorithms employ distinct approaches in handling data, making it essential to evaluate their effectiveness and efficiency in classification tasks. In the digital era characterized by massive data growth, the selection of an appropriate classification algorithm is a critical determinant for accurate and efficient data-driven decision-making. The main contribution of this research is to provide a comprehensive understanding of the relative strengths and limitations of each algorithm under varying data conditions. This study not only highlights comparative performance outcomes but also emphasizes practical implications for researchers and data science practitioners in selecting algorithms suited to specific needs. In doing so, it addresses a research gap concerning integrated evaluations of data characteristics and algorithmic performance. The methodology adopts a quantitative approach through computational experiments using standardized datasets (Titanic, Spam Email, and Wine). The datasets were divided into training and testing sets and analyzed using Python with the scikit-learn library. Performance evaluation was conducted based on accuracy, precision, recall, and F1-score, validated through cross-validation techniques to ensure reliability of results. The findings indicate that SVM outperforms in terms of accuracy and recall on complex datasets, Naïve Bayes is more efficient in computational time particularly for text data, while Decision Tree stands out for model interpretability despite slightly lower accuracy. These results are expected to serve as a practical reference for selecting suitable algorithms according to data characteristics, thereby supporting more targeted and intelligent modeling strategies in the era of digital transformation.
Downloads
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
With this statement, I hereby grant and transfer to RUBINSTEIN all exclusive copyright ownership rights for the aforementioned work. This includes, but is not limited to, the rights to publish, republish, modify, distribute, transmit, sell, or use the work and related materials globally, in whole or in part, in any language, and through any form of media whether electronic, printed, or any other medium known now or developed in the future. RUBINSTEIN also reserves the right to grant permission or license third parties to carry out any of these actions. I acknowledge that RUBINSTEIN will hold these exclusive rights from the date the article is accepted for publication. I also recognize that RUBINSTEIN, as the copyright holder, has the sole authority to license and permit the reproduction of the article. However, I understand that any other proprietary rights related to the work (such as patents or other rights to a process or procedure) remain with the author. Furthermore, I acknowledge that RUBINSTEIN allows authors to use their articles in accordance with the applicable Creative Commons license.
References
Asaad, R. R., & Abdulazeez, A. M. (2024). Comprehensive Classification of Iris Flower Species: A Machine Learning Approach. Indonesian Journal of Computer Science, 13(1), 242–262. http://ijcs.stmikindonesia.ac.id/ijcs/index.php/ijcs/article/view/3135
Asshiddiqi, M. F., & Lhaksmana, K. M. (2020). Perbandingan Metode Decision Tree dan Support Vector Machine untuk Analisis Sentimen pada Instagram Mengenai Kinerja PSSI. E-Proceeding of Engineering, 7(3), 9936–9948.
Aulia, M., & Hermawan, A. (2023). Analisis Perbandingan Algoritma SVM, Naïve Bayes, dan Perceptron untuk Analisis Sentimen Ulasan Produk Tokopedia. Jurnal Media Informatika Budidarma, 7(4), 1850. https://doi.org/10.30865/mib.v7i4.6839
Boyd, A., Czajka, A., & Bowyer, K. (2019). Deep Learning-Based Feature Extraction in Iris Recognition: Use Existing Models, Fine-tune or Train from Scratch? 2019 IEEE 10th International Conference on Biometrics Theory, Applications and Systems, BTAS 2019. https://doi.org/10.1109/BTAS46853.2019.9185978
Desiani, A. (2022). Penerapan Metode Support Vector Machine. IJAI: Indonesian Jpurnal of Applied Informatics, 7(1), 12–18.
Dozono, C., Aragaki, M., Hebishima, H., & Inage, S.-I. (n.d.). Proposal and Verification of Novel Machine Learning on Classification Problems. 4.
Hackeling, G. (2017). Mastering Machine Learning with scikit-learn. Packt Publishing.
Hermawan, A., Lunardi, L., Kurnia, Y., Daniawan, B., & Junaedi. (2025). Optimizing Convolutional Neural Networks with Particle Swarm Optimization for Enhanced Hoax News Detection. Journal of Information Systems Engineering and Business Intelligence, 11(1), 53–64. https://doi.org/10.20473/jisebi.11.1.53-64
Kerrigan, D., Trokielewicz, M., Czajka, A., & Bowyer, K. W. (2019). Iris Recognition with Image Segmentation Employing Retrained Off-the-Shelf Deep Neural Networks. 2019 International Conference on Biometrics, ICB 2019. https://doi.org/10.1109/ICB45273.2019.8987299
Khumaidi, A., & Herinanto, D. (2023). Sistem Pendukung Keputusan Penerimaan Siswa Baru dengan Menggunakan Metode SAW di SMP Xaverius Pringsewu Berbasis Web Mobile. Rubinstein, 2(1), 74–83. https://doi.org/10.31253/rubin.v2i1.2729
Liu, S., & Lee, I. (2018). Email Sentiment Analysis Through k-Means Labeling and Support Vector Machine Classification. Cybernetics and Systems, 49(3), 181–199. https://doi.org/10.1080/01969722.2018.1448242
Pinto, J. P., Kelur, S., & Shetty, J. (2018). Iris Flower Species Identification Using Machine Learning Approach. 2018 4th International Conference for Convergence in Technology (I2CT), 1–4. https://doi.org/10.1109/I2CT42659.2018.9057891
Putra, P., M. H. Pardede, A., & Syahputra, S. (2022). Analisis Metode K-Nearest Neighbour (KNN) Dalam Klasifikasi Data Iris Bunga. Jurnal Teknik Informatika Kaputama (JTIK), 6(1), 297–305.
Rahman, B., Fauzi, F., & Amri, S. (2023). Perbandingan Hasil Klasifikasi Data Iris menggunakan Algoritma K-Nearest Neighbor dan Random Forest. Journal Of Data Insights, 1(1), 19–26. https://doi.org/10.26714/jodi.v1i1.135
Rao, T. S., Hema, M., Priya, K. S., Krishna, K. V., & Ali, M. S. (2021). Iris Flower Classification Using Machine Learning. International Journal of All Research Education and Scientific Methods (IJARESM), 9(6), 2083–2090.
Rofianto, D., Arifin, O., & Widyawati, D. K. (2023). Perbandingan Metode Klasifikasi Learning Vector Quantization Dengan Diskriminan Fisher Pada Data Bunga Iris. JSR : Jaringan Sistem Informasi Robotik, 7(1), 44–49. https://doi.org/10.58486/jsr.v7i1.209
Ronaldo, R., & Kurnia, Y. (2024). Perbandingan Kinerja Algoritma SVM, Decision Tree, dan Naive Bayes untuk Klasifikasi dan Pengelompokan Spesies Iris. POTERS : Proceedings of Technology, Engineering and Computers, 1(1), 70–77.
Satyagraha, A., & Kurnia, Y. (2025). Forecasting Tourism Demand: A Bibliometric Review of Trends, Methodologies, and Big Data Integration (2015-2024). Rubinstein, 3(2), 106–117. https://doi.org/10.31253/rubin.v3i2.3783
Sunaryono. (2017). Penelitian Komparasi Algoritma Klasifikasi dalam Menentukan Website Palsu. TEKNIKOM, 1(1), 1–12.
Suresh Kumar, K., Radha Mani, A. S., Ananth Kumar, T., Jalili, A., Gheisari, M., Malik, Y., Chen, H. C., & Jahangir Moshayedi, A. (2024). Sentiment Analysis of Short Texts Using SVMs and VSMs-Based Multiclass Semantic Classification. Applied Artificial Intelligence, 38(1). https://doi.org/10.1080/08839514.2024.2321555
Susanti, N. A., & Walid, M. (2022). Klasifikasi Data Tweet Ujaran Kebencian Di Media Sosial. JATI: Jurnal Mahasiswa Teknik Informatika, 6(2), 538–543.
VanderPlas, J. (2022). Python Data Science Handbook. O’Reilly Media.
Abstract views: 307
/
PDF downloads: 293