Comparative Analysis of Support Vector Machine, Decision Tree, and Naive Bayes in Evaluating Machine Learning Effectiveness

Main Article Content

Authors

    Susanto Hariyanto( 1 ) Indah Fenriana( 2 ) Desiyanna Lasut( 3 ) Febrian( 4 )

    (1) Buddhi Dharma University | Indonesia
    (2) Buddhi Dharma University | Indonesia
    (3) Buddhi Dharma University | Indonesia
    (4) Buddhi Dharma University | Indonesia

Abstract

This study aims to analyze and compare the performance of three widely used machine learning algorithms for data classification: Support Vector Machine (SVM), Decision Tree, and Naïve Bayes. These algorithms employ distinct approaches in handling data, making it essential to evaluate their effectiveness and efficiency in classification tasks. In the digital era characterized by massive data growth, the selection of an appropriate classification algorithm is a critical determinant for accurate and efficient data-driven decision-making. The main contribution of this research is to provide a comprehensive understanding of the relative strengths and limitations of each algorithm under varying data conditions. This study not only highlights comparative performance outcomes but also emphasizes practical implications for researchers and data science practitioners in selecting algorithms suited to specific needs. In doing so, it addresses a research gap concerning integrated evaluations of data characteristics and algorithmic performance. The methodology adopts a quantitative approach through computational experiments using standardized datasets (Titanic, Spam Email, and Wine). The datasets were divided into training and testing sets and analyzed using Python with the scikit-learn library. Performance evaluation was conducted based on accuracy, precision, recall, and F1-score, validated through cross-validation techniques to ensure reliability of results. The findings indicate that SVM outperforms in terms of accuracy and recall on complex datasets, Naïve Bayes is more efficient in computational time particularly for text data, while Decision Tree stands out for model interpretability despite slightly lower accuracy. These results are expected to serve as a practical reference for selecting suitable algorithms according to data characteristics, thereby supporting more targeted and intelligent modeling strategies in the era of digital transformation.

Downloads

Download data is not yet available.

Article Details

How to Cite
[1]
S. . Hariyanto, Indah Fenriana, Desiyanna Lasut, and Febrian, “Comparative Analysis of Support Vector Machine, Decision Tree, and Naive Bayes in Evaluating Machine Learning Effectiveness”, rubin, vol. 4, no. 1, pp. 1–15, Dec. 2025.
Section
Articles

References

Asaad, R. R., & Abdulazeez, A. M. (2024). Comprehensive Classification of Iris Flower Species: A Machine Learning Approach. Indonesian Journal of Computer Science, 13(1), 242–262. http://ijcs.stmikindonesia.ac.id/ijcs/index.php/ijcs/article/view/3135

Asshiddiqi, M. F., & Lhaksmana, K. M. (2020). Perbandingan Metode Decision Tree dan Support Vector Machine untuk Analisis Sentimen pada Instagram Mengenai Kinerja PSSI. E-Proceeding of Engineering, 7(3), 9936–9948.

Aulia, M., & Hermawan, A. (2023). Analisis Perbandingan Algoritma SVM, Naïve Bayes, dan Perceptron untuk Analisis Sentimen Ulasan Produk Tokopedia. Jurnal Media Informatika Budidarma, 7(4), 1850. https://doi.org/10.30865/mib.v7i4.6839

Boyd, A., Czajka, A., & Bowyer, K. (2019). Deep Learning-Based Feature Extraction in Iris Recognition: Use Existing Models, Fine-tune or Train from Scratch? 2019 IEEE 10th International Conference on Biometrics Theory, Applications and Systems, BTAS 2019. https://doi.org/10.1109/BTAS46853.2019.9185978

Desiani, A. (2022). Penerapan Metode Support Vector Machine. IJAI: Indonesian Jpurnal of Applied Informatics, 7(1), 12–18.

Dozono, C., Aragaki, M., Hebishima, H., & Inage, S.-I. (n.d.). Proposal and Verification of Novel Machine Learning on Classification Problems. 4.

Hackeling, G. (2017). Mastering Machine Learning with scikit-learn. Packt Publishing.

Hermawan, A., Lunardi, L., Kurnia, Y., Daniawan, B., & Junaedi. (2025). Optimizing Convolutional Neural Networks with Particle Swarm Optimization for Enhanced Hoax News Detection. Journal of Information Systems Engineering and Business Intelligence, 11(1), 53–64. https://doi.org/10.20473/jisebi.11.1.53-64

Kerrigan, D., Trokielewicz, M., Czajka, A., & Bowyer, K. W. (2019). Iris Recognition with Image Segmentation Employing Retrained Off-the-Shelf Deep Neural Networks. 2019 International Conference on Biometrics, ICB 2019. https://doi.org/10.1109/ICB45273.2019.8987299

Khumaidi, A., & Herinanto, D. (2023). Sistem Pendukung Keputusan Penerimaan Siswa Baru dengan Menggunakan Metode SAW di SMP Xaverius Pringsewu Berbasis Web Mobile. Rubinstein, 2(1), 74–83. https://doi.org/10.31253/rubin.v2i1.2729

Liu, S., & Lee, I. (2018). Email Sentiment Analysis Through k-Means Labeling and Support Vector Machine Classification. Cybernetics and Systems, 49(3), 181–199. https://doi.org/10.1080/01969722.2018.1448242

Pinto, J. P., Kelur, S., & Shetty, J. (2018). Iris Flower Species Identification Using Machine Learning Approach. 2018 4th International Conference for Convergence in Technology (I2CT), 1–4. https://doi.org/10.1109/I2CT42659.2018.9057891

Putra, P., M. H. Pardede, A., & Syahputra, S. (2022). Analisis Metode K-Nearest Neighbour (KNN) Dalam Klasifikasi Data Iris Bunga. Jurnal Teknik Informatika Kaputama (JTIK), 6(1), 297–305.

Rahman, B., Fauzi, F., & Amri, S. (2023). Perbandingan Hasil Klasifikasi Data Iris menggunakan Algoritma K-Nearest Neighbor dan Random Forest. Journal Of Data Insights, 1(1), 19–26. https://doi.org/10.26714/jodi.v1i1.135

Rao, T. S., Hema, M., Priya, K. S., Krishna, K. V., & Ali, M. S. (2021). Iris Flower Classification Using Machine Learning. International Journal of All Research Education and Scientific Methods (IJARESM), 9(6), 2083–2090.

Rofianto, D., Arifin, O., & Widyawati, D. K. (2023). Perbandingan Metode Klasifikasi Learning Vector Quantization Dengan Diskriminan Fisher Pada Data Bunga Iris. JSR : Jaringan Sistem Informasi Robotik, 7(1), 44–49. https://doi.org/10.58486/jsr.v7i1.209

Ronaldo, R., & Kurnia, Y. (2024). Perbandingan Kinerja Algoritma SVM, Decision Tree, dan Naive Bayes untuk Klasifikasi dan Pengelompokan Spesies Iris. POTERS : Proceedings of Technology, Engineering and Computers, 1(1), 70–77.

Satyagraha, A., & Kurnia, Y. (2025). Forecasting Tourism Demand: A Bibliometric Review of Trends, Methodologies, and Big Data Integration (2015-2024). Rubinstein, 3(2), 106–117. https://doi.org/10.31253/rubin.v3i2.3783

Sunaryono. (2017). Penelitian Komparasi Algoritma Klasifikasi dalam Menentukan Website Palsu. TEKNIKOM, 1(1), 1–12.

Suresh Kumar, K., Radha Mani, A. S., Ananth Kumar, T., Jalili, A., Gheisari, M., Malik, Y., Chen, H. C., & Jahangir Moshayedi, A. (2024). Sentiment Analysis of Short Texts Using SVMs and VSMs-Based Multiclass Semantic Classification. Applied Artificial Intelligence, 38(1). https://doi.org/10.1080/08839514.2024.2321555

Susanti, N. A., & Walid, M. (2022). Klasifikasi Data Tweet Ujaran Kebencian Di Media Sosial. JATI: Jurnal Mahasiswa Teknik Informatika, 6(2), 538–543.

VanderPlas, J. (2022). Python Data Science Handbook. O’Reilly Media.


Abstract views: 307 / PDF downloads: 293