“COMPARATIVE STUDY OF KNN AND NAIVE BAYES ALGORITHMS WITH QUESTIONNAIRE DATA FOR STUDY PROGRAM RECOMMENDATION IN THE FACULTY OF ART AND DESIGN”
DOI:
https://doi.org/10.54314/jssr.v9i2.6267Keywords:
K-Nearest Neighbor (KNN), Naïve Bayes, Machine Learning, Classification, Study Program Recommendation.Abstract
Abstract: The purpose of this study is to analyze and compare the performance of the K-Nearest Neighbor (KNN) and Naïve Bayes algorithms in providing study program recommendations in the Faculty of Arts and Design. The data were obtained from 250 respondents through a questionnaire consisting of 20 indicators related to students’ interests, abilities, creativity, technology, and career preferences. The research process included data preprocessing, data transformation, dataset splitting into training and testing data, modeling using the KNN and Naïve Bayes algorithms, and model performance evaluation using accuracy metrics. The data processing was carried out using the Python programming language on the Google Colab platform. The results showed that the KNN algorithm achieved an accuracy of 94%, while the Naïve Bayes algorithm obtained an accuracy of 92%. These findings indicate that the KNN algorithm performed better in classifying study program recommendations compared to the Naïve Bayes algorithm. It is expected that this research can serve as a foundation for developing a more effective decision support system to assist prospective students in selecting study programs that match their interests and abilities.
Keywords: K-Nearest Neighbor (KNN), Naïve Bayes, Machine Learning, Classification, Study Program Recommendation.
Abstrak: Tujuan dari penelitian ini adalah untuk menganalisis dan membandingkan kinerja algoritma K-Nearest Neighbor (KNN) dan Naïve Bayes dalam memberikan rekomendasi program studi di Fakultas Seni dan Desain. Data diperoleh dari 250 responden melalui kuesioner yang terdiri dari 20 indikator yang berkaitan dengan minat, kemampuan, kreativitas, teknologi, dan preferensi karier mahasiswa. Proses preprocessing data, transformasi data, pembagian dataset menjadi data pelatihan dan pengujian, pemodelan menggunakan algoritma KNN dan Naive Bayes, dan evaluasi kinerja model dengan akurasi adalah bagian dari penelitian. Proses pengolahan data dilakukan pada platform Google Colab menggunakan bahasa pemrograman Python. Hasil penelitian menunjukkan bahwa algoritma KNN memiliki akurasi sebesar 94%, sedangkan algoritma Naïve Bayes memiliki akurasi sebesar 92%. Hasil ini menunjukkan bahwa algoritma KNN lebih baik dalam mengklasifikasikan rekomendasi program studi daripada algoritma Naïve Bayes. Diharapkan penelitian ini akan menjadi dasar untuk membuat sistem pendukung keputusan yang lebih baik yang membantu calon mahasiswa memilih program studi yang sesuai dengan minat dan kemampuan mereka.
Kata Kunci: KNN, Naïve Bayes, Machine Learning, Klasifikasi, Rekomendasi Program Studi.
Downloads
References
Agarwal, A., Sharma, P., Alshehri, M., Mohamed, A. A., & Alfarraj, O. (2021). Classification model for accuracy and intrusion detection using machine learning approach. Peerj Computer Science, 7, e437. https://doi.org/10.7717/peerj-cs.437
AL-Bakri, N. F., Yonan, J. F., & Sadiq, A. T. (2022). Tourism Companies Assessment via Social Media Using Sentiment Analysis. Baghdad Science Journal, 19(2), 0422. https://doi.org/10.21123/bsj.2022.19.2.0422
Al Fayed, A. J., Darma, S., Sinabariba, Z., & Pardede, S. M. P. (2025). Comparison of Naïve Bayes, K-Nearest Neighbors, and Decision Tree methods for classifying heart disease risk factors. Journal of Computer Science and Research (JoCoSiR), 3(3), 81–88.
Al Fayed, A. J., Darma, S., Aqsha, M. H., Pardede, S. M. P., & Amin, M. (2026). Perbandingan kinerja algoritma machine learning dalam memprediksi tingkat stres mahasiswa berdasarkan faktor akademik dan non-akademik. Journal of Science and Social Research, 9(1), 483–490. https://doi.org/10.54314/jssr.v9i1.5805.
Ayeni, O. O., Unachukwu, C. C., Hamad, N. M. A., Osawaru, B., & Adewusi, O. E. (2024). A multidisciplinary approach to STEM education: Combining HR, counseling, and mentorship. Magna Scientia Advanced Research and Reviews, 10(1), 351–360. https://doi.org/10.30574/msarr.2024.10.1.0026
Bruen, M. (2021). Uptake and Dissemination of Multi-Criteria Decision Support Methods in Civil Engineering—Lessons from the Literature. Applied Sciences, 11(7), 2940. https://doi.org/10.3390/app11072940
Elrahman, A. A. (2024). Predicting Adults’ Income using Naive Bayes Classifier. https://doi.org/10.21203/rs.3.rs-3890646/v1
Fokoué, E. (2018). To Bayes or Not To Bayes? That’s no longer the question! https://doi.org/10.48550/arxiv.1805.11012
https://doi.org/10.30864/eksplora.v13i1.994
Guo, Y., Graber, A., McBurney, R., & Balasubramanian, R. (2010). Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms. BMC Bioinformatics, 11(1). https://doi.org/10.1186/1471-2105-11-447
Gyll, S. P. (2021). Career development by design, not default: Creating a more efficient and data?driven process by connecting aptitude?based learner guidance to post?secondary pathways, competency?based credentials, and high?demand jobs. The Journal of Competency-Based Education, 6(1). https://doi.org/10.1002/cbe2.1236
Ha, K. M., Naseem, U., Keya, A. J., Maitra, S., Mithu, K., & Alam, Md. G. R. (2023). A Systematic Review on Airlines Industries based on Sentiment Analysis and Topic Modeling. https://doi.org/10.21203/rs.3.rs-3475984/v1
Jen, L. (2021). A Brief Overview of the Accuracy of Classification Algorithms for Data Prediction in Machine Learning Applications. Journal of Applied Data Sciences, 2(3), 84–92. https://doi.org/10.47738/jads.v2i3.38
Jin, H.-Y., & Cutumisu, M. (2023). Predicting pre-service teachers’ computational thinking skills using machine learning classifiers. Education and Information Technologies, 28(9), 11447–11467. https://doi.org/10.1007/s10639-023-11642-7
Khotimah, B. K. (2022). Performance of the K-Nearest Neighbors method on identification of maize plant nutrients. Jurnal Infotel, 14(1), 8–14. https://doi.org/10.20895/infotel.v14i1.735
Kurban, H., Kurban, M., Sharma, P., & Dalk?l?ç, M. (2021). Predicting Atom Types of Anatase TiO2 Nanoparticles with Machine Learning. Key Engineering Materials, 880, 89–94. https://doi.org/10.4028/www.scientific.net/kem.880.89
Kuyo, M., Mwalili, S., & Okango, E. (2021). Machine Learning Approaches for Classifying the Distribution of Covid-19 Sentiments. Open Journal of Statistics, 11(05), 620–632. https://doi.org/10.4236/ojs.2021.115037
Mavhemwa, P. M., Zennaro, M., Nsengiyumva, P., & Nzanywayingoma, F. (2024). Weighted naïve bayes multi-user classification for adaptive authentication. Journal of Physics Communications, 8(10), 105005. https://doi.org/10.1088/2399-6528/ad8a16
Meens, E., Bakx, A., Mulder, J., & Denissen, J. J. A. (2020). The Development and Validation of an Interest and Skill Inventory on Educational Choices (ISEC). European Journal of Psychological Assessment, 36(5), 817–828. https://doi.org/10.1027/1015-5759/a000546
Meyer, M. S., Cranmore, J., Rinn, A. N., & Hodges, J. (2020). College Choice: Considerations for Academically Advanced High School Seniors. Gifted Child Quarterly, 65(1), 52–74. https://doi.org/10.1177/0016986220957258
Missaghian, R. (2021). Social Capital and Post-Secondary Decision-Making Alignment for Low-Income Students. Social Sciences, 10(3), 83. https://doi.org/10.3390/socsci10030083
Pereira, N., Bright, S., Ozen, Z., Safitri, S., Castillo-Hermosilla, H., Matos, B. T. P. de, Karatas, T., & Fonseca, P. (2025). A Multitiered Approach to Computer Science Talent Development. Gifted Child Quarterly, 69(2), 130–146. https://doi.org/10.1177/00169862241307662
Patulin, E. P. (2019). Predictability of School Administrator’s Job Satisfaction through Hybrid Segmentation-based Prediction Model. International Journal of Advanced Trends in Computer Science and Engineering, 8(3), 507–512. https://doi.org/10.30534/ijatcse/2019/26832019
Reynolds, J., Elliott, J. L., Castillo, K., Sliwak, R. M., & Halligan, C. S. (2023). I lost my mentor, now what? The experiences of counseling psychology women doctoral students who lost their mentor: Training and program implications. Qualitative Psychology, 10(2), 227–244. https://doi.org/10.1037/qup0000237
Sari, A. A., Pramesty, A. S., Malik, A. Z., Al-Hidayah, D. N. H., Alviani, L. S., & Assyifa, R. A. (2023). The Influence of Transformational Education Prediction on Softskills of Madrasa Student using Data Mining. Khazanah Journal of Religion and Technology, 1(1), 20–25. https://doi.org/10.15575/kjrt.v1i1.157
Schelfhout, S., Wille, B., Fonteyne, L., Roels, E., Derous, E., Fruyt, F. D., & Duyck, W. (2021). How interest fit relates to STEM study choice: Female students fit their choices better. Journal of Vocational Behavior, 129, 103614. https://doi.org/10.1016/j.jvb.2021.103614
Tapidingan, Y. C., & Paseru, D. (2020). Comparative Analysis of Classification Methods of KNN and Naïve Bayes to Determine Stress Level of Junior High School Students. Indonesian Journal of Information Systems, 2(2), 80–89. https://doi.org/10.24002/ijis.v2i2.303
Taslim, T., Handayani, S., & Fajrizal, F. (2023). Kinerja Komparatif Optimasi Algoritma Naive Bayes dalam Klasifikasi Teks untuk Uji Klinis Kanker. Eksplora Informatika, 13(1), 113–123.
Zhao, Z., Tang, X., & Hu, M. (2025). Establishment of a value assessment framework for orphan drugs in China: an application of the discrete choice experiment in multicriteria decision analysis. Frontiers in Pharmacology, 16. https://doi.org/10.3389/fphar.2025.1677627
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Surya Darma, Nita Syahputri, Nurhayati

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.




