IT CAREER NAVIGATION: PERFORMANCE EVALUATION OF KNN AND NAÏVE BAYES IN CAREER PATH RECOMMENDATIONS FOR COMPUTER SCIENCE STUDENTS (CASE STUDY: BATTUTA UNIVERSITY)
DOI:
https://doi.org/10.54314/jssr.v9i2.6269Keywords:
Machine Learning, KNN, Naïve Bayes, Career Recommendation, ClassificationAbstract
Abstract: With the rapid development of information technology, there are many career options available in the field of informatics. However, it is often difficult for students to choose a specialization that matches their interests and abilities. The purpose of this study is to develop a career path recommendation system for informatics students and to evaluate the performance of the K-Nearest Neighbor (KNN) and Naive Bayes algorithms in classification tasks. The data used in this study were collected via a questionnaire comprising 22 assessment indicators related to students’ interests, academic understanding, and preferred work styles. A total of 300 respondent data points were utilized, with 20% allocated for testing and 80% for training. The research process included preprocessing, data transformation, modeling, and evaluation using accuracy, precision, recall, and F1-score metrics. The results show that the Naive Bayes algorithm outperforms KNN, achieving an accuracy of 97%, precision of 93%, recall of 93%, and an F1-score of 93%. Therefore, Naive Bayes is considered more optimal in terms of classification performance. It is expected that the developed system can assist students in determining their career paths in a more data-driven and objective manner.
Keywords: Machine Learning, KNN, Naïve Bayes, Career Recommendation, Classification
Abstrak: Dengan berkembangnya teknologi informasi yang cepat, ada banyak pilihan karir di bidang informatika. Namun, sulit bagi mahasiswa untuk memilih spesialisasi yang sesuai dengan minat dan kemampuan mereka. Tujuan dari penelitian ini adalah untuk membuat sistem rekomendasi jalur karier untuk mahasiswa informatika dan juga untuk mengevaluasi bagaimana algoritma K-Nearest Neighbor (KNN) dan Naive Bayes bekerja dalam klasifikasi. Data yang digunakan diperoleh melalui kuesioner yang terdiri dari 22 indikator penilaian yang berkaitan dengan minat mahasiswa, pemahaman akademik, dan gaya kerja yang mereka sukai. Sebanyak 300 data dari responden digunakan, dengan 20% data dialokasikan untuk pengujian dan 80% untuk pelatihan. Proses penelitian termasuk tahapan preprocessing, transformasi data, pemodelan, dan evaluasi menggunakan metrik akurasi, presisi, recall, dan skor F1. Hasil penelitian menunjukkan bahwa algoritma Naïve Bayes lebih baik dibandingkan KNN dengan nilai akurasi 97%, presisi 93%, recall 93%, dan skor F1. Akibatnya, Naïve Bayes lebih optimal dalam member. Diharapkan sistem yang dibuat dapat membantu mahasiswa dalam menentukan karir mereka secara lebih berbasis data dan objektif.
Kata Kunci: Machine Learning, KNN, Naïve Bayes, Rekomendasi Karier, Klasifikasi
Downloads
References
Ahmed, S. M. (2024). The Impact of Artificial Intelligence on Cybersecurity. IJCI, 3(2), 39–70. https://doi.org/10.59992/ijci.2024.v3n2p3
Al Fayed, A. J., Darma, S., Sinabariba, Z., & Pardede, S. M. P. (2025). Comparison of Naïve Bayes, K-Nearest Neighbors, and Decision Tree methods for classifying heart disease risk factors. Journal of Computer Science and Research (JoCoSiR), 3(3), 81–88.
Al Fayed, A. J., Darma, S., Aqsha, M. H., Pardede, S. M. P., & Amin, M. (2026). Perbandingan kinerja algoritma machine learning dalam memprediksi tingkat stres mahasiswa berdasarkan faktor akademik dan non-akademik. Journal of Science and Social Research, 9(1), 483–490. https://doi.org/10.54314/jssr.v9i1.5805.
Anand, Agash, R., Jayaram, & K., G. R. (2024). AI-Assisted User Interface: To Achieve Personal and Professional Goals. Irish Interdisciplinary Journal of Science & Research, 08(02), 01–15. https://doi.org/10.46759/iijsr.2024.8201
Bello, A., & Abdallah, S. M. S. (2024). Exploration of Teachers’ Perceptions of Their Roles in Career Guidance Regarding Secondary School Students’ Career Choices. Journal of Education Society and Behavioral Science, 37(4), 44–55. https://doi.org/10.9734/jesbs/2024/v37i41317
Bielza, C., & Larrañaga, P. (2014). Discrete Bayesian Network Classifiers. ACM Computing Surveys, 47(1), 1–43. https://doi.org/10.1145/2576868
Boateng, E. Y., Otoo, J., & Abaye, D. A. (2020). Basic Tenets of Classification Algorithms K-Nearest-Neighbor, Support Vector Machine, Random Forest, and Neural Network: A Review. Journal of Data Analysis and Information Processing, 08(04), 341–357. https://doi.org/10.4236/jdaip.2020.84020
Çetinkaya, A., Baykan, Ö. K., & K?rg?z, H. (2023). Analysis of Machine Learning Classification Approaches for Predicting Students’ Programming Aptitude. Sustainability, 15(17), 12917. https://doi.org/10.3390/su151712917
Choudhari, M., Rangari, S., Badge, P., Chopde, P., & Paraskar, A. (2024). Review on Educational Academic Performance Analysis and Dropout Visualization by Analyzing Student Grades. International Research Journal on Advanced Engineering and Management (IRJAEM), 2(05), 1408–1422. https://doi.org/10.47392/irjaem.2024.0194
Dubey, S., Tiwari, G., Singh, S., Goldberg, S., & Pinsky, E. (2023). Using machine learning for healthcare treatment planning. Frontiers in Artificial Intelligence, 6. https://doi.org/10.3389/frai.2023.1124182
Guo, H., Zhou, J., & Wu, C. (2018). Imbalanced Learning Based on Data-Partition and SMOTE. Information, 9(9), 238. https://doi.org/10.3390/info9090238
Journal, I. (2024). Career Guidance System. International Journal of Scientific Research in Engineering and Management, 08(01), 1–11. https://doi.org/10.55041/ijsrem28005
Kamal, N., Sarker, F., Rahman, A., Hossain, S., & Mamun, K. A. (2024). Recommender System in Academic Choices of Higher Education: A Systematic Review. IEEE Access, 12, 35475–35501. https://doi.org/10.1109/access.2024.3368058
Karl?k, B., & Öztoprak, E. (2012). Personalized Cancer Treatment Using a Naive Bayes Classifier. International Journal of Machine Learning and Computing, 339–344. https://doi.org/10.7763/ijmlc.2012.v2.141
Khan, Md. A. R., Paul, A. R., Rahman, F., Akter, J., Sultana, Z., & Rahman, M. (2023). Appropriate Job Selection Using Machine Learning Techniques. https://doi.org/10.21203/rs.3.rs-3164137/v1
Khare, Ms. P. (2024). CAREER PATH. International Journal of Scientific Research in Engineering and Management, 08(05), 1–5. https://doi.org/10.55041/ijsrem35160
Kurban, H., Sharma, P., Dalk?l?ç, M., & Kurban, M. (2025). Accelerating density of states prediction in Zn-doped MgO nanoparticles via kernel-optimized weighted k-NN. Scientific Reports, 15(1). https://doi.org/10.1038/s41598-025-07887-6
Kim, D., & Suh, Y.-J. (2023). GConvLoc: WiFi Fingerprinting-Based Indoor Localization Using Graph Convolutional Networks. IEICE Transactions on Information and Systems, E106.D(4), 570–574. https://doi.org/10.1587/transinf.2022edl8081.
Lapan, R. T., Turner, S. L., & Pierce, M. E. (2012). College and career readiness: Policy and research to support effective counseling in schools. 57–73. https://doi.org/10.1037/13755-003
Majumder, A., & Veilleux, C. B. (2022). Smart Health and Cybersecurity in the Era of Artificial Intelligence. https://doi.org/10.5772/intechopen.97196
Maraden, Y., Wibisono, G., Nugraha, I. G. D., Sudiarto, B., Jufri, F. H., Kazutaka, K., & Prabuwono, A. S. (2023). Enhancing Electricity Theft Detection through K-Nearest Neighbors and Logistic Regression Algorithms with Synthetic Minority Oversampling Technique: A Case Study on State Electricity Company (PLN) Customer Data. Energies, 16(14), 5405. https://doi.org/10.3390/en16145405
Oladipo, J. O., Okoye, C. C., Elufioye, O. A., Falaiye, T., & Nwankwo, E. E. (2024). Human factors in cybersecurity: Navigating the fintech landscape. International Journal of Science and Research Archive, 11(1), 1959–1967.
Palmer, X., Akafia, C., Woodson, E.,
Woodson, A., & Potter, L. (2024). Organoids, Biocybersecurity, and Cyberbiosecurity—A Light Exploration. Organoids, 3(2), 83–112. https://doi.org/10.3390/organoids3020007
Samtani, S., Kantarc?o?lu, M., & Chen, H. (2020). Trailblazing the Artificial Intelligence for Cybersecurity Discipline. ACM Transactions on Management Information Systems, 11(4), 1–19. https://doi.org/10.1145/3430360
Shakeel, C. S., Khan, S. J., Chaudhry, B. M., Aijaz, S. F., & Hassan, U. (2021). Classification Framework for Healthy Hairs and Alopecia Areata: A Machine Learning (ML) Approach. Computational and Mathematical Methods in Medicine, 2021, 1–10. https://doi.org/10.1155/2021/1102083
Shete, S. (2023). AI in Cybersecurity and User Interface Design beyond Chatbots. Design of Single Chip Microcomputer Control System for Stepping Motor, 1–4. https://doi.org/10.47363/jaicc/2023(2)164
Sutradhar, P., Tarefder, P. K., Prodan, I., Saddi, Md. S., & Rozario, V. S. (2021). Multi-Modal Case Study on MRI Brain Tumor Detection Using Support Vector Machine, Random Forest, Decision Tree, K-Nearest Neighbor, Temporal Convolution & Transfer Learning. Aiub Journal of Science and Engineering (Ajse), 20(3), 107–117. https://doi.org/10.53799/ajse.v20i3.175
Tomašev, N., Búza, K., & Mladeni?, D. (2016). Correcting the hub occurrence prediction bias in many dimensions. Computer Science and Information Systems, 13(1), 1–21. https://doi.org/10.2298/csis140929039t
Vellingiri, B., & Venkatesh, K. A. (2025). A multi-dimensional student performance prediction model (MSPP): An advanced framework for accurate academic classification and analysis. Methodsx, 14, 103148. https://doi.org/10.1016/j.mex.2024.103148
Waleed, M., Um, T.-W., Kamal, T., & Usman, S. M. (2021). Classification of Agricultural Farm Machinery Using Machine Learning and the Internet of Things. Symmetry, 13(3), 403. https://doi.org/10.3390/sym13030403
Yargholi, E., & Hossein?Zadeh, G. (2016). Brain Decoding-Classification of Hand Written Digits from fMRI Data Employing Bayesian Networks. Frontiers in Human Neuroscience, 10. https://doi.org/10.3389/fnhum.2016.00351
Zhang, Y., Gao, Z., Sun, J., & Liu, L. (2023). Machine-Learning Algorithms for Process Condition Data-Based Inclusion Prediction in Continuous-Casting Process: A Case Study. Sensors, 23(15), 6719. https://doi.org/10.3390/s23156719
Zhang, M., & Shi, W. (2020). Systematic comparison of five machine-learning methods in classification and interpolation of soil particle size fractions using different transformed data. https://doi.org/10.5194/hess-2019-648
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Surya Darma, Muhammad Irfan Sarif, Ahmad Jihad Alfayed, Andika, Katharina Tyas

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.




