IT CAREER NAVIGATION: PERFORMANCE EVALUATION OF KNN AND NAÏVE BAYES IN CAREER PATH RECOMMENDATIONS FOR COMPUTER SCIENCE STUDENTS (CASE STUDY: BATTUTA UNIVERSITY)

Authors

  • Surya Darma Potensi Utama University
  • Muhammad Irfan Sarif Pancabudi Development University
  • Ahmad Jihad Alfayed Pancabudi Development University
  • Andika Pancabudi Development University
  • Katharina Tyas Pancabudi Development University

DOI:

https://doi.org/10.54314/jssr.v9i2.6269

Keywords:

Machine Learning, KNN, Naïve Bayes, Career Recommendation, Classification

Abstract

Abstract: With the rapid development of information technology, there are many career options available in the field of informatics. However, it is often difficult for students to choose a specialization that matches their interests and abilities. The purpose of this study is to develop a career path recommendation system for informatics students and to evaluate the performance of the K-Nearest Neighbor (KNN) and Naive Bayes algorithms in classification tasks. The data used in this study were collected via a questionnaire comprising 22 assessment indicators related to students’ interests, academic understanding, and preferred work styles. A total of 300 respondent data points were utilized, with 20% allocated for testing and 80% for training. The research process included preprocessing, data transformation, modeling, and evaluation using accuracy, precision, recall, and F1-score metrics. The results show that the Naive Bayes algorithm outperforms KNN, achieving an accuracy of 97%, precision of 93%, recall of 93%, and an F1-score of 93%. Therefore, Naive Bayes is considered more optimal in terms of classification performance. It is expected that the developed system can assist students in determining their career paths in a more data-driven and objective manner.

Keywords: Machine Learning, KNN, Naïve Bayes, Career Recommendation, Classification

 

Abstrak: Dengan berkembangnya teknologi informasi yang cepat, ada banyak pilihan karir di bidang informatika. Namun, sulit bagi mahasiswa untuk memilih spesialisasi yang sesuai dengan minat dan kemampuan mereka. Tujuan dari penelitian ini adalah untuk membuat sistem rekomendasi jalur karier untuk mahasiswa informatika dan juga untuk mengevaluasi bagaimana algoritma K-Nearest Neighbor (KNN) dan Naive Bayes bekerja dalam klasifikasi. Data yang digunakan diperoleh melalui kuesioner yang terdiri dari 22 indikator penilaian yang berkaitan dengan minat mahasiswa, pemahaman akademik, dan gaya kerja yang mereka sukai. Sebanyak 300 data dari responden digunakan, dengan 20% data dialokasikan untuk pengujian dan 80% untuk pelatihan. Proses penelitian termasuk tahapan preprocessing, transformasi data, pemodelan, dan evaluasi menggunakan metrik akurasi, presisi, recall, dan skor F1. Hasil penelitian menunjukkan bahwa algoritma Naïve Bayes lebih baik dibandingkan KNN dengan nilai akurasi 97%, presisi 93%, recall 93%, dan skor F1. Akibatnya, Naïve Bayes lebih optimal dalam member. Diharapkan sistem yang dibuat dapat membantu mahasiswa dalam menentukan karir mereka secara lebih berbasis data dan objektif.

Kata Kunci: Machine Learning, KNN, Naïve Bayes, Rekomendasi Karier, Klasifikasi

Downloads

Download data is not yet available.

References

Ahmed, S. M. (2024). The Impact of Artificial Intelligence on Cybersecurity. IJCI, 3(2), 39–70. https://doi.org/10.59992/ijci.2024.v3n2p3

Al Fayed, A. J., Darma, S., Sinabariba, Z., & Pardede, S. M. P. (2025). Comparison of Naïve Bayes, K-Nearest Neighbors, and Decision Tree methods for classifying heart disease risk factors. Journal of Computer Science and Research (JoCoSiR), 3(3), 81–88.

Al Fayed, A. J., Darma, S., Aqsha, M. H., Pardede, S. M. P., & Amin, M. (2026). Perbandingan kinerja algoritma machine learning dalam memprediksi tingkat stres mahasiswa berdasarkan faktor akademik dan non-akademik. Journal of Science and Social Research, 9(1), 483–490. https://doi.org/10.54314/jssr.v9i1.5805.

Anand, Agash, R., Jayaram, & K., G. R. (2024). AI-Assisted User Interface: To Achieve Personal and Professional Goals. Irish Interdisciplinary Journal of Science & Research, 08(02), 01–15. https://doi.org/10.46759/iijsr.2024.8201

Bello, A., & Abdallah, S. M. S. (2024). Exploration of Teachers’ Perceptions of Their Roles in Career Guidance Regarding Secondary School Students’ Career Choices. Journal of Education Society and Behavioral Science, 37(4), 44–55. https://doi.org/10.9734/jesbs/2024/v37i41317

Bielza, C., & Larrañaga, P. (2014). Discrete Bayesian Network Classifiers. ACM Computing Surveys, 47(1), 1–43. https://doi.org/10.1145/2576868

Boateng, E. Y., Otoo, J., & Abaye, D. A. (2020). Basic Tenets of Classification Algorithms K-Nearest-Neighbor, Support Vector Machine, Random Forest, and Neural Network: A Review. Journal of Data Analysis and Information Processing, 08(04), 341–357. https://doi.org/10.4236/jdaip.2020.84020

Çetinkaya, A., Baykan, Ö. K., & K?rg?z, H. (2023). Analysis of Machine Learning Classification Approaches for Predicting Students’ Programming Aptitude. Sustainability, 15(17), 12917. https://doi.org/10.3390/su151712917

Choudhari, M., Rangari, S., Badge, P., Chopde, P., & Paraskar, A. (2024). Review on Educational Academic Performance Analysis and Dropout Visualization by Analyzing Student Grades. International Research Journal on Advanced Engineering and Management (IRJAEM), 2(05), 1408–1422. https://doi.org/10.47392/irjaem.2024.0194

Dubey, S., Tiwari, G., Singh, S., Goldberg, S., & Pinsky, E. (2023). Using machine learning for healthcare treatment planning. Frontiers in Artificial Intelligence, 6. https://doi.org/10.3389/frai.2023.1124182

Guo, H., Zhou, J., & Wu, C. (2018). Imbalanced Learning Based on Data-Partition and SMOTE. Information, 9(9), 238. https://doi.org/10.3390/info9090238

Journal, I. (2024). Career Guidance System. International Journal of Scientific Research in Engineering and Management, 08(01), 1–11. https://doi.org/10.55041/ijsrem28005

Kamal, N., Sarker, F., Rahman, A., Hossain, S., & Mamun, K. A. (2024). Recommender System in Academic Choices of Higher Education: A Systematic Review. IEEE Access, 12, 35475–35501. https://doi.org/10.1109/access.2024.3368058

Karl?k, B., & Öztoprak, E. (2012). Personalized Cancer Treatment Using a Naive Bayes Classifier. International Journal of Machine Learning and Computing, 339–344. https://doi.org/10.7763/ijmlc.2012.v2.141

Khan, Md. A. R., Paul, A. R., Rahman, F., Akter, J., Sultana, Z., & Rahman, M. (2023). Appropriate Job Selection Using Machine Learning Techniques. https://doi.org/10.21203/rs.3.rs-3164137/v1

Khare, Ms. P. (2024). CAREER PATH. International Journal of Scientific Research in Engineering and Management, 08(05), 1–5. https://doi.org/10.55041/ijsrem35160

Kurban, H., Sharma, P., Dalk?l?ç, M., & Kurban, M. (2025). Accelerating density of states prediction in Zn-doped MgO nanoparticles via kernel-optimized weighted k-NN. Scientific Reports, 15(1). https://doi.org/10.1038/s41598-025-07887-6

Kim, D., & Suh, Y.-J. (2023). GConvLoc: WiFi Fingerprinting-Based Indoor Localization Using Graph Convolutional Networks. IEICE Transactions on Information and Systems, E106.D(4), 570–574. https://doi.org/10.1587/transinf.2022edl8081.

Lapan, R. T., Turner, S. L., & Pierce, M. E. (2012). College and career readiness: Policy and research to support effective counseling in schools. 57–73. https://doi.org/10.1037/13755-003

Majumder, A., & Veilleux, C. B. (2022). Smart Health and Cybersecurity in the Era of Artificial Intelligence. https://doi.org/10.5772/intechopen.97196

Maraden, Y., Wibisono, G., Nugraha, I. G. D., Sudiarto, B., Jufri, F. H., Kazutaka, K., & Prabuwono, A. S. (2023). Enhancing Electricity Theft Detection through K-Nearest Neighbors and Logistic Regression Algorithms with Synthetic Minority Oversampling Technique: A Case Study on State Electricity Company (PLN) Customer Data. Energies, 16(14), 5405. https://doi.org/10.3390/en16145405

Oladipo, J. O., Okoye, C. C., Elufioye, O. A., Falaiye, T., & Nwankwo, E. E. (2024). Human factors in cybersecurity: Navigating the fintech landscape. International Journal of Science and Research Archive, 11(1), 1959–1967.

Palmer, X., Akafia, C., Woodson, E.,

Woodson, A., & Potter, L. (2024). Organoids, Biocybersecurity, and Cyberbiosecurity—A Light Exploration. Organoids, 3(2), 83–112. https://doi.org/10.3390/organoids3020007

Samtani, S., Kantarc?o?lu, M., & Chen, H. (2020). Trailblazing the Artificial Intelligence for Cybersecurity Discipline. ACM Transactions on Management Information Systems, 11(4), 1–19. https://doi.org/10.1145/3430360

Shakeel, C. S., Khan, S. J., Chaudhry, B. M., Aijaz, S. F., & Hassan, U. (2021). Classification Framework for Healthy Hairs and Alopecia Areata: A Machine Learning (ML) Approach. Computational and Mathematical Methods in Medicine, 2021, 1–10. https://doi.org/10.1155/2021/1102083

Shete, S. (2023). AI in Cybersecurity and User Interface Design beyond Chatbots. Design of Single Chip Microcomputer Control System for Stepping Motor, 1–4. https://doi.org/10.47363/jaicc/2023(2)164

Sutradhar, P., Tarefder, P. K., Prodan, I., Saddi, Md. S., & Rozario, V. S. (2021). Multi-Modal Case Study on MRI Brain Tumor Detection Using Support Vector Machine, Random Forest, Decision Tree, K-Nearest Neighbor, Temporal Convolution & Transfer Learning. Aiub Journal of Science and Engineering (Ajse), 20(3), 107–117. https://doi.org/10.53799/ajse.v20i3.175

Tomašev, N., Búza, K., & Mladeni?, D. (2016). Correcting the hub occurrence prediction bias in many dimensions. Computer Science and Information Systems, 13(1), 1–21. https://doi.org/10.2298/csis140929039t

Vellingiri, B., & Venkatesh, K. A. (2025). A multi-dimensional student performance prediction model (MSPP): An advanced framework for accurate academic classification and analysis. Methodsx, 14, 103148. https://doi.org/10.1016/j.mex.2024.103148

Waleed, M., Um, T.-W., Kamal, T., & Usman, S. M. (2021). Classification of Agricultural Farm Machinery Using Machine Learning and the Internet of Things. Symmetry, 13(3), 403. https://doi.org/10.3390/sym13030403

Yargholi, E., & Hossein?Zadeh, G. (2016). Brain Decoding-Classification of Hand Written Digits from fMRI Data Employing Bayesian Networks. Frontiers in Human Neuroscience, 10. https://doi.org/10.3389/fnhum.2016.00351

Zhang, Y., Gao, Z., Sun, J., & Liu, L. (2023). Machine-Learning Algorithms for Process Condition Data-Based Inclusion Prediction in Continuous-Casting Process: A Case Study. Sensors, 23(15), 6719. https://doi.org/10.3390/s23156719

Zhang, M., & Shi, W. (2020). Systematic comparison of five machine-learning methods in classification and interpolation of soil particle size fractions using different transformed data. https://doi.org/10.5194/hess-2019-648

Downloads

Published

2026-04-30

Issue

Section

Artikel

How to Cite

IT CAREER NAVIGATION: PERFORMANCE EVALUATION OF KNN AND NAÏVE BAYES IN CAREER PATH RECOMMENDATIONS FOR COMPUTER SCIENCE STUDENTS (CASE STUDY: BATTUTA UNIVERSITY). (2026). JOURNAL OF SCIENCE AND SOCIAL RESEARCH, 9(2), 2838-2847. https://doi.org/10.54314/jssr.v9i2.6269