ANALISIS KARIR ALUMNI MENGGUNAKAN RANDOM FOREST DENGAN INTERPRETASI SHAP DAN EVALUASI BIAS MODEL FAIRNESS

Authors

  • Irfan Abadi Saragih
  • Arip Muhridan
  • Andisyah Putra
  • Ridwan
  • Mhd. Ihsan Abidi

DOI:

https://doi.org/10.54314/jssr.v9i3.6362

Keywords:

random forest, SHAP, fairness, tracer study, alumni career

Abstract

 

Higher education institutions are required to produce graduates who are competitive in the job market, yet tracer study data is often analyzed only descriptively, leaving deeper patterns unexplored. This study proposes an integrated approach combining Random Forest, SHAP (SHapley Additive exPlanations), and fairness evaluation to analyze alumni career outcomes. Data were sourced from the 2024 tracer study of Universitas Royal, comprising 466 alumni records with a binary target variable indicating employment status. Preprocessing included missing value handling, one-hot encoding, and stratified train-test splitting with an 80:20 ratio. Class imbalance was addressed using the class_weight='balanced' parameter. The Random Forest model achieved an accuracy of 95.7%, F1-score of 0.946, and AUC-ROC of 0.938, significantly outperforming the dummy baseline of 58.5%. Cross-validation across five folds yielded a mean accuracy of 96.4%, confirming model stability. SHAP analysis revealed that features related to educational level appropriateness and relevance of study field to current occupation were the most dominant predictors. Fairness evaluation per study program showed minimal performance disparity between Sistem Informasi (94.7%) and Sistem Komputer (100%), indicating no significant predictive bias. This framework demonstrates that explainable and fair machine learning can serve as a reliable foundation for evidence-based academic policy making

Downloads

Download data is not yet available.

References

Alfadia Shauqie, S., Nurkamal Fauzan, M., Prianto, C., Sariasih No, J., Sukasari, K., Bandung, K., & Barat, J. (2025). Analisis Pengaruh Fitur Terhadap Tinggi Badan Anak menggunakan SHAP. JEPIN (Jurnal Edukasi Dan Penelitian Informatika), 11(2), 271–276.

Aprilia, K. T., Sitorus, I. P., Ridha, M. R., & Novelan, M. S. (2026). Comparison of Random Forest and Naïve Bayes Classifier Methods for Monkeypox Classification. 3(1), 58–67.

Barutu, S., Iqbal, M., & Nasution, D. (2025). Comparative Analysis of the C4 . 5 and Random Forest Algorithms for the Prediction of Diarrheal Disease. 4(07), 625–635. https://doi.org/10.58471/esaprom.v4i07

Ernawati, A., Sitorus, Z., Iqbal, M., & Nasution, D. (2025). Penerapan Data Mining Untuk Klasifikasi Penduduk Miskin Di Kabupaten Labuhanbatu Menggunakan Random Forest Dan K-Nearest Neighbors. Bulletin of Information Technology (BIT), 6(2), 23–35. https://doi.org/10.47065/bit.v5i2.1783

Fatimah, A., Tania, K. D., Meiriza, A., Studi, P., Informasi, S., Sriwijaya, U., Palembang-prabumulih, J., & Ilir, O. (2025). Analisis Komparatif Model Data Mining Dalam Prediksi Ketepatan. Journal Of Information Systems And Informatics Engineering, 9(1), 100–108.

Helmy, A., Sitorus, Z., Ardya, D., Hrp, A. C., T, S. I. S., & Sukrianto, S. (2025). Analysis of Social Assistance Donor Classification at the Muhammadiyah Medan Orphanage Using SVM. Sinkron, 9(1), 283–290. https://doi.org/10.33395/sinkron.v9i1.14299

Hidayat, I., Iqbal, M., Marlina, L., Putera, A., & Siahaan, U. (2025). Analysis of Heart Failure Prediction With. International Journal of Computer Sciences and Mathematics Engineering, 4(1), 10–17.

Parhusip, N., Iqbal, M., & Sitorus, Z. (2025). Analysis Of Licensing Data Using Naive Bayes And Decision Tree Algorithms To Evaluate Analysis Of Licensing Data Using Naive Bayes And Decision Tree Algorithms To Evaluate The Performance Of Digital Public Services (Case Study: Invesment and One-Stop Integrated Services Office Of Medan City). 07(02), 24–35. https://journal.cattleyadf.org/index.php/jatilima/index

Sholeha, S. H., Faid, M., & Yaqin, M. A. (2024). Prediksi Prediksi Perpindahan Pelanggan Pada Toko Online Menggunakan Metode Tree-Based Gradient Boosted Models. Journal of Computer System and Informatics (JoSYC), 5(3), 605–614. https://doi.org/10.47065/josyc.v5i3.5215

Sitorus, R. A., Arya, D., Buyung Satrio Dasopang, B. S. D., & Ilka Zufria, I. Z. (2023). Analisis Tracer Study Alumni Program Studi S1 Ilmu Komputer UIN Sumatera Utara. Jurnal Kridatama Sains Dan Teknologi, 5(02), 411–420. https://doi.org/10.53863/kst.v5i02.967

Subekti, E. E., Cahyadi, F., & Priyanto, W. (2021). Inventa : Jurnal Pendidikan Guru Sekolah Dasar STUDI PENELUSURAN ( TRACER STUDY ) PROGRAM STUDI PENDIDIKAN GURU SEKOLAH DASAR TAHUN LULUS 2015-2019. V(2).

Suci Amaliah, Nusrang, M., & Aswi, A. (2022). Penerapan Metode Random Forest Untuk Klasifikasi Varian Minuman Kopi di Kedai Kopi Konijiwa Bantaeng. VARIANSI: Journal of Statistics and Its Application on Teaching and Research, 4(3), 121–127. https://doi.org/10.35580/variansiunm31

Usman, M., Hidayat, A., Sambas, P. N., Sambas, P. N., Statistik, B. P., Explanations, S. A., & Vector, S. (2024). Prosiding Seminar Hasil Penelitian dan Pengabdian Kepada Masyarakat ALGORITMA SHAPLEY ADDITIVE EXPLANATIONS UNTUK ANALISIS Prosiding Seminar Hasil Penelitian dan Pengabdian Kepada Masyarakat. 7(1), 76–81.

Downloads

Published

2026-06-06

Issue

Section

Artikel

How to Cite

ANALISIS KARIR ALUMNI MENGGUNAKAN RANDOM FOREST DENGAN INTERPRETASI SHAP DAN EVALUASI BIAS MODEL FAIRNESS. (2026). JOURNAL OF SCIENCE AND SOCIAL RESEARCH, 9(3), 3041-3048. https://doi.org/10.54314/jssr.v9i3.6362

Most read articles by the same author(s)