IMPLEMENTASI REINFORCEMENT LEARNING UNTUK SUMMARIZATION PADA ARTIKEL BERITA DI INDONESIAMENGGUNAKAN MODEL TRANSFORMER

Authors

DOI:

https://doi.org/10.54314/jssr.v%25vi%25i.3152

Abstract

Abstract: This research aims to develop a Transformer-based text summarization model for Indonesian news articles and implement Reinforcement Learning to enhance summary quality. The research methodology involved using the Liputan6 dataset (Canonical subset), applying a T5-base model pre-trained with Supervised Fine-Tuning (SFT) on the same dataset, and subsequently optimizing it using the Reinforcement Learning algorithm Proximal Policy Optimization (PPO). Model performance was evaluated using the ROUGE-L metric. The results showed that the implementation of RL successfully increased the ROUGE-L F-measure score, from 0.314801 for the SFT-only model to 0.345324 for the SFT model with RL. This increase indicates an improvement in the longest common subsequence (LCS) similarity between the model-generated summaries and the reference summaries after optimization with RL. However, qualitative analysis of the generated summaries found that the RL model's summaries tended to be very short (often just one sentence) and omitted some important information present in the original articles. This suggests a limitation of the ROUGE-L metric, which focuses on lexical overlap, in fully capturing the semantic quality and completeness of the summary.

 

Keyword: Reinforcement Learning; Transformer; Abstractive Summarization; Indonesian News

 

Abstrak: Penelitian ini bertujuan untuk mengembangkan model peringkasan teks berbasis Transformer untuk artikel berita berbahasa Indonesia dan mengimplementasikan Reinforcement Learning guna meningkatkan kualitas ringkasan. Metodologi penelitian melibatkan penggunaan dataset Liputan6 (subset Canonical), penerapan model T5-base yang telah dilakukan Supervised Fine-Tuning (SFT) pada dataset yang sama, kemudian dioptimalkan menggunakan algoritma Reinforcement Learning, Proximal Policy Optimization (PPO). Evaluasi performa model dilakukan menggunakan metrik ROUGE-L. Hasil penelitian menunjukkan bahwa implementasi RL berhasil meningkatkan skor ROUGE-L F-measure, dari 0.314801 pada model SFT saja menjadi 0.345324 pada model SFT dengan RL. Peningkatan ini mengindikasikan adanya perbaikan dalam kesamaan urutan kata terpanjang (LCS) antara ringkasan hasil model dan ringkasan referensi setelah dioptimalkan dengan RL. Namun, analisis kualitatif terhadap ringkasan yang dihasilkan menemukan bahwa ringkasan hasil model RL cenderung sangat pendek (sering kali hanya satu kalimat) dan mengabaikan beberapa informasi penting yang ada di artikel asli. Hal ini menunjukkan adanya keterbatasan metrik ROUGE-L yang berfokus pada kesamaan leksikal dalam menangkap kualitas semantik dan kelengkapan ringkasan secara keseluruhan.

 

Kata kunci: Reinforcement Learning; Transformer; Peringkasan Abstraktif; Berita

                    Bahasa Indonesia

Downloads

Download data is not yet available.

Author Biographies

  • Muhammad Alif Nasrulloh, Universitas Muhammadiyah Malang
    Program Studi Informatika
  • Christian Sri Kusuma Aditya, Universitas Muhammadiyah Malang
    Program Studi Informatika

References

Alfhi Saputra, M. (2021). Peringkas Teks Otomatis Bahasa Indonesia secara Abstraktif Menggunakan Metode Long Short-Term Memory. E-Proceeding of Engineering : Vol.8, No.2 April 2021 |, 8(2), 3474–3488.

Itsnaini, Q. A., Hayaty, M., Putra, A. D., & Jabari, N. A. . (2023). Abstractive Text Summarization using Pre-Trained Language Model “Text-to-Text Transfer Transformer (T5).†ILKOM Jurnal Ilmiah, 15(1), 124–131. https://doi.org/10.33096/ilkom.v15i1.1532.124-131

Keneshloo, Y., Ramakrishnan, N., & Reddy, C. K. (2019). Deep transfer reinforcement learning for text summarization. SIAM International Conference on Data Mining, SDM 2019, 675–683. https://doi.org/10.1137/1.9781611975673.76

Khasanah, A. N., & Hayaty, M. (2023). Abstractive-Based Automatic Text Summarization on Indonesian News Using Gpt-2. JURTEKSI (Jurnal Teknologi Dan Sistem Informasi), 10(1), 9–18. https://doi.org/10.33330/jurteksi.v10i1.2492

Koto, F., Lau, J. H., & Baldwin, T. (2020). Liputan6: A Large-scale Indonesian Dataset for Text Summarization. 1, 598–608. http://arxiv.org/abs/2011.00679

Maulidia Sari, Y., & Siti Fatonah, N. (2021). JEPIN (Jurnal Edukasi dan Penelitian Informatika) Peringkasan Teks Otomatis pada Modul Pembelajaran Berbahasa Indonesia Menggunakan Metode Cross Latent Semantic Analysis (CLSA). Jurnal Edukasi Dan Penelitian Informatika, 7(2), 153–159. www.kompas.com.

Raihanunnisa, F., Arhami, M., & Hidayat, R. (2023). Pendekatan Hybrid Pada Sistem Peringkas Teks Artikel Berita Bahasa Inggris Menggunakan Natural Language Processing. Telematika Mkom, 15(2), 86–92. https://journal.budiluhur.ac.id/index.php/telematika/

Stiennon, N., Ouyang, L., Wu, J., Ziegler, D. M., Lowe, R., Voss, C., Radford, A., Amodei, D., & Christiano, P. (2020). Learning to summarize from human feedback. Advances in Neural Information Processing Systems, 2020-Decem(NeurIPS), 1–14.

Wang, M., Xie, P., Du, Y., & Hu, X. (2023). T5-Based Model for Abstractive Summarization: A Semi-Supervised Learning Approach with Consistency Loss Functions. Applied Sciences (Switzerland), 13(12). https://doi.org/10.3390/app13127111

Downloads

Published

2025-08-28

How to Cite

IMPLEMENTASI REINFORCEMENT LEARNING UNTUK SUMMARIZATION PADA ARTIKEL BERITA DI INDONESIAMENGGUNAKAN MODEL TRANSFORMER. (2025). JOURNAL OF SCIENCE AND SOCIAL RESEARCH, 8(3), 4553-4560. https://doi.org/10.54314/jssr.v%vi%i.3152

Most read articles by the same author(s)

<< < 152 153 154 155 156 157 158 159 160 161 > >>