Optimizing abstractive summarization with fine-tuned PEGASUS

Rafi, Sadiul Arefin; Rahman, Naimur; Islam, Kazi Nazibul; Ahmad, Ha-mim

View/Open

20101120,20101284,20101372,20101286_CSE.pdf (586.6Kb)

Date

2023-09

Publisher

Brac University

Abstract

Abstractive text summarization is the technique of generating a short and concise summary comprising the salient ideas of a source text without making a subset of the salient sentences from the source text. The introduction of transformer models such as BART, T5, and PEGASUS has made this sort of summarization process more efficient and accurate. The objective of this paper is to analyze the performance of different transformer models, compare them to find an efficient model and fine-tune the model on csebuetnlp/xlsum English corpus. The performance of the generated summaries from the fine-tuned PEGASUS models is evaluated using the ROUGE metric, which basically compares the auto-generated summaries with human-created summaries. The fine-tuned PEGASUS model gives a state-of-the-art performance on the XLSum English Corpus.

Keywords

Text summarization; Transformer; PEGASUS

Description

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.

Cataloged from the PDF version of the thesis.

Includes bibliographical references (pages 47-49).

Department

Department of Computer Science and Engineering, Brac University

Type

Thesis

Collections

Thesis & Report, BSc (Computer Science and Engineering) [1496]