Text summarization using transfer learnin: Extractive and abstractive summarization using BERT and GPT-2 on news and podcast data

RISNE, VICTOR; SIITOVA, ADÉLE

Text summarization using transfer learnin: Extractive and abstractive summarization using BERT and GPT-2 on news and podcast data

Ladda ner

CSE 19-83 ODR Risne Siltova.pdf (2.6 MB)

Publicerad

2019

Författare

RISNE, VICTOR

SIITOVA, ADÉLE

Typ

Examensarbete för masterexamen

Program

Computer systems and networks (MPCSN), MSc

Sammanfattning

A summary of a long text document enables people to easily grasp the information of the topic without having the need to read the whole document. This thesis aims to automate text summarization by using two approaches: extractive and abstractive. The former approach utilizes submodular functions and the language representation model BERT, while the latter uses the language model GPT-2. We operate on two types of datasets: CNN/DailyMail, a benchmarked news article dataset and Podcast, a dataset comprised of podcast episode transcripts. The results obtained using the GPT-2 on the CNN/DailyMail dataset are competitive to state-of-the-art. Besides the quantitative evaluation, we also perform a qualitative investigation in the form of a human evaluation, along with inspection of the trained model that demonstrates that it learns reasonable abstractions.

Ämne/nyckelord

transformer, BERT, GPT-2, text summarization, natural language processing

URI

https://hdl.handle.net/20.500.12380/300416

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Text summarization using transfer learnin: Extractive and abstractive summarization using BERT and GPT-2 on news and podcast data

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced