Deep Active Learning for Swedish Named Entity Recognition An empiric evaluation of active learning algorithms for Named Entity Recognition

Hagatulah, Nadim; Arvidsson, Kalle

Deep Active Learning for Swedish Named Entity Recognition An empiric evaluation of active learning algorithms for Named Entity Recognition

Ladda ner

CSE 21-75 Hagatulah Arvidsson.pdf (2.76 MB)

Publicerad

2021

Författare

Hagatulah, Nadim

Arvidsson, Kalle

Typ

Examensarbete för masterexamen

Program

Computer science – algorithms, languages and logic (MPALG), MSc

Sammanfattning

Named entity recognition holds promise for numerous practical applications involving text data, such as keyword extraction and automated anonymization. However, successfully train a machine learning model for Named Entity Recognition is challenging due to the amount of annotated data required, especially for cases where language that is not globally common such as Swedish is involved. In such cases, using a Deep pre-trained model such as BERT in conjunction with the practice of active learning may be preferred. To obtain some insight into the implementation of such an approach, this thesis serves as an empirical study of various active learning strategies when used in conjunction with BERT-based name entity recognition. The performance of different active learning algorithms and the effect of acquisition size on the performance of active learning is the main focus of this study. In conclusion, after comparing and evaluating 17 different active learning methods, the study’s empirical results demonstrate entropy sampling to be the best performing active learning algorithm for Named Entity Recognition of Swedish texts, and the choice of acquisition sizes is practically negligible to performance.

Ämne/nyckelord

Active Learning, Deep Learning, Transformer, BERT, NLP, Named Entity Recognition, Diversity-Based Sampling, Uncertainty-Based Sampling, Pool- Based Sampling, Cumulative Training

URI

https://hdl.handle.net/20.500.12380/302940

Samlingar

Examensarbeten för masterexamen

Visa fullständig post

Deep Active Learning for Swedish Named Entity Recognition An empiric evaluation of active learning algorithms for Named Entity Recognition

Ladda ner

Publicerad

Författare

Typ

Program

Modellbyggare

Tidskriftstitel

ISSN

Volymtitel

Utgivare

Sammanfattning

Beskrivning

Ämne/nyckelord

Citation

Arkitekt (konstruktör)

Geografisk plats

Byggnad (typ)

Byggår

Modelltyp

Skala

Teknik / material

Index

URI

Samlingar

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced