Ensemble model of Bidirectional Encoder Representation from Transformers for Named Entity Recognition

Jendle, Carl; Schönbeck, Linus

Ensemble model of Bidirectional Encoder Representation from Transformers for Named Entity Recognition

dc.contributor.author	Jendle, Carl
dc.contributor.author	Schönbeck, Linus
dc.contributor.department	Chalmers tekniska högskola / Institutionen för data och informationsteknik	sv
dc.contributor.examiner	Axelson-Fisk, Marina
dc.contributor.supervisor	Brown-Cohen, Jonah
dc.date.accessioned	2021-08-20T12:49:26Z
dc.date.available	2021-08-20T12:49:26Z
dc.date.issued	2021	sv
dc.date.submitted	2020
dc.description.abstract	Named entity recognition (NER) has been widely modeled using Bidirectional En coder Representations from Transformers (BERT) in state of the art implementations since its appearance in 2018. Various configurations based on BERT models currently hold 4 out of 5 top positions on the GLUE leaderboard, an acknowledged benchmark for natural language processing and understanding. Relying on BERT architecture, a range of NER model designs were investigated to predict entities in a comparatively small set of medical press releases. The performance of all investigated model designs proved to be boosted with transfer learning using the publicly available datasets Conll2003 and BC5CDR early on in the project. Transfer learning was therefore implemented in the best named entity recognition system found, the separate submodel system under Section 6.3.6. This final design consisted of two submodels, each classifying different entity subsets independently. The Conll and BC5CDR datasets were used for transfer learning in the respective submodels prior to the introduction of medical press release data. The separate submodel system reached an F1-score of 0.79 (Conll model) and 0.78 (BC5CDR model). The effect of pre-training a selection of publicly available BERT models on the medical press releases was also investigated, but was given less emphasis due to insufficient amounts of data.	sv
dc.identifier.coursecode	MPDSC	sv
dc.identifier.uri	https://hdl.handle.net/20.500.12380/303941
dc.language.iso	eng	sv
dc.setspec.uppsok	Technology
dc.subject	Transfer learning	sv
dc.subject	natural language processing	sv
dc.subject	named entity recognition	sv
dc.subject	BERT	sv
dc.subject	conditional random field	sv
dc.title	Ensemble model of Bidirectional Encoder Representation from Transformers for Named Entity Recognition	sv
dc.type.degree	Examensarbete för masterexamen	sv
dc.type.uppsok	H

Ladda ner

Original bundle

Visar 1 - 1 av 1

Namn:: CSE 21-54 Jendle Schönbeck.pdf
Storlek:: 1.96 MB
Format:: Adobe Portable Document Format
Beskrivning:

Ladda ner

License bundle

Visar 1 - 1 av 1

Namn:: license.txt
Storlek:: 1.51 KB
Format:: Item-specific license agreed upon to submission
Beskrivning:

Ladda ner

Samlingar

Examensarbeten för masterexamen