Flair NER Model trained on CleanCoNLL Dataset

This (unofficial) Flair NER model was trained on the awesome CleanCoNLL dataset.

The CleanCoNLL dataset was proposed by Susanna Rücker and Alan Akbik and introduces a corrected version of the classic CoNLL-03 dataset, with updated and more consistent NER labels.

Fine-Tuning

We use XLM-RoBERTa Large as backbone language model and the following hyper-parameters for fine-tuning:

Hyper-Parameter	Value
Batch Size	`4`
Learning Rate	`5-06`
Max. Epochs	`10`

Additionally, the FLERT approach is used for fine-tuning the model. Training logs and TensorBoard are also available for each model.

Results

We report micro F1-Score on development (in brackets) and test set for five runs with different seeds:

Seed 1	Seed 2	Seed 3	Seed 4	Seed 5	Avg.
(97.34) / 97.00	(97.26) / 96.90	(97.66) / 97.02	(97.42) / 96.96	(97.46) / 96.99	(97.43) / 96.97

Rücker and Akbik report 96.98 on three different runs, so our results are very close to their reported performance!

Flair Demo

The following snippet shows how to use the CleanCoNLL NER models with Flair:

from flair.data import Sentence
from flair.models import SequenceTagger

# load tagger
tagger = SequenceTagger.load("stefan-it/flair-clean-conll-2")

# make example sentence
sentence = Sentence("According to the BBC George Washington went to Washington.")

# predict NER tags
tagger.predict(sentence)

# print sentence
print(sentence)

# print predicted NER spans
print('The following NER tags are found:')
# iterate over entities and print
for entity in sentence.get_spans('ner'):
    print(entity)

stefan-it
/

flair-clean-conll-2

Flair NER Model trained on CleanCoNLL Dataset

Fine-Tuning

Results

Flair Demo

Finetuned from

Collection including stefan-it/flair-clean-conll-2

🧹 Fine-Tuned CleanCoNLL Models

Flair NER Model trained on CleanCoNLL Dataset

Fine-Tuning

Results

Flair Demo

Finetuned from FacebookAI/xlm-roberta-large

Collection including stefan-it/flair-clean-conll-2

Finetuned from