Update README.md
Browse files
README.md
CHANGED
@@ -15,6 +15,11 @@ This is a reduced version of the Portuguese capitalisation and punctuation resto
|
|
15 |
|
16 |
You can try the model in the following [SPACE](https://huggingface.co/spaces/VOCALINLP/punctuation_and_capitalization_restoration_sanivert)
|
17 |
## Details of the dataset
|
|
|
|
|
|
|
|
|
|
|
18 |
|
19 |
## Evaluation Metrics
|
20 |
The metrics used to the evaluation of the model are the Macro and the Weighted F1 scores.
|
|
|
15 |
|
16 |
You can try the model in the following [SPACE](https://huggingface.co/spaces/VOCALINLP/punctuation_and_capitalization_restoration_sanivert)
|
17 |
## Details of the dataset
|
18 |
+
This a neuralmind/bert-base-portuguese-cased model fine-tuned for punctuation restoration using the following data distribution.
|
19 |
+
|
20 |
+
| Language | Number of text samples | Number of tokens |
|
21 |
+
| -------- | ---------------------- | ---------------- |
|
22 |
+
| Portuguese | 2,974,058 | 49,720,263 |
|
23 |
|
24 |
## Evaluation Metrics
|
25 |
The metrics used to the evaluation of the model are the Macro and the Weighted F1 scores.
|