Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,10 @@ This is a reduced version of the Spanish capitalisation and punctuation restorat
|
|
18 |
|
19 |
You can try the model in the following [SPACE](https://huggingface.co/spaces/VOCALINLP/punctuation_and_capitalization_restoration_sanivert)
|
20 |
## Details of the dataset
|
|
|
|
|
|
|
|
|
21 |
|
22 |
## Evaluation Metrics
|
23 |
The metrics used to the evaluation of the model are the Macro and the Weighted F1 scores.
|
|
|
18 |
|
19 |
You can try the model in the following [SPACE](https://huggingface.co/spaces/VOCALINLP/punctuation_and_capitalization_restoration_sanivert)
|
20 |
## Details of the dataset
|
21 |
+
This a dccuchile/bert-base-spanish-wwm-uncased model fine-tuned for punctuation restoration using the following data distribution.
|
22 |
+
| Language | Number of text samples| Number of tokens|
|
23 |
+
| -------- | ----------------- | ----------------- |
|
24 |
+
| Spanish | 2,153,296 | 51,049,602 |
|
25 |
|
26 |
## Evaluation Metrics
|
27 |
The metrics used to the evaluation of the model are the Macro and the Weighted F1 scores.
|