readerbench
/

ro-offense

@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [readerbench/RoBERT-base](https://huggingface.co/readerbench/RoBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7285
-- Accuracy: 0.8132
-- Precision: 0.8131
-- Recall: 0.8173
-- F1 Macro: 0.8123
-- F1 Micro: 0.8132
-- F1 Weighted: 0.8094
 ## Model description
@@ -43,25 +43,26 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 64
 - eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 Macro | F1 Micro | F1 Weighted |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:--------:|:--------:|:-----------:|
-| No log        | 1.0   | 125  | 0.6284          | 0.7675   | 0.7662    | 0.7721 | 0.7681   | 0.7675   | 0.7654      |
-| No log        | 2.0   | 250  | 0.5576          | 0.7820   | 0.7826    | 0.7799 | 0.7796   | 0.7820   | 0.7803      |
-| No log        | 3.0   | 375  | 0.5405          | 0.8001   | 0.8122    | 0.8077 | 0.8026   | 0.8001   | 0.7943      |
-| 0.5338        | 4.0   | 500  | 0.5853          | 0.8172   | 0.8140    | 0.8120 | 0.8124   | 0.8172   | 0.8161      |
-| 0.5338        | 5.0   | 625  | 0.6476          | 0.8157   | 0.8143    | 0.8098 | 0.8118   | 0.8157   | 0.8148      |
-| 0.5338        | 6.0   | 750  | 0.6607          | 0.8122   | 0.8137    | 0.8173 | 0.8120   | 0.8122   | 0.8082      |
-| 0.5338        | 7.0   | 875  | 0.7285          | 0.8132   | 0.8131    | 0.8173 | 0.8123   | 0.8132   | 0.8094      |
 ### Framework versions

 This model is a fine-tuned version of [readerbench/RoBERT-base](https://huggingface.co/readerbench/RoBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8411
+- Accuracy: 0.8232
+- Precision: 0.8235
+- Recall: 0.8210
+- F1 Macro: 0.8207
+- F1 Micro: 0.8232
+- F1 Weighted: 0.8210
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 64
 - eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.2
 - num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 Macro | F1 Micro | F1 Weighted |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:--------:|:--------:|:-----------:|
+| No log        | 1.0   | 125  | 0.7789          | 0.7037   | 0.6825    | 0.7000 | 0.6873   | 0.7037   | 0.7132      |
+| No log        | 2.0   | 250  | 0.5170          | 0.8006   | 0.8066    | 0.8016 | 0.7986   | 0.8006   | 0.7971      |
+| No log        | 3.0   | 375  | 0.5139          | 0.8096   | 0.8168    | 0.8237 | 0.8120   | 0.8096   | 0.8047      |
+| 0.6074        | 4.0   | 500  | 0.6180          | 0.8247   | 0.8251    | 0.8187 | 0.8210   | 0.8247   | 0.8233      |
+| 0.6074        | 5.0   | 625  | 0.7311          | 0.8096   | 0.8071    | 0.8085 | 0.8064   | 0.8096   | 0.8071      |
+| 0.6074        | 6.0   | 750  | 0.8365          | 0.8101   | 0.8117    | 0.8191 | 0.8105   | 0.8101   | 0.8051      |
+| 0.6074        | 7.0   | 875  | 0.8411          | 0.8232   | 0.8235    | 0.8210 | 0.8207   | 0.8232   | 0.8210      |
 ### Framework versions