dhiya96 commited on
Commit
b081fbf
1 Parent(s): 4bfbca3

dhiya96/zephyr_summarisation_finetuned

Browse files
README.md ADDED
@@ -0,0 +1,104 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google-t5/t5-base
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
+ model-index:
9
+ - name: t5-base-finetuned-stocknews_1900_100
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # t5-base-finetuned-stocknews_1900_100
17
+
18
+ This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.2997
21
+ - Rouge1: 16.6203
22
+ - Rouge2: 8.7831
23
+ - Rougel: 13.9116
24
+ - Rougelsum: 15.4831
25
+ - Gen Len: 19.0
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 2e-05
45
+ - train_batch_size: 4
46
+ - eval_batch_size: 4
47
+ - seed: 42
48
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 40
51
+ - mixed_precision_training: Native AMP
52
+
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
57
+ | No log | 1.0 | 102 | 1.5488 | 14.6381 | 6.8963 | 12.1802 | 13.6527 | 19.0 |
58
+ | No log | 2.0 | 204 | 1.4139 | 15.0451 | 6.9216 | 12.6068 | 14.1445 | 19.0 |
59
+ | No log | 3.0 | 306 | 1.3627 | 15.3864 | 7.115 | 12.6537 | 14.267 | 19.0 |
60
+ | No log | 4.0 | 408 | 1.3288 | 15.6891 | 7.5106 | 13.0451 | 14.6203 | 19.0 |
61
+ | 1.8685 | 5.0 | 510 | 1.3087 | 15.8071 | 7.6382 | 13.103 | 14.7587 | 19.0 |
62
+ | 1.8685 | 6.0 | 612 | 1.2938 | 15.6775 | 7.6448 | 13.0823 | 14.6034 | 19.0 |
63
+ | 1.8685 | 7.0 | 714 | 1.2870 | 15.7672 | 7.89 | 13.3325 | 14.7821 | 19.0 |
64
+ | 1.8685 | 8.0 | 816 | 1.2779 | 16.1616 | 8.1642 | 13.4471 | 15.0305 | 19.0 |
65
+ | 1.8685 | 9.0 | 918 | 1.2731 | 16.3679 | 8.4804 | 13.7618 | 15.3468 | 19.0 |
66
+ | 1.1991 | 10.0 | 1020 | 1.2695 | 16.2821 | 8.456 | 13.7692 | 15.2461 | 19.0 |
67
+ | 1.1991 | 11.0 | 1122 | 1.2647 | 16.4056 | 8.5019 | 13.7217 | 15.3711 | 19.0 |
68
+ | 1.1991 | 12.0 | 1224 | 1.2667 | 16.4259 | 8.6692 | 13.8396 | 15.4122 | 19.0 |
69
+ | 1.1991 | 13.0 | 1326 | 1.2654 | 16.6988 | 8.9574 | 14.0239 | 15.6864 | 19.0 |
70
+ | 1.1991 | 14.0 | 1428 | 1.2648 | 16.7394 | 9.0588 | 14.0529 | 15.6644 | 19.0 |
71
+ | 1.0382 | 15.0 | 1530 | 1.2642 | 16.6864 | 9.106 | 13.9046 | 15.5687 | 19.0 |
72
+ | 1.0382 | 16.0 | 1632 | 1.2662 | 16.6786 | 8.8288 | 13.9603 | 15.5724 | 19.0 |
73
+ | 1.0382 | 17.0 | 1734 | 1.2651 | 16.7446 | 8.9211 | 13.9999 | 15.6617 | 19.0 |
74
+ | 1.0382 | 18.0 | 1836 | 1.2702 | 16.6361 | 8.8503 | 14.0324 | 15.546 | 19.0 |
75
+ | 1.0382 | 19.0 | 1938 | 1.2676 | 16.7046 | 9.0089 | 14.073 | 15.6342 | 19.0 |
76
+ | 0.9273 | 20.0 | 2040 | 1.2732 | 16.4339 | 8.6714 | 13.8422 | 15.44 | 19.0 |
77
+ | 0.9273 | 21.0 | 2142 | 1.2743 | 16.5655 | 8.7747 | 13.839 | 15.4958 | 19.0 |
78
+ | 0.9273 | 22.0 | 2244 | 1.2781 | 16.7749 | 8.9154 | 14.1216 | 15.6395 | 19.0 |
79
+ | 0.9273 | 23.0 | 2346 | 1.2814 | 16.535 | 8.7436 | 13.971 | 15.5056 | 19.0 |
80
+ | 0.9273 | 24.0 | 2448 | 1.2795 | 16.6612 | 8.7045 | 14.0096 | 15.5692 | 19.0 |
81
+ | 0.8539 | 25.0 | 2550 | 1.2844 | 16.6083 | 8.6106 | 13.9202 | 15.5641 | 19.0 |
82
+ | 0.8539 | 26.0 | 2652 | 1.2817 | 16.6449 | 8.8127 | 14.0562 | 15.5792 | 19.0 |
83
+ | 0.8539 | 27.0 | 2754 | 1.2856 | 16.6185 | 8.7475 | 14.0134 | 15.5439 | 19.0 |
84
+ | 0.8539 | 28.0 | 2856 | 1.2868 | 16.4913 | 8.7293 | 13.9367 | 15.4702 | 19.0 |
85
+ | 0.8539 | 29.0 | 2958 | 1.2905 | 16.4887 | 8.6461 | 13.8893 | 15.4342 | 19.0 |
86
+ | 0.8006 | 30.0 | 3060 | 1.2893 | 16.5861 | 8.695 | 13.9081 | 15.4307 | 19.0 |
87
+ | 0.8006 | 31.0 | 3162 | 1.2919 | 16.5972 | 8.8314 | 13.9069 | 15.4967 | 19.0 |
88
+ | 0.8006 | 32.0 | 3264 | 1.2940 | 16.5957 | 8.789 | 13.9202 | 15.4839 | 19.0 |
89
+ | 0.8006 | 33.0 | 3366 | 1.2946 | 16.6313 | 8.8011 | 13.9684 | 15.5256 | 19.0 |
90
+ | 0.8006 | 34.0 | 3468 | 1.2945 | 16.6711 | 8.8915 | 14.0228 | 15.5394 | 19.0 |
91
+ | 0.7598 | 35.0 | 3570 | 1.2970 | 16.67 | 8.891 | 13.9749 | 15.5174 | 19.0 |
92
+ | 0.7598 | 36.0 | 3672 | 1.2975 | 16.6223 | 8.7522 | 13.9528 | 15.4761 | 19.0 |
93
+ | 0.7598 | 37.0 | 3774 | 1.2987 | 16.6444 | 8.8594 | 13.9567 | 15.5117 | 19.0 |
94
+ | 0.7598 | 38.0 | 3876 | 1.2993 | 16.6444 | 8.8594 | 13.9567 | 15.5117 | 19.0 |
95
+ | 0.7598 | 39.0 | 3978 | 1.2996 | 16.6196 | 8.8108 | 13.9213 | 15.4806 | 19.0 |
96
+ | 0.7463 | 40.0 | 4080 | 1.2997 | 16.6203 | 8.7831 | 13.9116 | 15.4831 | 19.0 |
97
+
98
+
99
+ ### Framework versions
100
+
101
+ - Transformers 4.38.2
102
+ - Pytorch 2.1.0+cu121
103
+ - Datasets 2.18.0
104
+ - Tokenizers 0.15.2
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "decoder_start_token_id": 0,
3
+ "eos_token_id": 1,
4
+ "pad_token_id": 0,
5
+ "transformers_version": "4.38.2"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8252f805e9eceb9ea5b7c9df3779643f92e45545a779c8c0577a204d09328ba9
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7fed6700d26c484f3a2bcb21e0cd4c1e534ea58bc11f1f9ae7494752728503bb
3
  size 891644712
runs/Mar08_06-33-01_1ca75f98838e/events.out.tfevents.1709879590.1ca75f98838e.998.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b2676a4b6e6cfced38799461154fed3a7b24a8cd65c4d0a45ae74b61baac97ec
3
- size 27768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af1658eb5e35ecec013d37323fc70f47e30f072e40b0043c85dc04223696a7c2
3
+ size 28647