patnelt60 commited on
Commit
1c780d6
1 Parent(s): 20defc1

Training completed!

Browse files
Files changed (5) hide show
  1. README.md +15 -15
  2. config.json +1 -1
  3. pytorch_model.bin +1 -1
  4. tokenizer_config.json +4 -0
  5. training_args.bin +2 -2
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: distilbert-base-uncased
4
  tags:
5
  - generated_from_trainer
6
  datasets:
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Accuracy
24
  type: accuracy
25
- value: 0.8461290322580645
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +30,10 @@ should probably proofread and complete it, then remove this comment. -->
30
 
31
  # distilbert-base-uncased-finetuned-clinc
32
 
33
- This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the clinc_oos dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.2616
36
- - Accuracy: 0.8461
37
 
38
  ## Model description
39
 
@@ -64,16 +64,16 @@ The following hyperparameters were used during training:
64
 
65
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
66
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
67
- | No log | 1.0 | 40 | 2.8668 | 0.1906 |
68
- | No log | 2.0 | 80 | 2.5167 | 0.5610 |
69
- | No log | 3.0 | 120 | 2.1945 | 0.6932 |
70
- | No log | 4.0 | 160 | 1.9222 | 0.7497 |
71
- | No log | 5.0 | 200 | 1.7033 | 0.7877 |
72
- | No log | 6.0 | 240 | 1.5338 | 0.8090 |
73
- | No log | 7.0 | 280 | 1.4084 | 0.8265 |
74
- | 2.1094 | 8.0 | 320 | 1.3259 | 0.8339 |
75
- | 2.1094 | 9.0 | 360 | 1.2770 | 0.8439 |
76
- | 2.1094 | 10.0 | 400 | 1.2616 | 0.8461 |
77
 
78
 
79
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: patnelt60/distilbert-base-uncased-finetuned-clinc
4
  tags:
5
  - generated_from_trainer
6
  datasets:
 
22
  metrics:
23
  - name: Accuracy
24
  type: accuracy
25
+ value: 0.9267741935483871
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
30
 
31
  # distilbert-base-uncased-finetuned-clinc
32
 
33
+ This model is a fine-tuned version of [patnelt60/distilbert-base-uncased-finetuned-clinc](https://huggingface.co/patnelt60/distilbert-base-uncased-finetuned-clinc) on the clinc_oos dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.1904
36
+ - Accuracy: 0.9268
37
 
38
  ## Model description
39
 
 
64
 
65
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
66
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
67
+ | No log | 1.0 | 40 | 0.4572 | 0.8619 |
68
+ | No log | 2.0 | 80 | 0.3775 | 0.8881 |
69
+ | No log | 3.0 | 120 | 0.3184 | 0.9013 |
70
+ | No log | 4.0 | 160 | 0.2753 | 0.9110 |
71
+ | No log | 5.0 | 200 | 0.2441 | 0.9187 |
72
+ | No log | 6.0 | 240 | 0.2224 | 0.9232 |
73
+ | No log | 7.0 | 280 | 0.2073 | 0.9248 |
74
+ | 0.3426 | 8.0 | 320 | 0.1982 | 0.9268 |
75
+ | 0.3426 | 9.0 | 360 | 0.1923 | 0.9265 |
76
+ | 0.3426 | 10.0 | 400 | 0.1904 | 0.9268 |
77
 
78
 
79
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "distilbert-base-uncased",
3
  "activation": "gelu",
4
  "architectures": [
5
  "DistilBertForSequenceClassification"
 
1
  {
2
+ "_name_or_path": "patnelt60/distilbert-base-uncased-finetuned-clinc",
3
  "activation": "gelu",
4
  "architectures": [
5
  "DistilBertForSequenceClassification"
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:567ced3837dd281b7962fe34838fcff571fb2b542a4beeabd07ecd4162908478
3
  size 268314282
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:73bfbfe0077d1362bdfd8b7e66536f94930e12400570f19b3b5fd922ad4e5ed7
3
  size 268314282
tokenizer_config.json CHANGED
@@ -3,11 +3,15 @@
3
  "cls_token": "[CLS]",
4
  "do_lower_case": true,
5
  "mask_token": "[MASK]",
 
6
  "model_max_length": 512,
7
  "pad_token": "[PAD]",
8
  "sep_token": "[SEP]",
 
9
  "strip_accents": null,
10
  "tokenize_chinese_chars": true,
11
  "tokenizer_class": "DistilBertTokenizer",
 
 
12
  "unk_token": "[UNK]"
13
  }
 
3
  "cls_token": "[CLS]",
4
  "do_lower_case": true,
5
  "mask_token": "[MASK]",
6
+ "max_length": 512,
7
  "model_max_length": 512,
8
  "pad_token": "[PAD]",
9
  "sep_token": "[SEP]",
10
+ "stride": 0,
11
  "strip_accents": null,
12
  "tokenize_chinese_chars": true,
13
  "tokenizer_class": "DistilBertTokenizer",
14
+ "truncation_side": "right",
15
+ "truncation_strategy": "longest_first",
16
  "unk_token": "[UNK]"
17
  }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cae2c5b528d2fadf0f1c803519efd155c925ee775c81bcc6db5977170fe63efc
3
- size 4600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f8a870ebbd9d0512d0511e29138cf6ac696c1406020acd043129c6e9b6afcce2
3
+ size 4536