theSOL1 commited on
Commit
3a3b6d8
1 Parent(s): 4e8bda9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md CHANGED
@@ -1,3 +1,28 @@
1
  ---
 
2
  license: mit
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: ko
3
  license: mit
4
+ tags:
5
+ - bart
6
+ - grammar
7
  ---
8
+
9
+ # kogrammar-distil
10
+
11
+ Dataset: 국립국어원 맞춤법 교정 말뭉치
12
+ <br>
13
+ <br>
14
+ Backbone Model:
15
+ - [kobart-base-v2](https://huggingface.co/gogamza/kobart-base-v2/blob/main/README.md)
16
+ - [kogrammar-base](https://huggingface.co/theSOL1/kogrammar-base)
17
+ <br>
18
+ GitHub Repo: [SOL1archive/KoGrammar](https://github.com/SOL1archive/KoGrammar)
19
+
20
+ ## Train Method
21
+ 전체 데이터셋 중 약 67.5%를 학습데이터로 활용하여 학습함.
22
+ <br>
23
+ SFT Distillation을 이용해 [kogrammar-base](https://huggingface.co/theSOL1/kogrammar-base) 모델의 Decoder Layer를 6개에서 3개로 줄여 다시 학습시킴.
24
+
25
+ ## Metric
26
+ |BLEU-2|ROUGE-2 F1|
27
+ |-|-|
28
+ |77.8 %|55.0 %|