EleutherAI
/

polyglot-ko-1.3b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jason9693 commited on Sep 15, 2022

Commit

515df60

•

1 Parent(s): 03fc19a

update metric score

Files changed (1) hide show

README.md +4 -15

README.md CHANGED Viewed

@@ -72,11 +72,11 @@ As with all language models, it is hard to predict in advance how GPT-NeoX-Ko wi
 <figure>
-|  Model                   | Public      | Training FLOPs | LAMBADA PPL ↓ | LAMBADA Acc ↑ | Winogrande ↑ | Hellaswag ↑ | PIQA ↑    | Dataset Size (GB) |
 |--------------------------|-------------|----------------|---            |---            |---           |---          |---        |-------------------|
-| KoGPT-trinity&ddagger;   | &cross;     | -----          | 3.0           | 75%           | 72%          | 78%         | 80%       | -----             |
-| KoGPT-KakaoBrain&ddagger;   | &cross;     | -----          | 3.0           | 75%           | 72%          | 78%         | 80%       | -----             |
-| GPT-NeoX-Ko-1.3B(ours)&ddagger;   | &cross;     | -----          | 3.0           | 75%           | 72%          | 78%         | 80%       | -----             |
 <figcaption><p>Models roughly sorted by performance, or by FLOPs if not available.</p>
@@ -111,17 +111,6 @@ To cite this model:
 }
 ```
-To cite the codebase that trained this model:
-```bibtex
-@misc{mesh-transformer-jax,
-  author = {Wang, Ben},
-  title = {{Mesh-Transformer-JAX: Model-Parallel Implementation of Transformer Language Model with JAX}},
-  howpublished = {\url{https://github.com/kingoflolz/mesh-transformer-jax}},
-  year = 2021,
-  month = May
-}
-```
 If you use this model, we would love to hear about it! Reach out on [GitHub](https://github.com/kingoflolz/mesh-transformer-jax), Discord, or shoot Ben an email.
 ## Acknowledgements

 <figure>
+|  Model                   | Public      | Training FLOPs | kobest_boolq ↓ | kobest_copa ↑ | kobest_wic ↑ | kobest_hellaswag ↑ | kobest_sentineg ↑    | Dataset Size (GB) |
 |--------------------------|-------------|----------------|---            |---            |---           |---          |---        |-------------------|
+| KoGPT-trinity&ddagger;   | &cross;     | -----          | 0.6663           | 0.6222           | 0.656          | 0.4011         | 0.3534       | -----             |
+| KoGPT-KakaoBrain&ddagger;   | &cross;     | -----          | 0.3241           | 0.719           | 0.1356          | 0.4616         | 0.8065       | -----             |
+| GPT-NeoX-Ko-1.3B(ours)&ddagger;   | &cross;     | -----          | 0.5174           | 0.7072           | 0.6567          | 0.417         | 0.8444       | -----             |
 <figcaption><p>Models roughly sorted by performance, or by FLOPs if not available.</p>
 }
 ```
 If you use this model, we would love to hear about it! Reach out on [GitHub](https://github.com/kingoflolz/mesh-transformer-jax), Discord, or shoot Ben an email.
 ## Acknowledgements