File size: 215 Bytes
ce2f031 |
1 2 3 4 5 6 |
---
license: mit
pipeline_tag: feature-extraction
---
This repository contains the GPT-Soft-300m model described in [Cottention: Linear Transformers With Cosine Attention](https://huggingface.co/papers/2409.18747). |