gmongaras's picture
Update README.md
ce2f031 verified
|
raw
history blame contribute delete
No virus
215 Bytes
metadata
license: mit
pipeline_tag: feature-extraction

This repository contains the GPT-Soft-300m model described in Cottention: Linear Transformers With Cosine Attention.