Edit model card

An unofficial reproduced PRepBN-Llama-350M checkpoints for SLAB.

Model Sources [optional]

Evaluation

https://github.com/xinghaochen/SLAB/tree/main/llama

python evaluation.py --ckpt <checkpoint-path>

Results

BibTeX:

@inproceedings{guo2024slab,
  title={SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization},
  author={Guo, Jialong and Chen, Xinghao and Tang, Yehui  and Wang, Yunhe},
  booktitle={International Conference on Machine Learning},
  year={2024}
}
Downloads last month
0
Inference API
Unable to determine this model's library. Check the docs .