OuteAI
/

Lite-Oute-2-Mamba2Attn-250M-Instruct

Model card Files Files and versions Community

edwko commited on 29 days ago

Commit

4e3820e

•

1 Parent(s): 4ae1472

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -143,8 +143,8 @@ For instruction training, we first trained the model with Supervised Fine-tuning
 </table>
 ## Interfacing with the Instruct Model
-Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible.
-Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included.
 The attention layer implementation is based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
 > [!IMPORTANT]

 </table>
 ## Interfacing with the Instruct Model
+Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
+Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
 The attention layer implementation is based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
 > [!IMPORTANT]