OuteAI
/

Lite-Oute-2-Mamba2Attn-250M-Instruct

Model card Files Files and versions Community

edwko commited on 29 days ago

Commit

0d376fc

•

1 Parent(s): 152a992

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -143,9 +143,6 @@ For instruction training, we first trained the model with Supervised Fine-tuning
 </table>
 ## Interfacing with the Instruct Model
-Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
-Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
-The attention layer implementation for the modeling files are based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
 > [!IMPORTANT]
 > To ensure optimal performance, please use the following template when interacting with the model:
@@ -199,7 +196,9 @@ The play "Romeo and Juliet" by William Shakespeare is a classic example of a tra
 ```
 ## Usage with HuggingFace transformers
-Model weights were converted to be Hugging Face compatible, with custom modeling files included due to the lack of official support for Mamba2 attention layers.
 To speed up inference, we recommend installing mamba-ssm and flash attention 2.

 </table>
 ## Interfacing with the Instruct Model
 > [!IMPORTANT]
 > To ensure optimal performance, please use the following template when interacting with the model:
 ```
 ## Usage with HuggingFace transformers
+Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
+Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
+The attention layer implementation for the modeling files are based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
 To speed up inference, we recommend installing mamba-ssm and flash attention 2.