edwko commited on
Commit
4e3820e
1 Parent(s): 4ae1472

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -143,8 +143,8 @@ For instruction training, we first trained the model with Supervised Fine-tuning
143
  </table>
144
 
145
  ## Interfacing with the Instruct Model
146
- Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible.
147
- Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included.
148
  The attention layer implementation is based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
149
 
150
  > [!IMPORTANT]
 
143
  </table>
144
 
145
  ## Interfacing with the Instruct Model
146
+ Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
147
+ Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
148
  The attention layer implementation is based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
149
 
150
  > [!IMPORTANT]