edwko commited on
Commit
227e029
1 Parent(s): 0d376fc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -198,7 +198,7 @@ The play "Romeo and Juliet" by William Shakespeare is a classic example of a tra
198
  ## Usage with HuggingFace transformers
199
  Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
200
  Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
201
- The attention layer implementation for the modeling files are based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
202
 
203
  To speed up inference, we recommend installing mamba-ssm and flash attention 2.
204
 
 
198
  ## Usage with HuggingFace transformers
199
  Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
200
  Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
201
+ The implementation of Mamba2 with attention in the modeling files comes from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
202
 
203
  To speed up inference, we recommend installing mamba-ssm and flash attention 2.
204