prithivida commited on
Commit
123b3b5
1 Parent(s): 2f80623

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -158,7 +158,7 @@ for query, query_embedding in zip(queries, query_embeddings):
158
  # FAQs:
159
 
160
  #### How can I reduce overall inference cost ?
161
- - You can host these models without heavy torch dependency using the ONNX flavours of these models via [FlashRetrieve](https://github.com/PrithivirajDamodaran/FlashRetrieve) library.
162
 
163
  #### How do I reduce vector storage cost ?
164
  [Use Binary and Scalar Quantisation](https://huggingface.co/blog/embedding-quantization)
 
158
  # FAQs:
159
 
160
  #### How can I reduce overall inference cost ?
161
+ - You can host these models without heavy torch dependency using the ONNX flavours of these models via [FlashEmbed](https://github.com/PrithivirajDamodaran/flashembed) library.
162
 
163
  #### How do I reduce vector storage cost ?
164
  [Use Binary and Scalar Quantisation](https://huggingface.co/blog/embedding-quantization)