view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 • 140
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Jul 24 • 44
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 35 items • Updated 29 days ago • 313
EVIDENT PlatVR [datasets] Collection This work is supported by the Ministry of Industry, Trade and Tourism, Spain (AEI-010500-2023-280). • 3 items • Updated Apr 17 • 1
EVIDENT PlatVR [models] Collection This work is supported by the Ministry of Industry, Trade and Tourism, Spain (AEI-010500-2023-280). • 3 items • Updated Apr 17 • 1
MT5 release Collection The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. • 10 items • Updated Jul 31 • 14
Flan-T5 release Collection The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling • 7 items • Updated Jul 31 • 18
T5 release Collection The original T5 transformer release was done in two steps, the original T5 checkpoints and the improved T5v1 • 9 items • Updated Jul 31 • 11
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Jul 31 • 7
ALBERT release Collection The ALBERT release was done in two steps, over 4 checkpoints of different sizes each time. The first version is noted as "v1", the second as "v2". • 8 items • Updated Jul 31 • 5
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 324
BERT release Collection Regroups the original BERT models released by the Google team. Except for the models marked otherwise, the checkpoints support English. • 8 items • Updated Jul 31 • 18
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 7 items • Updated Jul 17 • 16
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated Jul 17 • 43