Edit model card

VikhrT5-3b: Улучшенная модель на базе FLAN T5 3b для русского языка

Cкорее всего она лучше чем FRED T5XL

Dataset VikhrT5-3b FRED-T5-XL(1.7b) FLAN-t5-xl(3b)
ru_mmlu 0.32 0.252 0.28 (лол2)
xwinograd_ru 0.71 (lol) 0.57 0.52
xnli_ru 0.4280 0.34 0.33
Downloads last month
7
Safetensors
Model size
2.96B params
Tensor type
F32
·
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train Vikhrmodels/VikhrT5-3b

Spaces using Vikhrmodels/VikhrT5-3b 2