xls-r_cv_ur / vocab.json
hadiqa123's picture
add tokenizer
697b7f6
raw
history blame
No virus
202 Bytes
{
" ": 11,
"[PAD]": 16,
"[UNK]": 15,
"ا": 4,
"ب": 5,
"ت": 14,
"د": 13,
"ز": 12,
"س": 3,
"ل": 9,
"م": 7,
"ڑ": 8,
"ک": 2,
"ھ": 10,
"ہ": 6,
"ی": 0,
"ے": 1
}