File size: 1,097 Bytes
cd8c8f0
 
 
12aeda2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9a544eb
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
license: mit
---
## Bangla Glove Vectors
This is a collection of pre-trained glove vectors for the Bengali language. You can find details training in [this](https://github.com/sagorbrur/GloVe-Bengali) repository.

This model is build for [bnlp](https://github.com/sagorbrur/bnlp) package.


## Datasets
- [Wikipedia dump datasets](https://dumps.wikimedia.org/bnwiki/latest/)

## Model Details
- wikipedia+crawl_news_articles (39M(39055685) tokens, 0.18M(178152) vocab size, 195.4MB download):
bn_glove.39M.300d.zip

- wikipeida+crawl_news_articles (39M(39055685) tokens, 0.18M(178152) vocab size, 100d vectors, 65MB download):
bn_glove.39M.100d.zip

- Wikipedia (20M(19965328) tokens, 0.13M(134255) vocab, 300d vectors, 145.9MB download):
bn_glove.300d.zip

- Wikipedia (20M(19965328) tokens, 0.13M(134255), 100d vectors, 51.2MB download):
bn_glove.100d.zip

## Usage
```py
from bnlp import BengaliGlove
glove_path = "bn_glove.39M.100d.txt"
word = "গ্রাম"
bng = BengaliGlove()
res = bng.closest_word(glove_path, word)
print(res)
vec = bng.word2vec(glove_path, word)
print(vec)

  ```