charmen-electra / README.md
berendg's picture
Update README.md
0541a0e
|
raw
history blame
649 Bytes
# Charmen-Electra
A byte-based transformer model trained on Hungarian language. In order to use the model you will need a custom Tokenizer which is available at: [https://github.com/szegedai/byte-offset-tokenizer](https://github.com/szegedai/byte-offset-tokenizer).
Since we use a custom architecture with Gradient Boosting, Down- and Up-Sampling, you have to enable Trusted Code like:
```python
model = AutoModel.from_pretrained("SzegedAI/charmen-electra", trust_remote_code=True)
```
#Acknowledgement
[![Artificial Intelligence - National Laboratory - Hungary](https://milab.tk.hu/uploads/images/milab_logo_en.png)](https://mi.nemzetilabor.hu/)