Edit model card

Model Card for Model ID

LOLA: Large and Open Source Multilingual Language Model

Model Description

How to Get Started with the Model

This pre-trained (causal language modeling) model can only be used for text-generation and requires further fine-tuning on downstream tasks.

How to use

You can use this model directly with a pipeline for text generation.

>>> from transformers import pipeline

>>> generator = pipeline('text-generation', model="dice-research/lola_v1", trust_remote_code=True)
>>> generator("The quick brown fox", max_length=13)
[{'generated_text': 'The quick brown fox jumps over the lazy dog.'}]

To use the top-k sampling, please set do_sample to True.

Note: The tokenizer used in the model comes from mGPT (https://github.com/ai-forever/mgpt)

Training Details

Training Framework

Pretraining Dataset

LOLA v1 Training:

Downloads last month
2,098
Safetensors
Model size
7.46B params
Tensor type
F32
·
Inference API
Inference API (serverless) does not yet support model repos that contain custom code.