milmor commited on
Commit
c8310b0
1 Parent(s): 3a806c4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -52,7 +52,7 @@ Since the Axolotl corpus contains misaligments, we just select the best samples
52
 
53
 
54
  ### Model and training
55
- We employ two training-stages using a multilingual T5-small. This model was chosen because it can handle different vocabularies and suffixes. The model is pretrained on different tasks and lenguages (French, Romanian, English, German).
56
 
57
  ### Training-stage 1 (learning Spanish)
58
  In training stage 1 we first introduce Spanish to the model. The objective is to learn a new language rich in data (Spanish) and not lose the previous knowledge acquired. We use the English-Spanish [Anki](https://www.manythings.org/anki/) dataset, which consists of 118.964 text pairs. We train the model till convergence adding the suffix "Translate Spanish to English: ".
 
52
 
53
 
54
  ### Model and training
55
+ We employ two training-stages using a multilingual T5-small. This model was chosen because it can handle different vocabularies and suffixes. The model is pretrained on different tasks and languages (French, Romanian, English, German).
56
 
57
  ### Training-stage 1 (learning Spanish)
58
  In training stage 1 we first introduce Spanish to the model. The objective is to learn a new language rich in data (Spanish) and not lose the previous knowledge acquired. We use the English-Spanish [Anki](https://www.manythings.org/anki/) dataset, which consists of 118.964 text pairs. We train the model till convergence adding the suffix "Translate Spanish to English: ".