legekka commited on
Commit
0b44c0d
1 Parent(s): 85c7947

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -15,9 +15,9 @@ This vits model was trained on the [KTH/hungarian-single-speaker-tts](https://hu
15
  The corpus consists of a single speaker, with 4515 segments extracted from [this](https://librivox.org/egri-csillagok-by-geza-gardonyi/) single LibriVox audiobook. It consists about 10 hours of audio data.
16
 
17
  ## Training
18
- The model was trained on a single RTX 3090 GPU. The training took about 1 day for the first checkpoint (Step 93000).
19
- Based on the quality of the preview model, we are aiming for 250K Steps.
20
 
21
  ## Usage
22
- The model can be used with [JayWalnut's git repo](https://github.com/jaywalnut310/vits), but you have to modify the `text/cleaners.py` file to contain our `hungarian_cleaners` method.
23
  We provided the necessary files in our repo to do so.
 
15
  The corpus consists of a single speaker, with 4515 segments extracted from [this](https://librivox.org/egri-csillagok-by-geza-gardonyi/) single LibriVox audiobook. It consists about 10 hours of audio data.
16
 
17
  ## Training
18
+ The model was trained on a single RTX 3090 GPU for 3 days, 200K steps with a batchsize of 16.
19
+ We saved some checkpoints with the optimizers, so the model could be train further, however we didn't notice any noticable effect after step 150K.
20
 
21
  ## Usage
22
+ The model `diana_final.pth` can be used with [JayWalnut's git repo](https://github.com/jaywalnut310/vits), but you have to modify the `text/cleaners.py` file to contain our `hungarian_cleaners` method.
23
  We provided the necessary files in our repo to do so.