tortoise-tts-models / README.md
mrq
readmes
461213a
|
raw
history blame contribute delete
No virus
995 Bytes
# Finetuned TorToiSe Models
In the `./finetunes/` folder contains a collection of my finetuned models. Each model folder contains:
* the `pickle`'d finetuned model for tortoise-tts
* the LJSpeech-formatted dataset used to train on it, also containing:
- the generated YAML for training stored in `train.yaml`
- the openai/whisper output stored in `whisper.json`
* a pre-computed voice latents (auto-suggested by parsing each chunk at 10 seconds, seems to be decent)
Most of these were quickly trained on either my dedicated system (2x6800XTs) or my personal system (1x2060) with a learning rate of `1e-4` for about 200 epochs each, for acceptable results, and to just provide some examples. In the future, I'll retrain these at lower LRs to compare.
## Model List
* Harry Mason (Silent Hill)
* James Sunderland (Silent Hill 2)
* Mitsuru Kirijo (Persona 3)
* Melina (Elden Ring)
* Japanese
### Planned
* Patrick Bateman (American Psycho)
* Shadow, Rouge, and Knuckles (Sonic Adventure 2)