fishaudio
/

fish-speech-1.2-sft

Model card Files Files and versions Community

Edit model card

Fish Speech V1.2

Fish Speech V1.2 is a leading text-to-speech (TTS) model trained on 300k hours of English, Chinese, and Japanese audio data.

Please refer to Fish Speech Github for more info.
Demo available at Fish Audio.

Citation

If you found this repository useful, please consider citing this work:

@misc{fish-speech-v1,
  author = {Shijia Liao, Tianyu Li},
  title = {Fish Speech V1},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/fishaudio/fish-speech}}
}

License

This model is permissively licensed under the BY-CC-NC-SA-4.0 license. The source code is released under BY-CC-NC-SA-4.0 license.

Downloads last month: 1,966

Inference API

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Spaces using fishaudio/fish-speech-1.2-sft 2