Automatic Speech Recognition
Transformers
Safetensors
Japanese
whisper
Inference Endpoints
Edit model card

How to use:

audio_file ="path/to/audio_file"

model = "sin2piusc/whisper-medium-JP-5k"

pipe = pipeline("automatic-speech-recognition", "model")

pipe(audio_file, generate_kwargs={"language": "japanese", "task":"translate"})

-- or

client = InferenceClient(model, token="hf_token")

client.automatic_speech_recognition(audio_file).text

--

Make sure to set task and language.

--

--Original sentence:

集団内のすべての個体が特定の表現形質に関して同一である場合 それらは単形性と呼ばれます。

When all individuals in a population are identical with respect to a particular phenotypic trait, they are called monomorphic.

--sin2piusc/whisper-medium-5k:

集団内のすべての個体が特定の表現形質に関して同一である場合 それらは単形性と呼ばれます

When all individuals in a population are identical with respect to a particular phenotypic trait, they are called monomorphic.

--openai/whisper-medium:

集団内のすべての個体が特定の表現形式に関して同一である場合、それらは単形性と呼ばれます。

If all individuals in a population are identical with respect to a particular form of expression, they are called monomorphic.

Downloads last month
43
Safetensors
Model size
764M params
Tensor type
F32
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train sin2piusc/whisper-medium-JP-5k