--- datasets: - Bingsu/zeroth-korean language: - ko metrics: - cer - wer base_model: - openai/whisper-large-v3-turbo pipeline_tag: automatic-speech-recognition --- ## Description Fine-tuning Whisper Large V3 Turbo on zeroth Korean dataset. ## Dataset split: - The test dataset from Korean zeroth is divided to test and validation -> 50% validation, 50% test - Train set duration: 206 hours 43 minutes - Validation set duration: 2 hours 22 minutes - Test set duration: 2 hours 22 minutes ## Results: - initial validation WER: 26.26% - final validation WER: 4.90% - initial validation CER: 6.67% - final validation CER: 1.78% - initial test WER: 26.75% - final test WER: 4.89% - initial test CER: 7.58% - final test CER: 2.06% ## Notes - Models did not converge, better results are possible.