Does anyone know why the WER does not decay? I'm fine tuning the openai whisper medium model for low resource language?
(https://i.stack.imgur.com/4bmPl.png)
per_device_train_batch_size="32"
per_device_eval_batch_size="16"
learning_rate="1e-5"
Does anyone know why the WER does not decay? I'm fine tuning the openai whisper medium model for low resource language?
(https://i.stack.imgur.com/4bmPl.png)
per_device_train_batch_size="32"
per_device_eval_batch_size="16"
learning_rate="1e-5"
Copyright © 2021 Jogjafile Inc.