Trained a new model with the Japanese tokenizer, and after ~55 epochs (~825000 samples processed),…
Would say that the minimum would need to be around 5000 to avoid overfitting then?
That is really hard to say. It really depends on dataset entropy and variance... I usually don't go bellow…
I agree, a LR finder would be awesome. I indeed tried very low learning rates but the model remains stuck at pretty high values (around 2) and doesn't go down anymore (I've read somewhere that…
Hey! Thank you, yes indeed I'm just trying to refine an english speaking model for a game character. Thank you for the finetuned model, if I understand correctly, James Becker took the feminine…
Is there a supported method in the GUI for testing out different training settings and switching between the final finetune?
No not really. It would be nice to have learning rate finder, but…
Depends what are you trying to achieve - when I am trying to finetune to new language I have a dataset of cca 500k clips (about 1000hours of audio) - James Becker (the original creator of tortoise…