captcha | ||
scripts | ||
.gitignore | ||
LICENSE | ||
README.md | ||
setup.py |
Tentative Title For A ResNet-Based Image Classifier
This is a simple ResNet based image classifier for """specific images""", using a similar training framework I use to train VALL-E.
Training
-
Throw the images you want to train under
./data/images/
. -
Modify the
./data/config.yaml
accordingly. -
Install using
pip3 install -e ./captcha/
. -
Train using
python3 -m captcha.train yaml='./data/config.yaml'
. -
Wait.
Inferencing
To be implemented.
Caveats
This was cobbled together in a night, partly to test how well my training framework fares when not married to my VALL-E implementation, and partly to solve a problem I have recently faced. Since I've been balls deep in learning the ins and outs of making VALL-E work, why not do the exact opposite (a tiny, image classification model of fixed lengths) to test the framework and my knowledge? Thus, this """ambiguous""" project is born.
This is by no ways state of the art, as it just leverages an existing ResNet arch provided by torchvision
.