A ResNet-based image classifier for """specific""" images

Go to file

mrq 16ad4fa1c9 fixed not being able to use other resnets as a base		2024-09-22 22:07:19 -05:00
data	updated framework to use the saner framework that mrq/vall-e uses these days	2024-09-04 15:48:29 -05:00
image_classifier	fixed not being able to use other resnets as a base	2024-09-22 22:07:19 -05:00
scripts	updated framework to use the saner framework that mrq/vall-e uses these days	2024-09-04 15:48:29 -05:00
.gitignore	updated framework to use the saner framework that mrq/vall-e uses these days	2024-09-04 15:48:29 -05:00
LICENSE	updated framework to use the saner framework that mrq/vall-e uses these days	2024-09-04 15:48:29 -05:00
README.md	updated framework to use the saner framework that mrq/vall-e uses these days	2024-09-04 15:48:29 -05:00
setup.py	updated framework to use the saner framework that mrq/vall-e uses these days	2024-09-04 15:48:29 -05:00

README.md

Tentative Title For A ResNet-Based Image Classifier

This is a simple ResNet based image classifier for images, using a similar training framework I use to train VALL-E.

Premise

This was cobbled together in a night, partly to test how well my training framework fares when not married to my VALL-E implementation, and partly to solve a minor problem I faced.

This is by no ways state of the art, as it just leverages an existing ResNet arch provided by torchvision.

Training

Throw the images you want to train under ./data/images/.
Modify the ./data/config.yaml accordingly.
Install using pip3 install -e ./image_classifier/.
Train using python3 -m image_classifier.train --yaml='./data/config.yaml'.
Wait.

Inferencing

Simply invoke the inferencer with the following command: python3 -m image_classifier --path="./data/path-to-your-image.png" --yaml="./data/config.yaml"

Continuous Usage

If you're looking to continuously classify images, use python3 -m image_classifier --listen --port=7860 --yaml="./data/config.yaml" instead to enable a light webserver using simple_http_server. Send a GET request to http://127.0.0.1:7860/?b64={base64 encoded image string} and a JSON response will be returned with the classified label.