From 4dd053f6943fc37355d398554f742fb91d0cf271 Mon Sep 17 00:00:00 2001 From: James Betker Date: Sun, 6 Jun 2021 16:56:40 -0600 Subject: [PATCH] Add distributed training guide to docs --- README.md | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 0c2020cd..059a5b0c 100644 --- a/README.md +++ b/README.md @@ -77,9 +77,12 @@ DLAS comes with some Dataset instances that I have created for my own use. Unles There are currently 3 base scripts for interacting with models. They all take a single parameter, `-opt` which specifies the configuration file which controls how they work. Configs (will be) documented above in the user guide. #### train.py -Starts (or continues) a training session. +Start (or continue) a training session: `python train.py -opt ` +Start a distributed training session: +`python -m torch.distributed.launch --nproc_per_node= --master_port=1234 train.py -o --launcher=pytorch` + #### test.py Runs a model against a validation or test set of data and reports metrics (for now, just PSNR and a custom perceptual metric) `python test.py -opt `