forked from mrq/DL-Art-School
Add distributed training guide to docs
This commit is contained in:
parent
2ad2b56438
commit
4dd053f694
|
@ -77,9 +77,12 @@ DLAS comes with some Dataset instances that I have created for my own use. Unles
|
|||
There are currently 3 base scripts for interacting with models. They all take a single parameter, `-opt` which specifies the configuration file which controls how they work. Configs (will be) documented above in the user guide.
|
||||
|
||||
#### train.py
|
||||
Starts (or continues) a training session.
|
||||
Start (or continue) a training session:
|
||||
`python train.py -opt <your_config.yml>`
|
||||
|
||||
Start a distributed training session:
|
||||
`python -m torch.distributed.launch --nproc_per_node=<gpus> --master_port=1234 train.py -o <opt> --launcher=pytorch`
|
||||
|
||||
#### test.py
|
||||
Runs a model against a validation or test set of data and reports metrics (for now, just PSNR and a custom perceptual metric)
|
||||
`python test.py -opt <your_config.yml>`
|
||||
|
|
Loading…
Reference in New Issue
Block a user