diff --git a/docs/models.md b/docs/models.md index c45030f..2d5e0d7 100644 --- a/docs/models.md +++ b/docs/models.md @@ -33,6 +33,16 @@ This script implements the core underlying model for VALL-E. This handle: This script aims to implement everything as required per VALL-E agnostically, to allow for different implementations to contain little extra code. +### Tasks + +The base model handles processing inputs into token sequences, per the requested task assigned to each input in a batch. + +Most sequences follow a `` sequence, but some tasks will receive the prompt as a list of tensors, instead. + +The length predictor `len` task will naively output the length in base-10 followed by a stop token. + +Speech-To-Text will follow a reverse sequence of `