Training error: RuntimeError: Could not infer dtype of NoneType #6

New Issue

Bluebomber182 · 2023-08-27T00:49:08Z

Bluebomber182 commented

2023-08-27 00:49:08 +00:00

I get this error message when I try training a model. Here is the zip file containing the files causing the errors
https://files.catbox.moe/q9shx6.zip

Traceback (most recent call last):
File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/usr/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/train.py", line 170, in
main()
File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/train.py", line 163, in main
trainer.train(
File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/utils/trainer.py", line 249, in train
for batch in _make_infinite_epochs(train_dl):
File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/utils/trainer.py", line 199, in _make_infinite_epochs
yield from tqdm(dl, "Epoch progress", dynamic_ncols=True)
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/tqdm/std.py", line 1182, in iter
for obj in iterable:
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 633, in next
data = self._next_data()
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 1325, in _next_data
return self._process_data(data)
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 1371, in _process_data
data.reraise()
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/_utils.py", line 644, in reraise
raise exception
RuntimeError: Caught RuntimeError in DataLoader worker process 3.
Original Traceback (most recent call last):
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/_utils/worker.py", line 308, in _worker_loop
data = fetcher.fetch(index)
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 51, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 51, in
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/data.py", line 286, in getitem
text = torch.tensor([*map(self.phone_symmap.get, _get_phones(path))]).to(self.text_dtype)
RuntimeError: Could not infer dtype of NoneType

[2023-08-26 17:39:28,803] [INFO] [launch.py:315:sigkill_handler] Killing subprocess 32045
[2023-08-26 17:39:28,803] [ERROR] [launch.py:321:sigkill_handler] ['/home/user/ai-voice-cloning/venv/bin/python3.10', '-u', '-m', 'vall_e.train', '--local_rank=0', 'yaml=/home/user/ai-voice-cloning/training/Merida/config.yaml'] exits with return code = 1

I get this error message when I try training a model. Here is the zip file containing the files causing the errors https://files.catbox.moe/q9shx6.zip Traceback (most recent call last): File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main return _run_code(code, main_globals, None, File "/usr/lib/python3.10/runpy.py", line 86, in _run_code exec(code, run_globals) File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/train.py", line 170, in <module> main() File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/train.py", line 163, in main trainer.train( File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/utils/trainer.py", line 249, in train for batch in _make_infinite_epochs(train_dl): File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/utils/trainer.py", line 199, in _make_infinite_epochs yield from tqdm(dl, "Epoch progress", dynamic_ncols=True) File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/tqdm/std.py", line 1182, in __iter__ for obj in iterable: File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 633, in __next__ data = self._next_data() File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 1325, in _next_data return self._process_data(data) File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 1371, in _process_data data.reraise() File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/_utils.py", line 644, in reraise raise exception RuntimeError: Caught RuntimeError in DataLoader worker process 3. Original Traceback (most recent call last): File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/_utils/worker.py", line 308, in _worker_loop data = fetcher.fetch(index) File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 51, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "/home/user/ai-voice-cloning/venv/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 51, in <listcomp> data = [self.dataset[idx] for idx in possibly_batched_index] File "/home/user/ai-voice-cloning/modules/vall-e/vall_e/data.py", line 286, in __getitem__ text = torch.tensor([*map(self.phone_symmap.get, _get_phones(path))]).to(self.text_dtype) RuntimeError: Could not infer dtype of NoneType [2023-08-26 17:39:28,803] [INFO] [launch.py:315:sigkill_handler] Killing subprocess 32045 [2023-08-26 17:39:28,803] [ERROR] [launch.py:321:sigkill_handler] ['/home/user/ai-voice-cloning/venv/bin/python3.10', '-u', '-m', 'vall_e.train', '--local_rank=0', 'yaml=/home/user/ai-voice-cloning/training/Merida/config.yaml'] exits with return code = 1

mrq commented

2023-08-27 02:14:36 +00:00

You'll either need to:

update the symmap table in ./vall_e/data.py:35 with the first line printed out when the dataset loads (it'll look like the existing symmap, but with more entries).
create an HDF5 dataset which will automatically create and store its own symmap with `python3 -m vall_e.data --action=hdf5 yaml="./path/to/your/config.yaml."

It used to automatically update the symmap just-in-time rather than ahead-of-time, but doing so incurred a penalty when using an extremely large dataset.

You'll either need to: * update the symmap table in [`./vall_e/data.py:35`](https://git.ecker.tech/mrq/vall-e/src/branch/master/vall_e/data.py#L35) with the first line printed out when the dataset loads (it'll look like the existing symmap, but with more entries). * create an HDF5 dataset which will automatically create and store its own symmap with `python3 -m vall_e.data --action=hdf5 yaml="./path/to/your/config.yaml." It *used* to automatically update the symmap just-in-time rather than ahead-of-time, but doing so incurred a penalty when using an extremely large dataset.

Bluebomber182 commented

2023-08-27 02:58:31 +00:00

I created the HDF5 dataset. I still got the "RuntimeError: Could not infer dtype of NoneType" error message. I tried setting to "True" under use_hdf5: in config.yaml and I got this error message
raise ValueError("No valid path is found for training.") ValueError: No valid path is found for training.

I created the HDF5 dataset. I still got the "RuntimeError: Could not infer dtype of NoneType" error message. I tried setting to "True" under use_hdf5: in config.yaml and I got this error message ` raise ValueError("No valid path is found for training.") ValueError: No valid path is found for training.`

Sign in to join this conversation.

No Label

No Milestone

No project

No Assignees

2 Participants

Notifications

Due Date

The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mrq/vall-e#6