DL-Art-School/codes/models/tacotron2/text/symbols.py

""" from https://github.com/keithito/tacotron """

'''
Defines the set of symbols used in text input to the model.

The default is a set of ASCII characters that works well for English or text that has been run through Unidecode. For other data, you can modify _characters. See TRAINING_DATA.md for details. '''
from models.tacotron2.text import cmudict

_pad        = '_'
_punctuation = '!\'(),.:;? '
_special = '-'
_letters = 'ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz'

# Prepend "@" to ARPAbet symbols to ensure uniqueness (some are the same as uppercase letters):
_arpabet = ['@' + s for s in cmudict.valid_symbols]

# Export all symbols:
symbols = [_pad] + list(_special) + list(_punctuation) + list(_letters) + _arpabet
Initial checkin of nvidia tacotron model & dataset These two are tested, full support for training to come. 2021-07-06 17:11:35 +00:00			`""" from https://github.com/keithito/tacotron """`

			`'''`
			`Defines the set of symbols used in text input to the model.`

			`The default is a set of ASCII characters that works well for English or text that has been run through Unidecode. For other data, you can modify _characters. See TRAINING_DATA.md for details. '''`
			`from models.tacotron2.text import cmudict`

			`_pad = '_'`
			`_punctuation = '!\'(),.:;? '`
			`_special = '-'`
			`_letters = 'ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz'`

			`# Prepend "@" to ARPAbet symbols to ensure uniqueness (some are the same as uppercase letters):`
			`_arpabet = ['@' + s for s in cmudict.valid_symbols]`

			`# Export all symbols:`
			`symbols = [_pad] + list(_special) + list(_punctuation) + list(_letters) + _arpabet`