ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
text-to-speech
tts
speech-synthesis
persian
data-collection
data-preprocessing
speech-processing
forced-alignment
speech-dataset
speech-corpus
dataset-preparation
persian-speech
tts-dataset
text-to-speech-dataset
mana-tts
speech-data-collection
-
Updated
Sep 13, 2024 - Jupyter Notebook