A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
tts
persian
speech-processing
asr
forced-alignment
speech-dataset
persian-speech-recognition
asr-evaluation
persian-speech-dataset
persian-text-to-speech
speech-data-collection
persian-speech-corpus
-
Updated
Sep 13, 2024 - Jupyter Notebook