Description
Updates
2024/07/09: we also uploaded a new version of YODAS as YODAS2, it provides unsegmented audios and higher sampling rate (24k)
README
This is the YODAS manual/automatic subset from our YODAS dataset, it has 369,510 hours of speech.
This dataset contains audio utterances and corresponding captions (manual or automatic) from YouTube. Note that manual caption only indicates that it is uploaded by users, but not necessarily transcribed by a human
For more details about YODAS… See the full description on the dataset page: https://huggingface.co/datasets/espnet/yodas.
What can I do with this?
Tags
license:cc-by-3.0arxiv:2406.00899region:us