Speech & AudioCommercial OK
Cantone
by AlienKevin
49.4Kdownloads
3likes
10K<n<100KDescription
Cantone
A dataset of 34,489 recordings of Cantonese syllables by 10 speakers.
Those syllables are generated through the Cantonese speech synthesis engines of Amazon, Apple, Google, and Microsoft.
All recordings are stored as WAV files with the following format
Channel: mono
Sample rate: 16 kHz
Bits per sample: 16
Here's a breakdown of the number of recordings under each speaker:
Company
Speaker
# Syllables
Amazon
Hiujin
3,885
Apple
Aasing
2,977
Apple
Sinji
2,977… See the full description on the dataset page: https://huggingface.co/datasets/AlienKevin/cantone.
What can I do with this?
Tags
task_categories:audio-classificationlanguage:yuelicense:mitsize_categories:10K<n<100Kmodality:audioregion:usspeechcantoneseyuesyllablepronunciation