Speech & AudioCommercial OK

Cantone

by AlienKevin

Bronze45
49.4Kdownloads
3likes
10K<n<100K

Description

Cantone A dataset of 34,489 recordings of Cantonese syllables by 10 speakers. Those syllables are generated through the Cantonese speech synthesis engines of Amazon, Apple, Google, and Microsoft. All recordings are stored as WAV files with the following format Channel: mono Sample rate: 16 kHz Bits per sample: 16 Here's a breakdown of the number of recordings under each speaker: Company Speaker # Syllables Amazon Hiujin 3,885 Apple Aasing 2,977 Apple Sinji 2,977… See the full description on the dataset page: https://huggingface.co/datasets/AlienKevin/cantone.

What can I do with this?

Tags

task_categories:audio-classificationlanguage:yuelicense:mitsize_categories:10K<n<100Kmodality:audioregion:usspeechcantoneseyuesyllablepronunciation