Role-Play & CharactersHuman AnnotatedCommercial OK
The Cross-lingual TRansfer Evaluation of Multilingual Encoders for Speech (XTREME-S) benchmark is a benchmark designed to evaluate speech representations across languages, tasks, domains and data regimes. It covers 102 languages from 10+ language families, 3 different domains and 4 task families: speech recognition, translation, classification and retrieval.
by google
41.0Kdownloads
384likes
10K<n<100KDescription
FLEURS
Fleurs is the speech version of the FLoRes machine translation benchmark.
We use 2009 n-way parallel sentences from the FLoRes dev and devtest publicly available sets, in 102 languages.
Training sets have around 10 hours of supervision. Speakers of the train sets are different than speakers from the dev/test sets. Multilingual fine-tuning is
used and ”unit error rate” (characters, signs) of all languages is averaged. Languages and results are also grouped into seven… See the full description on the dataset page: https://huggingface.co/datasets/google/fleurs.
What can I do with this?
Tags
task_categories:automatic-speech-recognitionannotations_creators:expert-generatedannotations_creators:crowdsourcedannotations_creators:machine-generatedlanguage_creators:crowdsourcedlanguage_creators:expert-generatedmultilinguality:multilinguallanguage:afrlanguage:amhlanguage:aralanguage:asmlanguage:astlanguage:azjlanguage:bellanguage:benlanguage:boslanguage:catlanguage:ceblanguage:cmnlanguage:ces