CodeSynthetic DataCommercial OK
synthetic_text_to_sql
by gretelai
2.4Kdownloads
633likes
100K<n<1MDescription
Image generated by DALL-E. See prompt for more details
synthetic_text_to_sql
gretelai/synthetic_text_to_sql is a rich dataset of high quality synthetic Text-to-SQL samples,
designed and generated using Gretel Navigator, and released under Apache 2.0.
Please see our release blogpost for more details.
The dataset includes:
105,851 records partitioned into 100,000 train and 5,851 test records
~23M total tokens, including ~12M SQL tokens
Coverage across 100 distinct… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/synthetic_text_to_sql.
What can I do with this?
Tags
task_categories:question-answeringtask_categories:table-question-answeringtask_categories:text-generationlanguage:enlicense:apache-2.0size_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:polarslibrary:mlcroissantlibrary:datadesignerarxiv:2306.05685region:ussyntheticSQLtext-to-SQLcodedatadesigner