CodeSynthetic DataCommercial OK

synthetic_text_to_sql

by gretelai

Silver51
2.4Kdownloads
633likes
100K<n<1M

Description

Image generated by DALL-E. See prompt for more details synthetic_text_to_sql gretelai/synthetic_text_to_sql is a rich dataset of high quality synthetic Text-to-SQL samples, designed and generated using Gretel Navigator, and released under Apache 2.0. Please see our release blogpost for more details. The dataset includes: 105,851 records partitioned into 100,000 train and 5,851 test records ~23M total tokens, including ~12M SQL tokens Coverage across 100 distinct… See the full description on the dataset page: https://huggingface.co/datasets/gretelai/synthetic_text_to_sql.

What can I do with this?

Tags

task_categories:question-answeringtask_categories:table-question-answeringtask_categories:text-generationlanguage:enlicense:apache-2.0size_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:polarslibrary:mlcroissantlibrary:datadesignerarxiv:2306.05685region:ussyntheticSQLtext-to-SQLcodedatadesigner