Benchmarks & EvaluationOther
AG’s News Corpus
by fancyzhx
78.2Kdownloads
186likes
100K<n<1MDescription
Dataset Card for "ag_news"
Dataset Summary
AG is a collection of more than 1 million news articles. News articles have been
gathered from more than 2000 news sources by ComeToMyHead in more than 1 year of
activity. ComeToMyHead is an academic news search engine which has been running
since July, 2004. The dataset is provided by the academic comunity for research
purposes in data mining (clustering, classification, etc), information retrieval
(ranking, search, etc), xml… See the full description on the dataset page: https://huggingface.co/datasets/fancyzhx/ag_news.
What can I do with this?
Tags
task_categories:text-classificationtask_ids:topic-classificationannotations_creators:foundlanguage_creators:foundmultilinguality:monolingualsource_datasets:originallanguage:enlicense:unknownsize_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsregion:us