Benchmarks & EvaluationOther

AG’s News Corpus

by fancyzhx

Silver57
78.2Kdownloads
186likes
100K<n<1M

Description

Dataset Card for "ag_news" Dataset Summary AG is a collection of more than 1 million news articles. News articles have been gathered from more than 2000 news sources by ComeToMyHead in more than 1 year of activity. ComeToMyHead is an academic news search engine which has been running since July, 2004. The dataset is provided by the academic comunity for research purposes in data mining (clustering, classification, etc), information retrieval (ranking, search, etc), xml… See the full description on the dataset page: https://huggingface.co/datasets/fancyzhx/ag_news.

What can I do with this?

Tags

task_categories:text-classificationtask_ids:topic-classificationannotations_creators:foundlanguage_creators:foundmultilinguality:monolingualsource_datasets:originallanguage:enlicense:unknownsize_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsregion:us