CodeCommercial OK

arxiv-papers-by-subject

by permutans

Bronze43
69.2Kdownloads
0likes
1M<n<10M

Description

arXiv Papers by Subject A reorganised version of the nick007x/arxiv-papers dataset, partitioned by subject code, year, and month for efficient selective access. Dataset Description This dataset contains metadata for over 2.5 million arXiv papers, organised into a hierarchical directory structure that allows users to download only the specific subjects and time periods they need, rather than the entire dataset. Motivation The original nick007x/arxiv-papers… See the full description on the dataset page: https://huggingface.co/datasets/permutans/arxiv-papers-by-subject.

What can I do with this?

Tags

task_categories:text-generationtask_categories:feature-extractionsource_datasets:nick007x/arxiv-paperslanguage:enlicense:mitsize_categories:1M<n<10Mregion:usarxivacademic-papersscientific-literatureresearchmetadata