CodeCommercial OK
arxiv-papers-by-subject
by permutans
69.2Kdownloads
0likes
1M<n<10MDescription
arXiv Papers by Subject
A reorganised version of the nick007x/arxiv-papers dataset, partitioned by subject code, year, and month for efficient selective access.
Dataset Description
This dataset contains metadata for over 2.5 million arXiv papers, organised into a hierarchical directory structure that allows users to download only the specific subjects and time periods they need, rather than the entire dataset.
Motivation
The original nick007x/arxiv-papers… See the full description on the dataset page: https://huggingface.co/datasets/permutans/arxiv-papers-by-subject.
What can I do with this?
Tags
task_categories:text-generationtask_categories:feature-extractionsource_datasets:nick007x/arxiv-paperslanguage:enlicense:mitsize_categories:1M<n<10Mregion:usarxivacademic-papersscientific-literatureresearchmetadata