Instruction FollowingPPO, SFTCommercial OK
Llama-Nemotron-Post-Training-Dataset
by nvidia
3.7Kdownloads
647likes
Description
Llama-Nemotron-Post-Training-Dataset-v1.1 Release
Update [4/8/2025]:
v1.1: We are releasing an additional 2.2M Math and 500K Code Reasoning Data in support of our release of Llama-3.1-Nemotron-Ultra-253B-v1. 🎉
Data Overview
This dataset is a compilation of SFT and RL data that supports improvements of math, code, general reasoning, and instruction following capabilities of the original Llama instruct model, in support of NVIDIA’s release of… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Llama-Nemotron-Post-Training-Dataset.
What can I do with this?
Tags
license:cc-by-4.0size_categories:1M<n<10Mformat:jsonmodality:textlibrary:datasetslibrary:dasklibrary:mlcroissantarxiv:2505.00949region:us