Instruction FollowingPPO, SFTCommercial OK

Llama-Nemotron-Post-Training-Dataset

by nvidia

Silver52
3.7Kdownloads
647likes

Description

Llama-Nemotron-Post-Training-Dataset-v1.1 Release Update [4/8/2025]: v1.1: We are releasing an additional 2.2M Math and 500K Code Reasoning Data in support of our release of Llama-3.1-Nemotron-Ultra-253B-v1. 🎉 Data Overview This dataset is a compilation of SFT and RL data that supports improvements of math, code, general reasoning, and instruction following capabilities of the original Llama instruct model, in support of NVIDIA’s release of… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Llama-Nemotron-Post-Training-Dataset.

What can I do with this?

Tags

license:cc-by-4.0size_categories:1M<n<10Mformat:jsonmodality:textlibrary:datasetslibrary:dasklibrary:mlcroissantarxiv:2505.00949region:us