Creative WritingRAGNon-Commercial
ATM-Bench
by Jingbiao
4.4Kdownloads
4likes
1K<n<10KDescription
ATM-Bench: Long-Term Personalized Referential Memory QA
ATM-Bench is the first benchmark for multimodal, multi-source personalized referential memory QA over long time horizons (~4 years) with evidence-grounded retrieval and answering.
Paper: According to Me: Long-Term Personalized Referential Memory QA
Overview
Existing long-term memory benchmarks focus primarily on dialogue history, failing to capture realistic personalized references grounded in lived experience.… See the full description on the dataset page: https://huggingface.co/datasets/Jingbiao/ATM-Bench.
What can I do with this?
Tags
task_categories:question-answeringtask_categories:visual-question-answeringlanguage:enlanguage:zhlicense:cc-by-nc-4.0size_categories:1K<n<10Kmodality:imagearxiv:2603.01990region:uspersonal-memorymultimodallong-term-memoryretrieval-augmented-generationbenchmark