Creative WritingRAGNon-Commercial

ATM-Bench

by Jingbiao

Bronze39
4.4Kdownloads
4likes
1K<n<10K

Description

ATM-Bench: Long-Term Personalized Referential Memory QA ATM-Bench is the first benchmark for multimodal, multi-source personalized referential memory QA over long time horizons (~4 years) with evidence-grounded retrieval and answering. Paper: According to Me: Long-Term Personalized Referential Memory QA Overview Existing long-term memory benchmarks focus primarily on dialogue history, failing to capture realistic personalized references grounded in lived experience.… See the full description on the dataset page: https://huggingface.co/datasets/Jingbiao/ATM-Bench.

What can I do with this?

Tags

task_categories:question-answeringtask_categories:visual-question-answeringlanguage:enlanguage:zhlicense:cc-by-nc-4.0size_categories:1K<n<10Kmodality:imagearxiv:2603.01990region:uspersonal-memorymultimodallong-term-memoryretrieval-augmented-generationbenchmark