<aside> 💡
ARC(Challenge) benchmark(Common Sense Reasoning)
A new dataset of 7,787 genuine grade-school level, multiple-choice science questions, assembled to encourage research in advanced question-answering. The dataset is partitioned into a Challenge Set and an Easy Set, where the former contains only questions answered incorrectly by both a retrieval-based algorithm and a word co-occurrence algorithm. We are also including a corpus of over 14 million science sentences relevant to the task, and an implementation of three neural baseline models for this dataset. We pose ARC as a challenge to the community.
from datasets import load_dataset
ds = load_dataset("allenai/ai2_arc", "ARC-Challenge")
## ARC-Challenge
{
"answerKey": "B",
"choices": {
"label": ["A", "B", "C", "D"],
"text": ["Shady areas increased.", "Food sources increased.", "Oxygen levels increased.", "Available water increased."]
},
"id": "Mercury_SC_405487",
"question": "One year, the oak trees in a park began producing more acorns than usual. The next year, the population of chipmunks in the park also increased. Which best explains why there were more chipmunks the next year?"
}