BAbI: Tackling Commonsense Reasoning
The BAbI benchmark presents a difficult set of tasks designed to evaluate the skills of AI systems in interpreting commonsense knowledge. It includes a wide range of cases that require reasoning about everyday concepts. By assessing how well AI models can resolve these problems, researchers hope to