ai
MMLU
Massive Multitask Language Understanding
Definition
MMLU is a benchmark consisting of approximately 15,000 multiple-choice questions across 57 academic and professional subjects, from elementary mathematics to professional law and medicine. It measures a model's breadth of world knowledge and reasoning ability.
MMLU scores are widely reported in model release papers, though models trained on instruction-following datasets can achieve high scores without genuine understanding.
Related Terms
Ship secure code faster
Crash Override integrates security into the developer workflow. No context switching, no waiting on reviews.