Skip to content
ai

ROUGE

Recall-Oriented Understudy for Gisting Evaluation

Definition

ROUGE is a set of automatic metrics for evaluating text summarization and translation quality by measuring n-gram overlap between generated text and human references. ROUGE-N measures n-gram recall, ROUGE-L measures the longest common subsequence, and ROUGE-S measures skip-bigram overlap.

Like BLEU, ROUGE correlates poorly with human judgment on abstractive generation tasks.


Ship secure code faster

Crash Override integrates security into the developer workflow. No context switching, no waiting on reviews.