ROUGE

Recall-Oriented Understudy for Gisting Evaluation

Definition

ROUGE is a set of automatic metrics for evaluating text summarization and translation quality by measuring n-gram overlap between generated text and human references. ROUGE-N measures n-gram recall, ROUGE-L measures the longest common subsequence, and ROUGE-S measures skip-bigram overlap.

Like BLEU, ROUGE correlates poorly with human judgment on abstractive generation tasks.

Related Terms

BLEU

Bilingual Evaluation Understudy

Evaluation

LLM Evaluation

Perplexity

Language Model Perplexity

← Back to Glossary

Ship secure code faster

Crash Override integrates security into the developer workflow. No context switching, no waiting on reviews.

Talk to a Human See the Product