📊 LLM Evaluation Framework for Professional Content Rewriting

0.85+: Outstanding performance (A) - ready for professional use
0.70-0.85: Strong performance (B) - good quality with minor improvements
0.50-0.70: Adequate performance (C) - usable but needs refinement
0.30-0.50: Weak performance (D) - requires significant revision
&lt;0.30: Poor performance (F) - likely needs complete rewriting

Evaluate the quality of LLM-generated content using multiple metrics with proper normalization.

Input Mode

Reference Only (Generate Candidate) Both Reference and Candidate

Reference Text

Reference Text

Evaluation Metrics

Summary