Evaluation of RAG Systems
Evaluating RAG systems is more complex than evaluating traditional LLMs, because RAG involves two distinct processes:
-
Retrieval quality
-
Generation quality (based on retrieved documents)
A good RAG system must fetch relevant documents and generate coherent, factually correct, and context-aware responses.