Modular RAG
Modular RAG systems evolve from rigid pipelines into reconfigurable architectures, where each stage is represented as a module. These systems are flexible, scalable, and easier to debug or extend.
Core Concepts of Modular RAG:
-
Modules
-
Represent high-level stages like retrieval, summarization, generation, orchestration.
-
Can be replaced or extended independently.
-
-
Sub-Modules
- Functional units inside modules (e.g., reranking inside the retrieval module).
-
Operators
- Lowest-level functional units (e.g., cosine similarity, token eliminator, prompt formatter).
-
Orchestration
- A control unit that dynamically routes queries through different modules depending on context or task.