AgenticRAG-Survey
AgentFreeAgentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
Capabilities16 decomposed
reflection pattern implementation for agent self-evaluation
Medium confidenceEnables autonomous agents to evaluate their own outputs and decisions by implementing a feedback loop where agents assess correctness, identify errors, and determine areas for improvement. This pattern integrates introspection mechanisms that allow agents to critique their reasoning chains and trigger iterative refinement cycles without external intervention, forming the basis for self-correcting RAG pipelines.
Implements reflection as a first-class agentic pattern within RAG pipelines rather than as post-hoc validation, enabling agents to autonomously trigger re-retrieval and re-generation cycles based on internal quality assessment without requiring external feedback loops.
Differs from traditional RAG validation by embedding reflection directly into agent decision-making, enabling continuous self-improvement rather than one-shot generation followed by external review.
planning pattern for multi-step task decomposition
Medium confidenceEnables agents to create structured, hierarchical task plans by decomposing complex queries into sequential or parallel sub-tasks with explicit dependencies and execution order. The pattern uses LLM-based planning to generate task graphs that specify retrieval steps, reasoning stages, and tool invocations, allowing agents to orchestrate complex workflows autonomously rather than following fixed pipelines.
Treats planning as a generative capability where agents dynamically create task graphs tailored to specific queries, rather than using static workflow templates, enabling adaptive task orchestration that responds to query complexity and available resources.
Provides more flexibility than fixed prompt-chaining pipelines by allowing agents to determine task structure dynamically, and more efficiency than exhaustive search by using LLM reasoning to prune suboptimal task sequences.
multi-agent rag architecture with specialized retriever and generator agents
Medium confidenceImplements a RAG system where distinct agents specialize in retrieval and generation, coordinating through shared context or message passing. The retriever agent focuses on finding relevant documents and evaluating retrieval quality, while the generator agent synthesizes responses from retrieved context. This separation enables specialization where each agent optimizes for its specific task while maintaining coordination through explicit communication protocols.
Separates retrieval and generation into distinct agents with independent optimization objectives, enabling specialization where each agent can be tuned for its specific task without compromising the other, rather than forcing a single agent to optimize for both.
Enables better specialization than single-agent systems by allowing independent optimization of retrieval and generation, and more modular than monolithic systems by enabling independent testing and deployment of retriever and generator.
hierarchical agentic rag with multi-level agent organization
Medium confidenceOrganizes agents in a hierarchical structure where high-level agents handle task decomposition and coordination, mid-level agents manage specialized domains or processing stages, and low-level agents execute specific operations. Information flows up and down the hierarchy, with higher-level agents making strategic decisions and lower-level agents executing tactical operations. This enables scalable organization of complex reasoning across many agents with clear responsibility boundaries.
Organizes agents in explicit hierarchical structures with clear parent-child relationships and delegation protocols, rather than flat multi-agent systems, enabling scalable organization of complex reasoning with clear responsibility boundaries.
Scales better than flat multi-agent systems by organizing agents hierarchically, and provides clearer responsibility assignment than peer-to-peer agent networks by establishing explicit authority relationships.
corrective agentic rag with feedback-driven iterative refinement
Medium confidenceImplements RAG systems with explicit feedback loops where agents detect retrieval or generation failures and trigger corrective actions. When agents identify that retrieved context is insufficient or generated responses are inaccurate, they autonomously adjust retrieval strategies (e.g., different query formulation, expanded search scope) or re-generate responses with corrected reasoning. This pattern enables self-correcting systems that improve output quality through iterative refinement driven by internal error detection.
Implements error correction as an autonomous capability where agents detect failures and trigger corrective actions without external feedback, rather than treating errors as terminal failures, enabling self-improving systems that adapt retrieval and generation strategies based on quality feedback.
More autonomous than systems requiring human feedback by implementing automatic error detection and correction, and more adaptive than fixed retrieval strategies by adjusting approach based on detected failures.
adaptive agentic rag with dynamic strategy selection based on query characteristics
Medium confidenceImplements RAG systems that dynamically adjust retrieval and generation strategies based on query analysis, task complexity, and available resources. Agents analyze incoming queries to determine optimal processing approach (e.g., simple retrieval vs multi-step reasoning, local vs remote execution) and select strategies that balance quality, latency, and cost. This pattern enables efficient resource utilization by matching processing complexity to query requirements rather than using uniform strategies for all queries.
Implements adaptive strategy selection where agents analyze query characteristics to determine optimal processing approach, rather than using uniform strategies for all queries, enabling efficient resource utilization by matching complexity to requirements.
More efficient than fixed-strategy systems by adapting to query characteristics, and more intelligent than simple routing by using query analysis to select strategies that balance multiple optimization objectives.
graph-based agentic rag with knowledge graph integration and semantic reasoning
Medium confidenceImplements RAG systems that leverage knowledge graphs to structure information and enable semantic reasoning across entities and relationships. Agents traverse knowledge graphs to find relevant information, reason about entity relationships, and synthesize responses based on graph structure. This pattern enables more sophisticated retrieval and reasoning by treating knowledge as interconnected entities and relationships rather than flat documents, supporting complex queries that require understanding of semantic relationships.
Leverages knowledge graph structure for both retrieval and reasoning, enabling agents to traverse semantic relationships and reason about entity connections, rather than treating knowledge as flat documents, enabling more sophisticated reasoning about interconnected information.
Enables more sophisticated reasoning than document-based RAG by leveraging semantic relationships, and more efficient retrieval than keyword search by using graph structure to identify relevant information.
agentic document workflow pattern for document-centric processing and analysis
Medium confidenceImplements specialized workflows for processing and analyzing documents where agents manage document ingestion, chunking, indexing, and multi-stage analysis. Agents coordinate document processing pipelines, apply domain-specific analysis (e.g., contract analysis, research paper summarization), and synthesize insights across documents. This pattern treats documents as first-class entities with explicit processing workflows, enabling sophisticated document analysis that goes beyond simple retrieval.
Treats documents as first-class entities with explicit processing workflows managed by agents, rather than treating documents as passive sources of text, enabling sophisticated document analysis with explicit coordination of ingestion, analysis, and synthesis stages.
Enables more sophisticated document analysis than simple retrieval by implementing explicit document processing workflows, and more flexible than fixed document processing pipelines by allowing agents to adapt processing based on document characteristics.
tool use pattern with schema-based function binding
Medium confidenceEnables agents to invoke external tools, APIs, and knowledge bases through a schema-based function registry that defines tool capabilities, parameters, and return types. Agents parse tool invocation requests from LLM outputs, validate parameters against schemas, execute tools with error handling, and integrate results back into the reasoning loop. This pattern supports both synchronous tool calls and asynchronous tool chains with result aggregation.
Implements tool use as a structured, schema-validated capability where agents operate against a formal tool registry with explicit parameter contracts, enabling type-safe tool invocations and systematic error handling rather than ad-hoc string parsing of tool calls.
More robust than simple string-based tool parsing by enforcing schema validation, and more flexible than hardcoded tool integrations by supporting dynamic tool discovery and parameter validation at runtime.
multi-agent collaboration pattern with role-based specialization
Medium confidenceEnables multiple specialized agents to work together on complex tasks by assigning distinct roles (e.g., retriever, analyzer, synthesizer) and implementing coordination mechanisms for task delegation, result aggregation, and conflict resolution. Agents communicate through shared context or message-passing protocols, with a coordinator agent managing task distribution and ensuring outputs from specialized agents are integrated coherently into final responses.
Treats multi-agent systems as first-class agentic patterns with explicit role definitions and coordination protocols, rather than running independent agents in parallel, enabling structured collaboration where agents understand their specialization and coordinate outputs.
Provides better output coherence than parallel independent agents by implementing explicit coordination, and more scalable than monolithic agents by distributing reasoning across specialized sub-agents.
prompt chaining workflow pattern for sequential task execution
Medium confidenceStructures complex tasks as sequences of dependent prompts where the output of one step becomes the input to the next, enabling step-by-step reasoning with explicit state transitions. Each step in the chain is a distinct LLM invocation with its own prompt, context, and validation logic, allowing agents to build up complex reasoning progressively while maintaining clear separation of concerns and enabling intermediate result inspection.
Implements prompt chaining as an explicit workflow pattern where each step is a distinct LLM invocation with independent prompts and validation, enabling fine-grained control over reasoning stages and intermediate result inspection rather than single-shot generation.
More transparent and auditable than single-shot generation by making each reasoning step explicit, and more flexible than fixed pipelines by allowing dynamic step selection based on intermediate results.
routing pattern for dynamic task direction based on query classification
Medium confidenceClassifies incoming queries or tasks and directs them to specialized processing pipelines or agents based on query type, complexity, or domain. The routing decision is made by an LLM-based classifier that analyzes query characteristics and selects the most appropriate handler from a registry of specialized processors, enabling efficient resource allocation and domain-specific optimization without requiring all queries to traverse the same pipeline.
Implements routing as an intelligent classification step that analyzes query characteristics to select specialized handlers, rather than using static rules or random assignment, enabling adaptive pipeline selection based on query semantics.
More efficient than single-pipeline systems by avoiding unnecessary processing steps, and more adaptive than rule-based routing by using LLM reasoning to classify queries based on semantic content.
parallelization pattern for concurrent task execution with result aggregation
Medium confidenceExecutes multiple independent tasks concurrently rather than sequentially, with explicit result aggregation and conflict resolution. The pattern identifies task independence, launches parallel executions, waits for all tasks to complete, and combines results using aggregation logic (e.g., voting, merging, ranking). This enables efficient utilization of computational resources and reduces total execution time for tasks with independent sub-components.
Implements parallelization as a first-class workflow pattern with explicit result aggregation logic, rather than simply launching tasks concurrently, enabling structured combination of parallel outputs with conflict resolution and ranking.
Reduces latency compared to sequential execution by leveraging parallelism, and provides more control than simple concurrent execution by implementing explicit aggregation strategies tailored to task semantics.
orchestrator-workers pattern for dynamic task delegation and coordination
Medium confidenceImplements a hierarchical coordination model where a central orchestrator agent analyzes tasks, decomposes them into sub-tasks, and delegates work to specialized worker agents. The orchestrator monitors worker progress, collects results, handles failures with retry logic, and synthesizes final outputs. Workers execute assigned tasks autonomously and report results back to the orchestrator, enabling scalable task distribution without requiring workers to understand the overall task structure.
Implements orchestrator-workers as an explicit coordination pattern where the orchestrator maintains global task state and makes intelligent delegation decisions, rather than simple task queue distribution, enabling adaptive load balancing and failure recovery.
Provides better fault tolerance than simple worker pools by implementing intelligent task reassignment, and more efficient than flat multi-agent systems by centralizing coordination logic in the orchestrator.
evaluator-optimizer pattern for iterative output refinement
Medium confidenceImplements a feedback loop where an evaluator agent assesses outputs against quality criteria, identifies deficiencies, and an optimizer agent iteratively refines outputs based on evaluation feedback. The pattern cycles between evaluation and optimization until quality thresholds are met or iteration limits are reached. This enables continuous improvement of agent outputs without external intervention, with clear quality metrics driving the refinement process.
Implements evaluation and optimization as a coupled feedback loop where evaluation results directly drive optimization decisions, rather than treating evaluation as post-hoc validation, enabling continuous quality improvement within the agent execution flow.
Provides more targeted refinement than simple re-generation by using evaluation feedback to guide optimization, and more efficient than exhaustive search by using LLM reasoning to identify specific improvement opportunities.
single-agent rag architecture with integrated retrieval and generation
Medium confidenceImplements a unified RAG system where a single agent manages both retrieval and generation within a single reasoning loop. The agent decides when to retrieve, what to retrieve, evaluates retrieved context, and generates responses iteratively. This architecture integrates all agentic patterns (reflection, planning, tool use) into a single agent's decision-making process, enabling end-to-end control over the RAG pipeline without inter-agent coordination overhead.
Unifies retrieval and generation within a single agent's reasoning loop, enabling tight coupling where retrieval decisions are informed by generation context and vice versa, rather than treating retrieval and generation as separate pipeline stages.
Simpler to implement and debug than multi-agent systems, and more efficient than rigid retrieval-then-generation pipelines by enabling adaptive retrieval based on generation progress.
Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.
Related Artifactssharing capabilities
Artifacts that share capabilities with AgenticRAG-Survey, ranked by overlap. Discovered automatically through the match graph.
star the repo
to get notified when new templates ship.**
PocketFlow
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
awesome-llm-apps
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
ai-agents-for-beginners
12 Lessons to Get Started Building AI Agents
hello-agents
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
Best For
- ✓Teams building autonomous RAG systems requiring high accuracy without human-in-the-loop validation
- ✓Developers implementing multi-turn agent workflows with quality gates
- ✓Researchers exploring self-improving LLM agent architectures
- ✓Teams building complex question-answering systems requiring multi-stage reasoning
- ✓Developers implementing adaptive RAG where retrieval strategy depends on task complexity
- ✓Organizations needing explainable agent decision-making with visible task decomposition
- ✓Teams with domain expertise in both retrieval optimization and generation quality
- ✓Developers implementing systems where retrieval and generation have different SLAs or resource requirements
Known Limitations
- ⚠Reflection adds computational overhead — requires additional LLM calls per evaluation cycle, typically 2-3x the base inference cost
- ⚠Reflection quality depends on LLM capability — weaker models may fail to identify genuine errors in their reasoning
- ⚠Risk of reflection loops becoming stuck in local optima without external guidance or hard stopping criteria
- ⚠Planning overhead — generating detailed task plans adds 500ms-2s latency before execution begins
- ⚠Plan quality varies with LLM capability — weaker models may generate suboptimal or redundant task sequences
- ⚠No guarantee of plan feasibility — agents may plan tasks requiring unavailable tools or data sources
Requirements
Input / Output
UnfragileRank
UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.
Repository Details
Last commit: Oct 20, 2025
About
Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
Categories
Alternatives to AgenticRAG-Survey
Are you the builder of AgenticRAG-Survey?
Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.
Get the weekly brief
New tools, rising stars, and what's actually worth your time. No spam.
Data Sources
Looking for something else?
Search →