Retrieval Augmented Generation Rag Technique Documentation With Architecture Patterns

1

langchainFramework67/100

via “retrieval-augmented generation (rag) pipeline composition”

Typescript bindings for langchain

Unique: RetrievalQA is a pre-built chain that combines a Retriever (vector store query interface) with a PromptTemplate and LLM. The chain automatically formats retrieved documents into context and passes them to the LLM. Multiple retrieval strategies (similarity, MMR) are supported through the Retriever interface, enabling optimization for different use cases.

vs others: More accessible than building custom RAG pipelines because it provides a standard pattern, and more flexible than monolithic RAG frameworks because retrievers, prompts, and LLMs are swappable.

2

MetaGPTFramework60/100

via “retrieval-augmented generation (rag) with configurable engines and semantic search”

Multi-agent software company simulator — PM, architect, engineer roles collaborate on projects.

Unique: Implements a pluggable RAG system with support for multiple retrieval engines (vector, BM25, hybrid) and per-role/per-action configuration. RAG is integrated into the action framework, enabling agents to automatically augment prompts with retrieved context before LLM invocation.

vs others: More flexible than single-engine RAG systems because it supports multiple retrieval strategies and allows fine-grained configuration per role/action. Compared to external RAG pipelines, MetaGPT's RAG is tightly integrated with the agent framework and automatically handles context injection.

3

Anthropic CookbookRepository59/100

via “retrieval-augmented-generation-pipeline-templates”

Official Anthropic recipes for building with Claude.

Unique: Demonstrates RAG patterns specifically optimized for Claude's context window and instruction-following capabilities, including techniques for injecting retrieved context into system prompts and handling multi-document synthesis. Uses LlamaIndex as an abstraction layer to support multiple vector databases without rewriting core logic.

vs others: More complete than generic RAG tutorials because it shows Claude-specific patterns (like using retrieved context in system prompts); more flexible than monolithic RAG frameworks because examples are modular and can be adapted to different vector databases.

4

ragflowRepository57/100

via “retrieval-augmented generation (rag) engine with agentic capabilities”

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Unique: RAGFlow uniquely combines deep document parsing with a visual agent workflow builder, allowing users to create complex AI applications seamlessly.

vs others: Unlike other RAG solutions, RAGFlow offers a comprehensive agentic workflow framework that enhances document processing and contextual understanding.

5

LangChain RAG TemplateTemplate57/100

via “rag framework for building retrieval-augmented generation applications”

LangChain reference RAG implementation from scratch.

Unique: This repository uniquely focuses on building RAG systems from scratch, providing educational insights and customizable code examples.

vs others: Unlike other RAG frameworks, this implementation emphasizes a step-by-step educational approach, allowing for deeper understanding and flexibility.

6

awesome-llm-appsRepository56/100

via “retrieval-augmented generation (rag) pattern library with multiple retrieval strategies”

100+ AI Agent & RAG apps you can actually run — clone, customize, ship.

Unique: Provides 8+ distinct RAG patterns (basic, corrective, hybrid, database routing, agentic, autonomous, reasoning-enhanced) with working implementations for each, allowing developers to compare trade-offs between retrieval quality and latency. Most RAG tutorials show only basic vector search; this library treats RAG as a design space with multiple valid solutions.

vs others: More comprehensive RAG pattern coverage than LangChain's built-in RAG examples; more practical than academic RAG papers with runnable code for each pattern

7

LM StudioApp55/100

via “document attachment and retrieval-augmented generation (rag) for chat”

Desktop app for running local LLMs — model discovery, chat UI, and OpenAI-compatible server.

Unique: Implements end-to-end RAG entirely locally without external vector databases or cloud services, with document attachment directly in the chat UI and automatic retrieval/injection into model context

vs others: Eliminates dependency on external vector databases (Pinecone, Weaviate) and cloud embedding services (OpenAI embeddings), reducing infrastructure complexity and ensuring document privacy vs cloud-based RAG solutions

8

RAG_TechniquesRepository54/100

via “foundational-rag-pipeline-implementation”

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.

Unique: Provides a unified pedagogical pipeline architecture that all 40+ techniques build upon, with dual-framework implementations (LangChain and LlamaIndex) showing how the same logical pipeline maps to different frameworks, enabling developers to understand RAG concepts independent of framework choice

vs others: More comprehensive than single-technique tutorials because it shows the complete pipeline context and how techniques compose, whereas most RAG guides focus on isolated techniques without showing integration points

9

PocketFlowFramework53/100

via “rag (retrieval-augmented generation) system composition”

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

Unique: Implements RAG as a composable workflow pattern using the Graph + Shared Store model, enabling retrieval results to be cached and reused across multiple agent iterations without external vector database dependencies

vs others: Simpler than LlamaIndex/LangChain RAG (no index management overhead) but less feature-rich than specialized RAG frameworks (no built-in reranking, no vector DB integration)

10

hello-agentsAgent52/100

via “rag pipeline with document processing and retrieval integration”

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Unique: Integrates RAG as a core agent capability with explicit examples of document chunking strategies, embedding generation, and retrieval integration into agent prompts, rather than treating RAG as a separate system bolted onto agents

vs others: More practical than fine-tuning for handling document-specific knowledge, but less precise than full-text search for exact phrase matching; best for semantic understanding of document content

11

awesome-generative-ai-guideRepository51/100

via “retrieval augmented generation system design and implementation”

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

Unique: Organizes RAG design around explicit decision points (retriever type, embedding model, vector database, ranking strategy) with research-backed guidance on trade-offs. Includes dedicated section on agentic RAG patterns for knowledge-grounded agent decision making.

vs others: More comprehensive than framework-specific RAG documentation; provides cross-framework architectural patterns and research-backed design guidance, whereas most RAG resources focus on implementation in a specific framework.

12

agentscopeAgent51/100

via “retrieval-augmented generation (rag) with vector stores and document readers”

Build and run agents you can see, understand and trust.

Unique: Integrates RAG through a Knowledge Base abstraction that works with pluggable vector stores and document readers, allowing agents to augment reasoning with retrieved context while maintaining separation between retrieval logic and agent reasoning

vs others: More modular than LangChain's RAG because vector stores and document readers are pluggable; more integrated than AutoGen's RAG support because it's built into the agent framework rather than requiring external libraries

13

ai-agents-for-beginnersAgent49/100

via “agentic-rag-pattern-with-context-engineering”

12 Lessons to Get Started Building AI Agents

Unique: Frames RAG as an agentic decision (agents decide when to retrieve) rather than a static pipeline, and explicitly teaches context engineering techniques like chat summarization and scratchpad management to handle token constraints — most RAG tutorials treat retrieval as a fixed preprocessing step.

vs others: Covers the full context lifecycle (types, management, summarization) rather than just retrieval mechanics, making it more applicable to long-running agent conversations where context budgets are critical.

14

GenerativeAIExamplesRepository49/100

via “retrieval-augmented generation (rag) pipeline orchestration across multiple frameworks”

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Unique: Decouples RAG stages (retrieval, reranking, generation) as independent microservices with pluggable implementations, enabling framework-agnostic RAG that supports both cloud-hosted and self-hosted inference patterns — differentiates from framework-specific RAG by providing portable, composable reference implementations

vs others: More flexible than framework-locked RAG because components are swappable, and more cost-effective than cloud-only RAG because self-hosted NIM deployment avoids per-query API costs while maintaining production-grade performance

15

AgentGuideRepository49/100

via “rag system design and vector database reference”

Unique: Bridges research papers (agentic RAG, GraphRAG) with practical tooling choices, including explicit document parsing guide that addresses production challenges like heterogeneous formats and metadata preservation

vs others: Connects theoretical RAG advances (agentic RAG, GraphRAG) to implementation choices; most tutorials focus only on basic RAG patterns

16

ai-notesRepository49/100

via “semantic search and rag architecture documentation”

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

Unique: Explicitly documents the interaction between embedding model choice, vector storage architecture, and LLM prompt injection patterns, treating RAG as an integrated system rather than separate components

vs others: More comprehensive than individual vector database documentation because it covers the full RAG pipeline, but less detailed than specialized RAG frameworks like LangChain

17

happy-llmRepository48/100

via “rag (retrieval-augmented generation) system implementation”

📚 从零开始构建大模型

Unique: Implements RAG as a modular pipeline with separate, swappable components for embedding generation, retrieval, ranking, and generation, allowing learners to understand each stage independently and experiment with different retrieval strategies without modifying the generation component

vs others: More transparent than using LangChain RAG chains because it shows the underlying retrieval and ranking logic explicitly, enabling customization and debugging of retrieval quality rather than treating it as a black box

18

awesome-chatgpt-zhRepository47/100

via “rag implementation pattern guide with vector database integration examples”

ChatGPT 中文指南🔥，ChatGPT 中文调教指南，指令指南，应用开发指南，精选资源清单，更好的使用 chatGPT 让你的生产力 up up up! 🚀

Unique: Provides end-to-end RAG implementation patterns with specific focus on Chinese language models and multilingual document handling. Includes vector database comparison matrix with performance metrics and cost analysis, enabling developers to make informed architectural decisions.

vs others: More comprehensive than individual framework documentation because it covers the full RAG pipeline with cross-framework comparisons, whereas LangChain or LlamaIndex docs focus on their specific abstractions.

19

awesome-generative-aiRepository45/100

via “retrieval-augmented-generation-system-resource-mapping”

A curated list of Generative AI tools, works, models, and references

Unique: Treats RAG as a distinct capability with dedicated resources covering the full pipeline (embeddings → vector databases → retrieval → reranking), rather than treating it as an LLM application pattern. Recognizes that RAG requires specialized infrastructure (vector databases, embedding models) beyond base LLMs

vs others: More comprehensive than single-tool documentation (Pinecone, Weaviate) by covering the full RAG ecosystem, but less detailed than specialized communities (Hugging Face, Papers with Code) which provide benchmarks and comparative analysis of retrieval methods

20

DecryptPromptRepository44/100

via “retrieval-augmented generation (rag) and knowledge integration research collection”

总结Prompt&LLM论文，开源数据&模型，AIGC应用

Unique: Organizes RAG research across the full pipeline (document retrieval, knowledge base construction, integration methods, table/chart understanding) showing how techniques like dense retrieval and knowledge base augmentation (KBLAM) work together to ground LLM outputs in external knowledge sources.

vs others: More comprehensive than framework documentation (LangChain RAG guides) by covering underlying retrieval research; more practical than pure information retrieval papers by organizing knowledge around LLM-specific challenges like context window constraints and hallucination reduction.

Top Matches

Also Known As

Company