┌─────────────────────────────────────────────────────┐
│                    RAG 流程                          │
├─────────────────────────────────────────────────────┤
│                                                      │
│  用户问题 ──▶ 编码为向量 ──▶ 向量数据库检索           │
│                   │                                  │
│                   ▼                                  │
│           找到最相关的文档                            │
│                   │                                  │
│                   ▼                                  │
│      把文档和问题一起发送给 LLM                       │
│                   │                                  │
│                   ▼                                  │
│              生成答案                                 │
│                                                      │
└─────────────────────────────────────────────────────┘

为什么需要 RAG？

LLM 的知识有截止日期
LLM 不知道你私有的数据
RAG 让 LLM 能"查阅"外部知识

关键组件

组件	作用
Embedding Model	把文本变成向量
Vector Database	存储和检索向量
Retrieval	找到最相关的文档
Generation	用检索结果生成答案

简化实现

class SimpleRAG:
    """简化版 RAG 系统"""

    def __init__(self):
        # 文档存储
        self.documents = []
        # 向量存储（简化版，用关键词）
        self.vectors = {}

    def add_document(self, text: str, metadata: dict = None):
        """添加文档"""
        self.documents.append({
            "text": text,
            "metadata": metadata or {}
        })

    def retrieve(self, query: str, top_k: int = 3) -> list:
        """检索相关文档"""
        # 简化版：基于关键词匹配
        results = []
        for doc in self.documents:
            # 计算简单相关性分数
            score = sum(1 for word in query if word in doc["text"].lower())
            if score > 0:
                results.append((score, doc))
        # 排序并返回 top_k
        results.sort(key=lambda x: x[0], reverse=True)
        return [doc for _, doc in results[:top_k]]

    def generate(self, query: str, llm) -> str:
        """RAG 生成"""
        docs = self.retrieve(query)
        context = "\n".join([d["text"] for d in docs])

        prompt = f"""
根据以下上下文回答问题：

上下文：
{context}

问题：{query}

答案：
"""
        return llm.invoke(prompt)

Step 06: Self-Correction - 自我修正

核心概念

Self-Correction = 让 Agent 能够自我发现并修复错误

┌─────────────────────────────────────────────────────┐
│                Self-Correction 流程                  │
├─────────────────────────────────────────────────────┤
│                                                      │
│  生成结果 ──▶ 验证 ──▶ 有问题？                       │
│                   │                                  │
│              ┌────┴────┐                            │
│              │         │                            │
│             是         否                            │
│              │         │                            │
│              ▼         ▼                            │
│        分析错误    返回结果                          │
│              │                                        │
│              ▼                                        │
│        生成修复方案                                   │
│              │                                        │
│              ▼                                        │
│        重新生成 ──▶ 再次验证                          │
│                                                      │
└─────────────────────────────────────────────────────┘

实现要点

class SelfCorrectingAgent:
    """自我修正 Agent"""

    def __init__(self):
        self.max_retries = 3

    def try_generate(self, requirement: str) -> str:
        """带自我修正的生成"""
        for attempt in range(self.max_retries):
            # 1. 生成
            result = self.generate(requirement)

            # 2. 验证
            validation = self.validate(result)

            # 3. 检查是否通过
            if validation["passed"]:
                return result

            # 4. 分析错误
            error = validation["error"]
            print(f"尝试 {attempt + 1} 失败: {error}")

            # 5. 准备修复
            requirement = self.prepare_fix(requirement, error, result)

        return f"经过 {self.max_retries} 次尝试仍失败"

Step 07: Multi-Agent - 多 Agent 协作

核心概念

Multi-Agent = 多个专门的 Agent 协同工作

┌─────────────────────────────────────────────────────┐
│                  Multi-Agent 架构                   │
├─────────────────────────────────────────────────────┤
│                                                      │
│                  ┌──────────────┐                    │
│                  │ Orchestrator │                    │
│                  │  (协调者)    │                    │
│                  └──────┬───────┘                    │
│                         │                            │
│     ┌───────────────────┼───────────────────┐         │
│     │                   │                   │         │
│     ▼                   ▼                   ▼         │
│ ┌──────────┐      ┌──────────┐      ┌──────────┐   │
│ │ Generator│      │ Validator│      │ Searcher │   │
│ │ (生成者) │      │ (验证者) │      │ (搜索者) │   │
│ └──────────┘      └──────────┘      └──────────┘   │
│                                                      │
└─────────────────────────────────────────────────────┘

协作模式

模式	说明	适用场景
串行	A → B → C 依次执行	步骤有依赖
并行	A / B / C 同时执行	步骤独立
循环	A → B → A → B 循环	需要反复验证

简化实现

class MultiAgentSystem:
    """多 Agent 协作系统"""

    def __init__(self):
        # 注册各个 Agent
        self.agents: dict[str, Agent] = {}

    def register(self, agent: Agent) -> None:
        self.agents[agent.name] = agent

    def process(self, requirement: str):
        # 1. 搜索相关知识
        searcher = self.agents.get("searcher")
        context = searcher.process(requirement) if searcher else ""

        # 2. 生成（可能需要多轮）
        generator = self.agents.get("generator")
        draft = generator.process({"requirement": requirement, "context": context}) if generator else requirement

        # 3. 验证
        validator = self.agents.get("validator")
        if validator:
            validation = validator.process(draft)
            if not validation.get("passed", True):
                return {"error": "验证失败", "validation": validation}

        return draft

README.md Unescape Escape

Step 05-07: RAG / Self-Correction / Multi-Agent

Step 05: RAG - 知识检索

核心概念

为什么需要 RAG？

关键组件

简化实现

Step 06: Self-Correction - 自我修正

核心概念

实现要点

Step 07: Multi-Agent - 多 Agent 协作

核心概念

协作模式

简化实现

📚 学习资源

README.md