agent_jrxml

Author	SHA1	Message	Date
panda	4e14334030	fix: per-node max_tokens + validation 502 guard + correct_jrxml output validity - backend/llm.py: per-node max_tokens via get_llm(max_tokens=N), LLM_MAX_TOKENS env var (default 8192) - agent/nodes.py: 5 generation nodes use max_tokens=32768, generate_skeleton retries at 65536 - agent/nodes.py: fix ns:field regex (<field → <[\w:]*field) to handle namespace prefixes - agent/nodes.py: fix correct_jrxml never writing back to state["current_jrxml"] - agent/nodes.py: correct_jrxml rejects non-JRXML output (no <jasperReport tag) - agent/nodes.py: _strip_continuation_wrapper strips markdown/prefixes from continuation rounds - agent/nodes.py: _extract_jrxml iterates multiple markdown code blocks, skips fragments - agent/graph.py: route_after_validate skips correction loop when service_unavailable - agent/graph.py: route_after_save skips validation for empty JRXML - backend/validation.py: returns service_unavailable: True for ConnectError and HTTP 5xx - Docs: CLAUDE.md v14 changelog, README.md LLM_MAX_TOKENS, .env.example LLM_MAX_TOKENS	2026-05-24 15:20:25 +08:00
panda	1210b926c3	fix: MAX_RETRY 5 + rolling continuation + namespace-aware JRXML extraction - MAX_RETRY: 3→5 (graph.py:35, nodes.py:25) with env override - Rolling continuation: _generate_with_continuation() auto-detects truncated JRXML and sends anchor-based continuation, max 3 rounds - JRXML extraction: regex/end-tag now namespace-prefix aware (ns0:jasperReport, ns:jasperReport, etc.) - All 5 generation nodes refactored to use continuation helper - Tests updated: scenario1 accepts ns-prefixed root, max_retry verifies graph termination - stop_reason capture + WARNING log on max_tokens truncation - Correction prompt now injects OCR context + layout schema	2026-05-23 10:58:46 +08:00
panda	1e5ce9725b	feat: FastAPI+SSE API server, JRXML auto-reorder, session integrity fixes	2026-05-22 17:53:59 +08:00
panda	83c7da7517	fix: system env vars silently overriding .env — load_dotenv(override=True) Root cause: load_dotenv() default override=False meant system-level ANTHROPIC_BASE_URL (https://api.deepseek.com/anthropic) took precedence over .env's OPENAI_BASE_URL (https://api.minimaxi.com/anthropic). All Anthropic API calls went to DeepSeek with a MiniMax key, causing 401. Changes: - backend/llm.py: load_dotenv(override=True) — .env always wins - .env.example: add explicit ANTHROPIC_API_KEY + ANTHROPIC_BASE_URL - CLAUDE.md: document env var priority pitfall	2026-05-21 22:36:43 +08:00
panda	067880bf2e	feat: 添加结构化日志系统，更新LLM配置与全部文档新增: - backend/logger.py — 集中日志模块 (JSON格式 + trace_id + 独立llm.log) - @log_node / @_log_route 装饰器覆盖17个节点和8个路由改进: - backend/llm.py — _LLMLoggingWrapper 自动记录LLM输入输出 - backend/llm.py — API Key优先读ANTHROPIC_API_KEY，模型名改为MiniMax-M2.7 - backend/llm.py — get_llm() 新增caller参数标识调用来源 - backend/validation.py — 新增验证结果/连接失败日志 - backend/session.py — 新增会话创建/删除日志 - app.py — 新增用户交互日志 (输入/执行/异常/会话操作) - app.py — 提前导入torchvision抑制transformers懒加载报错 - .env.example — 新增LOG_DIR/LOG_LEVEL/ANTHROPIC_API_KEY等配置项 - .gitignore — 新增logs/和db/忽略规则文档: - ROADMAP.md — 新增阶段四: 可观测性 - README.md — 补充日志架构/LLM配置/项目结构 - CLAUDE.md — 同步最新配置/日志/MAX_RETRY(3) - CODE_GUIDE.md — 新增第15章日志系统，更新架构图/LLM/配置	2026-05-19 23:40:01 +08:00
panda	70614dff5e	feat: comprehensive v2 upgrade — streaming, error KB, file upload, layout analysis Major changes: - Streaming: LLM统一 _BaseLLM 接口 (invoke + stream), generate/modify/correct 节点使用 get_stream_writer() 实现逐字输出, UI 节点平铺展开自动折叠 - Prompt外部化: 7个prompt拆分到 prompts/*.md, loader.py 支持热重载 - 错误自增长: backend/error_kb.py — 指纹去重 + ChromaDB持久化, correct_jrxml→validate 通过时自动入库, retrieve同时搜索错误KB - 文件上传: backend/file_parser.py — PDF/DOCX/图片/文本解析, 侧边栏多文件上传, 文本自动注入下一条消息 - A4模板识别: backend/layout_analyzer.py — 三种模式(完整A4/行片段修改/行片段新建), PaddleOCR元素提取 + 行分组 + JRXML section匹配 - 会话历史下载: jrxml_versions版本追踪 + 侧边栏历史版本下载按钮 - 预览修复: route_after_save跳过预览/导出意图的验证循环 - Ctrl+C修复: JS注入拦截Streamlit裸c键清缓存 Docs: CLAUDE.md (完整项目文档), ROADMAP.md (改进路线图) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-19 15:02:53 +08:00
panda	b280c2b453	feat: integrate RAG rag_jrxml submodule and fix Anthropic API key Add rag submodule for semantic JRXML chunk retrieval, refactor retrieve node to use RAGSearcher, and fix missing api_key in Anthropic SDK client initialization. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-19 09:42:57 +08:00
panda	664de945f1	fix: use raw Anthropic SDK for MiniMax with NO_PROXY workaround The langchain-anthropic wrapper fails auth with MiniMax because it sends an api_key that conflicts with ANTHROPIC_AUTH_TOKEN at the SDK level, causing the request to be sent with incorrect auth headers. Use raw Anthropic SDK directly with a simple MiniMaxLLM wrapper class instead. Root cause: MiniMax requires the API key ONLY via ANTHROPIC_AUTH_TOKEN (system env), not via api_key parameter or OPENAI_API_KEY. Setting os.environ["NO_PROXY"]="*" is also needed to prevent httpx from using a proxy that interferes with the auth header. Note: E2E testing with streamlit run app.py still pending.	2026-05-15 00:35:41 +08:00
panda	76f98a7aeb	feat: add Anthropic API provider support and missing env vars - Add LLM_PROVIDER env var (openai/anthropic) to switch cloud backend - Use ChatAnthropic for anthropic provider with custom base_url - Add CONTEXT_MAX_TOKENS, CONTEXT_KEEP_RECENT, SESSIONS_DIR, HISTORY_MAX_SNAPSHOTS to .env and .env.example - Add langchain-anthropic dependency to requirements.txt Note: E2E testing blocked — the configured MiniMax API key (sk-cp-...) returns 401 across all endpoints (Anthropic and OpenAI). The API key may be expired or lack text-generation model access.	2026-05-14 23:39:00 +08:00
panda	21a5fdf930	feat: 后端基础设施 — LLM工厂/Embedding工厂/验证客户端/会话持久化 - backend/llm.py: 支持 OpenAI 兼容 API 与 Ollama 本地模型切换 - backend/embeddings.py: 支持云端与本地嵌入模型（sentence-transformers） - backend/validation.py: FastAPI 验证服务 HTTP 客户端 - backend/session.py: JSON 文件会话管理（创建/加载/保存/列表/删除） - .env.example: 完整环境变量模板 - requirements.txt: 所有 Python 依赖声明	2026-05-14 23:20:56 +08:00

10 Commits