feat: v4 multimodal chat input, multi-format support, and annotation detection

- Replace st.chat_input with st-multimodal-chatinput (Ctrl+V paste, drag-drop, file button)
- Extract _process_uploaded_file() shared handler (eliminates ~70 duplicated lines)
- Add XLSX (openpyxl), XLS (xlrd), DOC (olefile) parsers to file_parser.py
- Add backend/annotation_detector.py: circle detection (HoughCircles) + arrow detection (HoughLinesP clustering) + OCR correlation + LLM context formatting
- Add annotation_result field to AgentState with session persistence
- Wire annotation detection into process_input and _format_ocr_context
- Add 11 new tests: 7 annotation detector + 4 multi-format parser
- Update all docs: CLAUDE.md, README.md, CODE_GUIDE.md, ROADMAP.md

This commit is contained in:

panda

2026-05-20 23:43:16 +08:00

parent c9f003e1b7

commit 9bb011e429

16 changed files with 1257 additions and 164 deletions

prompts/modification.md

View File

@@ -8,6 +8,8 @@
 - 如果添加新字段，正确声明它们。
 - 确保 <queryString> 是 <![CDATA[...]]> 中有效的 SQL。
 {ocr_context}
 当前 JRXML：
 {current_jrxml}