ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 15:31:05 +08:00

Author	SHA1	Message	Date
Hz_	e35860ad74	feat(go-api): Align document metadata batch APIs and upload_info with Python (#16269 ) ## Summary Align the Go implementations of these APIs with the Python behavior: - `POST /api/v1/datasets/:dataset_id/metadata/update` - `PATCH /api/v1/datasets/:dataset_id/documents/metadatas` - `POST /api/v1/documents/upload` ## What changed - Added the Go routes and handlers for the 3 APIs. - Aligned batch document metadata updates with Python semantics: - support `match` in update items - support list append / replace behavior - support deleting specific list values - remove metadata entirely when it becomes empty - create metadata for documents that previously had none when updates apply - count `updated` only when a document actually changes - Aligned `documents/upload` file uploads with Python-style `upload_info` behavior: - store upload-info blobs in the per-user downloads bucket - return lightweight upload descriptors instead of normal file-management responses - Improved URL upload behavior: - SSRF-guarded fetch with redirect validation - redirect limit aligned to Python behavior - normalize filename and MIME type - add `.pdf` when the fetched content is PDF - normalize HTML content into readable text instead of storing raw HTML shells ## Validation ### Unit tests Passed: - `go test ./internal/service` - `go test ./internal/handler` Also verified targeted cases for: - batch metadata update semantics - upload_info URL handling - upload_info download bucket behavior ### curl checks Verified the new Go endpoints with `curl` and compared the response shape and behavior with Python for: - `POST /api/v1/datasets/{dataset_id}/metadata/update` - `PATCH /api/v1/datasets/{dataset_id}/documents/metadatas` - `POST /api/v1/documents/upload` The Go responses were checked against Python for: - argument validation - success response shape - metadata update results - upload_info result structure - file vs URL input handling	2026-06-24 14:52:47 +08:00
Haruko386	97718ec779	feat[Go] implement datasets/<dataset_id>/<index_type> DELETE (#16257 ) ### What problem does this PR solve? As title ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-24 14:47:55 +08:00
Hz_	368db6fa58	feat(go-api): migrate datasets tags aggregation API to Go (#16181 ) ### Description Migrates the datasets tags aggregation API `GET /api/v1/datasets/tags/aggregation` from Python to Go. ### Changes - Registered the `GET /api/v1/datasets/tags/aggregation` route. - Implemented `AggregateTags` in datasets `handler` and `service`. - Added handler and service `unit tests`. ### Test Verification - Verified by comparing results between Python (9380) and Go (9384) services. - Tested scenarios: single dataset, multiple datasets, empty parameters, and unauthorized/invalid IDs. - All tests and Go `unit tests` passed.	2026-06-24 14:42:10 +08:00
Jin Hai	e615e4faab	Go CLI: fix mode switch (#16294 ) ### What problem does this PR solve? ``` RAGFlow(api/default)> add admin host '127.0.0.1:9383'; SUCCESS RAGFlow(api/default)> use admin; SUCCESS RAGFlow(admin)> delete api 'default'; SUCCESS RAGFlow(admin)> delete api 'default'; CLI error: api server: default not found RAGFlow(admin)> add api 'default' host '127.0.0.1:9384'; SUCCESS RAGFlow(admin)> use api 'default'; SUCCESS RAGFlow(api/default)> delete admin SUCCESS RAGFlow(api/default)> delete admin; CLI error: admin server not exists RAGFlow(api/default)> list api server; +------------+---------------+-----------------+---------+ \| api_server \| api_server_ip \| api_server_port \| auth \| +------------+---------------+-----------------+---------+ \| default \| 127.0.0.1 \| 9384 \| no auth \| +------------+---------------+-----------------+---------+ RAGFlow(api/default)> add admin host '127.0.0.1:9383'; SUCCESS RAGFlow(api/default)> show admin server; +-------------------+-----------+ \| field \| value \| +-------------------+-----------+ \| admin_server_ip \| 127.0.0.1 \| \| admin_server_port \| 9383 \| \| auth \| no auth \| +-------------------+-----------+ ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-24 13:41:01 +08:00
Jin Hai	919f596066	Fix release (#16278 ) ### What problem does this PR solve? Fix release ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 22:04:34 +08:00
Rander	017adf841f	fix(paddleocr): support PP-OCRv6 ocrResults fallback and integrate image parsing (#16150 ) ## Summary This PR fixes two issues discovered during testing of the PaddleOCR async API refactoring: ### 1. PP-OCRv6 returns `ocrResults` instead of `layoutParsingResults` Models like PP-OCRv6 are pure text recognition models that return results in `ocrResults.prunedResult.rec_texts` format rather than the `layoutParsingResults.prunedResult.parsing_res_list` format used by layout-aware models (PaddleOCR-VL series). Changes: - `deepdoc/parser/paddleocr_parser.py`: Extract `ocrResults` alongside `layoutParsingResults` in `_send_request()`, add fallback logic in `_transfer_to_sections()` and `parse_image()` - `internal/entity/models/paddleocr.go`: Add `ocrResults` struct and fallback extraction in Go OCR handler ### 2. Image parsing not integrated into picture chunker The `parse_image()` method existed in PaddleOCRParser but was never called from `rag/app/picture.py` (the module that handles image file uploads). Users configuring PaddleOCR as their layout recognizer would still get local deepdoc OCR for images. Changes: - `rag/app/picture.py`: When `layout_recognize` is set to PaddleOCR, use `PaddleOCROcrModel.parse_image()` instead of local OCR. Falls back gracefully to local OCR on failure. ## Testing Verified end-to-end in Docker: - PaddleOCR-VL-1.6 PDF parsing: ✅ (10 text blocks with bbox) - PaddleOCR-VL-1.6 image parsing: ✅ (219 chars) - PP-OCRv6 PDF parsing with ocrResults fallback: ✅ (10 text blocks) - PP-OCRv6 image parsing with ocrResults fallback: ✅ (136 chars) ## Related PRs - #15967 (merged) - PaddleOCR async Job API refactoring + new models - #16086 (merged) - PaddleOCR image parsing support	2026-06-23 22:02:54 +08:00
Yingfeng	706e0d2d06	Refactor harness framework (#16271 ) ### What problem does this PR solve? - Tools management - Pregel engine wrapper for better usage - UT race - Coding style ### Type of change - [x] Refactoring	2026-06-23 20:18:04 +08:00
Jin Hai	4f02ba4cf4	Go: show model and list all models (#16272 ) ### What problem does this PR solve? ``` RAGFlow(admin)> show model 'abc'; +------------+----------------------------------------------------------------+ \| field \| value \| +------------+----------------------------------------------------------------+ \| command \| get_model_by_model_name \| \| error \| 'get model by model name' is implemented in enterprise edition \| \| model_name \| abc \| +------------+----------------------------------------------------------------+ RAGFlow(admin)> list models; +-----------------+--------------------------------------------------------+ \| command \| error \| +-----------------+--------------------------------------------------------+ \| list_all_models \| 'list all models' is implemented in enterprise edition \| +-----------------+--------------------------------------------------------+ ``` ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 19:29:06 +08:00
Jin Hai	49714865c1	Go: rename ragflow_cli to ragflow-cli (#16270 ) ### What problem does this PR solve? rename ragflow cli binary ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 19:20:49 +08:00
Haruko386	d89e29fba8	Document[Go-develop]: update Go development docs (#16229 ) ### What problem does this PR solve? Document updated: ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-23 19:19:44 +08:00
Haruko386	5046626c17	feat[Go]: implement /datasets/<dataset_id>/documents/batch-update-status (#16258 ) ### What problem does this PR solve? accident close #16072 As title ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-23 19:19:08 +08:00
Haruko386	6cbd069ea3	feat[Go]: implement <document_id>/chunks/<chunk_id> PATCH (#16232 ) ### What problem does this PR solve? Implement: 1. `/api/v1/datasets/<dataset_id>/documents/<document_id>/chunks GET` 2. `/api/v1/datasets/<dataset_id>/documents/<document_id>/chunks/<chunk_id> PATCH` 3. `/api/v1/datasets/<dataset_id>/documents/<document_id>/chunks PATCH` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-23 18:50:36 +08:00
maoyifeng	643cb4788f	Go CLI: add response output (#16263 ) ### What problem does this PR solve? Go CLI: add response output	2026-06-23 18:12:15 +08:00
Jin Hai	e15130534f	Go: default public key (#16265 ) ### What problem does this PR solve? Provider default public key for CLI ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 17:43:26 +08:00
Jin Hai	dec2ce4a60	Go CLI: admin model framework (#16252 )	2026-06-23 16:57:05 +08:00
Zhichang Yu	2362210caf	refactor(log): unify Go logging to zap with rotation, strip per-package levels (#16261 ) Refactor the Go agent port's logging so every log line — gin access, agent canvas events, harness warnings, fatal boot errors — flows through a single common.Logger (zap) backed by a rotated file, with structured fields, level filtering, and configurable rotation. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-23 16:21:46 +08:00
Jin Hai	b661e9c19e	Go CLI: admin list providers (#16243 ) ### What problem does this PR solve? ``` RAGFlow(admin)> list providers; +----------------------+-------------------------------------------------------------+ \| command \| error \| +----------------------+-------------------------------------------------------------+ \| list_model_providers \| 'list model providers' is implemented in enterprise edition \| +----------------------+-------------------------------------------------------------+ RAGFlow(admin)> add provider 'zhipu-ai'; +-------------+-----------------------------------------------------------+ \| field \| value \| +-------------+-----------------------------------------------------------+ \| command \| add_model_provider \| \| error \| 'add model provider' is implemented in enterprise edition \| \| provider_id \| admin \| \| user_id \| zhipu-ai \| +-------------+-----------------------------------------------------------+ RAGFlow(admin)> delete provider 'zhipu-ai'; +-------------+--------------------------------------------------------------+ \| field \| value \| +-------------+--------------------------------------------------------------+ \| command \| delete_model_provider \| \| error \| 'delete model provider' is implemented in enterprise edition \| \| provider_id \| admin \| \| user_id \| zhipu-ai \| +-------------+--------------------------------------------------------------+ RAGFlow(admin)> add provider 'zhipu-ai' instance 'instance1'; +---------------+-----------------------------------------------------------+ \| field \| value \| +---------------+-----------------------------------------------------------+ \| command \| add_model_instance \| \| error \| 'add model instance' is implemented in enterprise edition \| \| instance_name \| instance1 \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| +---------------+-----------------------------------------------------------+ RAGFlow(admin)> delete provider 'zhipu-ai' instance 'test' +-------------+--------------------------------------------------------------+ \| field \| value \| +-------------+--------------------------------------------------------------+ \| instances \| [test] \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| \| command \| delete_model_provider \| \| error \| 'delete model instance' is implemented in enterprise edition \| +-------------+--------------------------------------------------------------+ RAGFlow(admin)> add provider 'zhipu-ai' instance 'instance1' model 'xxx'; +---------------+--------------------------------------------------+ \| field \| value \| +---------------+--------------------------------------------------+ \| command \| add_model \| \| error \| 'add model' is implemented in enterprise edition \| \| instance_name \| instance1 \| \| model_names \| [xxx] \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| +---------------+--------------------------------------------------+ RAGFlow(admin)> delete provider 'zhipu-ai' instance 'test' model 'xxx'; +---------------+------------------------------------------------------+ \| field \| value \| +---------------+------------------------------------------------------+ \| command \| delete_model_provider \| \| error \| 'delete models' is implemented in enterprise edition \| \| instance_name \| test \| \| models \| [xxx] \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| +---------------+------------------------------------------------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 10:26:31 +08:00
Zhichang Yu	06ededb26a	test(go): ensure go unit tests pass (#16241 ) ## Summary Stabilizes the Go unit-test surface so the test suite can run reliably in CI and locally via \`bash build.sh --test\`. ## Verification \`\`\`bash bash build.sh --test -- -count=10 -run TestWithCancel_SequentialAgent ./internal/harness/core/ bash build.sh --test -- -count=5 -run TestSiliconflowChatExtracts ./internal/entity/models/ bash build.sh --test # full suite \`\`\` All previously failing packages (\`admin\`, \`cli\`, \`handler\`, \`parser\`, \`router\`, \`service\`, \`service/chunk\`) now build and test successfully. \`TestWithCancel_SequentialAgent\` passes 10/10 (was flaky). SiliconFlow reasoning test passes after switching the assertion to the SiliconFlow wire format. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-22 20:43:29 +08:00
Jin Hai	0e6b28a7fe	Add show / set role default models (#16240 ) ### What problem does this PR solve? ``` RAGFlow(admin)> show role 'user' default models; +--------------------------+-----------------------------------------------------------------+-----------+ \| command \| error \| role_name \| +--------------------------+-----------------------------------------------------------------+-----------+ \| show_role_default_models \| 'show role default models' is implemented in enterprise edition \| user \| +--------------------------+-----------------------------------------------------------------+-----------+ RAGFlow(admin)> set role 'user' default chat 'glm4.5@test@zhipu-ai'; +------------+---------------------------------------------------------------+ \| field \| value \| +------------+---------------------------------------------------------------+ \| model_id \| \| \| model_type \| chat \| \| role_name \| user \| \| command \| set_role_default_model \| \| error \| 'set role default model' is implemented in enterprise edition \| +------------+---------------------------------------------------------------+ RAGFlow(admin)> reset role 'user' default chat; +------------+-----------------------------------------------------------------+ \| field \| value \| +------------+-----------------------------------------------------------------+ \| command \| reset_role_default_model \| \| error \| 'reset role default model' is implemented in enterprise edition \| \| model_type \| chat \| \| role_name \| user \| +------------+-----------------------------------------------------------------+ ``` --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-22 19:03:36 +08:00
Hz_	9eb7cee473	feat(go-api): migrate searchbot share detail endpoint to go (#16124 ) ## Summary - add public Go route for `/api/v1/searchbots/detail` - implement beta-token auth flow for shared search access - add tenant-based access check for shared search apps - add joined search detail query for the share response - align Go response shape with the current Python runtime behavior - add DAO / service / handler tests for the new endpoint	2026-06-22 18:17:37 +08:00
Hz_	2856cde2d1	feat(go-api): Implement BulkDeleteChats Go API and fix ListChats (#16157 ) ### Description - Bulk Delete Chats: Implemented Go endpoint `DELETE /api/v1/chats` supporting bulk delete by `ids`, `delete_all` flag, and backward-compatible `chat_id` body payload (with tenant-ownership security checks). - Bug Fix: Fixed a parameter swap in Go `ListChats` handler to properly exclude soft-deleted chats.	2026-06-22 18:16:52 +08:00
Hz_	4e0db3053d	feat(go-api): complete chat channel API migration with tests (#16139 ) close #16132 ## Summary This PR completes the Go-side merge and cleanup for chat channel APIs, including handler/service wiring, route registration, and test coverage. Implemented and aligned 5 chat channel APIs: ``` - POST `/api/v1/chat-channels` - GET `/api/v1/chat-channels` - GET `/api/v1/chat-channels/:channel_id` - PATCH `/api/v1/chat-channels/:channel_id` - DELETE `/api/v1/chat-channels/:channel_id` ``` Co-authored-by: Haruko386 <tryeverypossible@163.com>	2026-06-22 18:16:15 +08:00
Haruko386	02cc1d6438	fix: unable to chat after set model (#16195 ) ### What problem does this PR solve? ``` fixed: RAGFlow(api/default)> use model 'minimax-m2.5@test@minimax' SUCCESS RAGFlow(api/default)> chat message 'who r u' Answer: Hey! I'm MiniMax-M2.5, an AI assistant here to help you with questions, tasks, or whatever you need. What can I do for you? Time: 1.727263 ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-22 18:14:58 +08:00
Haruko386	b777e50291	feat[Go]: implement api /api/v1/datasets/<dataset_id>/chunks POST (#16067 ) ### What problem does this PR solve? As title ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-22 18:14:01 +08:00
Jin Hai	05e758e4fe	Go CLI: Fix alter role (#16226 ) ### What problem does this PR solve? As title. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-22 17:33:47 +08:00
Zhichang Yu	3f805a64f1	feat(agent): align Go agent behavior with Python (except retrieval component) (#16225 ) ## Summary Aligns the Go agent runtime/canvas/components/tools behavior with the Python `agent/` implementation so the same stored canvas DSL produces the same execution result on either side. Every component, tool, and runtime primitive in `internal/agent/` is now driven by the same semantics as its Python counterpart — variable resolution, template substitution, control flow, error reporting, retry/cancel, and stream event shapes. The retrieval component is the one explicit exception in this PR. It is being reworked in a separate change and is excluded from this alignment pass; the wrapper slot (`universe_a_wrappers.go → newRetrievalComponent`) is preserved. ## Scope of alignment ### Components (all aligned with `agent/component/`) `Begin` · `Message` · `LLM` (incl. ChatTemplateKwargs, MessageHistoryWindowSize, VisualFiles, Cite, OutputStructure, JSONOutput, TopP, MaxRetries, DelayAfterError, credentials) · `Agent` (react + tool artifact capture + `Reset()` interface-assert) · `Switch` (12/12 operators, Python-equivalent semantics) · `Categorize` · `Invoke` · `Iteration` · `Loop` (macro-expansion through `workflowx.AddLoopNode`) · `UserFillUp` (Python-equivalent interrupt/resume via eino `compose.Interrupt`/`ResumeWithData`) · `FillUp` · `DataOperations` · `ListOperations` · `StringTransform` · `VariableAggregator` · `VariableAssigner` · `Browser` (full stagehand runtime parity) · `DocsGenerator` · `ExcelProcessor`. ### Tools (all aligned with `agent/tools/`) `Retrieval` (wrapper slot only — logic out of scope) · `MCPToolAdapter` (streamable-HTTP) · `CodeExec` (sandbox bridge with `code_exec_contract.go` matching Python contract) · `AkShare` · `ArXiv` · `Crawler` · `DeepL` · `DuckDuckGo` · `Email` · `ExeSQL` · `GitHub` · `Google` · `GoogleScholar` · `Jin10` · `PubMed` · `QWeather` · `SearXNG` · `Tavily` · `Tushare` · `Wencai` · `Wikipedia` · `YahooFinance` — uniform `eino tool.InvokableTool` interface, SSRF protection, shared HTTP client. ### Canvas execution engine (`internal/agent/canvas/`) Aligned with Python's `agent/canvas.py`: - Scheduler (`scheduler.go`): state pre/post handlers, node lambdas, per-component timeout resolver (4-level: per-class env → per-class table → uniform env → 600s fallback), `legacyNoOpNames`. - Loop subgraph (`loop_subgraph.go`): Python-equivalent `AddLoopNode` macro expansion + condition translation. - Multibranch (`multibranch.go`): `Switch` / `Categorize` routing via `compose.NewGraphMultiBranch` — same branch selection semantics as Python. - Parallel subgraph (`parallel_subgraph.go`): matches Python's parallel fan-out contract. - Interrupt/Resume (`interrupt_resume.go`): `UserFillUpNodeBody` / `IsInterruptError` / `ExtractInterruptContexts` — replaces the deprecated Python sentinel chain with eino's native interrupt API, preserving the same external behavior. - Checkpoint (`checkpoint_store.go`): `RedisCheckPointStore` Get/Set/Delete, with business metadata (status / canvas_id / parent_run_id) on a parallel Redis Hash. - RunTracker (`run_tracker.go`): Start / MarkSucceeded / MarkFailed / MarkCancelled / AttachCheckpoint — same lifecycle as the Python run record. - Cancel (`cancel.go`): Redis pub/sub watch. - Stream (`stream.go`): SSE channel with `messages` / `waiting` / `errors` / `done` events, same shape as Python's `agent.canvas.RunEvent` payload. ### DSL bridge (`internal/agent/dsl/`) - `normalize.go`: v1↔v2 collapsed into a single wire format — Python and Go consume the same stored JSON. - `reset.go`: per-run state reset matches Python's `Canvas.reset()` semantics. - Testdata mirrors Python's `agent_msg.json` / `all.json` / etc. ### Runtime (`internal/agent/runtime/`) - `CanvasState` / `NewCanvasState` / `GetVar` / `SetVar` / `ReadVars`: same `{{cpn_id@param}}` resolution model. - `ResolveTemplate` (regex fast path + gonja fallback) — Python Jinja-style semantics. - `selector.go`, `metrics.go`, `component.go`: shared runtime contracts. ## Out of scope (intentionally) - `Retrieval` component logic — wrapped only; full parity lands in a follow-up PR. - Frontend — only minor dsl-bridge / canvas UX fixes ride along. - CLI / admin / model registry — orthogonal to agent behavior. ## How alignment is verified `internal/service/agent_run_e2e_test.go` exercises the full production chain against real Python-shaped DSL fixtures: ``` loadCanvasForUser → versionDAO.GetLatest → decodeCanvasFromDSL → canvas.Compile → cc.Workflow.Invoke → answer extraction ``` using in-memory SQLite + miniredis (no Docker). Covers: - `TestRunAgent_RealCanvas_BeginMessage` — happy path, `{{sys.query}}` resolution - `TestRunAgent_RealCanvas_WaitForUserResume` — two-run resume cycle (Python-equivalent) - `TestRunAgent_RealCanvas_CompileFails` — unknown component name → sanitized error (Python-equivalent) - `TestRunAgent_RealCanvas_InvokeFails` — unresolvable template ref (Python-equivalent) - `TestRunAgent_RunTracker_AttachCheckpoint_CallSequence` — Start→AttachCheckpoint→MarkSucceeded lifecycle `internal/handler/agent_test.go` — SSE streaming parity (`Content-Type: text/event-stream`, `data: {…}\n\n`, trailing `data: [DONE]\n\n`, OpenAI-compatible non-stream `choices`). `internal/agent/canvas/fixture_compile_test.go` + per-component tests pin the Python-equivalent outputs. ``` go test -count=1 -v -run 'TestRunAgent_RealCanvas\|TestRunAgent_RunTracker' ./internal/service/ ``` ## Design reference `docs/develop/agent-go-port-design.md` (1329 lines, last cross-checked 2026-06-17) — module layout, per-component / per-tool inventory, corner-case catalogue, and the actionable backlog (Section 14, including the retrieval alignment follow-up). --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-22 11:58:29 +08:00
Jin Hai	760229d917	Go CLI: admin list configs (#16221 ) ### What problem does this PR solve? - list configs; ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-22 08:19:23 +08:00
Jin Hai	5039f46999	Go CLI: refactor commands (#16213 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-21 16:50:02 +08:00
Jin Hai	1b712be599	Go CLI: refactor some commands (#16204 ) ### What problem does this PR solve? - list resources ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-20 02:31:07 +08:00
Jin Hai	11499f7bb3	Go CLI: add list user commands framework (#16201 )	2026-06-19 15:09:54 +08:00
Jin Hai	7214a23614	Go: fix duplicate models (#16197 ) ### What problem does this PR solve? 1. Remove unused file 2. Remove duplicate models 3. Resort the function order ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-19 09:57:58 +08:00
qinling0210	563d855780	Implement OpenAI chat completions in GO (#16177 ) ### What problem does this PR solve? Implement OpenAI chat completions in GO POST /api/v1/openai/<chat_id>/chat/completions OpenAI chat cli: internal/development.md ### Type of change - [x] Refactoring	2026-06-18 18:07:27 +08:00
Haruko386	217c2a94c2	feat[Go]: implement datasets/<dataset_id>/index P/G (#16153 ) ### What problem does this PR solve? ``` POST: http://localhost:9384/api/v1/datasets/433b390c630411f1a13eab5f89540b2a/index?type=graph Output: { "code": 0, "data": { "task_id": "ff5a3546bafa49d794a9a050d99c4a52" }, "message": "success" } ``` --- ``` GET: http://localhost:9384/api/v1/datasets/433b390c630411f1a13eab5f89540b2a/index?type=graph Output: { "code": 0, "data": { "id": "ff5a3546bafa49d794a9a050d99c4a52", "doc_id": "graph_raptor_x", "from_page": 100000000, "to_page": 100000000, "task_type": "graphrag", "priority": 0, "begin_at": "2026-06-17T18:07:45+08:00", "process_duration": 4.108135, "progress": -1, "progress_msg": "18:07:45 created task graphrag\n18:07:47 Task has been received.\n18:07:49 [ERROR][Exception]: Model config not found: Qwen/Qwen3-235B-A22B@test@SILICONFLOW", "retry_count": 1, "digest": "f16fd067d5c92cec", "create_time": 1781690865552, "create_date": "2026-06-17T18:07:45+08:00", "update_time": 1781690869108, "update_date": "2026-06-17T18:07:49+08:00" }, "message": "success" } ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:57:24 +08:00
Haruko386	5f6ebc97c6	feat[go]: implement /api/v1/datasets/<dataset_id> PUT (#16122 ) ### What problem does this PR solve? As pic shows ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:57:07 +08:00
Haruko386	6beae949d8	feat[Go]: add modelID for delete_model and update_status (#16025 ) ### What problem does this PR solve? 1. add modelID for delete_model and update_status 2. fix the bug when update-status delete model ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:56:51 +08:00
Jin Hai	3eb49ca7f8	Go: add command, list, remove, stop tasks (#16190 ) ### What problem does this PR solve? ``` RAGFlow(admin)> stop user 'abc' ingestion tasks; +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ \| command \| email \| error \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ RAGFlow(admin)> stop user 'abc' ingestion tasks 'created; +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ \| command \| email \| error \| status \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| created; \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ RAGFlow(admin)> stop user 'abc' ingestion tasks 'create'; +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ \| command \| email \| error \| status \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| create \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ RAGFlow(admin)> remove user 'abc' ingestion tasks 'create'; +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ \| command \| email \| error \| status \| tasks \| +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ \| remove_ingestion_tasks_by_condition \| abc \| 'Remove ingestion tasks by condition' is implemented in enterprise edition \| create \| \| +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ RAGFlow(admin)> remove user 'abc' ingestion tasks; +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ \| command \| email \| error \| tasks \| +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ \| remove_ingestion_tasks_by_condition \| abc \| 'Remove ingestion tasks by condition' is implemented in enterprise edition \| \| +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 17:50:21 +08:00
Jin Hai	5eedd13d49	Go: add command, show tasks summary (#16187 ) ### What problem does this PR solve? RAGFlow(admin)> show tasks summary; +---------+-----------------------------------------------------------------+ \| field \| value \| +---------+-----------------------------------------------------------------+ \| command \| show_users_quota_summary \| \| error \| 'Show users quota summary' is implemented in enterprise edition \| +---------+-----------------------------------------------------------------+ ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 17:09:20 +08:00
Jin Hai	20d11648a4	Go: add statistics command (#16119 ) ### What problem does this PR solve? Prepare for enterprise command ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 15:21:44 +08:00
Haruko386	27d723e13a	fix: fix some bugs in check_conn and drop_inst (#16180 ) ### What problem does this PR solve? As title: ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-18 14:19:46 +08:00
Hz_	69dbc44983	feat(go-api): migrate MCP server detail and download API to Go (#16113 ) ### What problem does this PR solve? - Migrated MCP server detail and export (download) API from Python to Go. - Registered route: `GET /api/v1/mcp/servers/:mcp_id` (supporting `?mode=download` query parameter).	2026-06-18 11:09:22 +08:00
Hz_	f59332bc37	feat(go-api): implement Go-side document PATCH API & align parsing/metadata sync behavior (#15975 ) ### What problem does this PR solve? This PR implements the Go backend counterpart for the document partial update API: `PATCH /api/v1/datasets/:dataset_id/documents/:document_id` ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring	2026-06-18 11:08:47 +08:00
Hz_	065797b047	Refactor(go-cli): improve variable and label naming in CLI parseAddModel (#16145 ) ### What problem does this PR solve? This PR improves code readability in the CLI parser by renaming the loop index `i` to `modelIndex`. It also renames the loop label `A` to `optionsLoop` to align with standard Go naming conventions. ### Type of change - [x] Refactoring	2026-06-17 20:21:42 +08:00
BitToby	2ab9256e8a	fix(go): correct OpenRouter streaming URL routing and reasoning parameter (#16111 ) ### What problem does this PR solve? Fixes two bugs in the OpenRouter streaming chat request builder (`internal/entity/models/openrouter.go`, `ChatStreamlyWithSender`): 1. qwen/glm models streamed to a broken URL. The code routed any `qwen`/`glm` model to `URLSuffix.AsyncChat`, but `conf/models/openrouter.json` defines no `async_chat` suffix (empty), so the request was POSTed to `<base>/` instead of `<base>/chat/completions` — breaking streaming for every qwen/glm model. The non-stream path has no such branch. Fix: all models use the standard `Chat` suffix, consistent with the non-stream path. 2. Streaming reasoning was never enabled. The request set reasoning via a non-standard `thinking` key, which OpenRouter ignores. OpenRouter's API — and this provider's own non-stream request (line ~110) and its streamed `delta.reasoning` parser (line ~311) — use the `reasoning` object. Fix: send `reasoning: {"enabled": <thinking>}` (and `{"effort": ...}` when set, taking precedence as in the non-stream path). Closes #16110 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-17 19:14:13 +08:00
Hunnyboy1217	e178c81bb4	refactor(go-models): harden Ollama ListModels and route through ParseListModel (#15853 ) (#15955 ) ### What problem does this PR solve? Part of #15853 (provider model-list refactor). Refactors Ollama `ListModels` onto the shared `ParseListModel` pattern and fixes two correctness issues: - Endpoint: switch the models suffix from `api/ps` (only currently-running models) to `api/tags` (all installed models) — the latter is what a model picker should show. - Parsing: Ollama returns `{"models":[{"name","model"}]}`, a non-OpenAI shape. Decode it into a typed struct, map the names into `ModelList`, then enrich through `ParseListModel`. This removes the previous unchecked type assertions (`result["models"].([]interface{})` / `.(map[string]interface{})` / `.(string)`) that panicked when the body was missing the `models` array or any field, and adds a fallback to the `model` field when `name` is blank. - Drops the no-op GET request body and a dead base-URL reassignment. #### Drive-by fix Shared gitee_test.go `DSModelList` -> `ModelList` compile fix (renamed in #15900) so the models test package builds; auto-resolves against the sibling #15853 PRs. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2026-06-17 18:47:27 +08:00
Zhichang Yu	e45659868a	feat(agent): ship the Go agent canvas port — eino interrupt/resume + Redis check-pointing (#16035 ) Replaces the Python agent canvas runtime with a Go implementation that runs inside `cmd/server_main`. The canvas compiles into an eino Workflow that pauses on wait-for-user via native Interrupt/Resume (no sentinel flag) and resumes from a Redis-backed CheckPointStore. All 21 Python agent components and ~35 tools are ported with functional parity. Sandbox providers now read their JSON config from the admin-panel system_settings table with env fallback. 234 files / +35,413 / -6,111. All Go files are gofmt-clean (CI gate added); drops the v2 DSL E2E step and the gap-analysis plan (both redundant after the port ships). ## Type of change - [x] Refactoring - [x] New feature - [x] Bug fix 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-17 13:24:03 +08:00
euvre	9bd53ce675	fix: return full record in get_ingestion_log (#16120 ) ### What problem does this PR solve? The `get_ingestion_log` endpoint (both Python `dataset_api_service.get_ingestion_log` and Go `DatasetService.GetIngestionLog`) was returning only the dataset-level field set, which omits critical fields such as `dsl`, `document_id`, `parser_id`, `document_name`, `pipeline_id`, etc. This caused the front-end dataflow-result page to be unable to render the pipeline timeline and chunks when viewing a single ingestion log, regardless of whether the log was a dataset-level operation (graph/raptor/mindmap) or a per-file parse. ### Background `PipelineOperationLogService` provides two field sets: \| Method \| Fields \| \|---\|---\| \| `get_dataset_logs_fields` \| Minimal set (progress, status, timestamps, etc.) \| \| `get_file_logs_fields` \| Superset — includes `document_id`, `dsl`, `parser_id`, `document_name`, `pipeline_id`, … \| When listing logs, the API correctly distinguishes dataset-level vs file-level logs and uses the appropriate converter. However, when fetching a single log by ID, both the Python and Go implementations were hardcoded to the dataset-level set, dropping the extra fields that the front-end needs.	2026-06-17 13:03:51 +08:00
Hunnyboy1217	fd196f694e	feat(go-models): harden ListModels for FishAudio (#15853 ) (#15957 ) ### What problem does this PR solve? Part of #15853 (provider model-list refactor). Final two providers. - voyage: Voyage AI exposes no live model-list endpoint — its public API only has `/v1/embeddings` and `/v1/rerank` — so the previous `ListModels` was a `no such method` stub. Replace it with a static-catalog listing sourced from the loaded provider definition, carrying each model's `max_tokens`, `model_types`, and embedding `dimensions`. `list models from voyage` now returns the 13-model catalog instead of erroring. - fishaudio: route the existing `/model` voice listing through the shared `ParseListModel` helper for consistency; keep the human-readable `title` as the model name and fall back to `_id` when a title is blank. #### Drive-by fix Shared gitee_test.go `DSModelList` -> `ModelList` compile fix (renamed in #15900); auto-resolves against the sibling #15853 PRs. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring Co-authored-by: Haruko386 <tryeverypossible@163.com>	2026-06-17 11:56:20 +08:00
Hz_	b48f03d0f5	feat(go/dao): migrate chat channel database entity and DAO to Go (#16055 ) ## Changes 1. Entity (`internal/entity/chat_channel.go`): - Implemented `ChatChannel` struct mapping the `chat_channel` database table. - Declared `ChatChannelListResponse` as a DTO to filter out sensitive credentials (`config` field) and fetch the associated `dialog_name` via left join. 2. GORM Migration (`internal/dao/database.go`): - Registered `&entity.ChatChannel{}` in the `dataModels` array inside `InitDB()` to enable safe GORM schema synchronization. 3. DAO (`internal/dao/chat_channel.go`): - Implemented `ChatChannelDAO` wrapping GORM CRUD methods (`Create`, `GetByID`, `UpdateByID`, `DeleteByID`). - Implemented `ListByTenantID` performing a `LEFT JOIN` on the `dialog` table to retrieve `dialog_name` while excluding `config` values to avoid credential leaks. 4. Test (`internal/dao/chat_channel_test.go`): - Added integration unit tests testing the full CRUD lifecycle and GORM left-join mapping list querying.	2026-06-17 11:26:13 +08:00
Jin Hai	6865039a22	Go: add more start server parameters (#16093 ) ### What problem does this PR solve? ``` $ ./bin/ragflow_server --version RAGFlow version: v0.26.0-65-g549f6109c $ ./bin/ragflow_server --debug # start server with debug log level $ ./bin/admin_server --version RAGFlow version: v0.26.0-65-g549f6109c $ ./bin/admin_server --debug # start server with debug log level $ ./bin/admin_server --init-superuser # init default superuser $ ./bin/ingestor --version RAGFlow version: v0.26.0-68-g6f6c39706 $ ./bin/ingestor --debug ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-16 20:27:37 +08:00
Rander	1235da7093	refactor(paddleocr): migrate from sync API to async Job API (#15967 ) ## Summary Migrate PaddleOCR integration from the deprecated synchronous HTTP API to the new asynchronous Job API (`submit → poll → fetch`), aligning with PaddleOCR 3.6.0+ architecture. ## Changes ### Python (`deepdoc/parser/paddleocr_parser.py`) - Replace synchronous `requests.post()` with async Job API flow (submit → poll → fetch) - Authentication: `token {token}` → `Bearer {token}` - File transfer: base64 JSON body → multipart file upload - Polling: exponential backoff (initial 3s, ×1.5, max 15s, timeout controlled by `request_timeout`) - Result: fetch full JSONL from result URL, preserving `prunedResult` with bbox info for crop functionality - Rename `api_url` → `base_url` (backward compatible: `api_url` still accepted as fallback) ### Python (`rag/llm/ocr_model.py`) - Prefer `paddleocr_base_url` / `PADDLEOCR_BASE_URL`, fallback to `paddleocr_api_url` / `PADDLEOCR_API_URL` ### Go (`internal/entity/models/paddleocr.go`) - Add `Client-Platform: ragflow` header to submit and poll requests - Change polling from fixed 3s to exponential backoff (initial 3s, ×1.5, max 15s) ### Python (`common/constants.py`) - Add `PADDLEOCR_BASE_URL` to env keys and default config ## Backward Compatibility - Old env var `PADDLEOCR_API_URL` still works (used as fallback) - Frontend field `paddleocr_api_url` still works (backend reads it as fallback) - No user-facing configuration changes required for existing setups ## Why not use the `paddleocr` SDK package directly? RAGFlow's `_transfer_to_sections()` relies on `prunedResult` (containing `block_bbox`, `block_label`, `parsing_res_list`) from the raw API response for PDF crop functionality. The SDK's public `parse_document()` API only returns `DocParsingResult` with `markdown_text`, discarding the bbox data. Therefore we implement the async Job API flow directly via HTTP, following the same logic as the SDK internally.	2026-06-16 19:34:21 +08:00

1 2 3 4 5 ...

447 Commits