ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 15:31:05 +08:00

Author	SHA1	Message	Date
Harsh Kashyap	b4a8a90c73	fix(rag/raptor): handle max_cluster edge case in GMM cluster selection (#16199 ) ### What problem does this PR solve? `_get_optimal_clusters` in `rag/raptor.py` had two edge-case issues in GMM cluster-count selection: 1. It used `np.arange(1, max_clusters)`, which never evaluates the upper-bound candidate (`max_clusters`). 2. When effective `max_clusters` becomes `1`, the candidate list was empty and `argmin` crashed. This PR makes candidate evaluation inclusive (`1..max_clusters`) and guards the single-cluster case by returning `1` directly. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Validation - `pytest test/unit_test/rag/test_raptor_psi_tree_builder.py --config-file pyproject.toml -q` - `ruff check rag/raptor.py test/unit_test/rag/test_raptor_psi_tree_builder.py` ### Tests added - Regression test for `max_cluster == 1` path (no crash, returns 1) - Regression test verifying upper-bound candidate is evaluated and can be selected _AI-assistance disclosure: parts of this change (bug triage and test scaffolding) were drafted with AI assistance and fully reviewed and verified by me._ --------- Co-authored-by: Harsh Kashyap <harshkashyap@Harshs-MacBook-Pro.local> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-23 21:07:26 +08:00
Yingfeng	706e0d2d06	Refactor harness framework (#16271 ) ### What problem does this PR solve? - Tools management - Pregel engine wrapper for better usage - UT race - Coding style ### Type of change - [x] Refactoring	2026-06-23 20:18:04 +08:00
Jin Hai	4f02ba4cf4	Go: show model and list all models (#16272 ) ### What problem does this PR solve? ``` RAGFlow(admin)> show model 'abc'; +------------+----------------------------------------------------------------+ \| field \| value \| +------------+----------------------------------------------------------------+ \| command \| get_model_by_model_name \| \| error \| 'get model by model name' is implemented in enterprise edition \| \| model_name \| abc \| +------------+----------------------------------------------------------------+ RAGFlow(admin)> list models; +-----------------+--------------------------------------------------------+ \| command \| error \| +-----------------+--------------------------------------------------------+ \| list_all_models \| 'list all models' is implemented in enterprise edition \| +-----------------+--------------------------------------------------------+ ``` ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 19:29:06 +08:00
Jin Hai	49714865c1	Go: rename ragflow_cli to ragflow-cli (#16270 ) ### What problem does this PR solve? rename ragflow cli binary ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 19:20:49 +08:00
Haruko386	d89e29fba8	Document[Go-develop]: update Go development docs (#16229 ) ### What problem does this PR solve? Document updated: ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-23 19:19:44 +08:00
Haruko386	5046626c17	feat[Go]: implement /datasets/<dataset_id>/documents/batch-update-status (#16258 ) ### What problem does this PR solve? accident close #16072 As title ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-23 19:19:08 +08:00
Haruko386	6cbd069ea3	feat[Go]: implement <document_id>/chunks/<chunk_id> PATCH (#16232 ) ### What problem does this PR solve? Implement: 1. `/api/v1/datasets/<dataset_id>/documents/<document_id>/chunks GET` 2. `/api/v1/datasets/<dataset_id>/documents/<document_id>/chunks/<chunk_id> PATCH` 3. `/api/v1/datasets/<dataset_id>/documents/<document_id>/chunks PATCH` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-23 18:50:36 +08:00
maoyifeng	643cb4788f	Go CLI: add response output (#16263 ) ### What problem does this PR solve? Go CLI: add response output	2026-06-23 18:12:15 +08:00
Wang Qi	a4f325be24	Fix: add /v1/document/upload_info -> /api/v1/documents/upload back (#16264 )	2026-06-23 17:47:55 +08:00
buua436	aba5d172bd	feat: add whatsapp web qr chat channel (#16238 ) Adds a WhatsApp chat channel backed by a QR-based web login flow so users can connect without manual token setup.	2026-06-23 17:45:31 +08:00
Jin Hai	e15130534f	Go: default public key (#16265 ) ### What problem does this PR solve? Provider default public key for CLI ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 17:43:26 +08:00
Jin Hai	dec2ce4a60	Go CLI: admin model framework (#16252 )	2026-06-23 16:57:05 +08:00
Zhichang Yu	2362210caf	refactor(log): unify Go logging to zap with rotation, strip per-package levels (#16261 ) Refactor the Go agent port's logging so every log line — gin access, agent canvas events, harness warnings, fatal boot errors — flows through a single common.Logger (zap) backed by a rotated file, with structured fields, level filtering, and configurable rotation. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-23 16:21:46 +08:00
VincentLambert	11e14a8353	fix: propagate contextvars through thread_pool_exec (#16247 ) ## Problem `thread_pool_exec()` dispatches work via `loop.run_in_executor()`, which submits the callable with a plain `executor.submit(func, args)` and does not* copy the caller's `contextvars.Context`. So a `ContextVar` set in the async caller is not visible inside the function running in the worker thread. This differs from `asyncio.to_thread()`, which runs the callable inside a copied context. `run_in_executor()` has never propagated context (verified on Python 3.12 and 3.13) — so this is a pre-existing gap in the helper, not a regression or a Python-version compatibility issue. Concretely, any code that sets a `ContextVar` in async code and reads it inside a function dispatched via `thread_pool_exec` (request tracing, per-task state, Langfuse trace propagation, etc.) silently loses that context. ## Fix Copy the current context before submitting and run the callable inside it with `ctx.run()`, matching what `asyncio.to_thread()` does: ```python async def thread_pool_exec(func, args, kwargs): loop = asyncio.get_running_loop() ctx = contextvars.copy_context() if kwargs: inner = functools.partial(func, args, *kwargs) return await loop.run_in_executor(_thread_pool_executor(), ctx.run, inner) return await loop.run_in_executor(_thread_pool_executor(), ctx.run, func, args) ``` This explicitly adds ContextVar propagation to the helper (it does not restore any prior behavior). Backward-compatible. ## Tests `TestThreadPoolExec` covers propagation, the kwargs path, per-call isolation and the unset-default case. > Note: the branch name still contains `python313` for historical reasons; the change is unrelated to any Python version.	2026-06-23 15:17:42 +08:00
balibabu	d8ee1ffaad	Fix: When re-entering the agent page, the data from the previous session flashes briefly. (#16251 ) Fix: When re-entering the agent page, the data from the previous session flashes briefly.	2026-06-23 14:13:47 +08:00
Haruko386	9f9433e218	fix: handle SIMDe headers installation for arm64 (#16244 ) ### What problem does this PR solve? Updated the release workflow to install SIMDe headers into the MSYS2 toolchain include directory. Adjusted CMake flags to remove references to the previous SIMDE_INCLUDE_DIR. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) dev-20260623	2026-06-23 10:37:04 +08:00
Jin Hai	b661e9c19e	Go CLI: admin list providers (#16243 ) ### What problem does this PR solve? ``` RAGFlow(admin)> list providers; +----------------------+-------------------------------------------------------------+ \| command \| error \| +----------------------+-------------------------------------------------------------+ \| list_model_providers \| 'list model providers' is implemented in enterprise edition \| +----------------------+-------------------------------------------------------------+ RAGFlow(admin)> add provider 'zhipu-ai'; +-------------+-----------------------------------------------------------+ \| field \| value \| +-------------+-----------------------------------------------------------+ \| command \| add_model_provider \| \| error \| 'add model provider' is implemented in enterprise edition \| \| provider_id \| admin \| \| user_id \| zhipu-ai \| +-------------+-----------------------------------------------------------+ RAGFlow(admin)> delete provider 'zhipu-ai'; +-------------+--------------------------------------------------------------+ \| field \| value \| +-------------+--------------------------------------------------------------+ \| command \| delete_model_provider \| \| error \| 'delete model provider' is implemented in enterprise edition \| \| provider_id \| admin \| \| user_id \| zhipu-ai \| +-------------+--------------------------------------------------------------+ RAGFlow(admin)> add provider 'zhipu-ai' instance 'instance1'; +---------------+-----------------------------------------------------------+ \| field \| value \| +---------------+-----------------------------------------------------------+ \| command \| add_model_instance \| \| error \| 'add model instance' is implemented in enterprise edition \| \| instance_name \| instance1 \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| +---------------+-----------------------------------------------------------+ RAGFlow(admin)> delete provider 'zhipu-ai' instance 'test' +-------------+--------------------------------------------------------------+ \| field \| value \| +-------------+--------------------------------------------------------------+ \| instances \| [test] \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| \| command \| delete_model_provider \| \| error \| 'delete model instance' is implemented in enterprise edition \| +-------------+--------------------------------------------------------------+ RAGFlow(admin)> add provider 'zhipu-ai' instance 'instance1' model 'xxx'; +---------------+--------------------------------------------------+ \| field \| value \| +---------------+--------------------------------------------------+ \| command \| add_model \| \| error \| 'add model' is implemented in enterprise edition \| \| instance_name \| instance1 \| \| model_names \| [xxx] \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| +---------------+--------------------------------------------------+ RAGFlow(admin)> delete provider 'zhipu-ai' instance 'test' model 'xxx'; +---------------+------------------------------------------------------+ \| field \| value \| +---------------+------------------------------------------------------+ \| command \| delete_model_provider \| \| error \| 'delete models' is implemented in enterprise edition \| \| instance_name \| test \| \| models \| [xxx] \| \| provider_id \| zhipu-ai \| \| user_id \| admin \| +---------------+------------------------------------------------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-23 10:26:31 +08:00
Zhichang Yu	06ededb26a	test(go): ensure go unit tests pass (#16241 ) ## Summary Stabilizes the Go unit-test surface so the test suite can run reliably in CI and locally via \`bash build.sh --test\`. ## Verification \`\`\`bash bash build.sh --test -- -count=10 -run TestWithCancel_SequentialAgent ./internal/harness/core/ bash build.sh --test -- -count=5 -run TestSiliconflowChatExtracts ./internal/entity/models/ bash build.sh --test # full suite \`\`\` All previously failing packages (\`admin\`, \`cli\`, \`handler\`, \`parser\`, \`router\`, \`service\`, \`service/chunk\`) now build and test successfully. \`TestWithCancel_SequentialAgent\` passes 10/10 (was flaky). SiliconFlow reasoning test passes after switching the assertion to the SiliconFlow wire format. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-22 20:43:29 +08:00
VincentLambert	a4fcc988e7	i18n(fr): add missing French translations for chat channels, username validation and model editing (#16217 ) ## Summary Several keys added in recent releases were missing from the French (`fr.ts`) locale file. - `top` — missing in both the common section and the dataset section - Chat channels — all UI strings for the new chat channels feature (`chatChannels`, `chatChannelDesc.`, `connectDialog`, `notConnected`, etc.) - Username validation* — `usernameMaxLength`, `usernameInvalidCharacters` - Model editing — `editCustomModelTitle` ## Changes - `web/src/locales/fr.ts` — 47 lines added, no other files touched 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-22 20:09:59 +08:00
balibabu	c849c76f8a	Feat: Add a prefix to the `name` of the `FormField` associated with the chat. (#16178 ) Fix: Add a prefix to the `name` of the `FormField` associated with the chat.	2026-06-22 19:18:11 +08:00
Jin Hai	0e6b28a7fe	Add show / set role default models (#16240 ) ### What problem does this PR solve? ``` RAGFlow(admin)> show role 'user' default models; +--------------------------+-----------------------------------------------------------------+-----------+ \| command \| error \| role_name \| +--------------------------+-----------------------------------------------------------------+-----------+ \| show_role_default_models \| 'show role default models' is implemented in enterprise edition \| user \| +--------------------------+-----------------------------------------------------------------+-----------+ RAGFlow(admin)> set role 'user' default chat 'glm4.5@test@zhipu-ai'; +------------+---------------------------------------------------------------+ \| field \| value \| +------------+---------------------------------------------------------------+ \| model_id \| \| \| model_type \| chat \| \| role_name \| user \| \| command \| set_role_default_model \| \| error \| 'set role default model' is implemented in enterprise edition \| +------------+---------------------------------------------------------------+ RAGFlow(admin)> reset role 'user' default chat; +------------+-----------------------------------------------------------------+ \| field \| value \| +------------+-----------------------------------------------------------------+ \| command \| reset_role_default_model \| \| error \| 'reset role default model' is implemented in enterprise edition \| \| model_type \| chat \| \| role_name \| user \| +------------+-----------------------------------------------------------------+ ``` --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-22 19:03:36 +08:00
Hz_	9eb7cee473	feat(go-api): migrate searchbot share detail endpoint to go (#16124 ) ## Summary - add public Go route for `/api/v1/searchbots/detail` - implement beta-token auth flow for shared search access - add tenant-based access check for shared search apps - add joined search detail query for the share response - align Go response shape with the current Python runtime behavior - add DAO / service / handler tests for the new endpoint	2026-06-22 18:17:37 +08:00
Hz_	2856cde2d1	feat(go-api): Implement BulkDeleteChats Go API and fix ListChats (#16157 ) ### Description - Bulk Delete Chats: Implemented Go endpoint `DELETE /api/v1/chats` supporting bulk delete by `ids`, `delete_all` flag, and backward-compatible `chat_id` body payload (with tenant-ownership security checks). - Bug Fix: Fixed a parameter swap in Go `ListChats` handler to properly exclude soft-deleted chats.	2026-06-22 18:16:52 +08:00
Hz_	4e0db3053d	feat(go-api): complete chat channel API migration with tests (#16139 ) close #16132 ## Summary This PR completes the Go-side merge and cleanup for chat channel APIs, including handler/service wiring, route registration, and test coverage. Implemented and aligned 5 chat channel APIs: ``` - POST `/api/v1/chat-channels` - GET `/api/v1/chat-channels` - GET `/api/v1/chat-channels/:channel_id` - PATCH `/api/v1/chat-channels/:channel_id` - DELETE `/api/v1/chat-channels/:channel_id` ``` Co-authored-by: Haruko386 <tryeverypossible@163.com>	2026-06-22 18:16:15 +08:00
Haruko386	02cc1d6438	fix: unable to chat after set model (#16195 ) ### What problem does this PR solve? ``` fixed: RAGFlow(api/default)> use model 'minimax-m2.5@test@minimax' SUCCESS RAGFlow(api/default)> chat message 'who r u' Answer: Hey! I'm MiniMax-M2.5, an AI assistant here to help you with questions, tasks, or whatever you need. What can I do for you? Time: 1.727263 ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-22 18:14:58 +08:00
Haruko386	b777e50291	feat[Go]: implement api /api/v1/datasets/<dataset_id>/chunks POST (#16067 ) ### What problem does this PR solve? As title ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-22 18:14:01 +08:00
Haruko386	c6b3618f9a	fix: fix release workflow for Windows and MAC builds (#16235 ) ### What problem does this PR solve? Removed references to 'simde' from the package lists and updated paths for compiler detection and CMake configuration to ensure proper handling of Windows executables. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-22 18:13:12 +08:00
Jin Hai	05e758e4fe	Go CLI: Fix alter role (#16226 ) ### What problem does this PR solve? As title. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-22 17:33:47 +08:00
Nick M	329e09f16a	Fix: metadata add modal sends empty value due to stale closure (#15229 ) Closes #15139. The "+ Add" flow in the Set/Edit Metadata modal posted updates with an empty value, so backend saves were silent no-ops and the document's "X fields" count stayed at 0 despite a "Success" toast. The value `<Input>` updates `tempValues` synchronously per keystroke but only writes through to `metaData.values` on blur (via `handleValueBlur`). When the user clicks the nested modal's Confirm button without first blurring, the click handler races the blur and `handleSave` closes over the pre-blur `metaData.values` — still the initial `['']`. `addUpdateValue` then queues an empty-string update; the auto-fire save sends it, and after `resetOperations()` the outer Save button posts `updates: []`. Read from `tempValues` instead so the queued update carries the typed value. Regression test in `tests/use-manage-values-modal.test.ts` asserts that `handleSave` passes the typed value (not the pre-blur empty string) to `addUpdateValue` in the add-new code path.	2026-06-22 16:30:42 +08:00
Haruko386	b337534a6c	fix: Enhance Windows build configuration in release.yml (#16227 ) ### What problem does this PR solve? Updated rust_target and added simde support for Windows builds. Modified CMake commands to include simde and adjusted paths for compilers. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-22 15:29:05 +08:00
Zhichang Yu	3f805a64f1	feat(agent): align Go agent behavior with Python (except retrieval component) (#16225 ) ## Summary Aligns the Go agent runtime/canvas/components/tools behavior with the Python `agent/` implementation so the same stored canvas DSL produces the same execution result on either side. Every component, tool, and runtime primitive in `internal/agent/` is now driven by the same semantics as its Python counterpart — variable resolution, template substitution, control flow, error reporting, retry/cancel, and stream event shapes. The retrieval component is the one explicit exception in this PR. It is being reworked in a separate change and is excluded from this alignment pass; the wrapper slot (`universe_a_wrappers.go → newRetrievalComponent`) is preserved. ## Scope of alignment ### Components (all aligned with `agent/component/`) `Begin` · `Message` · `LLM` (incl. ChatTemplateKwargs, MessageHistoryWindowSize, VisualFiles, Cite, OutputStructure, JSONOutput, TopP, MaxRetries, DelayAfterError, credentials) · `Agent` (react + tool artifact capture + `Reset()` interface-assert) · `Switch` (12/12 operators, Python-equivalent semantics) · `Categorize` · `Invoke` · `Iteration` · `Loop` (macro-expansion through `workflowx.AddLoopNode`) · `UserFillUp` (Python-equivalent interrupt/resume via eino `compose.Interrupt`/`ResumeWithData`) · `FillUp` · `DataOperations` · `ListOperations` · `StringTransform` · `VariableAggregator` · `VariableAssigner` · `Browser` (full stagehand runtime parity) · `DocsGenerator` · `ExcelProcessor`. ### Tools (all aligned with `agent/tools/`) `Retrieval` (wrapper slot only — logic out of scope) · `MCPToolAdapter` (streamable-HTTP) · `CodeExec` (sandbox bridge with `code_exec_contract.go` matching Python contract) · `AkShare` · `ArXiv` · `Crawler` · `DeepL` · `DuckDuckGo` · `Email` · `ExeSQL` · `GitHub` · `Google` · `GoogleScholar` · `Jin10` · `PubMed` · `QWeather` · `SearXNG` · `Tavily` · `Tushare` · `Wencai` · `Wikipedia` · `YahooFinance` — uniform `eino tool.InvokableTool` interface, SSRF protection, shared HTTP client. ### Canvas execution engine (`internal/agent/canvas/`) Aligned with Python's `agent/canvas.py`: - Scheduler (`scheduler.go`): state pre/post handlers, node lambdas, per-component timeout resolver (4-level: per-class env → per-class table → uniform env → 600s fallback), `legacyNoOpNames`. - Loop subgraph (`loop_subgraph.go`): Python-equivalent `AddLoopNode` macro expansion + condition translation. - Multibranch (`multibranch.go`): `Switch` / `Categorize` routing via `compose.NewGraphMultiBranch` — same branch selection semantics as Python. - Parallel subgraph (`parallel_subgraph.go`): matches Python's parallel fan-out contract. - Interrupt/Resume (`interrupt_resume.go`): `UserFillUpNodeBody` / `IsInterruptError` / `ExtractInterruptContexts` — replaces the deprecated Python sentinel chain with eino's native interrupt API, preserving the same external behavior. - Checkpoint (`checkpoint_store.go`): `RedisCheckPointStore` Get/Set/Delete, with business metadata (status / canvas_id / parent_run_id) on a parallel Redis Hash. - RunTracker (`run_tracker.go`): Start / MarkSucceeded / MarkFailed / MarkCancelled / AttachCheckpoint — same lifecycle as the Python run record. - Cancel (`cancel.go`): Redis pub/sub watch. - Stream (`stream.go`): SSE channel with `messages` / `waiting` / `errors` / `done` events, same shape as Python's `agent.canvas.RunEvent` payload. ### DSL bridge (`internal/agent/dsl/`) - `normalize.go`: v1↔v2 collapsed into a single wire format — Python and Go consume the same stored JSON. - `reset.go`: per-run state reset matches Python's `Canvas.reset()` semantics. - Testdata mirrors Python's `agent_msg.json` / `all.json` / etc. ### Runtime (`internal/agent/runtime/`) - `CanvasState` / `NewCanvasState` / `GetVar` / `SetVar` / `ReadVars`: same `{{cpn_id@param}}` resolution model. - `ResolveTemplate` (regex fast path + gonja fallback) — Python Jinja-style semantics. - `selector.go`, `metrics.go`, `component.go`: shared runtime contracts. ## Out of scope (intentionally) - `Retrieval` component logic — wrapped only; full parity lands in a follow-up PR. - Frontend — only minor dsl-bridge / canvas UX fixes ride along. - CLI / admin / model registry — orthogonal to agent behavior. ## How alignment is verified `internal/service/agent_run_e2e_test.go` exercises the full production chain against real Python-shaped DSL fixtures: ``` loadCanvasForUser → versionDAO.GetLatest → decodeCanvasFromDSL → canvas.Compile → cc.Workflow.Invoke → answer extraction ``` using in-memory SQLite + miniredis (no Docker). Covers: - `TestRunAgent_RealCanvas_BeginMessage` — happy path, `{{sys.query}}` resolution - `TestRunAgent_RealCanvas_WaitForUserResume` — two-run resume cycle (Python-equivalent) - `TestRunAgent_RealCanvas_CompileFails` — unknown component name → sanitized error (Python-equivalent) - `TestRunAgent_RealCanvas_InvokeFails` — unresolvable template ref (Python-equivalent) - `TestRunAgent_RunTracker_AttachCheckpoint_CallSequence` — Start→AttachCheckpoint→MarkSucceeded lifecycle `internal/handler/agent_test.go` — SSE streaming parity (`Content-Type: text/event-stream`, `data: {…}\n\n`, trailing `data: [DONE]\n\n`, OpenAI-compatible non-stream `choices`). `internal/agent/canvas/fixture_compile_test.go` + per-component tests pin the Python-equivalent outputs. ``` go test -count=1 -v -run 'TestRunAgent_RealCanvas\|TestRunAgent_RunTracker' ./internal/service/ ``` ## Design reference `docs/develop/agent-go-port-design.md` (1329 lines, last cross-checked 2026-06-17) — module layout, per-component / per-tool inventory, corner-case catalogue, and the actionable backlog (Section 14, including the retrieval alignment follow-up). --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-22 11:58:29 +08:00
Haruko386	dfe841a3e3	fix: Enhance Windows build process for office_oxide and rag tokenizer (#16223 ) ### What problem does this PR solve? Updated MSYS2 package list for Windows builds and added Rust target specifications. Modified build steps for office_oxide and rag tokenizer libraries to improve compatibility and streamline the build process. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-22 10:17:40 +08:00
Manan Bansal	70c0121b78	Fix: preserve tables when parsing DOCX with the laws parser (#16008 ) (#16155 ) ## What Fixes #16008 — tables contained in a DOCX are silently dropped when the document is parsed with the laws chunking method. ## Root cause `Docx.__call__` in `rag/app/laws.py` iterated `self.doc.paragraphs`, which only yields paragraph elements. Tables are separate `tbl` blocks in the document body, so they were never visited and were lost from the output. (The `naive` parser already handles tables by iterating the document body.) ## Changes - Iterate `self.doc._element.body` so tables are visited in document order alongside paragraphs. - Add a `__table_to_html` helper that renders each table to HTML, including merged-cell `colspan` detection (mirrors the `naive` parser's logic). - Inject each table into the section tree with a sentinel level deeper than any heading, so `Node.build_tree` merges it into its enclosing section — keeping the chapter/article title path as retrieval context rather than producing an orphaned chunk. - Guard the `h2_level` computation against an empty heading set, so a tables-only or empty DOCX no longer raises `IndexError`. This keeps the laws parser's hierarchical chunking and adds table extraction, so users no longer have to choose between losing structure (naive) or losing tables (laws). ## Tests Adds `test/unit_test/rag/test_laws_docx_tables.py` covering: - table content is preserved and carries its section title path, - merged adjacent cells collapse to `colspan`, - tables-only document does not crash, - empty document returns `[]`. All four pass; `ruff check` / `ruff format` are clean.	2026-06-22 09:46:44 +08:00
Jin Hai	760229d917	Go CLI: admin list configs (#16221 ) ### What problem does this PR solve? - list configs; ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com> dev-20260622	2026-06-22 08:19:23 +08:00
Jin Hai	5039f46999	Go CLI: refactor commands (#16213 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-21 16:50:02 +08:00
Jin Hai	1b712be599	Go CLI: refactor some commands (#16204 ) ### What problem does this PR solve? - list resources ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-20 02:31:07 +08:00
Jin Hai	11499f7bb3	Go CLI: add list user commands framework (#16201 )	2026-06-19 15:09:54 +08:00
Jin Hai	7214a23614	Go: fix duplicate models (#16197 ) ### What problem does this PR solve? 1. Remove unused file 2. Remove duplicate models 3. Resort the function order ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-19 09:57:58 +08:00
buua436	b409cfc3d5	feat: add dingtalk chat channel (#16183 ) ### What does this PR do? This PR adds a new DingTalk chat channel integration and hardens the inbound callback path. ### Summary - Adds DingTalk as a selectable chat channel in the UI and backend channel registry. - Adds the DingTalk chat channel icon asset. - Acknowledges DingTalk Stream callbacks and deduplicates repeated inbound messages to avoid duplicate replies.	2026-06-18 20:06:00 +08:00
Wang Qi	5ca1686ac7	Fix that agent cannot be the same name (#16192 ) Fix that agent cannot be the same name	2026-06-18 19:10:21 +08:00
Haruko386	eb5fcce1ca	fix: hard-coded paths for Windows C compiler (#16193 ) ### What problem does this PR solve? As title ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-18 18:55:02 +08:00
qinling0210	563d855780	Implement OpenAI chat completions in GO (#16177 ) ### What problem does this PR solve? Implement OpenAI chat completions in GO POST /api/v1/openai/<chat_id>/chat/completions OpenAI chat cli: internal/development.md ### Type of change - [x] Refactoring	2026-06-18 18:07:27 +08:00
Haruko386	b53b5bf12c	Json add paddleOCR models (#16156 ) close #15853 ### What problem does this PR solve? As title ### Type of change - [x] Other (add models):	2026-06-18 17:57:41 +08:00
Haruko386	217c2a94c2	feat[Go]: implement datasets/<dataset_id>/index P/G (#16153 ) ### What problem does this PR solve? ``` POST: http://localhost:9384/api/v1/datasets/433b390c630411f1a13eab5f89540b2a/index?type=graph Output: { "code": 0, "data": { "task_id": "ff5a3546bafa49d794a9a050d99c4a52" }, "message": "success" } ``` --- ``` GET: http://localhost:9384/api/v1/datasets/433b390c630411f1a13eab5f89540b2a/index?type=graph Output: { "code": 0, "data": { "id": "ff5a3546bafa49d794a9a050d99c4a52", "doc_id": "graph_raptor_x", "from_page": 100000000, "to_page": 100000000, "task_type": "graphrag", "priority": 0, "begin_at": "2026-06-17T18:07:45+08:00", "process_duration": 4.108135, "progress": -1, "progress_msg": "18:07:45 created task graphrag\n18:07:47 Task has been received.\n18:07:49 [ERROR][Exception]: Model config not found: Qwen/Qwen3-235B-A22B@test@SILICONFLOW", "retry_count": 1, "digest": "f16fd067d5c92cec", "create_time": 1781690865552, "create_date": "2026-06-17T18:07:45+08:00", "update_time": 1781690869108, "update_date": "2026-06-17T18:07:49+08:00" }, "message": "success" } ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:57:24 +08:00
Haruko386	5f6ebc97c6	feat[go]: implement /api/v1/datasets/<dataset_id> PUT (#16122 ) ### What problem does this PR solve? As pic shows ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:57:07 +08:00
Haruko386	6beae949d8	feat[Go]: add modelID for delete_model and update_status (#16025 ) ### What problem does this PR solve? 1. add modelID for delete_model and update_status 2. fix the bug when update-status delete model ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:56:51 +08:00
Jin Hai	3eb49ca7f8	Go: add command, list, remove, stop tasks (#16190 ) ### What problem does this PR solve? ``` RAGFlow(admin)> stop user 'abc' ingestion tasks; +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ \| command \| email \| error \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ RAGFlow(admin)> stop user 'abc' ingestion tasks 'created; +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ \| command \| email \| error \| status \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| created; \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ RAGFlow(admin)> stop user 'abc' ingestion tasks 'create'; +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ \| command \| email \| error \| status \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| create \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ RAGFlow(admin)> remove user 'abc' ingestion tasks 'create'; +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ \| command \| email \| error \| status \| tasks \| +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ \| remove_ingestion_tasks_by_condition \| abc \| 'Remove ingestion tasks by condition' is implemented in enterprise edition \| create \| \| +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ RAGFlow(admin)> remove user 'abc' ingestion tasks; +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ \| command \| email \| error \| tasks \| +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ \| remove_ingestion_tasks_by_condition \| abc \| 'Remove ingestion tasks by condition' is implemented in enterprise edition \| \| +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 17:50:21 +08:00
Haruko386	1a8ee8ba61	fix: wrong clang/toolchain for windows (#16191 ) ### What problem does this PR solve? As title ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-18 17:49:55 +08:00
buua436	a2de7d0060	fix: chat channel defaults and feishu shutdown (#16176 ) This PR keeps the chat-channel default values and Feishu shutdown behavior consistent after the rebase.	2026-06-18 17:44:48 +08:00
Jin Hai	5eedd13d49	Go: add command, show tasks summary (#16187 ) ### What problem does this PR solve? RAGFlow(admin)> show tasks summary; +---------+-----------------------------------------------------------------+ \| field \| value \| +---------+-----------------------------------------------------------------+ \| command \| show_users_quota_summary \| \| error \| 'Show users quota summary' is implemented in enterprise edition \| +---------+-----------------------------------------------------------------+ ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 17:09:20 +08:00

1 2 3 4 5 ...

6896 Commits