ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-07-04 18:45:38 +08:00

Author	SHA1	Message	Date
Liu An	0639dba89a	Docs: Update version references to v0.25.6 in READMEs and docs (#15248 ) ### What problem does this PR solve? - Update version tags in README files (including translations) from v0.25.5 to v0.25.6 - Modify Docker image references and documentation to reflect new version - Update version badges and image descriptions - Maintain consistency across all language variants of README files ### Type of change - [x] Documentation Update	2026-05-26 19:45:43 +08:00
Haruko386	3619ceca01	Go: implement provider: OrcaRouter (#15235 ) ### What problem does this PR solve? implement provider `OrcaRouter` The following functionalities are now supported: Cohere: - [x] Chat / Think Chat / Stream Chat / Stream Think Chat - [x] Model listing - [x] TTS - [ ] Balance ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-26 18:20:33 +08:00
dripsmvcp	a48bcf814d	Go: implement provider: ModelScope (#15041 ) Closes #15040. ModelScope was listed unchecked in the Go-rewrite tracker #14736 and already had an llm_factories.json entry (tags: LLM) but no Go driver, so the new Go API server could not route ModelScope instances. The Python side has supported it through the OpenAI-compatible base at rag/llm/chat_model.py:618 (ModelScopeChat), which requires a user-supplied base URL and appends /v1. This adds: - internal/entity/models/modelscope.go: self-hosted OpenAI-compatible driver with chat (sync + SSE stream with idle-timeout cancellation), list_models, and check_connection. Auth header is optional, matching the xinference pattern, so deployments without auth and auth-enabled deployments both work. Base URL is normalized so users can configure either the root endpoint or the /v1 endpoint. - internal/entity/models/modelscope_test.go: 12 tests covering name, URL normalization, factory routing, chat happy path / auth header / reasoning_content extraction, stream happy path / stream=false rejection / idle cancellation, list_models + check_connection, missing-base-URL clear error, and the no-such-method sentinels. - conf/models/modelscope.json: shipped config (class: "local", url_suffix v1/chat/completions and v1/models). - internal/entity/models/factory.go: case "modelscope" → ModelScopeModel. - internal/service/llm.go: ModelScope added to the selfDeployed map alongside Ollama, Xinference, LocalAI, LM-Studio, GPUStack — the Python side requires user-supplied URL with no default, so the Go side classifies it the same way. Follow-on issues will add Embed and Rerank, in line with how Novita, NVIDIA, TogetherAI, and other providers landed method-by-method. --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-26 18:18:46 +08:00
Hz_	84add43208	Add HuaweiCloud model provider (#15237 ) ### What problem does this PR solve? This PR adds HuaweiCloud provider integration in RAGFlow. Supported capabilities: - [x] Chat / Think Chat / Stream Chat / Stream Think Chat - [x] Embedding - [x] Rerank - [x] Model listing - [x] Provider connection checking Verified examples from the CLI: ``` check instance 'test' from 'HuaweiCloud'; chat with 'deepseek-v4-flash@test@HuaweiCloud' message 'hello'; think chat with 'deepseek-v4-flash@test@HuaweiCloud' message 'hello'; stream chat with 'deepseek-v4-flash@test@HuaweiCloud' message 'hello'; stream think chat with 'deepseek-v4-flash@test@HuaweiCloud' message 'hello'; embed text 'what is rag' 'who are you' with 'bge-m3@test@HuaweiCloud' dimension 1024; rerank query 'what is rag' document 'rag is retrieval augmented generation' 'rag need llm' 'famous rag project includes ragflow' with 'bge-reranker-v2-m3@test@HuaweiCloud' top 3; list supported models from 'HuaweiCloud' 'test'; LIST MODELS FROM 'HuaweiCloud' 'test'; ``` ### Type of change - [x] New Feature - [x] Provider integration	2026-05-26 17:13:15 +08:00
ghost	a7d25391dc	fix(tokenhub): wire Go driver and harden requests (#15224 ) ## Summary - Wire the Go TokenHub provider through the model factory. - Harden TokenHub request handling for chat, streaming, embeddings, and model listing. - Add focused TokenHub unit coverage for factory wiring and provider behavior. ## Notes - Refs #14736. - Follows up #15159. Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-26 17:12:37 +08:00
Jake Armstrong	0fb85a66bc	feat(go-models): add AWS Bedrock provider driver (#15166 ) ## Summary Closes #15165. Implements the AWS Bedrock model provider for the Go API server, tracked under #14736. Adds Converse + Converse-Stream chat and foundation-model listing, with SigV4 signing over a hand-rolled `net/http` path that matches the established pattern in `internal/entity/models/` (no new direct `go.mod` deps). ## Linked tracker Tracked under #14736 (Implement model providers of RAGFlow API server in Go). Closes #15165.	2026-05-26 17:10:06 +08:00
Idriss Sbaaoui	036ed5b236	Feat: add new tests and tescases for restful api suite (#15230 ) ### What problem does this PR solve? extend restful api suite ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Other (please describe): test	2026-05-26 13:24:22 +08:00
chanx	bce11527c3	Fix: Fixed metadata issue (#15226 ) ### What problem does this PR solve? Fix: Fixed metadata issue - The dataset's built-in metadata is now active, but it appears to be disabled in the individual file configuration. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-26 13:16:15 +08:00
Wang Qi	619b971785	Fix: empty file with better message (#15232 ) Fix: empty file with better message	2026-05-26 12:28:53 +08:00
天海蒼灆	0d2a17254c	fix(api): allow canvas_type in agent create and update APIs (#15201 ) ### What problem does this PR solve? Creating or updating an agent via `POST /api/v1/agents` and `PUT /api/v1/agents/{agent_id}` did not persist `canvas_type` because the handler `req` dict never assigned the field before `UserCanvasService.save` / `update_by_id`. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-26 11:31:46 +08:00
glorydavid03023	3dbd874a79	Go: implement Rerank in DeepInfra driver (#15185 ) ### What problem does this PR solve? The Go DeepInfra driver returned a stub error for `Rerank()` even though DeepInfra serves reranker models at `POST /v1/inference/{model}` with `query`, `documents`, and a `scores[]` response. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-26 10:52:09 +08:00
sxxtony	67f7d87dff	Go: implement provider: FuturMix (#15013 ) ### What problem does this PR solve? Add a Go driver for FuturMix (https://futurmix.ai/docs), one of the unchecked providers on the umbrella tracking issue #14736. FuturMix is documented as an "OpenAI-compatible API" aggregator over Claude / GPT / Gemini / DeepSeek (~22 models per their `/models` page). Until this PR, a tenant who configured `futurmix` as a model provider in the Go layer fell through to the default branch of `internal/entity/models/factory.go` and got the dummy driver. --------- Co-authored-by: sxxtony <sxxtony@users.noreply.github.com> Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-26 10:51:29 +08:00
Renzo	806414df43	Go: validate Baidu OCR inputs (#15168 ) ### What problem does this PR solve? Closes #15167. The Baidu Go provider advertises OCR support through `paddleocr-vl-0.9b`, but `BaiduModel.OCRFile` dereferenced required inputs before validating them. Calling OCR with a missing API config, API key, or model name could panic instead of returning a normal error. This PR adds explicit input validation for those required values. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-26 10:51:05 +08:00
Jake Armstrong	b961810e79	Go: implement OCR in ZhipuAI driver (#15143 ) ### What problem does this PR solve? Closes #15142. ZhipuAI lists `glm-ocr` as an OCR model, but the Go driver still returned `no such method` from `OCRFile`. This wires the advertised model to Z.AI's documented `layout_parsing` endpoint and returns the `md_results` Markdown output through the existing `OCRFileResponse.Text` field. This PR also adds focused tests for URL input, raw file-content base64 input, and validation errors. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): ### Test - [x] `go test -vet=off ./internal/entity/models -run 'TestZhipuAIOCRFile'`	2026-05-26 10:50:06 +08:00
Idriss Sbaaoui	c3b38d397f	Feat: add new tests and tescases for restful api suite (#15223 ) ### What problem does this PR solve? extend restful api suite ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Other (please describe): test	2026-05-26 10:08:45 +08:00
Jay Xu	54c3d23513	Fix [Bug]: Save parser configs in dataset configuration page is not working #15175 (#15177 ) ### What problem does this PR solve? Fix [Bug]: Save parser configs in dataset configuration page is not working #15175 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-26 10:04:43 +08:00
wdeveloper16	4b36801b53	fix: resolve asyncio correctness issues (fire-and-forget tasks, event loop nesting) (#14761 ) ## Summary Fixes the confirmed asyncio anti-patterns from #14755. Only the three verified bugs are addressed; patterns already correctly using `asyncio.new_event_loop()` in a fresh thread are left untouched. ### Changes `api/apps/restful_apis/tenant_api.py` — fire-and-forget `send_invite_email` `asyncio.create_task()` was called without storing the `Task` reference. CPython's GC can collect an unfinished task, silently cancelling it and swallowing exceptions. Fixed by storing the task in a module-level `_background_tasks: set[Task]` with a `done_callback` to discard it on completion — the standard Python idiom for safe background tasks. `api/apps/restful_apis/agent_api.py` — fire-and-forget `background_run` Same root cause in the webhook "Immediately" execution path. Same fix applied. `rag/llm/chat_model.py` (`LocalLLM._stream_response`) — `asyncio.get_event_loop()` on running loop `asyncio.get_event_loop()` returns Quart's running event loop when called from an async context. Calling `loop.run_until_complete()` on it raises `RuntimeError`. Replaced with `asyncio.new_event_loop()` so the generator uses a dedicated fresh loop, closed in a `finally` block. ## What was NOT changed - `llm_service._sync_from_async_stream` and `evaluation_service._sync_from_async_gen`: both already correctly use `asyncio.new_event_loop()` inside a fresh thread. - `llm_service._run_coroutine_sync`: only caller is `rag/app/resume.py` (sync context), so `thread.join()` is correct there. - `requests` in agent tools: sync methods dispatched through thread pools; httpx migration is a separate, larger refactor. ## Test plan - [ ] Invite a team member and confirm the email is sent with no task warnings in logs. - [ ] Trigger a webhook agent in "Immediately" mode; confirm canvas state is persisted after background run. - [ ] Verify `LocalLLM` (Jina backend) chat and streaming work end-to-end. Closes #14755 --------- Co-authored-by: Zhichang Yu <yuzhichang@gmail.com>	2026-05-25 22:45:40 +08:00
balibabu	ed179ce684	Fix: The prompt variable for the agent operator disappears after input. (#15218 ) ### What problem does this PR solve? Fix: The prompt variable for the agent operator disappears after input. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-25 20:36:51 +08:00
writinwaters	67e43e7df7	Docs: Minimum required Python version increased to 3.13. (#15219 ) ### What problem does this PR solve? Minimum Python version increased to 3.13. ### Type of change - [x] Documentation Update	2026-05-25 20:23:30 +08:00
qinling0210	af85aa9c7b	Implement Elasticsearch functions in GO (#15160 ) ### What problem does this PR solve? Implement Elasticsearch functions in GO (except for Search) ### Type of change - [x] Refactoring	2026-05-25 19:15:07 +08:00
Idriss Sbaaoui	7d200d5bd7	Feat: add new tests and tescases for restful api suite (#15208 ) ### What problem does this PR solve? extend restful api suite ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Other (please describe): test	2026-05-25 19:03:56 +08:00
balibabu	c7c75c0a87	Feat: Enable agent messages to display base64 images (#15212 ) ### What problem does this PR solve? Feat: Enable agent messages to display base64 images ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-05-25 19:02:03 +08:00
Wang Qi	f4d36f7082	Fix #15170 cannot filter document status (#15216 ) Fix #15170 cannot filter document status ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-25 18:58:37 +08:00
Haruko386	4783ce9951	fix(Go): rewrite chat, listmodels, embed for Ollama (#15213 ) ### What problem does this PR solve? IDK how to implement `Ollama` on #14580 but it's totally wrong. This is the rewrite version for `Ollama` Verified from CLI ``` # Embed RAGFlow(user)> embed text 'what is rag' 'who are you' with 'nomic-embed-text:latest@test12@ollama' dimension 1024; +-----------+-------+ \| dimension \| index \| +-----------+-------+ \| 768 \| 0 \| \| 768 \| 1 \| +-----------+-------+ # Chat RAGFlow(user)> think chat with 'qwen3:0.6b@test12@ollama' message 'who r u' Thinking: Okay, the user asked, "Who r u?" I need to respond appropriately. First, I should acknowledge their question. Since I'm an AI, I don't have a physical form, but I can confirm that I'm a large language model. I should keep the response friendly and offer help. Let me make sure I'm not making up any information and that the response is natural. Also, I should check for any typos and ensure clarity. Alright, that should cover it. Answer: I'm an AI language model, and I don't have a physical form. However, I can tell you that I'm designed to assist with questions and tasks. How can I help you today? Time: 2.914285 RAGFlow(user)> stream think chat with 'qwen3:0.6b@test12@ollama' message 'who r u' Thinking: , the user asked, "Who are you?" I need to respond appropriately. Since I'm an AI assistant, I should mention that I don't have a physical form or a mind. I should also clarify that I can help with various tasks like answering questions or providing information. It's important to keep the response friendly and informative while maintaining the correct tone. Answer: don't have a physical form or a mind, but I'm here to help with your questions or tasks! What can I do for you today? Time: 1.740047 # LisyModels RAGFlow(user)> list supported models from 'ollama' 'test12' +-------------------------+ \| model_name \| +-------------------------+ \| nomic-embed-text:latest \| \| qwen3:0.6b \| +-------------------------+ ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2026-05-25 18:55:03 +08:00
balibabu	0f92353bd9	Fix: Replace the red highlight at the top of the PDF document with yellow. (#15203 ) ### What problem does this PR solve? Fix: Replace the red highlight at the top of the PDF document with yellow. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-25 17:21:36 +08:00
Wang Qi	4776bfa8a2	Fix: Correct the API path (#15204 ) Follow on PR #15146 to reslove the backwad compatability issue. 1. /agents/<attachment_id>/download -> /agents/attachments/<attachment_id>/download ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-25 17:11:24 +08:00
Jonathan Chang	9d1006e4ec	fix: The output of the parser in the ingestion pipeline contains HTML tags (#14920 ) ## Summary This change fixes ingestion quality issues where MinerU parser output may contain HTML fragments (for example, table-related tags like `<tr>`, `<td>`, `<br>`), which were previously passed directly into chunking/tokenization and degraded chunk quality. The fix adds a sanitization step in the MinerU parser path so parsed sections are normalized to clean text before chunking. ## Change Type (select all) - [x] Bug fix - [x] Ingestion pipeline improvement - [x] Parser/chunking quality fix ## Related Issue - https://github.com/infiniflow/ragflow/issues/14831	2026-05-25 16:06:36 +08:00
Ahmad Intisar	e6068a7f7e	Fix: table parser metadata (#15127 ) ### What problem does this PR solve? This PR improves the table upload flow for CSV/Excel files by allowing table column role configuration at upload time. Previously, users had to: 1. Upload and parse a table file. 2. Open parser settings and manually set table column roles. 3. Re-parse the file for the roles to take effect. This was inefficient and required an unnecessary second parse. With this change: 1. When the knowledge base uses table parsing, the upload dialog extracts CSV/Excel headers client-side. 2. Users can choose Auto mode or Manual mode. 3. In Manual mode, users can assign per-column roles before upload. 4. The selected parser config is sent with the upload request and applied server-side during document creation. Result: configured table column roles are applied from the first parse. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Ahmad Intisar <ahmadintisar@Ahmads-MacBook-M4-Pro.local>	2026-05-25 16:05:38 +08:00
nickmopen	e7d45dd645	Feat: Expose Doc Generator file metadata as discrete outputs (#15080 ) Declare doc_id, filename, mime_type, and size as separate outputs on the Document Generation component so downstream nodes (e.g., the Code component) can consume them via the variable picker. The existing download JSON blob is preserved unchanged for the Message component's download-chip rendering. ### What problem does this PR solve? The Document Generation component previously exposed only a single `download` output — a JSON-encoded blob containing the file's `doc_id`, `filename`, `mime_type`, `size`, and base64 payload. On top of that, the variable picker actively hides this `download` entry from every consumer except the Message component (because the embedded base64 is too heavy to splat into arbitrary downstream nodes). The combined effect: users wiring the Doc Generator's output into a Code component had no way to retrieve basic file info such as `file_name` or `doc_id` from the picker, blocking workflows that need to post-process the generated file (e.g., registering it elsewhere, custom delivery, follow-up API calls). This PR declares `doc_id`, `filename`, `mime_type`, and `size` as discrete outputs on the Document Generation component, alongside the existing `download` blob. The new fields: - Appear in the variable picker for all downstream nodes, including the Code component, so users can bind them directly to script arguments. - Are cheap scalars only — no base64 payload leaks into other components. - Leave the existing `download` JSON blob completely untouched, so the Message component's download-chip rendering (which parses that blob via `_is_download_info`) keeps working with no behavior change. Changes: - `agent/component/docs_generator.py` — declare the four new outputs in `DocGeneratorParam` and emit them via `set_output(...)` in `_invoke`. - `web/src/pages/agent/constant/index.tsx` — extend `initialDocGeneratorValues.outputs` with the new keys. - `web/src/pages/agent/form/doc-generator-form/index.tsx` — mirror the new outputs in the zod schema so the form is valid. No changes needed to the picker's existing `download`-hiding filter — it matches only on the literal output name `download`, so the new metadata entries fall through naturally. Reported in: https://github.com/infiniflow/ragflow/issues/14461. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-05-25 16:05:00 +08:00
Haruko386	69f301b84a	Go: implement embed for Tencent Hunyuan (#15207 ) ### What problem does this PR solve? Implement embed for Tencent Hunyuan Verified from CLI ``` RAGFlow(user)> embed text 'what is rag' 'who are you' with 'hunyuan-embedding@test1@hunyuan' dimension 16; +-----------+-------+ \| dimension \| index \| +-----------+-------+ \| 1024 \| 0 \| \| 1024 \| 1 \| +-----------+-------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring	2026-05-25 16:04:17 +08:00
ちー	bb6cfc14e6	feat[go]: implement provider: TokenHub (#15159 ) ### What problem does this PR solve? implement provider TokenHub ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-05-25 16:02:50 +08:00
Wang Qi	5069561abc	Fix /chat/completions to allow send only the latest message (#15197 ) ### What problem does this PR solve? 1. Fix /chat/completions to send only the latest message 2. Allo chat stream=False ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-25 14:23:33 +08:00
Wang Qi	bb148edf4c	Revert "Fix: /openai/<chat_id>/chat/completions not aware of session_id" (#15205 ) Reverts infiniflow/ragflow#15155 because this is never supported, keep it as it is.	2026-05-25 14:23:10 +08:00
Jin Hai	f8c626bbc8	Go: add ingestion server (#15094 ) ### What problem does this PR solve? 1. Go ingestion server will connected with admin server with gRPC stream 2. Go ingestion server will be responsible for ingestion tasks ``` RAGFlow(admin)> list ingestors; +-----------------+-----------+----------------------------------+---------------------------+----------+------------+--------------+--------+------------+---------------+ \| address \| cpu_usage \| id \| last_heartbeat \| name \| process_id \| rss_usage \| status \| task_count \| vms_usage \| +-----------------+-----------+----------------------------------+---------------------------+----------+------------+--------------+--------+------------+---------------+ \| 127.0.0.1:58564 \| 0 \| bdd1870eea2646e0aacb8a2cd3307aa2 \| 2026-05-24T18:16:17+08:00 \| ingestor \| 680152 \| 212.72265625 \| active \| 0 \| 2589.12109375 \| +-----------------+-----------+----------------------------------+---------------------------+----------+------------+--------------+--------+------------+---------------+ RAGFlow(admin)> start ingestion 'abc'; +----------------------------------+ \| task_id \| +----------------------------------+ \| e714777639ca4760ab427b5f211e81ad \| +----------------------------------+ RAGFlow(admin)> stop ingestion 'f7bd39d0a724457eb5fdce6d81699776'; +----------------------------------+ \| task_id \| +----------------------------------+ \| f7bd39d0a724457eb5fdce6d81699776 \| +----------------------------------+ RAGFlow(admin)> list tasks; +-----+----------------------------------+-------+------+----------------------------------+---------------------------+------------+------------+ \| ETA \| assign_to \| error \| from \| id \| last_update \| start_time \| status \| +-----+----------------------------------+-------+------+----------------------------------+---------------------------+------------+------------+ \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| eae6431da72a40e796cff3a03008091b \| 2026-05-24T19:46:03+08:00 \| \| COMPLETED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| 6cccdd174bd049ecb05a774bbb47593f \| 2026-05-24T19:46:03+08:00 \| \| COMPLETED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| ef360d777e57485799adb96b30f2b4b8 \| 2026-05-24T19:46:03+08:00 \| \| CANCELED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| bcc5c5448cb64de48b6b6171c36fb790 \| 2026-05-24T19:46:03+08:00 \| \| CANCELED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| bfc25384c43a443294fe2da979a38ac2 \| 2026-05-24T19:46:03+08:00 \| \| DISPATCHED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| 84960537b85d413b8990a9efd5952d67 \| 2026-05-24T19:46:04+08:00 \| \| DISPATCHED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| 3d223c1b51e24b36861a3bfb2f1d58d4 \| 2026-05-24T19:46:03+08:00 \| \| CANCELED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| e433b0e356b846c89c301621a3c54494 \| 2026-05-24T19:46:03+08:00 \| \| COMPLETED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| 7c93a3880f074ebd8eca14e6b51bb7ef \| 2026-05-24T19:46:03+08:00 \| \| COMPLETED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| df2e4ef51aaf4390bff9a23f2692486e \| 2026-05-24T19:46:04+08:00 \| \| DISPATCHED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| 7377c53010194ef7a83aa206698d66ff \| 2026-05-24T19:46:05+08:00 \| \| DISPATCHED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| df64d1a1f9d348e3a2f174c4d7d69e73 \| 2026-05-24T19:46:05+08:00 \| \| DISPATCHED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| b59834512e2847e1bdf13ace04b8a456 \| 2026-05-24T19:46:06+08:00 \| \| DISPATCHED \| \| 0 \| 17937da188b84f23a5c10bb87588944b \| \| CLI \| 0064bb0ab69344028d1ecfda053826f4 \| 2026-05-24T19:46:03+08:00 \| \| QUEUED \| +-----+----------------------------------+-------+------+----------------------------------+---------------------------+------------+------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-05-25 14:00:08 +08:00
Haruko386	5d022d83e8	Go: implement provider: PaddleOCR_Local (#15158 ) ### What problem does this PR solve? Go: implement provider: PaddleOCR_Local Verified from CLI ``` RAGFlow(user)> ocr with 'PaddleOCR-VL@test@paddleocr_local' file './internal/test1.jpg' +----------------------+ \| text \| +----------------------+ \| ## Parallel to these \| +----------------------+ ``` ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue) - [X] New Feature (non-breaking change which adds functionality) - [X] Refactoring	2026-05-25 12:12:57 +08:00
dripsmvcp	8d8ea71877	Go: implement provider: Tencent Hunyuan (#15092 ) ## Summary - Adds a `Hunyuan` Go driver so the new API server can route Tencent Hunyuan chat instances (registered in `conf/llm_factories.json:3830` as `Tencent Hunyuan`). Follows the same SaaS-driver shape used for Astraflow, Avian, Novita, TogetherAI, Replicate, DeepInfra, Upstage, and LongCat. Closes #15087 --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-25 11:04:39 +08:00
Wang Qi	0ce6655789	Fix: /chat/completions not aware of conversation_id (#15162 ) ### What problem does this PR solve? Fix /chat/completions not aware of conversation_id ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-25 10:47:08 +08:00
VincentLambert	50424df48e	feat(i18n): complete French translation — add ~1400 missing keys (#15192 ) ## Summary - Brings the French locale (`web/src/locales/fr.ts`) to full parity with the English reference - Adds ~1400 missing translation keys across all sections: `common`, `chat`, `header`, `login`, `admin`, `setting`, `flow`, `knowledgeDetails`, `knowledgeConfiguration`, `memory`, `skills`, `skillSearch`, `chunk`, `mcp`, `fileManager`, `search`, `dataflowParser`, `datasetOverview`, `deleteModal`, `empty`, `explore`, `memories`, `pagination`, `language`, `knowledgeList` - All strings containing French apostrophes use double-quote delimiters (prevents JS syntax errors) ## Test plan - [ ] `npx esbuild src/locales/fr.ts --bundle=false` — no errors - [ ] `npx eslint src/locales/fr.ts` — no errors - [ ] Switch UI language to French and verify key sections render correctly (chat, knowledge base, admin panel, agent flow) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-25 10:32:45 +08:00
bitloi	432e966414	fix(go): support OpenAI audio endpoints (#15104 ) ### What problem does this PR solve? Closes #15102. OpenAI's Go provider config advertises `whisper-1` as ASR and `tts-1` as TTS, but the Go driver returned `openai, no such method` for both audio paths and did not define `url_suffix.asr` / `url_suffix.tts`. This PR: - adds OpenAI audio URL suffixes for `audio/transcriptions` and `audio/speech` - implements non-streaming `TranscribeAudio` using multipart form uploads - implements non-streaming `AudioSpeech` using the OpenAI speech JSON request shape - keeps streaming TTS explicitly unsupported instead of sending binary audio through the text SSE sender - adds focused tests for config coverage, ASR/TTS request shape, required TTS voice validation, and unsupported streaming TTS ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-25 10:25:53 +08:00
Wang Qi	e6dd397531	Fix: /openai/<chat_id>/chat/completions not aware of session_id (#15155 ) ### What problem does this PR solve? Fix: /openai/<chat_id>/chat/completions not aware of session_id ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-22 20:38:56 +08:00
Tohka	302f97de50	Go: implement reasoning_chat, TTS, ASR for Groq (#15153 ) ### What problem does this PR solve? Go: implement reasoning_chat, TTS, ASR for Groq Verify from CLI ``` RAGFlow(user)> think chat with 'qwen/qwen3-32b@test@groq' message 'who r u' Thinking: Okay, the user asked, who r u. I need to determine what the user is asking. They may be asking about my identity. I should introduce my name and basic functions. The user might want to know what I can do, so I should list some common use cases, such as answering questions, creating writing, coding, and expressing opinions. The user may be curious about how they can interact with me, so they can be advised to ask any questions or provide instructions. Keep your answers conversational, avoid overly technical terms, keep answers concise, and encourage further interaction. Check if there's any ambiguity in the answer and make sure it's accurate and meets the user's needs. Also consider if there are other aspects the user may be interested in, such as my training data or performance. But since the question is basic, I'll focus on the essentials first and invite the user to ask more. In summary, respond to the user's questions by introducing yourself, your functions, and encouraging further interaction. Answer: Hello! I'm Qwen. I am a large-scale language model developed by Tongyi Lab, designed to assist you in various ways, such as answering questions, creating text, logical reasoning, programming, and more. I aim to provide clear, accurate, and helpful information and support. How can I assist you today? Feel free to ask any questions or give me tasks! 😊 Time: 2.199908 RAGFlow(user)> stream think chat with 'openai/gpt-oss-20b@test@groq' message 'who r u' Thinking: to respond politely. Answer: ’m ChatGPT—an AI language model created by OpenAI. I’m here to answer questions, offer explanations, and help with a wide range of topics. How can I assist you today? RAGFlow(user)> tts with 'canopylabs/orpheus-arabic-saudi@test@groq' text 'hello? show yourself' play format 'wav' param '{"voice": "fahad"}' SUCCESS RAGFlow(user)> asr with 'whisper-large-v3-turbo@test@groq' audio './internal/test.wav' param '{"language": "en"}' +----------------------------------------------------------------------------------------------------------------------+ \| text \| +----------------------------------------------------------------------------------------------------------------------+ \| The examination and testimony of the experts enabled the Commission to conclude that five shots may have been fired \| +----------------------------------------------------------------------------------------------------------------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-05-22 18:02:30 +08:00
Haruko386	3f02ca7ba1	Go: implement embed, rerank, tts for AstraFlow (#15135 ) ### What problem does this PR solve? implement embed, rerank, tts for AstraFlow Verify from CLI ``` # Astraflow RAGFlow(user)> tts with 'IndexTeam/IndexTTS-2@test3@astraflow' text 'hello? show yourself' play format 'wav' param '{"voice": "jack_cheng"}' SUCCESS RAGFlow(user)> rerank query 'what is rag' document 'rag is retrieval augment generation' 'rag need llm' 'famous rag project includes ragflow' with 'bge-reranker-v2-m3@test3@astraflow' top 3; +-------+---------------------+ \| index \| relevance_score \| +-------+---------------------+ \| 0 \| 0.9837390184402466 \| \| 2 \| 0.06322699040174484 \| \| 1 \| 0.04663187265396118 \| +-------+---------------------+ RAGFlow(user)> embed text 'walkerwhat' 'jumperwho' with 'text-embedding-3-large@test3@astraflow' dimension 16 +-----------+-------+ \| dimension \| index \| +-----------+-------+ \| 3072 \| 0 \| \| 3072 \| 1 \| +-----------+-------+ # Xinference ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring	2026-05-22 18:02:01 +08:00
writinwaters	bf9297a343	Docs: Added a guide on integrating Discord. (#15156 ) ### What problem does this PR solve? How to ingest messages from your Discord server. ### Type of change - [x] Documentation Update	2026-05-22 17:49:18 +08:00
Wang Qi	87918650ff	Refactor: Move API files (#15151 ) Refactor: Move API files	2026-05-22 17:44:05 +08:00
Wang Qi	7e6844118b	Fix search vector_similarity_weight (#15108 ) ### What problem does this PR solve? Fix search vector_similarity_weight ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-22 16:05:13 +08:00
ghost	f9ce07ced1	feat(go-models): add Groq provider driver (#15097 ) ### What problem does this PR solve? Closes #15088. Adds Groq support to the Go model-provider layer so Groq instances can be routed through the Go API server with the same OpenAI-compatible chat, streaming, model listing, and connection-check flow used by other SaaS providers. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ## Summary - Added a Groq Go model driver. - Added the Groq provider catalog and default OpenAI-compatible API URL. - Registered Groq in the model factory. - Added focused provider tests. ## What changed - Implemented chat completions, SSE streaming, ListModels, and CheckConnection for Groq. - Covered request shape, stream termination, reasoning fallback, model listing, custom base URLs, safe transport setup, and unsupported methods. - Kept the provider catalog scoped to current Groq chat-capable model IDs. - Cleaned up pre-existing Go model package validation blockers so the package can be tested normally with vet enabled. ## Why The existing Python/provider catalog path includes Groq, but the Go model-provider layer did not have a Groq driver, so the Go API server could not instantiate or use Groq as requested in #15088. ## Notes The model package now validates without disabling vet. --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-22 15:24:52 +08:00
Lynn	893980ed8f	Fix: add model_type into llm_setting (#15141 ) ### What problem does this PR solve? Add model_type into llm_setting ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-22 15:23:07 +08:00
buua436	71a52d579c	fix: move agent attachment download api (#15146 ) ### What problem does this PR solve? move agent attachment download api to the correct route and update frontend callers ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Notes - Move the attachment download endpoint from document routes to agent routes. - Update frontend download callers to use the agent attachment endpoint. - Reuse the shared file response header helper instead of duplicating it in `agent_api.py`.	2026-05-22 15:22:05 +08:00
dripsmvcp	ed04893415	Go: implement provider: TokenPony (#15091 ) ## Summary - Adds a `TokenPony` Go driver so the new API server can route TokenPony chat instances, matching the existing Python `TokenPonyChat` (`rag/llm/chat_model.py:1210`). Follows the same SaaS-driver shape used for Astraflow, Avian, Novita, TogetherAI, Replicate, DeepInfra, Upstage, and LongCat. Closes #15086 --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-05-22 15:21:45 +08:00
kpdev	faf77a5a8a	feat(evaluation): track token usage in evaluation results (#13487 ) ## Summary Implements the TODO in `evaluation_service.py`: Track token usage in evaluation results. ## Changes - Import `num_tokens_from_string` from `common.token_utils` - Prompt tokens: Use the full prompt returned by `async_chat` when available (includes system prompt + knowledge base + query), otherwise fall back to the question token count - Completion tokens: Count tokens in the generated answer - Storage: Store `token_usage` as `{prompt_tokens, completion_tokens, total_tokens}` in each `EvaluationResult` instead of `None` ## Why The evaluation pipeline previously saved `token_usage: None` for every result. This change allows downstream consumers (e.g. evaluation dashboards, cost tracking) to see approximate token usage per test case using the same tokenizer (tiktoken cl100k_base) used elsewhere in RAGFlow. ## Testing - No new tests added; existing evaluation flow unchanged - Token counting uses existing `num_tokens_from_string` utility --------- Co-authored-by: kiannidev <kiannidev@users.noreply.github.com>	2026-05-22 15:19:53 +08:00

1 2 3 4 5 ...

6396 Commits