ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 23:41:12 +08:00

Author	SHA1	Message	Date
kpdev	de18313f97	fix(api): POST /documents/stop removes partial chunks and resets counters (#15789 ) ### What problem does this PR solve? `POST /api/v1/datasets/{dataset_id}/documents/stop` (`stop_parse_documents`) cancels parsing tasks and sets `run` to `CANCEL`, but it does not remove chunks already indexed in the doc store or reset `progress` / `chunk_num`. REST callers can end up with a “cancelled” document that still returns partial chunks in `GET .../chunks` and in retrieval. Legacy `DELETE /api/v1/datasets/{dataset_id}/chunks` (`stop_parsing`) already performs full cleanup: it resets counters and calls `docStoreConn.delete`. This PR aligns the newer stop endpoint with that behavior so both paths leave the dataset consistent. Fixes [#15788](https://github.com/infiniflow/ragflow/issues/15788). ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): ### Changes - Update `stop_parse_documents` in `document_api.py` to reset `progress` and `chunk_num` to `0` and delete partial chunks via `docStoreConn.delete` after `cancel_all_task_of`. - Add unit test `test_stop_parse_documents_cleans_partial_chunks` to assert counters reset and doc store delete is invoked. ### Test plan - [x] Unit test: `pytest test/testcases/test_http_api/test_file_management_within_dataset/test_doc_sdk_routes_unit.py::TestDocRoutesUnit::test_stop_parse_documents_cleans_partial_chunks -v` - [ ] Manual: upload a slow document, start parse, call `POST .../documents/stop` while `RUNNING`, verify `GET .../chunks` returns zero chunks and UI `chunk_count` is 0 - [ ] Control: legacy `DELETE .../chunks` behavior unchanged --------- Co-authored-by: Wang Qi <wangq8@outlook.com>	2026-06-11 15:51:32 +08:00
oktofeesh	c15b2b3f66	fix(connectors): enforce WebDAV numeric string size limits (#15731 ) ## Summary - Normalize WebDAV file-size metadata before applying the sync size threshold. - Enforce the same threshold for numeric string sizes in both document sync and slim snapshot paths. - Add focused WebDAV unit coverage for size parsing and over-threshold skips. ## Why Some WebDAV servers return file sizes from PROPFIND metadata as strings. The previous threshold check only handled integer values, so oversized files could still be downloaded and sent into the chunking pipeline. Closes #15724. ## Validation - `uv run --no-project --with pytest --with pytest-asyncio pytest test/unit_test/data_source/test_webdav_connector_unit.py -q` - `uvx ruff check common/data_source/webdav_connector.py test/unit_test/data_source/test_webdav_connector_unit.py` - `python -m compileall -q common/data_source/webdav_connector.py test/unit_test/data_source/test_webdav_connector_unit.py` - `git diff --check` --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-11 15:47:54 +08:00
Rene Arredondo	b978e26208	fix(db): drop Peewee-auto-named unique index on tenant_model_instance (#15699 ) (#15879 ) ## Summary Fixes #15699. User upgrades to v0.25.6 against an existing MySQL database, tries to add an Ollama provider instance, and gets: ``` MySQL IntegrityError: Duplicate entry 'dbaafbfe608a11f1a5516d6066988224' for key 'tenant_model_instance.tenantmodelinstance_api_key_provider_id' ``` The route at [api/apps/restful_apis/provider_api.py:354](api/apps/restful_apis/provider_api.py#L354) catches it and returns `get_error_data_result(message="Internal server error")` — which by RAGFlow's convention is HTTP 200 with an error `code` on the body — hence the reporter's "200 status code but the database errored" complaint. ### Root cause The provider-instance refactor in [PR #15460](https://github.com/infiniflow/ragflow/pull/15460) dropped the unique-compound-index tuple from `TenantModelInstance`: ```python # Removed in #15460 class Meta: db_table = "tenant_model_instance" indexes = ( (("api_key", "provider_id"), True), # unique ) ``` and added a one-shot drop in `migrate_db()` for existing databases. But the drop targets the wrong index name: ```python # Before this PR — wrong name for table_name, index_name in [ ("tenant_model_instance", "idx_api_key_provider_id"), # ← doesn't exist ("tenant_model", "idx_provider_model_instance"), ]: ``` Peewee's auto-derived index name is `<lowercase classname>_<col1>_<col2>` → `tenantmodelinstance_api_key_provider_id`, which matches the user's error verbatim. The drop raises `OperationalError: 1091 (HY000): Can't DROP …`, the surrounding `except` clause at [db_models.py:1736](api/db/db_models.py#L1736) swallows it as expected-on-fresh-installs, and the legacy unique index lives on indefinitely. ### Why Ollama hits it specifically Ollama doesn't require an API key. The form posts `api_key: ""`. The app-layer dedupe at [provider_api_service.py:288-292](api/apps/services/provider_api_service.py#L288-L292): ```python api_key_str = "" if api_key: # ← skipped for "" ... same_key_instance = TenantModelInstanceService.get_by_provider_id_and_api_key(...) if same_key_instance: return False, f"Already exist instance: ... with api_key {api_key}" ``` falls through for empty keys. Control reaches `TenantModelInstanceService.create_instance(..., api_key="")` which inserts a row whose `(api_key, provider_id) = ("", <provider_uuid>)` collides with any prior Ollama row that already shipped that same pair → the still-present unique index throws. (`dbaafbfe608a11f1a5516d6066988224` in the user's error is the duplicated `provider_id` UUID, paired with the empty `api_key`.) ### Fix Add the Peewee auto-name alongside the existing `idx_` entry so the migration finally drops the obsolete index on next restart: ```python legacy_indexes = [ ("tenant_model_instance", "idx_api_key_provider_id"), ("tenant_model_instance", "tenantmodelinstance_api_key_provider_id"), # ← added ("tenant_model", "idx_provider_model_instance"), ] ``` The surrounding `try/except (OperationalError, ProgrammingError)` matches `1091` / `can't DROP` / `does not exist` and treats them as success, so every state is idempotent (see Test plan). ### Idempotency matrix \| Database state \| First entry (`idx_api_key_provider_id`) \| New entry (`tenantmodelinstance_api_key_provider_id`) \| \| --- \| --- \| --- \| \| Fresh install (≥ #15460) — neither index exists \| `1091` → swallowed \| `1091` → swallowed \| \| Upgraded from before `dc4b82523` (the user's case) — auto-name present \| `1091` → swallowed \| drops the index* \| \| Upgraded after a manual rename to `idx_` \| drops the index \| `1091` → swallowed \| \| Re-run of `migrate_db()` after either of the above \| `1091` → swallowed \| `1091` → swallowed \| No rollback hazard: nothing depends on this unique constraint anymore (`create_instance` dedupes by `instance_name` via `duplicate_name`, see [tenant_model_instance_service.py:27](api/db/services/tenant_model_instance_service.py#L27)). ### What this PR does NOT change - `provider_api_service.create_provider_instance`* — its `if api_key:` gate is correct for the post-migration world: multiple Ollama instances with empty keys under one provider are legitimate, so we shouldn't tighten the app-layer check. - `TenantModelInstance` Peewee model — the `indexes` tuple was already removed in #15460. New databases never get the constraint in the first place. - The `except → get_error_data_result` → HTTP 200 pattern at `provider_api.py:354` — that's a project-wide convention; changing one route to HTTP 500 would be inconsistent and out of scope. ## Test plan - [ ] Reproducer (pre-fix): on a database originally created before #15460, configure an Ollama provider with an empty `api_key`, then try to create a second instance under the same provider — confirm the `Duplicate entry … 'tenantmodelinstance_api_key_provider_id'` error in the server log. - [ ] Verify the index is present pre-restart: `SHOW INDEX FROM tenant_model_instance WHERE Key_name = 'tenantmodelinstance_api_key_provider_id';` — non-empty result. - [ ] Restart with the fix applied: server starts cleanly, `migrate_db()` runs, no `Failed to drop index` in critical logs. - [ ] Verify the index is gone post-restart: same `SHOW INDEX` query — empty result. - [ ] Re-run the reproducer: two Ollama instances under the same provider, both `api_key=""`, both succeed. - [ ] Restart a second time — no new errors; the matching `1091` swallow keeps the migration idempotent. - [ ] Fresh install smoke test: drop the DB volume, start clean — no `1091` noise (the new index never existed), no functional regression. ## Files changed - [api/db/db_models.py](api/db/db_models.py) — extend the legacy-index drop list with `tenantmodelinstance_api_key_provider_id`; refactor the inline list to a named `legacy_indexes` local with a comment pointing at #15460 and #15699. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Co-authored-by: Wang Qi <wangq8@outlook.com>	2026-06-11 15:47:12 +08:00
monsterDavid	a851228ded	fix(preview): authenticate markdown document preview requests (#15589 ) ## Summary Fixes [#15585](https://github.com/infiniflow/ragflow/issues/15585). - Route markdown preview through the shared `request` client (same as txt/image previewers) so `Authorization` headers and interceptors are applied consistently. - Add a unit test covering `AUTH_BETA` token loading for embedded search auth. ## Root cause Search result preview for `.md`/`.mdx` used raw `fetch`, which did not apply the same auth path as other preview types. That led to `401` on `GET /api/v1/documents/{id}/preview` even when the user was logged in or using an embedded search `auth` query param. ## Test plan - [ ] Log in, run a search, open a markdown citation link — preview loads (no 401). - [ ] Open an embedded shared search URL with `auth` query param, preview a markdown file — preview loads. - [ ] Confirm PDF/txt preview still works in the same search UI. --------- Co-authored-by: MkDev11 <89318445+bitloi@users.noreply.github.com> Co-authored-by: Wang Qi <wangq8@outlook.com>	2026-06-11 15:46:20 +08:00
bohdansolovie	47fb462e46	fix(api): guard dataset delete when File2Document row is missing (#15533 ) ## Summary Fixes #15532 — `delete_datasets()` crashes with `IndexError` when a document has no `File2Document` row. `delete_datasets()` in `dataset_api_service.py` called `File2DocumentService.get_by_document_id()` and immediately accessed `f2d[0].file_id` without checking whether the lookup returned any rows. Documents created via API ingestion or connector sync may exist without a linked file record, causing dataset deletion to abort with HTTP 500. This PR mirrors the existing guard already used in `file_service.py` and `document_api_service.py`.	2026-06-11 15:18:08 +08:00
Idriss Sbaaoui	9871a7e0b6	fix: replicate model provider (#15933 ) ### What problem does this PR solve? FIx replicate model provider failing with valid api key ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Wang Qi <wangq8@outlook.com>	2026-06-11 15:08:33 +08:00
Rene Arredondo	3f929e3904	fix(es): downgrade LLM-generated invalid SQL to WARNING in ES sql() (#15409 ) (#15709 ) ## Summary Fixes #15409. Reporter sees scary ERROR-level stack traces in `ragflow_server.log` on every chat turn against a knowledge base whose spreadsheet has many columns with embedded IDs (e.g. `id-wstc-bios fvt-322-wstc-bios fvt-323`). Simple queries work; complex ones return "No answer" with logs that look like a hard crash. ### What's actually happening 1. The user uploads a wide Excel/CSV. [rag/app/table.py:477-493](rag/app/table.py#L477-L493) turns each header into an ES field with a type suffix, e.g. `id-wstc-bios fvt-322-wstc-bios fvt-323_tks`. This is correct — the parser faithfully encodes the user's column names. 2. The user asks about test case `fvt-085`. The SQL chat path in [api/db/services/dialog_service.py:914 use_sql](api/db/services/dialog_service.py#L914) asks the LLM to write SQL using the field list. The LLM sees the `id-wstc-bios fvt-NNN-wstc-bios fvt-MMM_tks` pattern and pattern-completes a plausible-but-nonexistent column. 3. Elasticsearch rejects with `BadRequestError(400, 'verification_exception')`: `Unknown column [id-wstc-bios fvt-085-wstc-bios fvt-086_tks]` and suggests the closest valid column. 4. The recovery path already exists: `use_sql` catches the exception, re-prompts the LLM with the error text (which contains ES's "did you mean" hint), and on second failure the caller at [api/db/services/dialog_service.py:626](api/db/services/dialog_service.py#L626) falls back to vector search. The chat does produce an answer — it's just generated from the vector hits instead of SQL. The only real bug is logging: - [common/doc_store/es_conn_base.py:399](common/doc_store/es_conn_base.py#L399) catches every exception with `self.logger.exception(...)`, which writes a full traceback at ERROR level. - For LLM-generated SQL this is the hot path, not an exceptional condition — it can fire twice per turn before the fallback runs. ### Fix Catch `elasticsearch.BadRequestError` (the parent class of `verification_exception` / `parsing_exception` / similar SQL-validity errors) separately and log it at WARNING with the SQL plus ES error message. The message still carries the unknown column name and ES's suggested alternative, so it's actionable for anyone investigating "why is my LLM producing bad SQL?" — just without the misleading stack trace. Other exception types (`ConnectionTimeout`, generic `Exception`) keep their original `ERROR`-level traceback treatment; those represent real connectivity / library bugs. This is a one-file, two-line-net change. The retry loop in `use_sql`, the `add_kb_filter` injection, and the vector-search fallback are all unchanged. ### What this PR does NOT change - The LLM prompts in `use_sql` — they already specify `Use EXACT field names from the schema` and pass the field list explicitly. Strengthening them risks regressing well-behaved cases and is out of scope for #15409. - The single-retry policy — extending it to multi-retry with extracted ES suggestions is a separate enhancement. - The parser at `rag/app/table.py` — the field names match the user's actual column headers; the parser is doing its job. ## Files changed - [common/doc_store/es_conn_base.py](common/doc_store/es_conn_base.py) - Add `BadRequestError` to the `elasticsearch` import. - In `ESConnectionBase.sql()`, add an `except BadRequestError` arm above the generic `except Exception` that logs at WARNING and re-raises (so `use_sql` retry/fallback still triggers).	2026-06-11 15:04:52 +08:00
zaviermeekz-cpu	a1dc2da7b4	fix: add model_name to embed completion request (#15883 ) (#15888 ) ### What problem does this PR solve? When embedding a chatbot, the API returned `"Model Name is required"`. The embed widget now includes the assistant's `llm_id` as `model_name` in the completion request. ### Type of change - [x] Bug Fix ### How has this been tested? - Created a chatbot with a default model. - Embedded it and sent a message – the error is gone and the assistant replies correctly. ### Related Issue Closes #15883 Co-authored-by: RAGFlow Dev <dev@ragflow.local> Co-authored-by: Wang Qi <wangq8@outlook.com>	2026-06-11 14:38:37 +08:00
balibabu	5d3f8bbf32	Fix: The regular expression configuration for pipeline header-based chunking will be reset. (#15935 ) ### What problem does this PR solve? Fix: The regular expression configuration for pipeline header-based chunking will be reset. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-11 14:12:24 +08:00
Wang Qi	906618fb30	Fix Agent chat Minimax content in thinking (#15937 ) Fix Agent chat Minimax content in thinking	2026-06-11 14:09:57 +08:00
Jin Hai	ca00d23aac	Go: add parse and chunk command (#15936 ) ### What problem does this PR solve? Two commands are used for ingestion file testing ``` RAGFlow(api/default)> chunk 'file' with 'dsl'; Chunk file: file, DSL: dsl SUCCESS RAGFlow(api/default)> parse file 'filename' chat 'xxx'; Success to parse local file "filename", vision: , chat: xxx, asr: , ocr: , embedding: , doc_parse: SUCCESS ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-11 13:33:26 +08:00
Haruko386	84edf539e7	Go: Refactor list-models func (#15900 ) ### What problem does this PR solve? As title Issue: #15853 ### Type of change - [x] Refactoring	2026-06-11 13:32:50 +08:00
JPette1783	4b10c0b885	fix(go-models): guard nil pointers in DeepSeek and VolcEngine streaming (#15817 ) ### What problem does this PR solve? `ChatStreamlyWithSender` in two Go model drivers could panic on nil pointer dereferences when a caller passes a nil model config or omits the reasoning `Effort`: - deepseek.go - `switch chatModelConfig.Effort` dereferenced `Effort` without a nil check. It now defaults to `"high"` when nil. - volcengine.go* - the `modelConfig` pointer itself was dereferenced (`Stream`, `MaxTokens`, `Temperature`, .) with no guard, and `Effort` was dereferenced unchecked. `modelConfig` now defaults to an empty `&ChatConfig{}` when nil so the optional-field accesses are safe, and `Effort` defaults to `"medium"` when nil. Addresses the CodeRabbit review on `volcengine.go` `ChatStreamlyWithSender`. Per maintainer feedback ("one PR do one thing"), the unrelated `handler/auth.go` and `service/heartbeat_sender.go` changes were removed so this PR is scoped to the model-provider fixes. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-11 13:32:24 +08:00
Rene Arredondo	19104168a6	fix(sync): tolerate list inputs for Discord server_ids / channels (#15790 ) (#15809 ) ## Summary Fixes #15790. Every Discord sync launched from the current Web UI crashes immediately with: ``` 'list' object has no attribute 'split' ``` The error is raised in [rag/svr/sync_data_source.py:650-651](rag/svr/sync_data_source.py#L650-L651): ```python server_ids=server_ids.split(",") if server_ids else [], channel_names=channel_names.split(",") if channel_names else [], ``` ### Root cause Three independent bugs stack here, all in the Discord branch of `sync_data_source.py`: 1. Type mismatch (the user's exact error). The current form at [web/src/pages/user-setting/data-source/constant/index.tsx:833-843](web/src/pages/user-setting/data-source/constant/index.tsx#L833-L843) uses `FormFieldType.Tag` for both Server IDs and Channels: ```tsx { label: 'Server IDs', name: 'config.server_ids', type: FormFieldType.Tag, required: false }, { label: 'Channels', name: 'config.channels', type: FormFieldType.Tag, required: false }, ``` Tag inputs serialise to lists, not comma-separated strings. The backend `.split(",")` then explodes on the very first sync. 2. Field-name mismatch. The form writes `config.channels`. The backend reads `self.conf.get("channel_names", None)`. Even if `.split(",")` were fixed, channels would silently be empty for every UI-created source. 3. Int conversion missing. [common/data_source/discord_connector.py:82](common/data_source/discord_connector.py#L82) types `server_ids` as `list[int]` (Discord guild IDs are integers); the previous `.split(",")` produced strings, so the `channel.guild.id not in server_ids` filter at [discord_connector.py:92](common/data_source/discord_connector.py#L92) silently never matched. So even the configurations that didn't crash were also broken — there is no path through the current code that actually filtered by server id from a UI-created source. ### Fix A 39-line patch in one function: - New `Discord._coerce_str_list` static method: accepts `None` / `""` / `list` / `tuple` / `set` / scalar / comma-separated str, returns a clean `list[str]` with whitespace trimmed and empty entries dropped. Smoke-tested against the 10 input shapes that can hit it (see Test plan). - `_generate` reads `config.channels` first (the form's actual key) and falls back to `config.channel_names`, so SDK callers and legacy configs that already shipped with the old key keep working. - `server_ids` is coerced to `list[int]`. Non-integer entries are logged and dropped instead of crashing the sync, so a single malformed tag from the form doesn't tank the rest of the run. ### What this PR does NOT change - Web form key (`config.channels`) — kept as-is. Renaming it to `channel_names` would force a UI migration and break in-flight configs; the backend fallback solves the same problem more safely. - `common/data_source/discord_connector.py` — its signature was already correct. - Other connectors (Slack, Gmail, Confluence, etc.) — they don't crash today and were not in the issue's scope. ## Test plan `Discord._coerce_str_list` has been exercised against all ten realistic input shapes — list, tuple, set, comma-separated string, str with extra whitespace, empty entries, integers from a Tag input, None, empty list, single trailing comma. All pass.	2026-06-11 13:27:42 +08:00
zaviermeekz-cpu	c50f9c59aa	fix: allow zero message history window and clear history for new sessions (#15897 ) (#15902 ) ### What problem does this PR solve? Two bugs in the Agent Categorize component: 1. The backend rejected `message_history_window_size = 0` while frontend allowed it, causing API errors. 2. When calling the agent API without a `session_id`, a new session was created but retained history from previous conversations. ### Type of change - [x] Bug Fix ### How has this been tested? - Issue 1: `CategorizeParam().check()` now accepts `0` and rejects negative values. - Issue 2: `canvas.clear_history()` is called for new sessions (no `session_id`), ensuring fresh conversation state. Verified via UI and API that a second call without `session_id` does not remember the first conversation. ### Related Issue Closes #15897 Co-authored-by: RAGFlow Dev <dev@ragflow.local> Co-authored-by: Wang Qi <wangq8@outlook.com>	2026-06-11 13:24:48 +08:00
Rene Arredondo	a079c08594	fix(deps): exclude litellm 1.82.6 (internal ImportError) — #15916 (#15920 ) ## Summary Fixes #15916. A fresh `docker compose -f docker-compose-macos.yml up -d` against v0.25.6 errors out on container start with	2026-06-11 11:40:07 +08:00
Wang Qi	238a01d9e3	Fix multiple tags (#15931 ) Fix multiple tags	2026-06-11 10:55:28 +08:00
Lynn	32559d2dfc	Fix: model list (#15914 ) ### What problem does this PR solve? Display OCR tag for model providers. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-11 09:40:45 +08:00
Rene Arredondo	bf59eb77cc	feat(go-api): port forgot-password flow to Go (#15282 ) (#15290 ) ## Summary Implements chunk 1 of #15282 — the four `/api/v1/auth/password/...` endpoints from the login-page Go port. Chunk 2 (OAuth/OIDC) is deferred to its own subtask, matching the issue author's own confidence-low recommendation ("multi-provider, stateful redirect flow with external dependencies; recommend its own subtask"). New endpoints, all registered under `apiNoAuth` (forgot-password users are unauthenticated by definition): \| Method \| Path \| Status \| \|--------\|------\|--------\| \| `POST` \| `/api/v1/auth/password/forgot/captcha` \| new \| \| `POST` \| `/api/v1/auth/password/forgot/otp` \| new \| \| `POST` \| `/api/v1/auth/password/forgot/otp/verify` \| new \| \| `POST` \| `/api/v1/auth/password/reset` \| new \| ## Wire compatibility with the Python backend The two backends share state through Redis, so the Go port had to use identical keys, encodings, and constants. Either backend can now validate a code the other minted. - Redis keys: `captcha:<email>`, `otp:<email>`, `otp_attempts:<email>`, `otp_last_sent:<email>`, `otp_lock:<email>`, `otp:verified:<email>` — same as `api/utils/web_utils.py`. - Stored OTP value: `"<hex_hash>:<hex_salt>"` — same as Python. - Hash: HMAC-SHA256 with a `crypto/rand` 16-byte salt — same as `hash_code()`. - Constants: `OTP_LENGTH=4`, `OTP_TTL=5min`, `ATTEMPT_LIMIT=5`, `ATTEMPT_LOCK_SECONDS=30min`, `RESEND_COOLDOWN_SECONDS=60s` — all match `api/utils/web_utils.py`. - Email body: matches `RESET_CODE_EMAIL_TMPL` byte-for-byte. ## Files ### New \| File \| Purpose \| \|---\|---\| \| `internal/utility/otp.go` \| OTP/captcha constants, Redis key builders (`CaptchaRedisKey`, `OTPRedisKeys`, `OTPVerifiedRedisKey`), `HashOTPCode`, `GenerateOTPCode` / `GenerateCaptchaCode` / `GenerateOTPSalt` via `crypto/rand`, and `EncodeOTPStorageValue` / `DecodeOTPStorageValue` matching Python's storage shape. \| \| `internal/utility/smtp.go` \| Minimal stdlib `net/smtp` sender. `SendResetCodeEmail(to, otp, ttlMin)` builds an RFC 5322 plain-text message and dispatches via implicit TLS / STARTTLS / plain — same selectors as Python `aiosmtplib`. Returns `SMTPNotConfiguredError` if the config block is empty. \| ### Modified \| File \| Change \| \|---\|---\| \| `internal/server/config.go` \| New `SMTPConfig` struct + `Config.SMTP` field. Field names mirror the `smtp:` keys in `common/settings.py` (`mail_server`, `mail_port`, `mail_use_ssl`, `mail_use_tls`, `mail_username`, `mail_password`, `mail_from_name`, `mail_from_address`, `mail_frontend_url`) so a single `conf/service_conf.yaml` powers both backends. \| \| `internal/service/user.go` \| Four methods — `ForgotIssueCaptcha`, `ForgotSendOTP`, `ForgotVerifyOTP`, `ForgotResetPassword`. Reuses the existing `decryptPassword`, `HashPassword`, `userDAO.Update`, and `utility.GenerateToken` so the reset+auto-login path is identical to `LoginByEmail`. \| \| `internal/handler/user.go` \| Four handlers in the same `c.JSON` shape as `LoginByEmail`. The reset handler rotates the access token and emits an `Authorization` header for auto-login (matches Python `construct_response(auth=user.get_id())`). \| \| `internal/router/router.go` \| Routes registered under `apiNoAuth`, with an explanatory comment on why they sit outside the auth middleware. \| ## Known divergence — captcha rendering The Python endpoint returns a rendered `image/JPEG` from the `python-captcha` library. The Go side has no image-captcha dependency vendored in `go.mod`, and hand-rolling a raster generator was out of scope for this PR. This commit returns JSON `{captcha: "<text>"}` instead. Implications: - Backend gate is identical — the OTP step still verifies the user-submitted captcha string against the Redis value, so the security model is unchanged. - Frontend impact: the password-reset page rendering needs a small tweak (text display instead of `<img>`) until a Go captcha library is wired in. - The handler comments call this out explicitly so the next PR knows what to swap. Possible follow-ups (any one closes the gap): 1. Add `github.com/mojocn/base64Captcha` or `github.com/dchest/captcha` to `go.mod` and replace the JSON response with an `image/JPEG`. 2. Hand-roll a 5x7 bitmap font + `image/png` writer using only the stdlib. 3. Render a server-side SVG (cheap, but trivially OCR-able — only useful as a UI shim). ## Test plan - [ ] Captcha: `POST /api/v1/auth/password/forgot/captcha?email=<existing>` returns `{code: 0, data: {captcha: "ABCD"}}`. Redis shows `captcha:<email>` with that value and ~60s TTL. Unknown email returns `code: CodeDataError`. - [ ] OTP send: `POST /api/v1/auth/password/forgot/otp` with the right captcha mints an OTP, stores `<hash>:<salt>` under `otp:<email>` for 5 min, sends an email, returns success. With a wrong captcha returns `CodeAuthenticationError`. Hitting it again within 60s returns "you still have to wait …" with `CodeNotEffective`. - [ ] OTP verify: correct OTP → `code: 0`, OTP keys cleared, `otp:verified:<email>` = `"1"`. Wrong OTP → `code: CodeAuthenticationError`, attempt counter bumped; after 5 wrong tries `otp_lock:<email>` is set and further attempts hit `CodeNotEffective`. - [ ] Reset: with the verified flag set, supply a new password (RSA-encrypted+base64, same as `LoginByEmail`). Returns `code: 0`, `Authorization` header set, verified flag deleted. Without the verified flag returns `CodeAuthenticationError`. - [ ] Wire-compat smoke: mint an OTP from the Python backend, verify it via the Go endpoint, and vice versa. Should both succeed. - [ ] SMTP misconfigured: drop `smtp.mail_server` from `conf/service_conf.yaml`. The OTP-send endpoint should now return "failed to send email" without panicking; check the log for the `SMTPNotConfiguredError` warning. - [ ] End-to-end FE: hit the password-reset flow from `web/src/pages/login-next/`. Confirm the text-captcha shim works after the FE tweak. - [ ] `go build ./...` and `go vet ./...` — I could not run these in the sandbox; please confirm a clean build before merging. - [ ] `uv run pytest` to confirm no Python regressions (shared Redis schema). ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-10 21:27:56 +08:00
Jonathan Chang	dfcf226ba3	feat: Implement API of ragflow server in Go (#15256 ) ## Summary - Implemented the Go API endpoint for Memory message forgetting: - `DELETE /api/v1/messages/{memory_id}:{message_id}` - Added route registration for the Memory message DELETE endpoint only. - Added request path validation for `memory_id:message_id`. - Added service logic to mark a message as forgotten by setting `forget_at`. - Preserved Python-compatible response behavior: - Success returns `code: 0`, `message: true`, `data: null`. - Added focused unit tests for message path parsing and invalid message ID handling. - Fixed Linux cgo linker config to use the installed shared PCRE2 library so Go tests/builds can run in this environment. ## Related Issue Closes: #15240 ## Change Type - [x] Feature - [x] Test - [x] Build / CI compatibility ## Implemented API - `DELETE /api/v1/messages/{memory_id}:{message_id}` ## Real Behavior Proof Validated with targeted Go tests: ```bash /tmp/go1.25.0/bin/go test ./internal/handler ./internal/router ``` Result: ```text ok ragflow/internal/handler ? ragflow/internal/router [no test files] ``` Validated server entrypoint build: ```bash /tmp/go1.25.0/bin/go build -o /tmp/ragflow-server-main ./cmd/server_main.go ``` Result: ```text build succeeded ``` Validated patch formatting: ```bash git diff --check ``` Result: ```text no whitespace errors ``` ## Checklist - [x] Implemented only `DELETE /api/v1/messages/{memory_id}:{message_id}`. - [x] Did not implement unrelated Memory message APIs. - [x] Added route registration. - [x] Added handler validation. - [x] Added service-level memory access check. - [x] Added tests. - [x] Ran targeted Go tests. - [x] Ran server build validation. - [x] Ran `git diff --check`.	2026-06-10 21:27:35 +08:00
Jin Hai	3e4fb8cf1c	Go: fix test and remove unused code (#15909 ) ### What problem does this PR solve? 1. Fix go test, some cases still failed. 2. Remove unused code. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-10 20:38:43 +08:00
Hz_	e132173d1a	feat(go): Update Qwen models in all_models.json (#15910 ) ## Summary - Add official Qwen models to `conf/all_models.json` with `qwen/` canonical names - Include verified aliases from official Qwen/Hugging Face model IDs and common provider naming - Add metadata for context length, model types, thinking support, and embedding dimensions ## Details - Added Qwen model families from the official Hugging Face Qwen organization - Normalized canonical model names to the `qwen/...` format - Preserved official HF IDs and lowercase/common aliases for lookup compatibility - Added `dimension` for Qwen embedding models - Added or corrected `max_tokens` for Qwen model families, including: - Qwen2.5 Instruct variants - Qwen3 original, 2507, VL, Coder, Coder-Next, Next, Embedding, and Reranker models - Qwen3.5 and Qwen3.6 models - QwQ models - Added verified `thinking` metadata where officially supported - Corrected `model_types` for Qwen Image, Omni, Audio, VL, embedding, reranker, benchmark, and tokenizer entries	2026-06-10 20:37:01 +08:00
Hz_	515acf4f60	fix(go): Fix case-insensitive model alias lookup (#15911 ) ## Summary - Normalize model alias index keys to lowercase - Detect lowercase alias collisions during provider manager initialization - Fix ListModels metadata mapping for mixed-case provider aliases	2026-06-10 20:36:43 +08:00
chanx	dfa4c5a795	Fix: add image2text/speech2text/ocr support (#15915 ) ### What problem does this PR solve? Fix: add image2text/speech2text/ocr support ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 20:28:25 +08:00
Wang Qi	acaeb416ca	Fix cannot add fish audio (#15913 ) Fix cannot add fish audio	2026-06-10 20:27:43 +08:00
chanx	1fd9e1df8e	Fix: add thin scrollbar styling for x-spreadsheet component (#15912 ) ### What problem does this PR solve? Fix: add thin scrollbar styling for x-spreadsheet component ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 19:39:00 +08:00
balibabu	aafe6c5534	Fix: The dataset retrieval test returned an incorrect total number. (#15901 ) ### What problem does this PR solve? Fix: The dataset retrieval test returned an incorrect total number. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: balibabu <assassin_cike@163.com>	2026-06-10 19:11:31 +08:00
buua436	2980981da2	fix: route visual agent calls to image model (#15906 ) ### What problem does this PR solve? Ensure agent components with image inputs route to `image2text` models instead of staying on the chat path, so visual requests use the CV wrapper when supported. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 19:09:18 +08:00
Jack	0d3e410826	fix: strip Ollama-style tag suffix from LocalAI model names (#15908 ) ## Summary LocalAI exposes two API surfaces with conflicting naming conventions: - `GET /api/tags` returns model names with `:latest` suffix (Ollama format) - `POST /v1/chat/completions` expects names without `:latest` (OpenAI format) RAGFlow discovered models via `/api/tags` and stored the tagged name, then used it with `/v1/chat/completions`, causing a 404 error because LocalAI didn't recognize `model:latest`. ## Fix In `LocalAI.get_model_list()`, strip the tag suffix from model names using `model["name"].rsplit(":", 1)[0]`, so stored names match what the OpenAI-compatible endpoints expect.	2026-06-10 19:05:05 +08:00
Lynn	7355db183f	Fix: model list (#15905 ) ### What problem does this PR solve? Set OpenDataLoader and call in parser and naive ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 17:44:50 +08:00
Wang Qi	ad63877f04	Fix cannot add bedrock (#15904 ) Fix cannot add bedrock	2026-06-10 17:08:15 +08:00
少卿	8e17a12990	fix: remove think text buffering for real-time reasoning stream (#15891 ) Fix: remove think text buffering for real-time reasoning stream	2026-06-10 16:55:57 +08:00
Wang Qi	3091d91cf7	Fix no need to put inactive models to bottom (#15903 ) Fix no need to put inactive models to bottom	2026-06-10 16:55:02 +08:00
Hunnyboy1217	16d5b4fa02	feat[Go]: implement POST /api/v1/files/link-to-datasets (#15674 ) ### What problem does this PR solve? Closes #15673 — ports the Python `file2document_api.py` `convert()` endpoint to Go. \| Method \| Path \| Handler \| \|--------\|------\|---------\| \| POST \| `/api/v1/files/link-to-datasets` \| `FileHandler.LinkToDatasets` \| ### Type of change - [x] New Feature (non-breaking change which adds functionality) --- #### Implementation notes Files changed: ``` internal/service/file2document.go – new service (File2DocumentService) internal/dao/file2document.go – added Create method internal/handler/file.go – FileHandler gains file2DocumentService; LinkToDatasets HTTP handler internal/router/router.go – route registered ``` Functional parity table: \| Concern \| Go behaviour \| \|---------\|-------------\| \| Required fields \| `file_ids` and `kb_ids` both required; missing either → `CodeDataError` mirroring Python `@validate_request` \| \| File existence \| `fileDAO.GetByIDs(fileIDs)` builds a set; any missing ID → `"File not found!"` \| \| KB existence \| `kbDAO.GetByID(kbID)` per KB; missing → `"Can't find this dataset!"` \| \| Folder expansion \| `getAllInnermostFileIDs` recursively calls `fileDAO.ListByParentID` — mirrors `FileService.get_all_innermost_file_ids` \| \| File permissions \| `checkFileTeamPermission`: `file.TenantID == userID` OR user in tenant's team — mirrors `check_file_team_permission` \| \| KB permissions \| `checkKBTeamPermission`: `kb.TenantID == userID` OR user in tenant's team — mirrors `check_kb_team_permission` \| \| Fire-and-forget \| `go convertFiles(...)` goroutine after all validation passes — mirrors `loop.run_in_executor(None, _convert_files, …)` \| \| Conversion \| `convertFiles`: for each file → delete existing mappings + hard-delete old documents → create new `Document` in each target KB → create `File2Document` mapping — mirrors Python `_convert_files` \| \| `getParser` \| Extension-based lookup with fallback to `kb.ParserID` — mirrors `FileService.get_parser` \| \| Immediate return \| `true` returned to caller as soon as goroutine is scheduled \| --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-06-10 16:46:55 +08:00
Hz_	3796835c4d	feat(go-api): migrate agent file download handler to Go with strict P… (#15769 ) ## What does this PR do? This PR migrates the Agent Temporary File Download endpoint (`GET /api/v1/agents/download`) from the Python backend to the Go backend, optimizing the data retrieval flow and maintaining strict functional parity. It also fixes a persistent parsing error in the Sandbox code execution node. ## Checklist - [x] Code logic matches Python implementation - [x] All local unit tests passed - [x] No breaking changes to existing router interfaces Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-06-10 16:09:36 +08:00
Jin Hai	139f4515e8	Go: refactor CLI (#15898 ) ### What problem does this PR solve? 1. remove unused code 2. fix login issue ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-10 16:06:30 +08:00
Idriss Sbaaoui	357cb84cd4	Fix: cohere call failing (#15899 ) ### What problem does this PR solve? cohere api call failing because of missing prefix ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 15:57:10 +08:00
buua436	dcf623d60d	feat: support multi-type factory models (#15893 ) ### What problem does this PR solve? Support factory models with multiple model types, so visual chat models can be exposed as both image2text and chat while preserving the database model-type-per-record design. This also updates the SILICONFLOW model list and adds a helper script to refresh SiliconFlow models from the provider API. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-10 15:35:21 +08:00
Lynn	478c9846a1	Fix: model list (#15860 ) ### What problem does this PR solve? Remove tenant_llm call in rag. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 14:59:57 +08:00
Wang Qi	899f76af6b	Fix add OpenRouter base_url, UI need to select at least one model to verify (#15894 ) Fix add OpenRouter base_url, UI need to select at least one model to verify	2026-06-10 14:59:27 +08:00
chanx	6822307436	fix: rename ark_api_key to api_key for volcengine provider config (#15896 ) ### What problem does this PR solve? fix: rename ark_api_key to api_key for volcengine provider config ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 14:56:38 +08:00
Lynn	f632bb4a85	Fix: tenant_model migrate (#15886 ) ### What problem does this PR solve? Find instance for models. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 14:06:23 +08:00
chanx	c23809a4bd	Fix: Fix some model provider-related UI issues (#15884 ) ### What problem does this PR solve? Fix: Fix some model provider-related UI issues ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 14:05:57 +08:00
Hz_	38755c705a	feat(go): Add DeepSeek models and Gitee alias metadata tests (#15885 ) This PR expands conf/all_models.json with DeepSeek model entries and provider aliases. Changes: - Added DeepSeek model entries across `V4`, `V3.2`, `V3.1`, `V3`, `R1`, `Coder`, `Math`, `VL`, `OCR`, `Prover`, `MoE`, and `LLM` series. - Normalized model name values to lowercase canonical IDs. - Added alias values for official DeepSeek/Hugging Face names and provider-specific names from OpenRouter, VolcEngine, SiliconFlow, HuaweiCloud, and QiniuCloud. - Preserved model metadata such as max_tokens, model_types, and thinking where applicable. - Added Gitee ListModels tests to verify DeepSeek aliases map back to model metadata from all_models.json. - Added an optional Gitee integration test gated by GITEE_LIST_MODELS_INTEGRATION=1. Test: /usr/local/go/bin/go clean -cache /usr/local/go/bin/go test ./internal/entity/models -run 'TestGiteeListModels(MapsAllDeepSeekAliasesToModelMetadata\|KeepsOwnedBySuffixAfterAliasMetadataLookup\| Integration)'	2026-06-10 13:59:23 +08:00
buua436	093eec3105	fix: handle qwen rerank error response (#15881 ) ### What problem does this PR solve? Fix QWen rerank error handling so DashScope error responses without a text attribute do not raise a secondary KeyError and hide the real provider error. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-10 13:05:24 +08:00
Wang Qi	9aa81e7cad	Fix paddle ocr / minerU cannot add (#15858 ) Fix paddle ocr / minerU cannot add	2026-06-10 13:04:13 +08:00
Idriss Sbaaoui	7f4bf69f05	Enhancement: slim Docker image, add .dockerignore, fix Go binary shipping (#15880 ) ### What problem does this PR solve? The RAGFlow Docker image was 9.06 GB with build-only compiler packages leaking into the runtime, duplicate frontend source shipped alongside compiled assets, and no .dockerignore causing ~6 GB of unnecessary context transfer per build. ### Type of change - [x] Performance Improvement	2026-06-10 11:44:22 +08:00
oktofeesh	bbc1f2ecec	feat(go-api): add RAG retrieval to chat completions (#15739 ) ## Summary - Add knowledge-base retrieval support to Go chat completions. ## What changed - Routes KB-backed chat sessions through the Go retrieval service instead of falling back to solo chat. - Resolves embedding and rerank models, validates accessible knowledge bases, and preserves tenant-aware retrieval. - Rejects mixed embedding models across selected knowledge bases before retrieval to avoid incompatible vector dimensions. - Threads the HTTP request context into streaming retrieval so cancelled requests can stop downstream retrieval work. - Applies metadata filters and message-level `doc_ids` before retrieval. - Expands parent/child chunks before building references and prompt context. - Injects retrieved knowledge through a copied dialog prompt config so the caller's original dialog is not mutated. - Honors configured empty responses when no chunks are found. - Names the metadata no-match sentinel and reuses it across retrieval/handler paths. - Adds a defensive content cast while appending streamed answers. - Adds focused unit coverage for retrieval, metadata filtering, authorization, multimodal messages, references, empty-response behavior, prompt immutability, and mixed embedding models. --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-10 11:07:45 +08:00
Jin Hai	7c1bd9a5a5	Go CLI: switch to admin/api server (#15861 ) ### What problem does this PR solve? ``` RAGFlow(api/default)> use admin SUCCESS RAGFlow(api/default)> use api 'abc'; SUCCESS ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-10 10:57:00 +08:00
writinwaters	9d9c2dc92c	Docs: Supported model providers and URLs updated (#15866 ) ### What problem does this PR solve? Updated supported model providers and the corresponding URLs. ~~Synced supported model providers and base URLs with llm_factories.json, while keeping the AI Badgr configuration example via the OpenAI-API-Compatible provider.~~ ### Type of change - [x] Documentation Update	2026-06-10 10:18:14 +08:00

1 2 3 4 5 ...

6708 Commits