ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 23:41:12 +08:00

Author	SHA1	Message	Date
Hz_	e290a0d23e	feat(go-api): Langfuse API key migration behavior (#16356 ) ## Summary - Align Langfuse API key set/get/delete behavior with the Python implementation. - Improve DAO handling for Langfuse credential save/delete flows. - Add tests for Langfuse service error handling and API key lifecycle behavior.	2026-06-25 19:25:55 +08:00
Rander	017adf841f	fix(paddleocr): support PP-OCRv6 ocrResults fallback and integrate image parsing (#16150 ) ## Summary This PR fixes two issues discovered during testing of the PaddleOCR async API refactoring: ### 1. PP-OCRv6 returns `ocrResults` instead of `layoutParsingResults` Models like PP-OCRv6 are pure text recognition models that return results in `ocrResults.prunedResult.rec_texts` format rather than the `layoutParsingResults.prunedResult.parsing_res_list` format used by layout-aware models (PaddleOCR-VL series). Changes: - `deepdoc/parser/paddleocr_parser.py`: Extract `ocrResults` alongside `layoutParsingResults` in `_send_request()`, add fallback logic in `_transfer_to_sections()` and `parse_image()` - `internal/entity/models/paddleocr.go`: Add `ocrResults` struct and fallback extraction in Go OCR handler ### 2. Image parsing not integrated into picture chunker The `parse_image()` method existed in PaddleOCRParser but was never called from `rag/app/picture.py` (the module that handles image file uploads). Users configuring PaddleOCR as their layout recognizer would still get local deepdoc OCR for images. Changes: - `rag/app/picture.py`: When `layout_recognize` is set to PaddleOCR, use `PaddleOCROcrModel.parse_image()` instead of local OCR. Falls back gracefully to local OCR on failure. ## Testing Verified end-to-end in Docker: - PaddleOCR-VL-1.6 PDF parsing: ✅ (10 text blocks with bbox) - PaddleOCR-VL-1.6 image parsing: ✅ (219 chars) - PP-OCRv6 PDF parsing with ocrResults fallback: ✅ (10 text blocks) - PP-OCRv6 image parsing with ocrResults fallback: ✅ (136 chars) ## Related PRs - #15967 (merged) - PaddleOCR async Job API refactoring + new models - #16086 (merged) - PaddleOCR image parsing support	2026-06-23 22:02:54 +08:00
Zhichang Yu	06ededb26a	test(go): ensure go unit tests pass (#16241 ) ## Summary Stabilizes the Go unit-test surface so the test suite can run reliably in CI and locally via \`bash build.sh --test\`. ## Verification \`\`\`bash bash build.sh --test -- -count=10 -run TestWithCancel_SequentialAgent ./internal/harness/core/ bash build.sh --test -- -count=5 -run TestSiliconflowChatExtracts ./internal/entity/models/ bash build.sh --test # full suite \`\`\` All previously failing packages (\`admin\`, \`cli\`, \`handler\`, \`parser\`, \`router\`, \`service\`, \`service/chunk\`) now build and test successfully. \`TestWithCancel_SequentialAgent\` passes 10/10 (was flaky). SiliconFlow reasoning test passes after switching the assertion to the SiliconFlow wire format. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-22 20:43:29 +08:00
Zhichang Yu	3f805a64f1	feat(agent): align Go agent behavior with Python (except retrieval component) (#16225 ) ## Summary Aligns the Go agent runtime/canvas/components/tools behavior with the Python `agent/` implementation so the same stored canvas DSL produces the same execution result on either side. Every component, tool, and runtime primitive in `internal/agent/` is now driven by the same semantics as its Python counterpart — variable resolution, template substitution, control flow, error reporting, retry/cancel, and stream event shapes. The retrieval component is the one explicit exception in this PR. It is being reworked in a separate change and is excluded from this alignment pass; the wrapper slot (`universe_a_wrappers.go → newRetrievalComponent`) is preserved. ## Scope of alignment ### Components (all aligned with `agent/component/`) `Begin` · `Message` · `LLM` (incl. ChatTemplateKwargs, MessageHistoryWindowSize, VisualFiles, Cite, OutputStructure, JSONOutput, TopP, MaxRetries, DelayAfterError, credentials) · `Agent` (react + tool artifact capture + `Reset()` interface-assert) · `Switch` (12/12 operators, Python-equivalent semantics) · `Categorize` · `Invoke` · `Iteration` · `Loop` (macro-expansion through `workflowx.AddLoopNode`) · `UserFillUp` (Python-equivalent interrupt/resume via eino `compose.Interrupt`/`ResumeWithData`) · `FillUp` · `DataOperations` · `ListOperations` · `StringTransform` · `VariableAggregator` · `VariableAssigner` · `Browser` (full stagehand runtime parity) · `DocsGenerator` · `ExcelProcessor`. ### Tools (all aligned with `agent/tools/`) `Retrieval` (wrapper slot only — logic out of scope) · `MCPToolAdapter` (streamable-HTTP) · `CodeExec` (sandbox bridge with `code_exec_contract.go` matching Python contract) · `AkShare` · `ArXiv` · `Crawler` · `DeepL` · `DuckDuckGo` · `Email` · `ExeSQL` · `GitHub` · `Google` · `GoogleScholar` · `Jin10` · `PubMed` · `QWeather` · `SearXNG` · `Tavily` · `Tushare` · `Wencai` · `Wikipedia` · `YahooFinance` — uniform `eino tool.InvokableTool` interface, SSRF protection, shared HTTP client. ### Canvas execution engine (`internal/agent/canvas/`) Aligned with Python's `agent/canvas.py`: - Scheduler (`scheduler.go`): state pre/post handlers, node lambdas, per-component timeout resolver (4-level: per-class env → per-class table → uniform env → 600s fallback), `legacyNoOpNames`. - Loop subgraph (`loop_subgraph.go`): Python-equivalent `AddLoopNode` macro expansion + condition translation. - Multibranch (`multibranch.go`): `Switch` / `Categorize` routing via `compose.NewGraphMultiBranch` — same branch selection semantics as Python. - Parallel subgraph (`parallel_subgraph.go`): matches Python's parallel fan-out contract. - Interrupt/Resume (`interrupt_resume.go`): `UserFillUpNodeBody` / `IsInterruptError` / `ExtractInterruptContexts` — replaces the deprecated Python sentinel chain with eino's native interrupt API, preserving the same external behavior. - Checkpoint (`checkpoint_store.go`): `RedisCheckPointStore` Get/Set/Delete, with business metadata (status / canvas_id / parent_run_id) on a parallel Redis Hash. - RunTracker (`run_tracker.go`): Start / MarkSucceeded / MarkFailed / MarkCancelled / AttachCheckpoint — same lifecycle as the Python run record. - Cancel (`cancel.go`): Redis pub/sub watch. - Stream (`stream.go`): SSE channel with `messages` / `waiting` / `errors` / `done` events, same shape as Python's `agent.canvas.RunEvent` payload. ### DSL bridge (`internal/agent/dsl/`) - `normalize.go`: v1↔v2 collapsed into a single wire format — Python and Go consume the same stored JSON. - `reset.go`: per-run state reset matches Python's `Canvas.reset()` semantics. - Testdata mirrors Python's `agent_msg.json` / `all.json` / etc. ### Runtime (`internal/agent/runtime/`) - `CanvasState` / `NewCanvasState` / `GetVar` / `SetVar` / `ReadVars`: same `{{cpn_id@param}}` resolution model. - `ResolveTemplate` (regex fast path + gonja fallback) — Python Jinja-style semantics. - `selector.go`, `metrics.go`, `component.go`: shared runtime contracts. ## Out of scope (intentionally) - `Retrieval` component logic — wrapped only; full parity lands in a follow-up PR. - Frontend — only minor dsl-bridge / canvas UX fixes ride along. - CLI / admin / model registry — orthogonal to agent behavior. ## How alignment is verified `internal/service/agent_run_e2e_test.go` exercises the full production chain against real Python-shaped DSL fixtures: ``` loadCanvasForUser → versionDAO.GetLatest → decodeCanvasFromDSL → canvas.Compile → cc.Workflow.Invoke → answer extraction ``` using in-memory SQLite + miniredis (no Docker). Covers: - `TestRunAgent_RealCanvas_BeginMessage` — happy path, `{{sys.query}}` resolution - `TestRunAgent_RealCanvas_WaitForUserResume` — two-run resume cycle (Python-equivalent) - `TestRunAgent_RealCanvas_CompileFails` — unknown component name → sanitized error (Python-equivalent) - `TestRunAgent_RealCanvas_InvokeFails` — unresolvable template ref (Python-equivalent) - `TestRunAgent_RunTracker_AttachCheckpoint_CallSequence` — Start→AttachCheckpoint→MarkSucceeded lifecycle `internal/handler/agent_test.go` — SSE streaming parity (`Content-Type: text/event-stream`, `data: {…}\n\n`, trailing `data: [DONE]\n\n`, OpenAI-compatible non-stream `choices`). `internal/agent/canvas/fixture_compile_test.go` + per-component tests pin the Python-equivalent outputs. ``` go test -count=1 -v -run 'TestRunAgent_RealCanvas\|TestRunAgent_RunTracker' ./internal/service/ ``` ## Design reference `docs/develop/agent-go-port-design.md` (1329 lines, last cross-checked 2026-06-17) — module layout, per-component / per-tool inventory, corner-case catalogue, and the actionable backlog (Section 14, including the retrieval alignment follow-up). --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-22 11:58:29 +08:00
qinling0210	563d855780	Implement OpenAI chat completions in GO (#16177 ) ### What problem does this PR solve? Implement OpenAI chat completions in GO POST /api/v1/openai/<chat_id>/chat/completions OpenAI chat cli: internal/development.md ### Type of change - [x] Refactoring	2026-06-18 18:07:27 +08:00
BitToby	2ab9256e8a	fix(go): correct OpenRouter streaming URL routing and reasoning parameter (#16111 ) ### What problem does this PR solve? Fixes two bugs in the OpenRouter streaming chat request builder (`internal/entity/models/openrouter.go`, `ChatStreamlyWithSender`): 1. qwen/glm models streamed to a broken URL. The code routed any `qwen`/`glm` model to `URLSuffix.AsyncChat`, but `conf/models/openrouter.json` defines no `async_chat` suffix (empty), so the request was POSTed to `<base>/` instead of `<base>/chat/completions` — breaking streaming for every qwen/glm model. The non-stream path has no such branch. Fix: all models use the standard `Chat` suffix, consistent with the non-stream path. 2. Streaming reasoning was never enabled. The request set reasoning via a non-standard `thinking` key, which OpenRouter ignores. OpenRouter's API — and this provider's own non-stream request (line ~110) and its streamed `delta.reasoning` parser (line ~311) — use the `reasoning` object. Fix: send `reasoning: {"enabled": <thinking>}` (and `{"effort": ...}` when set, taking precedence as in the non-stream path). Closes #16110 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-17 19:14:13 +08:00
Hunnyboy1217	e178c81bb4	refactor(go-models): harden Ollama ListModels and route through ParseListModel (#15853 ) (#15955 ) ### What problem does this PR solve? Part of #15853 (provider model-list refactor). Refactors Ollama `ListModels` onto the shared `ParseListModel` pattern and fixes two correctness issues: - Endpoint: switch the models suffix from `api/ps` (only currently-running models) to `api/tags` (all installed models) — the latter is what a model picker should show. - Parsing: Ollama returns `{"models":[{"name","model"}]}`, a non-OpenAI shape. Decode it into a typed struct, map the names into `ModelList`, then enrich through `ParseListModel`. This removes the previous unchecked type assertions (`result["models"].([]interface{})` / `.(map[string]interface{})` / `.(string)`) that panicked when the body was missing the `models` array or any field, and adds a fallback to the `model` field when `name` is blank. - Drops the no-op GET request body and a dead base-URL reassignment. #### Drive-by fix Shared gitee_test.go `DSModelList` -> `ModelList` compile fix (renamed in #15900) so the models test package builds; auto-resolves against the sibling #15853 PRs. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2026-06-17 18:47:27 +08:00
Hunnyboy1217	fd196f694e	feat(go-models): harden ListModels for FishAudio (#15853 ) (#15957 ) ### What problem does this PR solve? Part of #15853 (provider model-list refactor). Final two providers. - voyage: Voyage AI exposes no live model-list endpoint — its public API only has `/v1/embeddings` and `/v1/rerank` — so the previous `ListModels` was a `no such method` stub. Replace it with a static-catalog listing sourced from the loaded provider definition, carrying each model's `max_tokens`, `model_types`, and embedding `dimensions`. `list models from voyage` now returns the 13-model catalog instead of erroring. - fishaudio: route the existing `/model` voice listing through the shared `ParseListModel` helper for consistency; keep the human-readable `title` as the model name and fall back to `_id` when a title is blank. #### Drive-by fix Shared gitee_test.go `DSModelList` -> `ModelList` compile fix (renamed in #15900); auto-resolves against the sibling #15853 PRs. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring Co-authored-by: Haruko386 <tryeverypossible@163.com>	2026-06-17 11:56:20 +08:00
Hz_	b48f03d0f5	feat(go/dao): migrate chat channel database entity and DAO to Go (#16055 ) ## Changes 1. Entity (`internal/entity/chat_channel.go`): - Implemented `ChatChannel` struct mapping the `chat_channel` database table. - Declared `ChatChannelListResponse` as a DTO to filter out sensitive credentials (`config` field) and fetch the associated `dialog_name` via left join. 2. GORM Migration (`internal/dao/database.go`): - Registered `&entity.ChatChannel{}` in the `dataModels` array inside `InitDB()` to enable safe GORM schema synchronization. 3. DAO (`internal/dao/chat_channel.go`): - Implemented `ChatChannelDAO` wrapping GORM CRUD methods (`Create`, `GetByID`, `UpdateByID`, `DeleteByID`). - Implemented `ListByTenantID` performing a `LEFT JOIN` on the `dialog` table to retrieve `dialog_name` while excluding `config` values to avoid credential leaks. 4. Test (`internal/dao/chat_channel_test.go`): - Added integration unit tests testing the full CRUD lifecycle and GORM left-join mapping list querying.	2026-06-17 11:26:13 +08:00
Rander	1235da7093	refactor(paddleocr): migrate from sync API to async Job API (#15967 ) ## Summary Migrate PaddleOCR integration from the deprecated synchronous HTTP API to the new asynchronous Job API (`submit → poll → fetch`), aligning with PaddleOCR 3.6.0+ architecture. ## Changes ### Python (`deepdoc/parser/paddleocr_parser.py`) - Replace synchronous `requests.post()` with async Job API flow (submit → poll → fetch) - Authentication: `token {token}` → `Bearer {token}` - File transfer: base64 JSON body → multipart file upload - Polling: exponential backoff (initial 3s, ×1.5, max 15s, timeout controlled by `request_timeout`) - Result: fetch full JSONL from result URL, preserving `prunedResult` with bbox info for crop functionality - Rename `api_url` → `base_url` (backward compatible: `api_url` still accepted as fallback) ### Python (`rag/llm/ocr_model.py`) - Prefer `paddleocr_base_url` / `PADDLEOCR_BASE_URL`, fallback to `paddleocr_api_url` / `PADDLEOCR_API_URL` ### Go (`internal/entity/models/paddleocr.go`) - Add `Client-Platform: ragflow` header to submit and poll requests - Change polling from fixed 3s to exponential backoff (initial 3s, ×1.5, max 15s) ### Python (`common/constants.py`) - Add `PADDLEOCR_BASE_URL` to env keys and default config ## Backward Compatibility - Old env var `PADDLEOCR_API_URL` still works (used as fallback) - Frontend field `paddleocr_api_url` still works (backend reads it as fallback) - No user-facing configuration changes required for existing setups ## Why not use the `paddleocr` SDK package directly? RAGFlow's `_transfer_to_sections()` relies on `prunedResult` (containing `block_bbox`, `block_label`, `parsing_res_list`) from the raw API response for PDF crop functionality. The SDK's public `parse_document()` API only returns `DocParsingResult` with `markdown_text`, discarding the bbox data. Therefore we implement the async Job API flow directly via HTTP, following the same logic as the SDK internally.	2026-06-16 19:34:21 +08:00
Jin Hai	509e5b0fed	Fix auto migration issue (#16081 ) ### What problem does this PR solve? Fix DB migration issue. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-16 17:02:35 +08:00
Jin Hai	fad82fd1c0	Go: fix register user (#16058 ) ### What problem does this PR solve? Fix register user ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-16 14:03:53 +08:00
Yingfeng	b5bea72e4b	Add git-like file commit API (#15978 ) ### What problem does this PR solve? \| # \| Method \| Endpoint \| Description \| Git Equivalent \| \|---\|--------\|----------\|-------------\|----------------\| \| 1 \| `POST` \| `/api/v1/{prefix}/{folder_id}/commits` \| Create a snapshot commit with file changes (add/modify/delete/rename) \| `git add` + `git commit` \| \| 2 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits` \| List commit history (paginated) \| `git log` \| \| 3 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}` \| Get commit detail with file changes \| `git show` \| \| 4 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}/files` \| List file changes in a commit \| `git show --name-status` \| \| 5 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/diff?from=...&to=...` \| Compare two commits and return differences \| `git diff` \| \| 6 \| `GET` \| `/api/v1/{prefix}/{folder_id}/changes` \| Get uncommitted changes (add/modify/delete) \| `git status` \| \| 7 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}/tree` \| Get the folder tree snapshot at commit time \| `git ls-tree` \| \| 8 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}/files/{file_id}/content` \| Get a file's content as it existed in a specific commit \| `git show HEAD:file` \| \| 9 \| `GET` \| `/api/v1/{prefix}/{file_id}/versions` \| Get version history for a specific file across all commits \| `git log -- file` \| Where `{prefix}/{id}` can be: - `folders/{folder_id}` — direct folder access - `workspaces/{workspace_id}` — alias of `folders/{folder_id}` - `datasets/{dataset_id}` — resolves to the dataset's folder - `memories/{memory_id}` — resolves to the memory's folder - `skills/{skill_id}` — resolves to the skill's folder ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2026-06-15 11:19:56 +08:00
Zhichang Yu	3fa15c0e2f	feat(agent): Go port — canvas engine, 22 components, DSL v2, 13 endpoints (#15952 ) Ports the agent canvas subsystem from Python to Go. ## What's included ### Canvas Engine (Phase 0/1) - State engine, scheduler, variable resolver, Redis checkpoint store, cancel protocol - 209 tests across canvas / component / io packages ### 22 Components (P0–P4) \| Tier \| Components \| \|---\|---\| \| P0 T1+T2+T3 \| LLM, Agent, ExitLoop, Switch, Categorize, Begin, Message, Invoke \| \| P1 T3 \| VariableAggregator, VariableAssigner, StringTransform, ListOperations, DataOperations \| \| P2 T3 \| Iteration, IterationItem, Loop, LoopItem \| \| P3 T3 \| UserFillUp, Fillup \| \| P4 T5 \| Browser, ExcelProcessor, DocsGenerator \| ### DSL v2 Schema (Phase 2.5) - Typed v2 in-memory model with v1-to-v2 auto-detect converter - v1 legacy field stripping per plan §2.11.7 ### HTTP Endpoints & Bug Fixes (Plans PR1–PR3) - DELETE SQL bug fix: gorm v2 `Where("id = ?", id).Delete(...)` pattern - CreateAgent validation: title/DSL required, duplicate check, 103 envelope - 13 new endpoints: templates, prompts, tags, sessions CRUD, chat/completions (SSE + non-stream stubs), rerun, test_db_connection, logs, webhook/logs - 756 Go unit tests (745 → 756, +18) - 17 → 0 Python integration test failures (test_agents.py + test_session_management/) ### Tools 21 eino tools: HTTPHelper, search tools, financial/data tools, mandatory stubs ### Infrastructure OTel observability, NATS message queue, DeepDoc gRPC client, SSRF guards, IDOR mitigation	2026-06-12 22:58:28 +08:00
Haruko386	547139da29	fix(Go-models): preserve model name lookup when aliases exist (#15969 ) ### What problem does this PR solve? As title ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Documentation Update	2026-06-12 19:15:28 +08:00
Jin Hai	e96bc37d06	Go: use NATS as the message queue (#15327 ) ### What problem does this PR solve? ``` RAGFlow(admin)> mq publish 'msg2'; SUCCESS RAGFlow(admin)> mq publish 'msg3'; SUCCESS RAGFlow(admin)> mq list; +---------+---------------+ \| message \| subject \| +---------+---------------+ \| msg1 \| tasks.RAGFLOW \| \| msg2 \| tasks.RAGFLOW \| \| msg3 \| tasks.RAGFLOW \| +---------+---------------+ RAGFlow(admin)> mq pull 2; +---------+---------------+ \| message \| subject \| +---------+---------------+ \| msg1 \| tasks.RAGFLOW \| \| msg2 \| tasks.RAGFLOW \| +---------+---------------+ RAGFlow(admin)> mq pull noack; +---------+---------------+ \| message \| subject \| +---------+---------------+ \| abc \| tasks.RAGFLOW \| +---------+---------------+ RAGFlow(admin)> mq show +-------------------+----------------+--------+---------------+---------------+-------------------+---------------+ \| ack_pending_count \| consumer_count \| memory \| message_count \| pending_count \| redelivered_count \| waiting_count \| +-------------------+----------------+--------+---------------+---------------+-------------------+---------------+ \| 2 \| 1 \| 0 \| 2 \| 0 \| 1 \| 0 \| +-------------------+----------------+--------+---------------+---------------+-------------------+---------------+ RAGFlow(admin)> list ingestors; +--------------+-------------------------------------------+--------+ \| host \| name \| status \| +--------------+-------------------------------------------+--------+ \| 192.168.1.38 \| ingestor-8f0e4bd5650a4ac58b0151969fbf6935 \| alive \| +--------------+-------------------------------------------+--------+ RAGFlow(admin)> list ingestion tasks; +----------------------------------+----------------------------------+-----------+------+-------------+----------------------------------+ \| document_id \| id \| status \| step \| user \| user_id \| +----------------------------------+----------------------------------+-----------+------+-------------+----------------------------------+ \| ffe64fae423411f1a2d938a74640adcc \| 90d3d0f6528941c1ac8eb0360effccc4 \| COMPLETED \| 5 \| aaa@aaa.com \| 2ba4881420fa11f19e9c38a74640adcc \| +----------------------------------+----------------------------------+-----------+------+-------------+----------------------------------+ RAGFlow(admin)> remove ingestion tasks '90d3d0f6528941c1ac8eb0360effccc4'; +---------+----------------------------------+ \| delete \| task_id \| +---------+----------------------------------+ \| success \| 90d3d0f6528941c1ac8eb0360effccc4 \| +---------+----------------------------------+ RAGFlow(admin)> stop ingestion tasks 'e89e20d9a25848a1b79bd9345ddbfe1d'; +----------+----------------------------------+ \| status \| task_id \| +----------+----------------------------------+ \| STOPPING \| e89e20d9a25848a1b79bd9345ddbfe1d \| +----------+----------------------------------+ # Publish a message RAGFlow(admin)> mq publish 'cdd'; SUCCESS # List current tasks in the message queue RAGFlow(admin)> mq list +----------------------------------+---------------+ \| message \| subject \| +----------------------------------+---------------+ \| 7ce392a3c1624cd2be4b5276e8825059 \| tasks.RAGFLOW \| +----------------------------------+---------------+ # Consume a task from the message queue RAGFlow(admin)> mq pull +------+-----+----------------+ \| ack \| id \| type \| +------+-----+----------------+ \| true \| cdd \| ingestion_test \| +------+-----+----------------+ # User mode # List ingestion tasks, followed by dataset id RAGFlow(user)> list ingestion tasks from '0abe79f9423311f1ad8d38a74640adcc'; +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| create_date \| create_time \| dataset_id \| document_id \| id \| schema \| status \| update_date \| update_time \| user_id \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| 2026-05-30T20:21:06+08:00 \| 1780143666289 \| 0abe79f9423311f1ad8d38a74640adcc \| ffe64fae423411f1a2d938a74640adcc \| 8d758cd14a8b4ba8ab505003fb52017d \| \| COMPLETED \| 2026-05-30T20:21:26+08:00 \| 1780143686431 \| 2ba4881420fa11f19e9c38a74640adcc \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ RAGFlow(user)> list ingestion tasks; +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| create_date \| create_time \| dataset_id \| document_id \| id \| schema \| status \| update_date \| update_time \| user_id \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| 2026-06-02T19:02:31+08:00 \| 1780398151417 \| 0abe79f9423311f1ad8d38a74640adcc \| ffe64fae423411f1a2d938a74640adcc \| e89e20d9a25848a1b79bd9345ddbfe1d \| \| COMPLETED \| 2026-06-02T19:02:52+08:00 \| 1780398172208 \| 2ba4881420fa11f19e9c38a74640adcc \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ # Create an ingestion task # First argument is document id, second argument is dataset id RAGFlow(user)> start ingestion 'ffe64fae423411f1a2d938a74640adcc' from '0abe79f9423311f1ad8d38a74640adcc'; +----------------------------------+-------------------------------------------+ \| document_id \| result \| +----------------------------------+-------------------------------------------+ \| ffe64fae423411f1a2d938a74640adcc \| task_id: 8d758cd14a8b4ba8ab505003fb52017d \| +----------------------------------+-------------------------------------------+ # Pause an ingestion task, first argument is ingestion id RAGFlow(user)> stop ingestion '8d758cd14a8b4ba8ab505003fb52017d'; +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| create_date \| create_time \| dataset_id \| document_id \| id \| schema \| status \| update_date \| update_time \| user_id \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| 2026-05-30T20:21:06+08:00 \| 1780143666289 \| 0abe79f9423311f1ad8d38a74640adcc \| ffe64fae423411f1a2d938a74640adcc \| 8d758cd14a8b4ba8ab505003fb52017d \| \| COMPLETED \| 2026-05-30T20:21:26+08:00 \| 1780143686431 \| 2ba4881420fa11f19e9c38a74640adcc \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ # Delete an ingestion task RAGFlow(api/default)> remove ingestion tasks 'f366450a27d54677aec1c7090add30f0'; +---------+----------------------------------+ \| remove \| task_id \| +---------+----------------------------------+ \| success \| f366450a27d54677aec1c7090add30f0 \| +---------+----------------------------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-12 14:56:44 +08:00
JPette1783	daa3811165	feat(models): add shared HTTP client, SSE parser, and stub helpers for Go model drivers (#15821 ) ### What problem does this PR solve? The Go model-driver layer () has ~38,700 lines across 109 files. Roughly 74% of that is boilerplate duplicated into every driver: identical HTTP client setup, the same 65-line SSE scanner loop, and 10-11 one-line "not supported" stub methods per driver. Any fix must be manually propagated to every file. Closes #15820. This PR establishes the three shared utility files that form the foundation for incremental driver migration: --- ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Co-authored-by: Haruko386 <tryeverypossible@163.com>	2026-06-11 19:20:12 +08:00
Haruko386	9c30557ef7	Go: add dimensions for list models and fix some embed-bug in providers (#15940 ) ### What problem does this PR solve? As title ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring	2026-06-11 19:18:49 +08:00
Hz_	312514c032	feat(go): Add embedding dimension metadata and validation (#15939 ) ### What problem does this PR solve? - Replace embedding model `dimension` metadata with `max_dimension`. - Add optional `dimensions` metadata for models with fixed selectable output dimensions. - Include `max_dimension` and `dimensions` in model list responses. - Validate requested embedding dimensions before calling provider embedding APIs. - Forward SiliconFlow embedding dimensions with the correct `dimensions` request field. - Add unit coverage for embedding dimension validation rules.	2026-06-11 17:55:13 +08:00
Haruko386	84edf539e7	Go: Refactor list-models func (#15900 ) ### What problem does this PR solve? As title Issue: #15853 ### Type of change - [x] Refactoring	2026-06-11 13:32:50 +08:00
JPette1783	4b10c0b885	fix(go-models): guard nil pointers in DeepSeek and VolcEngine streaming (#15817 ) ### What problem does this PR solve? `ChatStreamlyWithSender` in two Go model drivers could panic on nil pointer dereferences when a caller passes a nil model config or omits the reasoning `Effort`: - deepseek.go - `switch chatModelConfig.Effort` dereferenced `Effort` without a nil check. It now defaults to `"high"` when nil. - volcengine.go* - the `modelConfig` pointer itself was dereferenced (`Stream`, `MaxTokens`, `Temperature`, .) with no guard, and `Effort` was dereferenced unchecked. `modelConfig` now defaults to an empty `&ChatConfig{}` when nil so the optional-field accesses are safe, and `Effort` defaults to `"medium"` when nil. Addresses the CodeRabbit review on `volcengine.go` `ChatStreamlyWithSender`. Per maintainer feedback ("one PR do one thing"), the unrelated `handler/auth.go` and `service/heartbeat_sender.go` changes were removed so this PR is scoped to the model-provider fixes. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-11 13:32:24 +08:00
Hz_	515acf4f60	fix(go): Fix case-insensitive model alias lookup (#15911 ) ## Summary - Normalize model alias index keys to lowercase - Detect lowercase alias collisions during provider manager initialization - Fix ListModels metadata mapping for mixed-case provider aliases	2026-06-10 20:36:43 +08:00
Hz_	38755c705a	feat(go): Add DeepSeek models and Gitee alias metadata tests (#15885 ) This PR expands conf/all_models.json with DeepSeek model entries and provider aliases. Changes: - Added DeepSeek model entries across `V4`, `V3.2`, `V3.1`, `V3`, `R1`, `Coder`, `Math`, `VL`, `OCR`, `Prover`, `MoE`, and `LLM` series. - Normalized model name values to lowercase canonical IDs. - Added alias values for official DeepSeek/Hugging Face names and provider-specific names from OpenRouter, VolcEngine, SiliconFlow, HuaweiCloud, and QiniuCloud. - Preserved model metadata such as max_tokens, model_types, and thinking where applicable. - Added Gitee ListModels tests to verify DeepSeek aliases map back to model metadata from all_models.json. - Added an optional Gitee integration test gated by GITEE_LIST_MODELS_INTEGRATION=1. Test: /usr/local/go/bin/go clean -cache /usr/local/go/bin/go test ./internal/entity/models -run 'TestGiteeListModels(MapsAllDeepSeekAliasesToModelMetadata\|KeepsOwnedBySuffixAfterAliasMetadataLookup\| Integration)'	2026-06-10 13:59:23 +08:00
Jack	2f99d52fb5	fix(ci): re-enable Go tests and fix compilation errors after ListModels signature change (#15862 ) ## Summary This PR re-enables the Go test steps in CI that were previously commented out, and fixes all compilation errors that have accumulated in `internal/entity/models/` since the `ListModels` return type was changed from `[]string` to `[]ListModelResponse`. ## Changes ### CI (`.github/workflows/tests.yml`) - Re-enable Prepare test resources step (clones resource repo with WordNet data) - Re-enable Test Go packages step (runs `go test ./internal/...`) - Fix resource path race condition by using `/tmp/resource-${GITHUB_RUN_ID}` instead of `/tmp/resource` - Exclude `/cli` package from Go tests (contains `main` redeclarations) ### Test fixes (16 model provider test files) All errors were caused by the upstream change from `[]string` to `[]ListModelResponse` in the `ListModels` interface: - Add `joinModelNames` test helper to extract `.Name` from `[]ListModelResponse` slices - `strings.Join(models, ",")` → `joinModelNames(models, ",")` (11 files) - `ids[i] != "..."` → `ids[i].Name != "..."` (cometapi, mistral) - `got[i] != want[i]` → `got[i].Name != want[i]` (bedrock) - `[]string` return types → `[]ListModelResponse` (google) ### Pre-existing bugs in model_test.go Bugs introduced by the upstream `entity/` → `entity/models/` directory rename: - Add missing `pm := GetProviderManager()` calls in 3 test functions - Fix `InitProviderManager` signature (`_, err :=` → `err :=`) - Fix `MaxTokens` `*int` dereference (6 comparisons) - Fix `readProviderConfig` relative path (3 levels up instead of 2) ### model.go - Add `findRepoRoot()` to make `conf/all_models.json` resolution work from any CWD, fixing `TestSiliconFlowProviderConfigLoadsLatestProModels` ### Test validation ```bash go build ./internal/... # ✅ go test ./internal/entity/models/... -count=1 # ✅ all pass ``` 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 21:12:15 +08:00
JPette1783	e050f1816e	fix(models): guard unsafe index access in Google and Ollama drivers (#15819 ) ### What problem does this PR solve? Fixes four panic / spurious-error paths in the Go model layer. Closes #15818. \| # \| File \| Bug \| Fix \| \|---\|------\|-----\|-----\| \| 1 \| \| Thinking-mode streaming path: accessed unconditionally; Gemini emits usage-only chunks with an empty slice, causing a runtime panic \| Guard each step: , , before indexing \| \| 2 \| \| is a plain for ordinary requests; the cast to silently returns , then panics immediately \| Switch on concrete type; handle both and \| \| 3 \| \| Identical panic on the streaming path \| Same switch-on-type fix \| \| 4 \| \| The field is optional (absent for non-thinking models) but the code returned an error when it was missing, breaking every ordinary Ollama completion \| Change to a silent comma-ok assertion; is empty string when the field is absent \| ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-09 19:26:52 +08:00
Jin Hai	719ce15c95	Go CLI: update list supported models (#15845 ) ### What problem does this PR solve? Now list supported models will show more info. ``` RAGFlow(api/default)> list supported models from 'gitee' 'test'; +-----------+------------+-------------+----------------------------------------------------------+---------------------------------------------+ \| dimension \| max_tokens \| model_types \| name \| thinking \| +-----------+------------+-------------+----------------------------------------------------------+---------------------------------------------+ \| \| \| \| Wan2.7 \| \| \| \| \| \| HappyHorse-1.0 \| \| \| \| \| \| Qwen3.6-27B@Qwen \| \| \| \| \| \| Qwen3.6-35B-A3B@Qwen \| \| \| \| 1048576 \| [chat] \| DeepSeek-V4-Flash@deepseek-ai \| map[clear_thinking:true default_value:true] \| \| \| 1048576 \| [chat] \| DeepSeek-V4-Pro@deepseek-ai \| map[clear_thinking:true default_value:true] \| +-----------+------------+-------------+----------------------------------------------------------+---------------------------------------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-09 19:01:00 +08:00
Jin Hai	55abf4f565	Go: new CLI command, list all models and show model (#15786 ) ### What problem does this PR solve? ``` RAGFlow(user)> list models; +---------------------------+------------+-------------+--------------------+---------------------------------------------+ \| alias \| max_tokens \| model_types \| name \| thinking \| +---------------------------+------------+-------------+--------------------+---------------------------------------------+ \| \| 1048576 \| [chat] \| deepseek-v4-flash \| map[clear_thinking:true default_value:true] \| \| \| 1048576 \| [chat] \| deepseek-v4-pro \| map[clear_thinking:true default_value:true] \| \| \| 1024000 \| [chat] \| minimax-m3 \| map[clear_thinking:true default_value:true] \| \| \| 64000 \| [vision] \| glm-4.5v \| map[clear_thinking:true default_value:true] \| \| [baai/bge-m3] \| 8192 \| [embedding] \| bge-m3 \| \| \| [baai/bge-reranker-v2-m3] \| 1024 \| [rerank] \| bge-reranker-v2-m3 \| \| \| \| \| [tts] \| step-audio-tts-3b \| \| \| [qwen/qwen3-asr-1.7b] \| \| [asr] \| qwen3-asr-1.7b \| \| \| [paddleocr-vl-1.5] \| \| [ocr] \| paddleocr-vl-0.9b \| \| +---------------------------+------------+-------------+--------------------+---------------------------------------------+ RAGFlow(user)> show model 'minimax-m3'; +--------------+---------------------------------------------+ \| field \| value \| +--------------+---------------------------------------------+ \| name \| minimax-m3 \| \| max_tokens \| 1024000 \| \| model_types \| [chat] \| \| thinking \| map[clear_thinking:true default_value:true] \| \| class \| \| \| alias \| \| \| ModelTypeMap \| \| +--------------+---------------------------------------------+ RAGFlow(user)> show model 'baai/bge-m3'; +--------------+---------------+ \| field \| value \| +--------------+---------------+ \| model_types \| [embedding] \| \| thinking \| \| \| class \| \| \| alias \| [baai/bge-m3] \| \| ModelTypeMap \| \| \| name \| bge-m3 \| \| max_tokens \| 8192 \| +--------------+---------------+ ``` --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-08 21:38:15 +08:00
Jack	35527f6755	fix: guard http.DefaultTransport type assertion in xiaomi for Go 1.25 (#15787 ) ## Problem `TestXiaomiNewModelWithCustomDefaultTransport` panics on Go 1.25: ``` panic: interface conversion: http.RoundTripper is models.roundTripperFunc, not http.Transport ``` In Go 1.25, `http.DefaultTransport` is no longer `http.Transport`, so the unchecked type assertion in `NewXiaomiModel` panics when the test replaces it with a `roundTripperFunc`. ## Fix Use a safe type assertion with fallback to a new `http.Transport`, matching the pattern already used in `modelscope.go`. ## Verification ```bash go test -run TestXiaomiNewModelWithCustomDefaultTransport ./internal/entity/models/... # PASS ``` Internal contributors only. 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-08 21:11:21 +08:00
Jack	338fdb65fb	feat(ci): enable go test in CI pipeline (#15750 ) ## What problem does this PR solve? Go test files are never compiled in CI — only production binaries via `go build`. This allowed a missing `"sort"` import in `metadata_filter_test.go` to be merged without detection. ## Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) ## Changes - Add `go test -count=1 ./internal/...` step after Go build in CI workflow - Fix missing `"sort"` import in `metadata_filter_test.go` (pre-existing compile error) 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-08 20:06:57 +08:00
oktofeesh	6fc3955cab	fix(go-models): normalize Qwen reasoning families (#15735 ) ## Summary Normalizes Qwen model-family names before reasoning extraction so provider-prefixed Qwen models use the existing `<think>...</think>` fallback.	2026-06-08 19:32:19 +08:00
oktofeesh	e0dc7af5dd	fix(go-models): fix MiniMax driver requests (#15527 ) ## Summary - keep MiniMax chat calls in non-streaming mode and streaming calls in SSE mode - make MiniMax model listing and connection checks use a bodyless GET /v1/models - add focused MiniMax request/response regression tests	2026-06-08 19:32:01 +08:00
oktofeesh	25df0a6725	fix(go-models): validate URL suffix config keys (#15734 ) ## Summary Fixes typoed model-provider URL suffix keys and adds strict nested decoding so future URL suffix config mistakes fail during provider loading instead of being silently ignored.	2026-06-08 19:29:36 +08:00
Haruko386	8dc7f1d95e	Go: implement ASR and TTS for xiaomi (#15765 ) ### What problem does this PR solve? Verified from CLI ``` RAGFlow(user)> chat with 'mimo-v2.5@test@xiaomi' message 'who r u' Answer: Hello! I'm MiMo-v2.5, a large language model developed by Xiaomi's LLM Core Team. You can think of me as a friendly AI assistant ready to help you answer questions, have conversations, or work on creative tasks. My context window can handle up to 1 million tokens, so we can dive into pretty long discussions or documents if you'd like. What can I help you with today? Time: 3.831830 RAGFlow(user)> stream chat with 'mimo-v2.5@test@xiaomi' message 'who r u' Answer: there! I'm MiMo-v2.5, an AI assistant created by the Xiaomi LLM Core Team. I'm here to chat, help out, answer questions, or just have a friendly conversation. Think of me as a helpful buddy with a pretty big memory (1 million tokens worth!). What can I do for you today?😊 Time: 2.421630 RAGFlow(user)> think chat with 'mimo-v2.5@test@xiaomi' message 'who r u' Thinking: The user is asking a simple question about who I am. According to my system prompt, I should: - Identify myself as MiMo-v2.5 - State that I was developed by the Xiaomi LLM Core Team - Answer in first person and be warm and conversational Answer: Hey there! 👋 I'm MiMo, an AI assistant created by the Xiaomi LLM Core Team. Think of me as a friendly chat buddy who's here to help you with all sorts of questions and tasks! I love having conversations, answering questions, brainstorming ideas, and helping people figure things out. Whether you want to chat, need help with something specific, or just want to explore ideas together — I'm here for it! 😊 What can I help you with today? Time: 6.651589 RAGFlow(user)> tts with 'mimo-v2.5-tts@test@xiaomi' text 'hello? show yourself' play format 'wav' param '{"voice": "Chloe"}' SUCCESS RAGFlow(user)> asr with 'mimo-v2.5-asr@test@xiaomi' audio './internal/test.wav' param '{"language": "zh"}' +------------------------------------------------------------------------------------------------------------------------+ \| text \| +------------------------------------------------------------------------------------------------------------------------+ \| 1 The examination and testimony of the experts enabled the commission to conclude that five shots may have been fired. \| +------------------------------------------------------------------------------------------------------------------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring	2026-06-08 19:27:45 +08:00
oktofeesh	d63bd81d0d	fix(go-models): fix Moonshot model and balance requests (#15528 ) ## Summary - keep Moonshot chat calls in non-streaming mode and streaming calls in SSE mode - make Moonshot model listing and balance checks use bodyless GET requests - add focused Moonshot request/response regression tests	2026-06-08 19:27:19 +08:00
Haruko386	67ce0c896d	feat[Go]: implement /api/v1/agents/<agent_id>/sessions (#15705 ) ### What problem does this PR solve? As Title Codes were tested by Postman ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-08 16:26:27 +08:00
bitloi	220ee9dbfb	fix: normalize reasoning model families (#15612 ) ### What problem does this PR solve? Closes #15611. RAGFlow's fallback reasoning parser only recognized the exact model family `qwen3`. For provider-prefixed Qwen model names such as SiliconFlow's `qwen/qwen3-8b`, the derived model class can be `qwen/qwen3`, so inline `<think>...</think>` content was not split from the visible answer when `reasoning_content` was absent. This PR normalizes model-family detection before fallback reasoning extraction, keeps the parser nil-safe, and adds focused tests for Qwen3 variants plus Gitee and SiliconFlow chat responses. It also makes SiliconFlow propagate `ChatConfig.Thinking` into the chat request body, matching the existing Gitee behavior, so Qwen thinking mode is actually enabled when requested. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring ### Validation - `/root/go/bin/gofmt -l internal/entity/models/common.go internal/entity/models/common_test.go internal/entity/models/reasoning_family_provider_test.go internal/entity/models/siliconflow.go` - `git diff --check` - `/root/go/bin/go test ./internal/entity/models -run 'Test(NormalizeModelFamily\|GetThinkingAndAnswer\|GiteeChatExtractsQwenThinkingFromInlineContent\|SiliconflowChatExtractsProviderPrefixedQwenThinkingFromInlineContent)' -vet=off -count=1` Note: the full package command `/root/go/bin/go test ./internal/entity/models -vet=off -count=1` now runs locally, but it currently fails on an unrelated existing `TestAstraflowEmbedReturnsNoSuchMethod` panic in `internal/entity/models/astraflow.go:482`.	2026-06-08 13:32:52 +08:00
oktofeesh	b1a2210d06	fix(go-models): increase JieKouAI SSE scanner buffer (#15737 ) ## Summary - Raise the JieKouAI streaming SSE scanner buffer to handle larger data chunks without truncation.	2026-06-08 13:10:10 +08:00
tmimmanuel	5e25e2600b	Go: implement Xiaomi chat provider (#15626 ) ### What problem does this PR solve? Implements the Xiaomi MiMo chat provider for the Go model provider layer. Reference issue: #14736 Official docs used: - Xiaomi MiMo OpenAI-compatible chat API: https://platform.xiaomimimo.com/docs/en-US/api/chat/openai-api - Xiaomi MiMo model and rate limits: https://platform.xiaomimimo.com/docs/en-US/quick-start/model - Xiaomi MiMo model hyperparameters: https://platform.xiaomimimo.com/docs/en-US/quick-start/model-hyperparameters	2026-06-08 13:09:36 +08:00
qinling0210	c960dc2a4c	Refine handling of POST /api/v1/datasets/search in GO (#15583 ) ### What problem does this PR solve? Refine handling of POST /api/v1/datasets/search in GO ### Type of change - [x] Refactoring	2026-06-08 11:49:37 +08:00
tmimmanuel	f78ef328bb	Go: implement Bedrock embeddings (#15543 ) ### What problem does this PR solve? Fixes #15542. AWS Bedrock support for the Go model provider layer was added in #15166, but embedding support was intentionally left out of scope and `BedrockModel.Embed(...)` still returned the `no such method` sentinel. This PR implements Bedrock text embeddings under the umbrella provider tracker #14736. ### What this PR includes - `internal/entity/models/bedrock.go`: implement `BedrockModel.Embed(...)` through Bedrock Runtime `InvokeModel` with existing SigV4 auth, region resolution, and runtime URL helpers. - Titan embeddings: supports `amazon.titan-embed-text-v1` and `amazon.titan-embed-text-v2:0`; v2 forwards `EmbeddingConfig.Dimension` as `dimensions` when provided, while v1 keeps the payload minimal. - Cohere embeddings: supports `cohere.embed-english-v3`, `cohere.embed-multilingual-v3`, and `cohere.embed-v4:0`; batches input texts and maps returned vectors to RAGFlow `EmbeddingData` in input order. - `conf/models/bedrock.json`: adds the `embedding` URL suffix (`invoke`) and Bedrock embedding model entries. - `internal/entity/models/bedrock_test.go`: adds unit tests for Titan, Cohere, typed Cohere responses, validation, empty input, unsupported models, and HTTP error propagation. Reference docs: - Bedrock InvokeModel API: https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_InvokeModel.html - Titan Text Embeddings: https://docs.aws.amazon.com/bedrock/latest/userguide/titan-embedding-models.html - Cohere Embed models on Bedrock: https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-embed.html ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### How was this tested? - [x] `jq empty conf/models/bedrock.json` - [x] `git diff --check` - [x] `go test ./internal/entity/models/... -run Bedrock -count=1` - [x] `go test ./internal/entity/models/... -run '^$' -count=1` - [x] `go test ./internal/entity/models/... -run Bedrock -race -count=1` Note: `go test ./internal/entity/models/... -count=1` currently fails in unrelated existing Astraflow coverage (`TestAstraflowEmbedReturnsNoSuchMethod` panics in `internal/entity/models/astraflow.go`). The Bedrock-specific tests and compile-only package check pass.	2026-06-05 13:26:32 +08:00
Haruko386	4b2af1347c	feat[Go]: implement Agent/Workflow PUT /api/v1/agents/<canvas_id>/tags (#15641 ) feat[Go]: implement Agent/Workflow PUT /api/v1/agents/<canvas_id>/tags (#15641)	2026-06-05 13:22:23 +08:00
Haruko386	baeb0c0431	Refactor[Go Model Provider]: refactor baseURL and modelConfig (#15627 ) ### What problem does this PR solve? As Title ### Type of change - [x] Refactoring	2026-06-04 17:50:22 +08:00
bitloi	2eed0d4679	refactor(go-models): add unsupported model driver defaults (#15431 ) ### What problem does this PR solve? Adds a shared safe default implementation for unsupported Go model-driver capability methods and migrates the confirmed panic-stub providers to use it. The Go `ModelDriver` interface requires providers to implement many capability methods even when the provider does not support them. XunFei had unsupported capability methods implemented as `panic("implement me")`, Mistral still had a panic in `ParseFile`, and HuaweiCloud carried an unreachable `panic("implement me")` after a normal chat return. ### Type of change - [x] Refactoring Co-authored-by: Haruko386 <tryeverypossible@163.com>	2026-06-03 19:16:28 +08:00
Jin Hai	d736f358ba	Go: refactor model provider (#15568 ) ### What problem does this PR solve? 1. Add license announcement 2. Add sanity check on API config 3. Add base class: BaseModel 4. Add GetBaseURL ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-03 16:33:58 +08:00
Haruko386	473d06d1ad	feat[Go]: implement add multi_models (#15563 )	2026-06-03 15:26:46 +08:00
Jin Hai	dbebc66ba8	Go: refactor provider code (#15564 ) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-03 14:09:07 +08:00
Jin Hai	e1f19f6679	Go: fix gitee balance api (#15554 ) ``` RAGFlow(user)> create provider 'gitee' instance 'intl' key 'api-token' url 'https://ai.gitee.com/v1' region 'intl'; SUCCESS ``` --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-03 13:23:20 +08:00
Dexterity	2819d0ea24	fix(go-models): use per call context timeouts so long streaming responses are not truncated (#15380 ) ### What problem does this PR solve? Closes #15379 Around 29 Go model providers in `internal/entity/models/` share an `http.Client` configured with `Timeout: 120 * time.Second`, and reuse that same client for `ChatStreamlyWithSender`. Go's `http.Client.Timeout` is a hard ceiling on the whole request that also covers reading the response body, so it behaves as a wall clock on streaming. Any streamed chat response that lasts longer than 120 seconds gets cut off in the middle with a timeout error. Long generations, reasoning model outputs, and slow or overloaded upstreams are the common victims. The providers that already behave correctly (`groq`, `mistral`, `voyage`, `anthropic`) set no client `Timeout` and instead wrap each request in a `context.WithTimeout`. This change converges the affected providers onto that same pattern. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-06-02 15:27:26 +08:00
glorydavid03023	5733e0624c	fix(go-models): harden N1N default transport handling (#15351 ) ## Summary - Harden `NewN1NModel` to avoid panics when `http.DefaultTransport` is a custom non-`*http.Transport` RoundTripper. - Fallback to a safe transport (`ProxyFromEnvironment`) while preserving existing pooling/timeout settings. - Add `n1n_test.go` with coverage for name/factory plus `TestN1NNewModelWithCustomDefaultTransport`. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-02 13:40:10 +08:00
Lynn	3bc5ed282e	Fix: model-provider bugs (#15460 ) ### What problem does this PR solve? Fix: - Use @ to avoid split by `_` in model_name. - Verify api_key when add instance. - Pop api_key in list intances response. - Remove useless index. - Sort providers, instances and models by name. - Get `is_tools` from llm_factories.json ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-02 13:24:53 +08:00

1 2 3 4 5

216 Commits