ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 23:41:12 +08:00

Author	SHA1	Message	Date
Jin Hai	760229d917	Go CLI: admin list configs (#16221 ) ### What problem does this PR solve? - list configs; ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-22 08:19:23 +08:00
Jin Hai	5039f46999	Go CLI: refactor commands (#16213 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-21 16:50:02 +08:00
Jin Hai	1b712be599	Go CLI: refactor some commands (#16204 ) ### What problem does this PR solve? - list resources ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-20 02:31:07 +08:00
Jin Hai	11499f7bb3	Go CLI: add list user commands framework (#16201 )	2026-06-19 15:09:54 +08:00
Jin Hai	7214a23614	Go: fix duplicate models (#16197 ) ### What problem does this PR solve? 1. Remove unused file 2. Remove duplicate models 3. Resort the function order ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-19 09:57:58 +08:00
qinling0210	563d855780	Implement OpenAI chat completions in GO (#16177 ) ### What problem does this PR solve? Implement OpenAI chat completions in GO POST /api/v1/openai/<chat_id>/chat/completions OpenAI chat cli: internal/development.md ### Type of change - [x] Refactoring	2026-06-18 18:07:27 +08:00
Haruko386	217c2a94c2	feat[Go]: implement datasets/<dataset_id>/index P/G (#16153 ) ### What problem does this PR solve? ``` POST: http://localhost:9384/api/v1/datasets/433b390c630411f1a13eab5f89540b2a/index?type=graph Output: { "code": 0, "data": { "task_id": "ff5a3546bafa49d794a9a050d99c4a52" }, "message": "success" } ``` --- ``` GET: http://localhost:9384/api/v1/datasets/433b390c630411f1a13eab5f89540b2a/index?type=graph Output: { "code": 0, "data": { "id": "ff5a3546bafa49d794a9a050d99c4a52", "doc_id": "graph_raptor_x", "from_page": 100000000, "to_page": 100000000, "task_type": "graphrag", "priority": 0, "begin_at": "2026-06-17T18:07:45+08:00", "process_duration": 4.108135, "progress": -1, "progress_msg": "18:07:45 created task graphrag\n18:07:47 Task has been received.\n18:07:49 [ERROR][Exception]: Model config not found: Qwen/Qwen3-235B-A22B@test@SILICONFLOW", "retry_count": 1, "digest": "f16fd067d5c92cec", "create_time": 1781690865552, "create_date": "2026-06-17T18:07:45+08:00", "update_time": 1781690869108, "update_date": "2026-06-17T18:07:49+08:00" }, "message": "success" } ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:57:24 +08:00
Haruko386	5f6ebc97c6	feat[go]: implement /api/v1/datasets/<dataset_id> PUT (#16122 ) ### What problem does this PR solve? As pic shows ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:57:07 +08:00
Haruko386	6beae949d8	feat[Go]: add modelID for delete_model and update_status (#16025 ) ### What problem does this PR solve? 1. add modelID for delete_model and update_status 2. fix the bug when update-status delete model ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2026-06-18 17:56:51 +08:00
Jin Hai	3eb49ca7f8	Go: add command, list, remove, stop tasks (#16190 ) ### What problem does this PR solve? ``` RAGFlow(admin)> stop user 'abc' ingestion tasks; +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ \| command \| email \| error \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+-------+ RAGFlow(admin)> stop user 'abc' ingestion tasks 'created; +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ \| command \| email \| error \| status \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| created; \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+----------+-------+ RAGFlow(admin)> stop user 'abc' ingestion tasks 'create'; +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ \| command \| email \| error \| status \| tasks \| +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ \| stop_ingestion_tasks_by_condition \| abc \| 'Stop ingestion tasks by condition' is implemented in enterprise edition \| create \| \| +-----------------------------------+-------+--------------------------------------------------------------------------+--------+-------+ RAGFlow(admin)> remove user 'abc' ingestion tasks 'create'; +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ \| command \| email \| error \| status \| tasks \| +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ \| remove_ingestion_tasks_by_condition \| abc \| 'Remove ingestion tasks by condition' is implemented in enterprise edition \| create \| \| +-------------------------------------+-------+----------------------------------------------------------------------------+--------+-------+ RAGFlow(admin)> remove user 'abc' ingestion tasks; +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ \| command \| email \| error \| tasks \| +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ \| remove_ingestion_tasks_by_condition \| abc \| 'Remove ingestion tasks by condition' is implemented in enterprise edition \| \| +-------------------------------------+-------+----------------------------------------------------------------------------+-------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 17:50:21 +08:00
Jin Hai	5eedd13d49	Go: add command, show tasks summary (#16187 ) ### What problem does this PR solve? RAGFlow(admin)> show tasks summary; +---------+-----------------------------------------------------------------+ \| field \| value \| +---------+-----------------------------------------------------------------+ \| command \| show_users_quota_summary \| \| error \| 'Show users quota summary' is implemented in enterprise edition \| +---------+-----------------------------------------------------------------+ ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 17:09:20 +08:00
Jin Hai	20d11648a4	Go: add statistics command (#16119 ) ### What problem does this PR solve? Prepare for enterprise command ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-18 15:21:44 +08:00
Haruko386	27d723e13a	fix: fix some bugs in check_conn and drop_inst (#16180 ) ### What problem does this PR solve? As title: ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-18 14:19:46 +08:00
Hz_	69dbc44983	feat(go-api): migrate MCP server detail and download API to Go (#16113 ) ### What problem does this PR solve? - Migrated MCP server detail and export (download) API from Python to Go. - Registered route: `GET /api/v1/mcp/servers/:mcp_id` (supporting `?mode=download` query parameter).	2026-06-18 11:09:22 +08:00
Hz_	f59332bc37	feat(go-api): implement Go-side document PATCH API & align parsing/metadata sync behavior (#15975 ) ### What problem does this PR solve? This PR implements the Go backend counterpart for the document partial update API: `PATCH /api/v1/datasets/:dataset_id/documents/:document_id` ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring	2026-06-18 11:08:47 +08:00
Hz_	065797b047	Refactor(go-cli): improve variable and label naming in CLI parseAddModel (#16145 ) ### What problem does this PR solve? This PR improves code readability in the CLI parser by renaming the loop index `i` to `modelIndex`. It also renames the loop label `A` to `optionsLoop` to align with standard Go naming conventions. ### Type of change - [x] Refactoring	2026-06-17 20:21:42 +08:00
BitToby	2ab9256e8a	fix(go): correct OpenRouter streaming URL routing and reasoning parameter (#16111 ) ### What problem does this PR solve? Fixes two bugs in the OpenRouter streaming chat request builder (`internal/entity/models/openrouter.go`, `ChatStreamlyWithSender`): 1. qwen/glm models streamed to a broken URL. The code routed any `qwen`/`glm` model to `URLSuffix.AsyncChat`, but `conf/models/openrouter.json` defines no `async_chat` suffix (empty), so the request was POSTed to `<base>/` instead of `<base>/chat/completions` — breaking streaming for every qwen/glm model. The non-stream path has no such branch. Fix: all models use the standard `Chat` suffix, consistent with the non-stream path. 2. Streaming reasoning was never enabled. The request set reasoning via a non-standard `thinking` key, which OpenRouter ignores. OpenRouter's API — and this provider's own non-stream request (line ~110) and its streamed `delta.reasoning` parser (line ~311) — use the `reasoning` object. Fix: send `reasoning: {"enabled": <thinking>}` (and `{"effort": ...}` when set, taking precedence as in the non-stream path). Closes #16110 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-17 19:14:13 +08:00
Hunnyboy1217	e178c81bb4	refactor(go-models): harden Ollama ListModels and route through ParseListModel (#15853 ) (#15955 ) ### What problem does this PR solve? Part of #15853 (provider model-list refactor). Refactors Ollama `ListModels` onto the shared `ParseListModel` pattern and fixes two correctness issues: - Endpoint: switch the models suffix from `api/ps` (only currently-running models) to `api/tags` (all installed models) — the latter is what a model picker should show. - Parsing: Ollama returns `{"models":[{"name","model"}]}`, a non-OpenAI shape. Decode it into a typed struct, map the names into `ModelList`, then enrich through `ParseListModel`. This removes the previous unchecked type assertions (`result["models"].([]interface{})` / `.(map[string]interface{})` / `.(string)`) that panicked when the body was missing the `models` array or any field, and adds a fallback to the `model` field when `name` is blank. - Drops the no-op GET request body and a dead base-URL reassignment. #### Drive-by fix Shared gitee_test.go `DSModelList` -> `ModelList` compile fix (renamed in #15900) so the models test package builds; auto-resolves against the sibling #15853 PRs. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2026-06-17 18:47:27 +08:00
Zhichang Yu	e45659868a	feat(agent): ship the Go agent canvas port — eino interrupt/resume + Redis check-pointing (#16035 ) Replaces the Python agent canvas runtime with a Go implementation that runs inside `cmd/server_main`. The canvas compiles into an eino Workflow that pauses on wait-for-user via native Interrupt/Resume (no sentinel flag) and resumes from a Redis-backed CheckPointStore. All 21 Python agent components and ~35 tools are ported with functional parity. Sandbox providers now read their JSON config from the admin-panel system_settings table with env fallback. 234 files / +35,413 / -6,111. All Go files are gofmt-clean (CI gate added); drops the v2 DSL E2E step and the gap-analysis plan (both redundant after the port ships). ## Type of change - [x] Refactoring - [x] New feature - [x] Bug fix 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-17 13:24:03 +08:00
euvre	9bd53ce675	fix: return full record in get_ingestion_log (#16120 ) ### What problem does this PR solve? The `get_ingestion_log` endpoint (both Python `dataset_api_service.get_ingestion_log` and Go `DatasetService.GetIngestionLog`) was returning only the dataset-level field set, which omits critical fields such as `dsl`, `document_id`, `parser_id`, `document_name`, `pipeline_id`, etc. This caused the front-end dataflow-result page to be unable to render the pipeline timeline and chunks when viewing a single ingestion log, regardless of whether the log was a dataset-level operation (graph/raptor/mindmap) or a per-file parse. ### Background `PipelineOperationLogService` provides two field sets: \| Method \| Fields \| \|---\|---\| \| `get_dataset_logs_fields` \| Minimal set (progress, status, timestamps, etc.) \| \| `get_file_logs_fields` \| Superset — includes `document_id`, `dsl`, `parser_id`, `document_name`, `pipeline_id`, … \| When listing logs, the API correctly distinguishes dataset-level vs file-level logs and uses the appropriate converter. However, when fetching a single log by ID, both the Python and Go implementations were hardcoded to the dataset-level set, dropping the extra fields that the front-end needs.	2026-06-17 13:03:51 +08:00
Hunnyboy1217	fd196f694e	feat(go-models): harden ListModels for FishAudio (#15853 ) (#15957 ) ### What problem does this PR solve? Part of #15853 (provider model-list refactor). Final two providers. - voyage: Voyage AI exposes no live model-list endpoint — its public API only has `/v1/embeddings` and `/v1/rerank` — so the previous `ListModels` was a `no such method` stub. Replace it with a static-catalog listing sourced from the loaded provider definition, carrying each model's `max_tokens`, `model_types`, and embedding `dimensions`. `list models from voyage` now returns the 13-model catalog instead of erroring. - fishaudio: route the existing `/model` voice listing through the shared `ParseListModel` helper for consistency; keep the human-readable `title` as the model name and fall back to `_id` when a title is blank. #### Drive-by fix Shared gitee_test.go `DSModelList` -> `ModelList` compile fix (renamed in #15900); auto-resolves against the sibling #15853 PRs. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring Co-authored-by: Haruko386 <tryeverypossible@163.com>	2026-06-17 11:56:20 +08:00
Hz_	b48f03d0f5	feat(go/dao): migrate chat channel database entity and DAO to Go (#16055 ) ## Changes 1. Entity (`internal/entity/chat_channel.go`): - Implemented `ChatChannel` struct mapping the `chat_channel` database table. - Declared `ChatChannelListResponse` as a DTO to filter out sensitive credentials (`config` field) and fetch the associated `dialog_name` via left join. 2. GORM Migration (`internal/dao/database.go`): - Registered `&entity.ChatChannel{}` in the `dataModels` array inside `InitDB()` to enable safe GORM schema synchronization. 3. DAO (`internal/dao/chat_channel.go`): - Implemented `ChatChannelDAO` wrapping GORM CRUD methods (`Create`, `GetByID`, `UpdateByID`, `DeleteByID`). - Implemented `ListByTenantID` performing a `LEFT JOIN` on the `dialog` table to retrieve `dialog_name` while excluding `config` values to avoid credential leaks. 4. Test (`internal/dao/chat_channel_test.go`): - Added integration unit tests testing the full CRUD lifecycle and GORM left-join mapping list querying.	2026-06-17 11:26:13 +08:00
Jin Hai	6865039a22	Go: add more start server parameters (#16093 ) ### What problem does this PR solve? ``` $ ./bin/ragflow_server --version RAGFlow version: v0.26.0-65-g549f6109c $ ./bin/ragflow_server --debug # start server with debug log level $ ./bin/admin_server --version RAGFlow version: v0.26.0-65-g549f6109c $ ./bin/admin_server --debug # start server with debug log level $ ./bin/admin_server --init-superuser # init default superuser $ ./bin/ingestor --version RAGFlow version: v0.26.0-68-g6f6c39706 $ ./bin/ingestor --debug ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-16 20:27:37 +08:00
Rander	1235da7093	refactor(paddleocr): migrate from sync API to async Job API (#15967 ) ## Summary Migrate PaddleOCR integration from the deprecated synchronous HTTP API to the new asynchronous Job API (`submit → poll → fetch`), aligning with PaddleOCR 3.6.0+ architecture. ## Changes ### Python (`deepdoc/parser/paddleocr_parser.py`) - Replace synchronous `requests.post()` with async Job API flow (submit → poll → fetch) - Authentication: `token {token}` → `Bearer {token}` - File transfer: base64 JSON body → multipart file upload - Polling: exponential backoff (initial 3s, ×1.5, max 15s, timeout controlled by `request_timeout`) - Result: fetch full JSONL from result URL, preserving `prunedResult` with bbox info for crop functionality - Rename `api_url` → `base_url` (backward compatible: `api_url` still accepted as fallback) ### Python (`rag/llm/ocr_model.py`) - Prefer `paddleocr_base_url` / `PADDLEOCR_BASE_URL`, fallback to `paddleocr_api_url` / `PADDLEOCR_API_URL` ### Go (`internal/entity/models/paddleocr.go`) - Add `Client-Platform: ragflow` header to submit and poll requests - Change polling from fixed 3s to exponential backoff (initial 3s, ×1.5, max 15s) ### Python (`common/constants.py`) - Add `PADDLEOCR_BASE_URL` to env keys and default config ## Backward Compatibility - Old env var `PADDLEOCR_API_URL` still works (used as fallback) - Frontend field `paddleocr_api_url` still works (backend reads it as fallback) - No user-facing configuration changes required for existing setups ## Why not use the `paddleocr` SDK package directly? RAGFlow's `_transfer_to_sections()` relies on `prunedResult` (containing `block_bbox`, `block_label`, `parsing_res_list`) from the raw API response for PDF crop functionality. The SDK's public `parse_document()` API only returns `DocParsingResult` with `markdown_text`, discarding the bbox data. Therefore we implement the async Job API flow directly via HTTP, following the same logic as the SDK internally.	2026-06-16 19:34:21 +08:00
Jin Hai	3d8bc76e27	Go refactor: merge similar functions (#16098 ) ### What problem does this PR solve? Merge password related functions ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-16 19:26:42 +08:00
Jin Hai	509e5b0fed	Fix auto migration issue (#16081 ) ### What problem does this PR solve? Fix DB migration issue. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-16 17:02:35 +08:00
Jin Hai	fad82fd1c0	Go: fix register user (#16058 ) ### What problem does this PR solve? Fix register user ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-16 14:03:53 +08:00
Hz_	0c92a38055	feat(go-cli): support add models with embedding type (#16020 ) ### What problem does this PR solve? This PR enhances the CLI parser to support dimension configurations for custom embedding models. Users can now specify the maximum dimension and other supported dimensions directly after the embedding keyword. ``` add model 'x1 x2 x3 x4 x5' to provider 'vllm' instance 'test' with tokens 1024 chat think vision, token 2048 chat, token 1024 think vision, token 0 embedding 2048 64 1024 2048, token 0 embedding 2048; ``` - The first integer following embedding represents the max_dimension. - Any subsequent integers represent specific alternative dimensions. - If no subsequent integers are provided, dimensions defaults to empty, indicating all sizes under max_dimension are supported.	2026-06-16 12:53:43 +08:00
Hz_	3d7b45bbd7	feat(go-api): support setting tenant default models by model_id (#16030 ) ### Description Currently, when setting tenant default models (e.g., chat, embedding, rerank), the API only accepts the composite name (`model_name@model_instance@model_provider`). However, some integrations and front-end features prefer using the database `model_id` (UUID) directly. This PR adds support for `model_id` in default model configuration: 1. Request Binding: Added `model_id` (optional field) to the request body schema in the handler. 2. Database Lookup: If `model_id` is supplied, the service queries the database to resolve the respective provider, instance, and model names. 3. Security Validation: Verified that the provider associated with the resolved `model_id` belongs to the requesting tenant. 4. Unit Tests: Added `TestSetTenantDefaultModels_WithModelID` to verify DB ID resolution and tenant mapping.	2026-06-16 12:53:03 +08:00
Yingfeng	956357b997	Feat: add harness-go framework —— agent core (#16045 ) ### What problem does this PR solve? core module for agent layer built on top of graph engine #16039 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-16 11:39:48 +08:00
Haruko386	efdd58df66	feat[Go] add max_dimension and dimensions for ModelRequest (#16019 ) ### What problem does this PR solve? As title ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-16 10:31:27 +08:00
Yingfeng	e7c068747e	Feat: add harness-go framework —— graph engine (#16039 ) ### What problem does this PR solve? go-version of Pregel-based BSP engine ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-06-15 21:36:39 +08:00
Jin Hai	417f805bd9	Go: add API mode check in file system command (#16022 ) ### What problem does this PR solve? As title. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-15 16:37:47 +08:00
Jin Hai	e3cb86d540	Go: parse HTML file (#16018 ) ### What problem does this PR solve? ``` RAGFlow(api/default)> parse file 'test.html'; Parsing HTML file: test.html <html> ...... ``` Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-15 15:49:17 +08:00
Haruko386	0480dee83f	fix: output 2 lines when list-supported models (#16015 ) ### What problem does this PR solve? ``` RAGFlow(api/default)> list supported models from 'longcat' 'test' +-----------+------------+---------------+------------+-------------+-----------------------------+----------+ \| dimension \| dimensions \| max_dimension \| max_tokens \| model_types \| name \| thinking \| +-----------+------------+---------------+------------+-------------+-----------------------------+----------+ \| \| \| \| \| \| LongCat-2.0-Preview@LongCat \| \| \| \| \| \| \| \| LongCat-2.0-Preview@LongCat \| \| +-----------+------------+---------------+------------+-------------+-----------------------------+----------+ # Fixed: RAGFlow(api/default)> list supported models from 'longcat' 'test' +------------+---------------+------------+-------------+-----------------------------+----------+ \| dimensions \| max_dimension \| max_tokens \| model_types \| name \| thinking \| +------------+---------------+------------+-------------+-----------------------------+----------+ \| \| \| \| \| LongCat-2.0-Preview@LongCat \| \| +------------+---------------+------------+-------------+-----------------------------+----------+ ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-15 15:26:35 +08:00
Jin Hai	2846216674	Go: add Markdown parser (#16016 ) ### What problem does this PR solve? ``` RAGFlow(api/default)> parse file 'README.md'; Parsing Markdown file: README.md --- AST tree: HTMLBlock '<div align="center">\n<a href="https:…' ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-15 15:07:29 +08:00
Jin Hai	fcebcebe1e	Move REDIS to engine dir (#16006 ) ### What problem does this PR solve? as title. ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-15 14:44:16 +08:00
Hz_	bc963f8cf2	refactor(go): replace GenerateUUID1 with GenerateToken for entity IDs (#16010 ) ### Description - Refactor: Replaced `utility.GenerateUUID1` (UUID v1) with `utility.GenerateToken` (UUID v4) for generating entity IDs (`userID`, `kbID`, `modelID`, etc.). - Cleanup: Removed the unused `GenerateUUID1` function from `utility` package. - Improvement: Simplified ID generation logic and eliminated unnecessary error handling boilerplate since `GenerateToken` cannot fail.	2026-06-15 14:06:07 +08:00
Yingfeng	b5bea72e4b	Add git-like file commit API (#15978 ) ### What problem does this PR solve? \| # \| Method \| Endpoint \| Description \| Git Equivalent \| \|---\|--------\|----------\|-------------\|----------------\| \| 1 \| `POST` \| `/api/v1/{prefix}/{folder_id}/commits` \| Create a snapshot commit with file changes (add/modify/delete/rename) \| `git add` + `git commit` \| \| 2 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits` \| List commit history (paginated) \| `git log` \| \| 3 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}` \| Get commit detail with file changes \| `git show` \| \| 4 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}/files` \| List file changes in a commit \| `git show --name-status` \| \| 5 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/diff?from=...&to=...` \| Compare two commits and return differences \| `git diff` \| \| 6 \| `GET` \| `/api/v1/{prefix}/{folder_id}/changes` \| Get uncommitted changes (add/modify/delete) \| `git status` \| \| 7 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}/tree` \| Get the folder tree snapshot at commit time \| `git ls-tree` \| \| 8 \| `GET` \| `/api/v1/{prefix}/{folder_id}/commits/{commit_id}/files/{file_id}/content` \| Get a file's content as it existed in a specific commit \| `git show HEAD:file` \| \| 9 \| `GET` \| `/api/v1/{prefix}/{file_id}/versions` \| Get version history for a specific file across all commits \| `git log -- file` \| Where `{prefix}/{id}` can be: - `folders/{folder_id}` — direct folder access - `workspaces/{workspace_id}` — alias of `folders/{folder_id}` - `datasets/{dataset_id}` — resolves to the dataset's folder - `memories/{memory_id}` — resolves to the memory's folder - `skills/{skill_id}` — resolves to the skill's folder ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2026-06-15 11:19:56 +08:00
Jin Hai	32d5c0039b	Go: refactor model API to accept model id (#15999 ) ### What problem does this PR solve? Not not only model_name@instance_name@provider_name is acceptable, but also model_id is acceptable. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-15 10:10:14 +08:00
Jin Hai	e89afbae21	Go: file parser config (#15989 ) ### What problem does this PR solve? Add parser config ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-13 19:40:43 +08:00
Jin Hai	d32e05d560	Go: add more file parser (#15979 ) ### What problem does this PR solve? Now we can parse 'pptx', 'ppt', 'doc', 'xls', 'xlsx' ``` RAGFlow(api/default)> parse file 'test.pptx'; Parsing PPTX file: test.pptx Document format: pptx ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-12 23:28:14 +08:00
Zhichang Yu	3fa15c0e2f	feat(agent): Go port — canvas engine, 22 components, DSL v2, 13 endpoints (#15952 ) Ports the agent canvas subsystem from Python to Go. ## What's included ### Canvas Engine (Phase 0/1) - State engine, scheduler, variable resolver, Redis checkpoint store, cancel protocol - 209 tests across canvas / component / io packages ### 22 Components (P0–P4) \| Tier \| Components \| \|---\|---\| \| P0 T1+T2+T3 \| LLM, Agent, ExitLoop, Switch, Categorize, Begin, Message, Invoke \| \| P1 T3 \| VariableAggregator, VariableAssigner, StringTransform, ListOperations, DataOperations \| \| P2 T3 \| Iteration, IterationItem, Loop, LoopItem \| \| P3 T3 \| UserFillUp, Fillup \| \| P4 T5 \| Browser, ExcelProcessor, DocsGenerator \| ### DSL v2 Schema (Phase 2.5) - Typed v2 in-memory model with v1-to-v2 auto-detect converter - v1 legacy field stripping per plan §2.11.7 ### HTTP Endpoints & Bug Fixes (Plans PR1–PR3) - DELETE SQL bug fix: gorm v2 `Where("id = ?", id).Delete(...)` pattern - CreateAgent validation: title/DSL required, duplicate check, 103 envelope - 13 new endpoints: templates, prompts, tags, sessions CRUD, chat/completions (SSE + non-stream stubs), rerun, test_db_connection, logs, webhook/logs - 756 Go unit tests (745 → 756, +18) - 17 → 0 Python integration test failures (test_agents.py + test_session_management/) ### Tools 21 eino tools: HTTPHelper, search tools, financial/data tools, mandatory stubs ### Infrastructure OTel observability, NATS message queue, DeepDoc gRPC client, SSRF guards, IDOR mitigation	2026-06-12 22:58:28 +08:00
bitloi	cafa0f2e4f	fix: SSE write timeout (#15852 ) ### What problem does this PR solve? Fixes #15840. The Go HTTP server sets `WriteTimeout: 120s`, which also applies to long-lived SSE responses. Existing Go streaming handlers did not clear the per-response write deadline, so streams that run longer than the server timeout can be terminated mid-response. This PR adds a small handler helper that clears the response write deadline for SSE requests and calls it only in existing Go streaming branches: - conversation completion streaming - provider chat streaming - provider transcription streaming - provider speech streaming The global server `WriteTimeout` remains unchanged for non-streaming requests. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Test plan - `/root/go/bin/go test ./internal/handler -run TestDisableWriteDeadlineForSSEAllowsLongLivedStream -count=1` - `/root/go/bin/go test ./internal/handler -count=1`	2026-06-12 20:49:34 +08:00
Jin Hai	234f1b7cff	Go: add office_oxide and parse docx file. (#15976 ) ### What problem does this PR solve? As title. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-12 20:28:15 +08:00
Haruko386	547139da29	fix(Go-models): preserve model name lookup when aliases exist (#15969 ) ### What problem does this PR solve? As title ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Documentation Update	2026-06-12 19:15:28 +08:00
Yingfeng	5a7d7771a3	Decouple skill space from Python API (#15971 ) ### What problem does this PR solve? Make skill space independent of Python filesystem API ### Type of change - [x] Refactoring	2026-06-12 18:18:55 +08:00
Jin Hai	115b730d07	Go: parse ingestion DSL (#15938 ) PR #15938 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-12 17:58:36 +08:00
bitloi	22a058f56c	fix(go): redact internal handler errors (#15746 ) ### What problem does this PR solve? Refs #15743 Some Go API handlers return raw `err.Error()` strings in `CodeServerError` responses. Those errors can include internal backend details such as database, storage, search engine, or host information. This PR adds a small shared `jsonInternalError` helper for handler-level internal failures. The helper logs the raw error server-side with request method/path context, then returns the existing generic `common.CodeServerError.Message()` to API clients. This first slice migrates the existing `jsonError(c, common.CodeServerError, err.Error())` production call sites in agent, dataset graph, file, and system handlers. It intentionally does not close the full issue because direct `c.JSON` error responses in other handlers remain for follow-up PRs. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): ### Tests - `/root/go/bin/go test ./internal/handler -count=1` --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-06-12 16:09:10 +08:00
Jin Hai	e96bc37d06	Go: use NATS as the message queue (#15327 ) ### What problem does this PR solve? ``` RAGFlow(admin)> mq publish 'msg2'; SUCCESS RAGFlow(admin)> mq publish 'msg3'; SUCCESS RAGFlow(admin)> mq list; +---------+---------------+ \| message \| subject \| +---------+---------------+ \| msg1 \| tasks.RAGFLOW \| \| msg2 \| tasks.RAGFLOW \| \| msg3 \| tasks.RAGFLOW \| +---------+---------------+ RAGFlow(admin)> mq pull 2; +---------+---------------+ \| message \| subject \| +---------+---------------+ \| msg1 \| tasks.RAGFLOW \| \| msg2 \| tasks.RAGFLOW \| +---------+---------------+ RAGFlow(admin)> mq pull noack; +---------+---------------+ \| message \| subject \| +---------+---------------+ \| abc \| tasks.RAGFLOW \| +---------+---------------+ RAGFlow(admin)> mq show +-------------------+----------------+--------+---------------+---------------+-------------------+---------------+ \| ack_pending_count \| consumer_count \| memory \| message_count \| pending_count \| redelivered_count \| waiting_count \| +-------------------+----------------+--------+---------------+---------------+-------------------+---------------+ \| 2 \| 1 \| 0 \| 2 \| 0 \| 1 \| 0 \| +-------------------+----------------+--------+---------------+---------------+-------------------+---------------+ RAGFlow(admin)> list ingestors; +--------------+-------------------------------------------+--------+ \| host \| name \| status \| +--------------+-------------------------------------------+--------+ \| 192.168.1.38 \| ingestor-8f0e4bd5650a4ac58b0151969fbf6935 \| alive \| +--------------+-------------------------------------------+--------+ RAGFlow(admin)> list ingestion tasks; +----------------------------------+----------------------------------+-----------+------+-------------+----------------------------------+ \| document_id \| id \| status \| step \| user \| user_id \| +----------------------------------+----------------------------------+-----------+------+-------------+----------------------------------+ \| ffe64fae423411f1a2d938a74640adcc \| 90d3d0f6528941c1ac8eb0360effccc4 \| COMPLETED \| 5 \| aaa@aaa.com \| 2ba4881420fa11f19e9c38a74640adcc \| +----------------------------------+----------------------------------+-----------+------+-------------+----------------------------------+ RAGFlow(admin)> remove ingestion tasks '90d3d0f6528941c1ac8eb0360effccc4'; +---------+----------------------------------+ \| delete \| task_id \| +---------+----------------------------------+ \| success \| 90d3d0f6528941c1ac8eb0360effccc4 \| +---------+----------------------------------+ RAGFlow(admin)> stop ingestion tasks 'e89e20d9a25848a1b79bd9345ddbfe1d'; +----------+----------------------------------+ \| status \| task_id \| +----------+----------------------------------+ \| STOPPING \| e89e20d9a25848a1b79bd9345ddbfe1d \| +----------+----------------------------------+ # Publish a message RAGFlow(admin)> mq publish 'cdd'; SUCCESS # List current tasks in the message queue RAGFlow(admin)> mq list +----------------------------------+---------------+ \| message \| subject \| +----------------------------------+---------------+ \| 7ce392a3c1624cd2be4b5276e8825059 \| tasks.RAGFLOW \| +----------------------------------+---------------+ # Consume a task from the message queue RAGFlow(admin)> mq pull +------+-----+----------------+ \| ack \| id \| type \| +------+-----+----------------+ \| true \| cdd \| ingestion_test \| +------+-----+----------------+ # User mode # List ingestion tasks, followed by dataset id RAGFlow(user)> list ingestion tasks from '0abe79f9423311f1ad8d38a74640adcc'; +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| create_date \| create_time \| dataset_id \| document_id \| id \| schema \| status \| update_date \| update_time \| user_id \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| 2026-05-30T20:21:06+08:00 \| 1780143666289 \| 0abe79f9423311f1ad8d38a74640adcc \| ffe64fae423411f1a2d938a74640adcc \| 8d758cd14a8b4ba8ab505003fb52017d \| \| COMPLETED \| 2026-05-30T20:21:26+08:00 \| 1780143686431 \| 2ba4881420fa11f19e9c38a74640adcc \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ RAGFlow(user)> list ingestion tasks; +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| create_date \| create_time \| dataset_id \| document_id \| id \| schema \| status \| update_date \| update_time \| user_id \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| 2026-06-02T19:02:31+08:00 \| 1780398151417 \| 0abe79f9423311f1ad8d38a74640adcc \| ffe64fae423411f1a2d938a74640adcc \| e89e20d9a25848a1b79bd9345ddbfe1d \| \| COMPLETED \| 2026-06-02T19:02:52+08:00 \| 1780398172208 \| 2ba4881420fa11f19e9c38a74640adcc \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ # Create an ingestion task # First argument is document id, second argument is dataset id RAGFlow(user)> start ingestion 'ffe64fae423411f1a2d938a74640adcc' from '0abe79f9423311f1ad8d38a74640adcc'; +----------------------------------+-------------------------------------------+ \| document_id \| result \| +----------------------------------+-------------------------------------------+ \| ffe64fae423411f1a2d938a74640adcc \| task_id: 8d758cd14a8b4ba8ab505003fb52017d \| +----------------------------------+-------------------------------------------+ # Pause an ingestion task, first argument is ingestion id RAGFlow(user)> stop ingestion '8d758cd14a8b4ba8ab505003fb52017d'; +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| create_date \| create_time \| dataset_id \| document_id \| id \| schema \| status \| update_date \| update_time \| user_id \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ \| 2026-05-30T20:21:06+08:00 \| 1780143666289 \| 0abe79f9423311f1ad8d38a74640adcc \| ffe64fae423411f1a2d938a74640adcc \| 8d758cd14a8b4ba8ab505003fb52017d \| \| COMPLETED \| 2026-05-30T20:21:26+08:00 \| 1780143686431 \| 2ba4881420fa11f19e9c38a74640adcc \| +---------------------------+---------------+----------------------------------+----------------------------------+----------------------------------+--------+-----------+---------------------------+---------------+----------------------------------+ # Delete an ingestion task RAGFlow(api/default)> remove ingestion tasks 'f366450a27d54677aec1c7090add30f0'; +---------+----------------------------------+ \| remove \| task_id \| +---------+----------------------------------+ \| success \| f366450a27d54677aec1c7090add30f0 \| +---------+----------------------------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-06-12 14:56:44 +08:00

1 2 3 4 5 ...

421 Commits