ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 23:41:12 +08:00

Author	SHA1	Message	Date
writinwaters	d4147efc66	Docs: (#14492 ) ### What problem does this PR solve? Added v0.25.1 release notes ### Type of change - [x] Documentation Update	2026-04-29 20:29:58 +08:00
Wang Qi	c4d0b0ebcf	Fix visit dataset error (#14490 ) ### What problem does this PR solve? Fix visit dataset error ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 20:17:00 +08:00
balibabu	1692f0928f	Fix: The pipeline column header in the FileLogsTable is displaying incorrectly. (#14489 ) ### What problem does this PR solve? Fix: The pipeline column header in the FileLogsTable is displaying incorrectly. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 19:52:28 +08:00
writinwaters	9280c64518	Docs: Updated Title chunker references (#14483 ) ### What problem does this PR solve? Updated Title chunker references ### Type of change - [x] Documentation Update	2026-04-29 19:37:24 +08:00
Jin Hai	261be81127	Go: add drop instance models (#14485 ) ### What problem does this PR solve? 1. drop instance model 2. Fix issue of drop instance but not drop models. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-29 19:18:49 +08:00
Haruko386	0e1477eb23	Go: implement provider: MiniMax (#14478 ) ### What problem does this PR solve? implement MiniMax provider ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 19:06:40 +08:00
Magicbook1108	de8c6ad0f3	Feat: enable sync deleted file for Discord (#14451 ) ### What problem does this PR solve? Feat: enable sync deleted file for Discord ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 19:05:40 +08:00
bitloi	2bc8c6d35e	feat(dropbox): support deleted-file sync (#14476 ) ### What problem does this PR solve? Partially addresses #14362 by adding deleted-file sync support for the Dropbox data source. Dropbox previously did not provide the slim current-file snapshot required by stale document reconciliation, and its sync runner returned only document batches. As a result, enabling deleted-file sync could not remove local documents that had been deleted from Dropbox. This PR: - Adds `retrieve_all_slim_docs_perm_sync()` to `DropboxConnector`. - Reuses Dropbox metadata traversal to collect current remote file IDs without downloading file contents. - Wires incremental Dropbox sync to return `(document_generator, file_list)` when `sync_deleted_files` is enabled. - Enables the deleted-file sync toggle for Dropbox in the data source settings UI. - Adds regression coverage for slim snapshots, nested folders, paginated listings, duplicate filenames, and full reindex behavior. Tests: - `uv run pytest test/unit_test/common/test_dropbox_connector.py -q` - `uv run pytest test/unit_test/rag/test_sync_data_source.py -q` - `uv run pytest test/unit_test/common/test_dropbox_connector.py test/unit_test/rag/test_sync_data_source.py -q` - `uv run ruff check common/data_source/dropbox_connector.py rag/svr/sync_data_source.py test/unit_test/common/test_dropbox_connector.py test/unit_test/rag/test_sync_data_source.py` - `./node_modules/.bin/eslint src/pages/user-setting/data-source/constant/index.tsx` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 19:05:11 +08:00
Magicbook1108	db1a73b255	Feat: enable sync deleted files in gitlab (#14481 ) ### What problem does this PR solve? Feat: enable sync deleted files in gitlab ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 19:04:10 +08:00
euvre	a0f9ae16d2	Fix: RAPTOR "Generation scope" reset to "Single file" when selecting "Dataset" (#14477 ) ## Problem In the Dataset Configuration page, changing the RAPTOR Generation scope from "Single file" to "Dataset" and clicking Save did not persist the change. After refreshing or re-entering the page, the scope always reverted to "Single file". ## Root Cause 1. Backend: The `RaptorConfig` Pydantic model in `api/utils/validation_utils.py` was configured with `extra="forbid"` but did not declare a `scope` field. When the frontend sent `"scope": "dataset"`, Pydantic rejected the request. 2. Frontend: The `extractRaptorConfigExt` utility in `web/src/hooks/parser-config-utils.ts` treated `scope` as an unknown field and moved it into the nested `ext` object. Consequently, the backend could not read `raptor_config.get("scope", "file")` correctly, so the default `"file"` was always used. ## Changes - Added `scope: Literal["file", "dataset"]` to the backend `RaptorConfig` model with a default of `"file"`. - Added `scope` to the known-field whitelist in the frontend `extractRaptorConfigExt` helper so it is transmitted as a top-level raptor field instead of being buried in `ext`. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: noob <yixiao121314@outlook.com>	2026-04-29 18:46:28 +08:00
Wang Qi	1b84892e3a	Fix delete graph (#14484 ) ### What problem does this PR solve? Fix delete graph ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 18:09:10 +08:00
Wang Qi	3991bdfaf5	Fix graph task type (#14475 ) ### What problem does this PR solve? Fix graph task type ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 17:05:56 +08:00
Jin Hai	bb05a8bd7e	Update create model instance command (#14441 ) ### What problem does this PR solve? 1. support command: ``` RAGFlow(user)> create provider 'vllm' instance 'test' key 'test-key' url 'base-url' region 'abc'; SUCCESS RAGFlow(user)> list instances from 'vllm'; +----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+ \| apiKey \| extra \| id \| instanceName \| providerID \| status \| +----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+ \| test-key \| {"base_url":"base-url","region":"abc"} \| 40213c89430311f1a7cf38a74640adcc \| test \| b4d40e6142d311f1a4f938a74640adcc \| enable \| +----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+ ``` 2. support add vllm model ``` RAGFlow(user)> add model 'Qwen/Qwen2-0.5B' to provider 'vllm' instance 'test' with tokens 131072 chat; SUCCESS ``` 3. add vllm chat ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-29 17:05:08 +08:00
qinling0210	486ca463aa	Port PR14454 to GO (PruneDeletedChunks) (#14463 ) ### What problem does this PR solve? Port PR14454 to GO (PruneDeletedChunks) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 17:04:22 +08:00
Magicbook1108	e0b3070012	Feat: enable sync deleted files for Gmail && fix google drive issues (#14462 ) ### What problem does this PR solve? Feat: enable sync deleted files for Gmail && fix google drive issues ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: bill <yibie_jingnian@163.com> Co-authored-by: balibabu <assassin_cike@163.com>	2026-04-29 17:03:56 +08:00
balibabu	a736948493	Fix: Clicking the button in the bottom-right corner of the `/chats/widget` page fails to display the dialog box. (#14465 ) ### What problem does this PR solve? Fix: Clicking the button in the bottom-right corner of the `/chats/widget` page fails to display the dialog box. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 17:03:33 +08:00
Wang Qi	6afb1957d8	Fix query param type (#14471 ) ### What problem does this PR solve? Fix query param type ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 16:53:28 +08:00
Wang Qi	9690923516	Fix delete graphrag raptor (#14469 ) ### What problem does this PR solve? Fix delete graphrag raptor ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 16:47:42 +08:00
Haruko386	decf673049	Go: implement provider: volcengine (#14460 ) ### What problem does this PR solve? implement `volcengine` provider ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 15:45:08 +08:00
Wang Qi	b684c89950	Add backward compat APIs (#14427 ) ### What problem does this PR solve? Add backward compat APIs: ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 15:15:49 +08:00
buua436	c08ced09a7	Fix: add retrieval fallback comments (#14457 ) ### What problem does this PR solve? add retrieval fallback comments ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 14:44:31 +08:00
qinling0210	f3c232cf47	Remove model_bundle.go, modify chat_session.go (#14458 ) ### What problem does this PR solve? Remove model_bundle.go, modify chat_session.go ### Type of change - [x] Refactoring	2026-04-29 14:44:12 +08:00
balibabu	ce933357c6	Fix: Dataset: When configuring the "general chunk method," options such as chunk size and parent-child slicing are unavailable. (#14459 ) ### What problem does this PR solve? Fix: Dataset: When configuring the "general chunk method," options such as chunk size and parent-child slicing are unavailable. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: balibabu <assassin_cike@163.com>	2026-04-29 14:37:48 +08:00
buua436	a7ce1b1677	Fix: prune deleted doc chunks from retrieval (#14454 ) ### What problem does this PR solve? prune deleted doc chunks from retrieval ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-29 13:03:09 +08:00
Jin Hai	b493a33316	Go: update chat URL (#14453 ) ### What problem does this PR solve? Update the URL to: /api/v1/chat/completions ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-29 11:45:06 +08:00
Magicbook1108	3b7a6eaa6c	Feat: sync deleted files in Bitbucket (#14450 ) ### What problem does this PR solve? Feat: sync deleted files in Bitbucket ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 11:29:17 +08:00
Paras Sondhi	74fa54f122	feat(google-drive): optimize memory payload and enable sync deletion (#14372 ) Addresses the Google Drive integration for #14362 This PR completely overhauls the Google Drive sync logic to accurately detect remote deletions, while drastically reducing the memory footprint during the snapshot phase. ### What changed under the hood: * Killed the memory bloat: Swapped out the massive document dictionary objects for a lightweight `collections.namedtuple` (`SlimDoc = namedtuple('SlimDoc', ['id'])`). This prevents RAM spikes during `retrieve_all_slim_docs_perm_sync` on massive enterprise drives. * Flawless downstream integration: The `SlimDoc` object relies on simple duck typing. It perfectly delivers the `.id` attribute required by `ConnectorService.cleanup_stale_documents_for_task`, meaning your core `hash128` vector cleanup logic runs natively without modification. * Fixed the Shared Drive blindspot: The standard API query was missing team folders. Injected the `corpora="allDrives"` and `includeItemsFromAllDrives=True` override flags so the connector now accurately maps state across both personal workspaces and organizational Shared Drives. ### Testing: Isolated the Google API retrieval logic locally to prove the `SlimDoc` mapping works and correctly registers state drops when a file is trashed remotely. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Performance Improvement	2026-04-29 10:04:36 +08:00
Stephen Hu	345bec812d	refactor: improve QwenRerank logic (#14388 ) ### What problem does this PR solve? improve QwenRerank logic ### Type of change - [x] Refactoring	2026-04-28 20:17:34 +08:00
Magicbook1108	0d18b293f5	Fix: enable sync deleted file in airtable (#14438 ) ### What problem does this PR solve? Fix: enable sync deleted file in airtable ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 20:09:08 +08:00
Magicbook1108	926efbd29b	Fix: update based on #14436 (#14440 ) ### What problem does this PR solve? Fix: update based on #14436 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 20:08:42 +08:00
euvre	35f6d81b73	Refactor: migrate chunk retrieval_test and knowledge_graph to REST API endpoints (#14402 ) ### What problem does this PR solve? ## Summary Migrate two web API endpoints to REST-style HTTP API endpoints, following the pattern established in #14222: \| Old Endpoint \| New Endpoint \| \|---\|---\| \| `POST /v1/chunk/retrieval_test` \| `POST /api/v1/datasets/<dataset_id>/search` \| \| `GET /v1/chunk/knowledge_graph` \| `GET /api/v1/datasets/<dataset_id>/graph` \|	2026-04-28 20:00:26 +08:00
Magicbook1108	85575259ac	Fix: google authentication - gmail && google-drive (#14422 ) ### What problem does this PR solve? Fix: google authentication - gmail && google-drive ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 18:09:02 +08:00
qinling0210	dcce864d4c	Simplify Encode (#14437 ) ### What problem does this PR solve? Simplify Encode ### Type of change - [x] Refactoring	2026-04-28 18:07:42 +08:00
Magicbook1108	d532151be0	Feat: more model for paddle (#14436 ) ### What problem does this PR solve? Feat: more model for paddle ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-28 18:07:00 +08:00
Haruko386	4e5a093ac5	Go: implement provider: Moonshot (#14433 ) ### What problem does this PR solve? implement `Moonshot` provider ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-28 18:06:25 +08:00
Jack	c330005659	Fix: document level auto metadata config missing after save (#14421 ) ### What problem does this PR solve? Steps to re-produce (existing bug before API migration): create a new dataset upload a file click on "General" in "Parse" column and then click on "switch or configure ingestion pipeline" click on "Settings" (at right of "Auto metadata") click "Add" to add new metadata click on "Save" re-open "Settings" and the newly added metadata is not there ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 17:09:23 +08:00
buua436	e6e80041f5	Fix: agent toolcall null response & schema validation & DeepSeek think history (#14425 ) ### What problem does this PR solve? agent toolcall null response & schema validation & DeepSeek think history ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 17:09:08 +08:00
Jin Hai	f670913bb4	Refactor model type to model class (#14426 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-28 16:05:15 +08:00
Jin Hai	7c25870923	Go: update db model (#14423 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-28 16:04:55 +08:00
Magicbook1108	18fbfafca6	Feat: enable sync deleted files for more connectors (#14353 ) ### What problem does this PR solve? Feat: enable sync delted files for connectors ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-28 15:07:14 +08:00
NeedmeFordev	0df65d358a	Fix case-insensitive matching for manual meta_data_filter in / not in list values (#14397 ) ## Summary Fixes case-asymmetric matching for manual `meta_data_filter` when using `in` / `not in` with a list `value`. Document metadata strings were lowercased, but list elements were not, so values like `"F2"` failed to match `["F2", "F11"]` even though `=` behaved correctly. Closes #14389 ## Changes - `common/metadata_utils.py`: For `in` / `not in`, normalize string elements when `value` and/or `input` is a list, consistent with scalar string lowercasing. - `test/unit_test/common/test_metadata_filter_operators.py`: Regression tests for list `value` case-insensitivity and `not in`. ## Type of change - [x] Bug fix (non-breaking)	2026-04-28 14:51:48 +08:00
Idriss Sbaaoui	2a37562791	Fix manual naive parser position extraction fallback (#14420 ) ### What problem does this PR solve? This PR fixes a regression where Manual pipeline + Naive (Plain Text) PDF parsing crashed with `AttributeError: 'PlainParser' object has no attribute 'extract_positions'` in `rag/app/manual.py`. fixes #14411 ### Type of change: - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 14:21:30 +08:00
Jin Hai	ae420f6358	Go: fix compilation (#14418 ) ### What problem does this PR solve? Add methods to volcengine ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-28 13:21:05 +08:00
qinling0210	effc84a042	Refactor model in GO (#14398 ) ### What problem does this PR solve? Refactor model in GO ### Type of change - [x] Refactoring	2026-04-28 12:59:01 +08:00
Wang Qi	5885691c68	Always return success if no such task id (#14417 ) ### What problem does this PR solve? Always return success if no such task id to follow existing code logic. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 12:55:24 +08:00
buua436	444e564329	Fix: align chat recommendation and thumbup APIs (#14413 ) ### What problem does this PR solve? align chat recommendation and thumbup APIs ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 12:55:16 +08:00
buua436	7a70a0fd85	Fix: preserve infinity available_int zero filter (#14416 ) ### What problem does this PR solve? preserve infinity available_int zero filter ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 12:54:32 +08:00
Jin Hai	819257f257	Go: add volcengine (#14409 ) ### What problem does this PR solve? 1. Refactor server_main 2. Add volcengine ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-28 12:12:58 +08:00
Jack	2d522ccb36	Fix: thumbnails issue in chat (#14415 ) [Uploading part_4-13.pdf…]() ### What problem does this PR solve? In chat, the thumbnails didn't display correctly ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Steps to reproduce: 1. create dataset and upload a file (see attached) 2. parse the document 3. once parsing completed, create a chat and associate it with the dataset 4. ask a question (DAP VS DAPE comparison) 5. check result	2026-04-28 11:39:29 +08:00
writinwaters	0cf105da8d	Doc: Added a database schema and migration guide. (#14404 ) ### What problem does this PR solve? Added a database schema and migration guide. ### Type of change - [x] Documentation Update	2026-04-28 09:54:33 +08:00

1 2 3 4 5 ...

5970 Commits