ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-07-04 18:45:38 +08:00

Author	SHA1	Message	Date
CaptainTimon	2717ee283f	feat(raptor): add Psi tree builder with original-space ranking and safe migration (#14679 ) ### What problem does this PR solve? Closes #14674. This PR improves RAPTOR configuration and tree construction while preserving the existing RAPTOR behavior as the default. RAPTOR currently builds summary layers with the original UMAP + GMM clustering path. This PR keeps that default path, and adds: - A hidden backend tree-builder option: - `tree_builder="raptor"`: default, existing RAPTOR behavior. - `tree_builder="psi"`: rank-aware Psi-style tree builder using original embedding-space cosine ranking. - A user-facing clustering method option for the default RAPTOR builder: - `clustering_method="gmm"`: existing default. - `clustering_method="ahc"`: agglomerative hierarchical clustering path. - A RAPTOR UI setting for `Clustering method` and `Max cluster`. ### What changed #### Backend - Added `tree_builder` support for RAPTOR/Psi. - Added `clustering_method` support for GMM/AHC. - Kept existing RAPTOR + GMM as the default. - Added Psi tree building from original-space cosine similarity. - Added bucketed Psi building controls for large inputs: - `raptor.ext.psi_exact_max_leaves` - `raptor.ext.psi_bucket_size` - Added method-aware RAPTOR summary metadata using existing `extra.raptor_method`. - Avoided adding a dedicated DB schema field for experimental method tracking. - Added cleanup/migration logic to avoid mixing stale RAPTOR summary trees. - Added defensive checks for Psi tree construction and summary failures. #### Frontend/UI - Added `Clustering method` in RAPTOR settings with `GMM` and `AHC`. - Added/kept `Max cluster` in RAPTOR settings. - Enlarged max cluster UI limit to `1024`, matching backend validation. - Kept AHC editable even when a RAPTOR task has already finished. - Fixed the UI save payload so `clustering_method` and `tree_builder` are serialized through `parser_config.raptor.ext`, avoiding backend validation errors for extra top-level RAPTOR fields. Example saved RAPTOR config: ```json { "raptor": { "max_cluster": 317, "ext": { "clustering_method": "ahc", "tree_builder": "raptor" } } } Co-authored-by: CaptainTimon <CaptainTimon@users.noreply.github.com>	2026-05-12 09:42:31 +08:00
yuch85	3ad3241ae0	feat: persist RAPTOR layer metadata on summary chunks (#13286 ) ## Summary RAPTOR's recursive clustering builds a `layers` list tracking `(start_idx, end_idx)` boundaries per level, but currently discards this information — only the flat `chunks` list is returned. This makes it impossible to distinguish leaf-level summaries from top-level ones. This PR: - Returns `(chunks, layers)` tuple from `raptor.py`'s `__call__` - Annotates each RAPTOR summary chunk with `raptor_layer_int` (1 = first summary level, 2 = summary-of-summaries, etc.) - Adds `raptor_layer_int` to `infinity_mapping.json` (Elasticsearch handles it via existing `_int` dynamic template) ### Why this matters Downstream features need to know which RAPTOR layer a summary belongs to: - Retrieving the top-level document summary* for entity extraction, search snippets, or document comparison - Filtering by abstraction level — users may want only high-level summaries or only leaf-level cluster summaries - RAPTOR recall quality — #10951 reports summaries not being recalled for definition queries; layer metadata enables targeted retrieval ### Changes \| File \| Change \| LOC \| \|------\|--------\|-----\| \| `rag/raptor.py` \| Return `(chunks, layers)` tuple \| ~3 \| \| `rag/svr/task_executor.py` \| Build `chunk_layer` mapping, set `raptor_layer_int` \| ~12 \| \| `conf/infinity_mapping.json` \| Add `raptor_layer_int` integer field \| ~1 \| ### Backward compatibility - Additive only — no existing fields or behavior changed - Existing RAPTOR chunks continue to work (they'll have `raptor_layer_int = 0` by default) - New RAPTOR chunks get layer metadata automatically ## Test plan - [ ] Parse a document with RAPTOR enabled, verify `raptor_layer_int` is set on indexed chunks - [ ] Verify `raptor_layer_int` values increase with abstraction level (layer 1 < layer 2 < ...) - [ ] Verify existing RAPTOR deletion (`delete by raptor_kwd`) still works - [ ] Verify Infinity backend accepts the new field Fixes #7488 Related: #4104, #11191, #10951 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: yuch85 <yuch85.1@gmail.com> Co-authored-by: Wang Qi <wangq8@outlook.com>	2026-04-27 10:20:46 +08:00
Ricardo-M-L	09a09a5b20	fix: correct typo in IterationItem name check and incomplete error message (#13890 ) Two small fixes: 1. iterationitem.py line 72: Typo "interationitem" → "iterationitem" (missing 't'). The component name check never matched IterationItem components. 2. raptor.py line 94: Error message "Embedding error: " had a trailing colon with no details. Changed to "Embedding error: empty embeddings returned".	2026-04-02 10:35:28 +08:00
Stephen Hu	77483b1e58	refactor: remove useless variable in raptor (#13648 ) ### What problem does this PR solve? remove useless variable in raptor ### Type of change - [x] Refactoring	2026-03-17 15:56:51 +08:00
Kevin Hu	32c0161ff1	Refa: Clean the folders. (#12890 ) ### Type of change - [x] Refactoring	2026-01-29 14:23:26 +08:00
Stephen Hu	0782a7d3c6	Refactor: improve task cancellation checks in RAPTOR (#12813 ) ### What problem does this PR solve? Introduced a helper method _check_task_canceled to centralize and simplify task cancellation checks throughout RecursiveAbstractiveProcessing4TreeOrganizedRetrieval. This reduces code duplication and improves maintainability. ### Type of change - [x] Refactoring	2026-01-26 11:34:54 +08:00
Kevin Hu	927db0b373	Refa: asyncio.to_thread to ThreadPoolExecutor to break thread limitat… (#12716 ) ### Type of change - [x] Refactoring	2026-01-20 13:29:37 +08:00
Yongteng Lei	2b260901df	Fix: raptor don't have attribute chat (#11936 ) ### What problem does this PR solve? Raptor don't have attribute chat. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-12 20:08:18 +08:00
buua436	65a5a56d95	Refa:replace trio with asyncio (#11831 ) ### What problem does this PR solve? change: replace trio with asyncio ### Type of change - [x] Refactoring	2025-12-09 19:23:14 +08:00
Yongteng Lei	908450509f	Feat: add fault-tolerant mechanism to RAPTOR (#11206 ) ### What problem does this PR solve? Add fault-tolerant mechanism to RAPTOR. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-13 18:48:07 +08:00
Kevin Hu	c30ffb5716	Fix: ollama model list issue. (#11175 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-11 19:46:41 +08:00
Kevin Hu	f441f8ffc2	Fix: waitForResponse component. (#11172 ) ### What problem does this PR solve? #10056 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-11-11 16:58:47 +08:00
Yongteng Lei	0cd8024c34	Feat: RAPTOR handle cancel gracefully (#11074 ) ### What problem does this PR solve? RAPTOR handle cancel gracefully. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-06 17:18:03 +08:00
Jin Hai	1e45137284	Move 'timeout' to common folder (#10983 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-04 11:51:12 +08:00
Jin Hai	360f5c1179	Move token related functions to common (#10942 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-03 08:50:05 +08:00
Yongteng Lei	cd77425b87	Fix: potential negative max_tokens in RAPTOR (#10701 ) ### What problem does this PR solve? Fix potential negative max_tokens in RAPTOR. #10235. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue	2025-10-21 15:49:51 +08:00
Stephen Hu	312635cb13	Refactor: based on async await to handle Redis when raptor (#9576 ) ### What problem does this PR solve? based on async await to handle Redis when raptor ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-08-22 10:58:02 +08:00
Kevin Hu	929dc97509	Fix: duplicated role... (#9622 ) ### What problem does this PR solve? #9611 #9603 #9597 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-21 12:14:43 +08:00
Kevin Hu	312f1a0477	Fix: enlarge raptor timeout limits. (#9600 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-20 17:29:15 +08:00
Kevin Hu	935ce872d8	Refa: remove temperature since some LLMs fail to support. (#8981 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-07-23 10:17:04 +08:00
Kevin Hu	24c41d2a61	Perf: make `do_cancel` quicker. (#8846 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-15 14:35:00 +08:00
Kevin Hu	c642dbefca	Perf: Enhance timeout handling. (#8826 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-15 09:36:45 +08:00
Kevin Hu	e441c17c2c	Refa: limit embedding concurrency and fix `chat_with_tool` (#8543 ) ### What problem does this PR solve? #8538 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2025-06-27 19:28:41 +08:00
Zhichang Yu	1ed0b25910	Fix task_limiter in raptor.py (#8124 ) ### What problem does this PR solve? Fix task_limiter in raptor.py ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-06-09 10:18:03 +08:00
WhiteBear	2c62652ea8	<think> tag is missing. (#7256 ) ### What problem does this PR solve? Some models force thinking, resulting in the absence of the think tag in the returned content ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-04-24 11:44:10 +08:00
aniaan	8b8a2f2949	fix(nursery): Fix Closure Trap Issues in Trio Concurrent Tasks (#7106 ) ## Problem Description Multiple files in the RAGFlow project contain closure trap issues when using lambda functions with `trio.open_nursery()`. This problem causes concurrent tasks created in loops to reference the same variable, resulting in all tasks processing the same data (the data from the last iteration) rather than each task processing its corresponding data from the loop. ## Issue Details When using a `lambda` to create a closure function and passing it to `nursery.start_soon()` within a loop, the lambda function captures a reference to the loop variable rather than its value. For example: ```python # Problematic code async with trio.open_nursery() as nursery: for d in docs: nursery.start_soon(lambda: doc_keyword_extraction(chat_mdl, d, topn)) ``` In this pattern, when concurrent tasks begin execution, `d` has already become the value after the loop ends (typically the last element), causing all tasks to use the same data. ## Fix Solution Changed the way concurrent tasks are created with `nursery.start_soon()` by leveraging Trio's API design to directly pass the function and its arguments separately: ```python # Fixed code async with trio.open_nursery() as nursery: for d in docs: nursery.start_soon(doc_keyword_extraction, chat_mdl, d, topn) ``` This way, each task uses the parameter values at the time of the function call, rather than references captured through closures. ## Fixed Files Fixed closure traps in the following files: 1. `rag/svr/task_executor.py`: 3 fixes, involving document keyword extraction, question generation, and tag processing 2. `rag/raptor.py`: 1 fix, involving document summarization 3. `graphrag/utils.py`: 2 fixes, involving graph node and edge processing 4. `graphrag/entity_resolution.py`: 2 fixes, involving entity resolution and graph node merging 5. `graphrag/general/mind_map_extractor.py`: 2 fixes, involving document processing 6. `graphrag/general/extractor.py`: 3 fixes, involving content processing and graph node/edge merging 7. `graphrag/general/community_reports_extractor.py`: 1 fix, involving community report extraction ## Potential Impact This fix resolves a serious concurrency issue that could have caused: - Data processing errors (processing duplicate data) - Performance degradation (all tasks working on the same data) - Inconsistent results (some data not being processed) After the fix, all concurrent tasks should correctly process their respective data, improving system correctness and reliability.	2025-04-18 18:00:20 +08:00
dylan	e54c0e39b5	fix bug [ERROR][Exception]: 8 vs. 9 (#6955 ) ### What problem does this PR solve? Sometimes, the s in chunks (s, a) is an empty string. This causes the condition if s and len(a) > 0 in the line chunks = [(s, a) for s, a in chunks if s and len(a) > 0] to fail, which changes the length of the new chunks. As a result, the final assertion assert len(chunks) - end == n_clusters, "{} vs. {}".format(len(chunks) - end, n_clusters) fails and raises a confusing error like 7 vs. 8 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-04-11 17:01:49 +08:00
Zhichang Yu	6ec6ca6971	Refactor graphrag to remove redis lock (#5828 ) ### What problem does this PR solve? Refactor graphrag to remove redis lock ### Type of change - [x] Refactoring	2025-03-10 15:15:06 +08:00
Zhichang Yu	c813c1ff4c	Made task_executor async to speedup parsing (#5530 ) ### What problem does this PR solve? Made task_executor async to speedup parsing ### Type of change - [x] Performance Improvement	2025-03-03 18:59:49 +08:00
Kevin Hu	96e9d50060	Let parallism of RAPTOR controlable. (#5379 ) ### What problem does this PR solve? #4874 ### Type of change - [x] Refactoring	2025-02-26 15:58:06 +08:00
Kevin Hu	9aa222f738	Let list_chat go without kb checking. (#5280 ) ### What problem does this PR solve? #5278 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-24 13:21:05 +08:00
Kevin Hu	e6c024f8bf	Fix too many clause while searching. (#5119 ) ### What problem does this PR solve? #5100 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-19 13:18:39 +08:00
Kevin Hu	29ceeba95f	Fix hit cache error while raptoring. (#4955 ) ### What problem does this PR solve? #4126 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-14 12:00:19 +08:00
Kevin Hu	c354239b79	Make infinity adapt to condition `exist`. (#4657 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-26 18:45:36 +08:00
Kevin Hu	0e5124ec99	Show the errors out. (#4305 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-12-31 15:32:02 +08:00
Kevin Hu	cb6e9ce164	Cache the result from llm for graphrag and raptor (#4051 ) ### What problem does this PR solve? #4045 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-17 09:48:03 +08:00
Zhichang Yu	0d68a6cd1b	Fix errors detected by Ruff (#3918 ) ### What problem does this PR solve? Fix errors detected by Ruff ### Type of change - [x] Refactoring	2024-12-08 14:21:12 +08:00
Kevin Hu	27cd765d6f	Fix raptor issue (#3737 ) ### What problem does this PR solve? #3732 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-29 11:55:41 +08:00
Zhichang Yu	4413683898	Introduced beartype (#3460 ) ### What problem does this PR solve? Introduced [beartype](https://github.com/beartype/beartype) for runtime type-checking. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-18 17:38:17 +08:00
Kevin Hu	a1d01a1b2f	enlarge the default token length of RAPTOR summarization (#3454 ) ### What problem does this PR solve? #3426 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-18 10:15:26 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00
Kevin Hu	b9fa00f341	add API for tenant function (#2866 ) ### What problem does this PR solve? feat: API access key management https://github.com/infiniflow/ragflow/issues/2846 feat: Render markdown file with remark-loader https://github.com/infiniflow/ragflow/issues/2846 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-16 16:10:24 +08:00
KevinHuSh	46454362d7	fix raptor bugs (#928 ) ### What problem does this PR solve? #922 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-05-27 11:01:20 +08:00
KevinHuSh	6f99bbbb08	add raptor (#899 ) ### What problem does this PR solve? #882 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-05-23 14:31:16 +08:00

45 Commits