ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-07-03 01:01:56 +08:00

Author	SHA1	Message	Date
Jack	b363146997	refactor: overhaul task executor with layered architecture and comprehensive test suite (#15471 ) ## Summary Decomposes the monolithic `task_executor.py` (1945 lines) into a 6-layer architecture with clear separation of concerns. The refactored code is functionally equivalent to the original, verified through 400 passing tests and a production-vs-dry-run comparison framework. ## Architecture ``` entry (task_manager) └─ orchestration (task_handler) ├─ services (chunk_service, embedding_service, dataflow_service, raptor_service, post_processor) │ └─ utilities (chunk_builder, chunk_post_processor, embedding_utils) └─ infrastructure (task_context, recording_context, interceptor) ``` Key design decisions: - TaskContext — typed facade over raw task dict, injects rate limiters + callbacks via composition - RecordingContext + Comparator — enables side-by-side production vs dry-run execution for safe migration - NullRecordingContext — zero-allocation no-op for production, uses `__slots__` - WriteOperationInterceptor — FIFO replay of previous runs function returns for comparison mode ## Migration Strategy The original `handle_task()` in `task_executor.py` uses a 3-way switch via `TE_RUN_MODE`: - `TE_RUN_MODE=0` (default) → runs refactored code - `TE_RUN_MODE=1` → runs both original + refactored, compares all intermediate results - `TE_RUN_MODE=2` → runs original code (fallback) The comparison mode (`TE_RUN_MODE=1`) records ~40 intermediate values (chunks, vectors, token counts, func return values) from the production run and replays them during dry-run, then uses `ContextComparator` to report mismatches. ## Functional Equivalence Fixes All divergences between original and refactored code were identified and fixed: - Timeout decorators (handle/build_chunks/raptor/embedding) - NullRecordingContext leak in finally block causing RuntimeError - MinIO None-binary check with proper FileNotFoundError - Dataflow dispatch after embedding binding + init_kb - Memory task missing return after processing - RAPTOR checkpoint progress reporting - Tag cache (get_tags_from_cache/set_tags_to_cache) restoration - dataflow_id correction in _load_dsl - Language default Chinese, dead code guard removal - embed_chunks made async with proper thread_pool_exec - Full GraphRAG default configuration (10 parameters) - Hardcoded q_768_vec fallback removal in RAPTOR ## Test Changes - 20 new tests covering table parser manual mode, tag cache, embedding edge cases, RAPTOR checkpoint, dataflow_id correction, storage binary None, cancel cleanup, metadata=None boundary - Unified `make_task_context`/`make_task_dict` factories eliminated 10+ duplicated helpers - DataflowService tests migrated from internal method mocks to IO boundary mocks (real orchestration code executes) - Parametrized duplicate build_chunks post-processor tests - 7 raptor tests modernized to @pytest.mark.asyncio - Mock count per test reduced through boundary-level mocking strategy Test count: 400 passing, 0 warnings, 0 skips ## Files Changed \| File \| Change \| \|------\|--------\| \| `rag/svr/task_executor.py` \| +1 line (NullRecordingContext fix) \| \| `rag/svr/task_executor_refactor/task_handler.py` \| Orchestration layer, 8 logic fixes \| \| `rag/svr/task_executor_refactor/chunk_service.py` \| +timeout + None-check \| \| `rag/svr/task_executor_refactor/embedding_service.py` \| sync→async rewrite \| \| `rag/svr/task_executor_refactor/dataflow_service.py` \| dataflow_id fix + timeout \| \| `rag/svr/task_executor_refactor/raptor_service.py` \| checkpoint fix + assert \| \| `rag/svr/task_executor_refactor/chunk_post_processor.py` \| tag cache restore \| \| `rag/svr/task_executor_refactor/task_context.py` \| language default fix \| \| `test/.../conftest.py` \| +294 lines shared helpers \| \| `test/.../*.py` \| 15 test files refactored, 20 new tests \| --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 17:18:31 +08:00
wdeveloper16	14c0985182	feat: bump Python minimum from 3.12 to 3.13, drop strenum backport (#14767 ) Closes #14753 ## What changed \| File \| Change \| \|---\|---\| \| `pyproject.toml` \| `requires-python` → `>=3.13,<3.15`; remove `strenum==0.4.15` \| \| `Dockerfile` \| `uv python install 3.13`, `uv sync --python 3.13` \| \| `.github/workflows/tests.yml` \| `uv sync --python 3.13` on both matrix legs \| \| `CLAUDE.md` \| dev setup command + requirements note updated \| \| `deepdoc/parser/mineru_parser.py` \| `from strenum import StrEnum` → `from enum import StrEnum` \| \| `agent/tools/code_exec.py` \| same \| `StrEnum` has been in the stdlib since Python 3.11 — the `strenum` backport package is no longer needed once the floor is 3.13. ## Why uv.lock is not regenerated `uv lock --python 3.13` fails because: 1. The infiniflow/graspologic fork pins `numpy>=1.26.4,<2.0.0` 2. `tensorflow-cpu>=2.20.0` (the first release with cp313 wheels) depends on `ml-dtypes>=0.5.1`, which requires `numpy>=2.1.0` 3. These two constraints are irreconcilable on Python 3.13 The lockfile regeneration requires loosening the `numpy` upper bound in the `infiniflow/graspologic` fork. Once that fork commit is updated and the SHA in `pyproject.toml:49` is bumped, `uv lock --python 3.13` will succeed. ## RFC corrections Two claims in the original RFC (#14753) did not hold up under code review: - "graspologic hard-blocks 3.13" — the infiniflow fork at the pinned commit has no `<3.13` Python constraint. The blocker is the transitive `numpy<2.0.0` conflict with tensorflow-cpu's test dependency, not a direct Python version cap. - "free-threading throughput gains for I/O-bound workload" — Python 3.13 free-threading requires a special `--disable-gil` build and provides no benefit for async I/O code (the GIL is already released during I/O). The real motivation is forward compatibility and improved error messages.	2026-05-15 14:40:53 +08:00
Wang Qi	7fb6a12067	Update API document (#14364 ) ### What problem does this PR solve? Update API document ### Type of change - [ ] Documentation Update	2026-04-24 20:36:47 +08:00
balibabu	a2bea30749	Fix: Editing an empty response in the retrieval operator will cause the focus to shift to the metadata input box. (#14253 ) ### What problem does this PR solve? Fix: Editing an empty response in the retrieval operator will cause the focus to shift to the metadata input box. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-21 16:19:55 +08:00
Kevin Hu	32c0161ff1	Refa: Clean the folders. (#12890 ) ### Type of change - [x] Refactoring	2026-01-29 14:23:26 +08:00
Zhichang Yu	f128a1fa9e	Bump python to >=3.12 (#11846 ) ### What problem does this PR solve? Bump python to >=3.12 ### Type of change - [x] Refactoring	2025-12-09 19:55:25 +08:00
Zhichang Yu	bb6022477e	Bump infinity to v0.6.11. Requires python>=3.11 (#11814 ) ### What problem does this PR solve? Bump infinity to v0.6.11. Requires python>=3.11 ### Type of change - [x] Refactoring	2025-12-09 16:23:37 +08:00
Marvion	395ce16b3c	Fix: correct MCP server authentication header format in frontend (#9819 ) - Fix MCP test connection authentication issues by updating frontend request format - Add variables field with authorization_token for template substitution - Change headers to use proper Authorization Bearer format with template variable 🤖 Generated with [Claude Code](https://claude.ai/code) ### What problem does this PR solve? correct MCP server authentication header format in frontend ### Type of change * [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Marvion <marvionliu@wukongjx.cn> Co-authored-by: Claude <noreply@anthropic.com>	2025-11-03 20:00:27 +08:00

8 Commits