ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 23:41:12 +08:00

Author	SHA1	Message	Date
akie	35b2a714f9	Fix: tag datasets not visible in tag sets dropdown (#13921 ) ## Problem Description When a user creates Dataset A using the Tag parser (for CSV/Excel files with tag definitions), and then creates Dataset B, the Tag Sets dropdown in Dataset B's Configuration page cannot display Dataset A. ### Steps to Reproduce 1. Create Dataset A with Tag as the chunking method 2. Upload a CSV file to Dataset A to generate tags 3. Create Dataset B 4. Navigate to Dataset B → Configuration → Tag Sets 5. Expected: Dataset A should appear in the dropdown 6. Actual: The dropdown is empty, Dataset A is not visible --- ## Root Cause Analysis After thorough code review, the original code logic is correct. The `chunk_method` field flows properly through the system: ### Data Flow ```mermaid sequenceDiagram participant Frontend participant Pydantic participant API participant Database Note over Frontend,Database: Creating a Tag Dataset Frontend->>Pydantic: POST {chunk_method: "tag"} Pydantic->>API: serialization_alias converts<br/>chunk_method → parser_id API->>Database: INSERT {parser_id: "tag"} Note over Frontend,Database: Querying Datasets Frontend->>API: GET /api/v1/datasets API->>Database: SELECT parser_id, ... Database-->>API: Returns {parser_id: "tag"} API->>API: remap_dictionary_keys()<br/>parser_id → chunk_method API-->>Frontend: {chunk_method: "tag"} Note over Frontend: Filter: x.chunk_method === 'tag' Note over Frontend: ✅ Match found! ``` ### Field Mapping Location: `api/utils/api_utils.py:657-662` ```python DEFAULT_KEY_MAP = { "chunk_num": "chunk_count", "doc_num": "document_count", "parser_id": "chunk_method", # Maps DB field to API response "embd_id": "embedding_model", } ``` ### Frontend Filtering (Already Correct) Location: `web/src/pages/dataset/dataset-setting/components/tag-item.tsx:24` ```typescript const knowledgeOptions = knowledgeList .filter((x) => x.chunk_method === 'tag') // ✅ Correct field .map((x) => ({...})); ``` --- ## Actual Issue The most likely causes for the "bug" are: 1. Browser Cache: Old data cached before proper deployment 2. Stale Data: Datasets created before the code was fully deployed 3. Container Not Restarted: Changes not applied to running container --- ## Resolution No code changes are needed. The existing code correctly: 1. Accepts `chunk_method` from frontend 2. Converts to `parser_id` via Pydantic serialization_alias 3. Stores in database as `parser_id` 4. Maps back to `chunk_method` in API response 5. Frontend filters by `chunk_method === 'tag'`	2026-04-03 17:29:10 +08:00
LeonTung	0b724be521	chore(templates): Update the customer feedback dispatcher template (#13919 ) ### What problem does this PR solve? Update the customer feedback dispatcher template and introduce a new operator `Variable Aggregator`. ### Type of change - [x] Other (please describe): Template change --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-04-03 16:51:39 +08:00
balibabu	5b43c7cf16	Feat: Place the language configuration in web/.env for easy user configuration. (#13920 ) ### What problem does this PR solve? Feat: Place the language configuration in web/.env for easy user configuration. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-03 16:50:18 +08:00
Ricardo-M-L	354108922b	fix: use f-string with separator in switch operator error message (#13915 ) \`switch.py\` line 137 concatenates the operator directly after the text without separator: \`'Not supported operator' + operator\` → produces \`"Not supported operatorXXX"\` Changed to: \`f'Not supported operator: {operator}'\`	2026-04-03 16:49:28 +08:00
chanx	21af67f6f9	feat(File Management): Refactor File List API and Add Knowledge Base Document Initialization (#13914 ) ### What problem does this PR solve? feat(File Management): Refactor File List API and Add Knowledge Base Document Initialization - Migrate the file list API endpoint from `/v1/file/list` to `/api/v1/files` to align with the Python implementation. - Add logic for initializing knowledge base documents; automatically create the `.knowledgebase` folder and associated documents when retrieving the root directory. - Enhance parameter validation and error handling, including the introduction of a new `CodeParamError` error code. - Optimize the file list response structure to match the implementation on the Python side. - Update the Vite configuration to support proxying the new `/api/v1/files` endpoint. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-03 15:08:43 +08:00
writinwaters	6263857c1e	Agent templates regrouped and renamed (#13873 ) ### What problem does this PR solve? Regrouped and renamed agent templates to increase user engagement. ### Type of change - [x] Refactoring	2026-04-03 13:43:25 +08:00
Zhichang Yu	ab358fe949	feat: make Azure cloud authority configurable for SPN auth (#13898 ) ## Summary - The Azure SPN storage handler hardcoded `AzureAuthorityHosts.AZURE_CHINA`, preventing users in Azure Public Cloud regions (UK-South, EU, US, etc.) from authenticating - Add a `cloud` config option (env: `AZURE_CLOUD`) supporting all four Azure sovereignties: `public`, `china`, `government`, `germany` - Defaults to `public` (global Azure) — the most common international use case Closes #13259 ## Test plan - [ ] Verify default (`cloud: public`) connects to Azure Public Cloud endpoints - [ ] Verify `cloud: china` retains existing behavior for Azure China users - [ ] Verify `AZURE_CLOUD` env var overrides the config file value 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-03 12:51:26 +08:00
Zhichang Yu	384fa6fc6e	Replace MinIO official image with pgsty/minio fork (#13896 ) ## Summary - Replace `quay.io/minio/minio` with `pgsty/minio` community fork in `docker/docker-compose-base.yml` MinIO stopped distributing pre-built Docker images and changed its license. The pgsty/minio fork provides drop-in compatible images under AGPLv3. Closes #13840 ## Test plan - [x] Verify `docker compose -f docker/docker-compose-base.yml up -d` pulls the pgsty/minio image successfully - [ ] Verify MinIO console accessible on port 9001 - [ ] Verify RAGFlow backend can connect to MinIO and perform file operations normally 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-02 22:03:02 +08:00
Yongteng Lei	b7daf6285b	Refa: Chat conversations /convsersation API to RESTFul (#13893 ) ### What problem does this PR solve? Chat conversations /convsersation API to RESTFul. ### Type of change - [x] Refactoring	2026-04-02 20:49:23 +08:00
chanx	bbb9b1df85	feat: Implement file upload and folder creation features by GO (#13903 ) ### What problem does this PR solve? feat: Implement file upload and folder creation features - Add file upload route in router.go - Add file operation methods in dao/file.go - Add util/file.go for file type detection and filename handling - Implement file upload and folder creation endpoints in handler/file.go - Implement file upload and folder creation logic in service/file.go - Modify response message format in memory.go - Add document count method in dao/document.go ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-02 20:21:04 +08:00
Jin Hai	6c29128de1	Refactor model provider and command (#13887 ) ### What problem does this PR solve? Introduce 5 new tables, including model groups and provider instance. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-02 20:20:35 +08:00
qinling0210	f02f5fa435	Get ROW_ID from search() in Infinity (#13901 ) ### What problem does this PR solve? 1. Search() in Infinity can return row_id now 2. To Get ROW_ID from search(), refer to handling of retrieval_test. example ``` $ curl -s -X POST "http://localhost:$PORT/v1/chunk/retrieval_test" -H "Authorization: $TOKEN" -H "Content-Type: application/json" -d '{"kb_id": "4fcd01582ca911f1954184ba59049aa3", "question": "曹操"}' ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-02 18:56:43 +08:00
Idriss Sbaaoui	ee1bb8a8b5	Fix: overlapping document parse race that can clear chunks (#13900 ) ### What problem does this PR solve? This PR fixes a race in batch document parsing where overlapping parse requests for the same document could clear/rewrite chunk state and make previously parsed content appear lost. It adds an atomic per-document parse guard so only one parse can run at a time for that document (Fixes #13864 ). ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-02 18:50:56 +08:00
writinwaters	3b96cedece	Docs: Updated chat-specific APIs (#13888 ) ### What problem does this PR solve? Chat-specific API descriptions updated. ### Type of change - [x] Documentation Update	2026-04-02 14:15:09 +08:00
NeedmeFordev	6b7989b4b4	Add file type validation (#13802 ) ### What problem does this PR solve? This PR fixes WebDAV sync behavior for unsupported file types ([#13795](https://github.com/infiniflow/ragflow/issues/13795)). Previously, the WebDAV connector selected files primarily by modified time (and size threshold) and could still pass unsupported extensions into the download/document-generation path. This caused unnecessary processing and inconsistent behavior compared with connectors that validate file type earlier. This change adds extension validation in two places: 1. Early filter during recursive listing to skip unsupported files before they enter the download flow. 2. Defensive filter before download/document creation to prevent unsupported files from being processed if any listing edge case slips through. It also wires `allow_images` into the WebDAV sync path so image extension handling follows connector policy. Scope is intentionally limited to WebDAV for a focused bug-fix PR. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### How was this tested? - Manual verification with mixed file types under the configured WebDAV path: - supported: `.pdf`, `.txt`, `.md` - unsupported: `.exe`, `.bin`, `.dat` - Triggered full sync and polling sync. - Confirmed unsupported files are skipped before download. - Confirmed supported files are still indexed normally. - Confirmed image handling follows `allow_images` setting. Fixes: #13795	2026-04-02 14:12:27 +08:00
Idriss Sbaaoui	dd529137eb	Fix: markdown table double extraction in parser (#13892 ) ### What problem does this PR solve? Fixes markdown tables being parsed twice (once as markdown and again as generated HTML), which caused duplicate table chunks in the chunk list UI. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-02 13:31:56 +08:00
Sank	1c2c4b337e	[RU] Add schema synchronization and translate (#13891 ) ### What problem does this PR solve? Add schema synchronization and translate ### Type of change - [x] Translation into Russian	2026-04-02 11:18:27 +08:00
Ricardo-M-L	09a09a5b20	fix: correct typo in IterationItem name check and incomplete error message (#13890 ) Two small fixes: 1. iterationitem.py line 72: Typo "interationitem" → "iterationitem" (missing 't'). The component name check never matched IterationItem components. 2. raptor.py line 94: Error message "Embedding error: " had a trailing colon with no details. Changed to "Embedding error: empty embeddings returned".	2026-04-02 10:35:28 +08:00
balibabu	af40be68c3	Fix: The dataset on the list page cannot be renamed. (#13886 ) ### What problem does this PR solve? Fix: The dataset on the list page cannot be renamed. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-01 20:53:05 +08:00
Yongteng Lei	b622c47ed6	Refa: Chats /chat API to RESTFul (#13881 ) ### What problem does this PR solve? Refactor Chats /chat API to RESTFul. ### Type of change - [x] Refactoring	2026-04-01 20:10:37 +08:00
qinling0210	bb4a06f759	Implement InsertDataset and InsertMetadata in GO (#13883 ) ### What problem does this PR solve? Implement InsertDataset and InsertMetadata in GO new internal cli for go: INSERT DATASET FROM FILE "file_name" INSERT METADATA FROM FILE "file_name" ### Type of change - [x] Refactoring	2026-04-01 16:16:25 +08:00
Liu An	b1d28b5898	Revert "Refa: Chats /chat API to RESTFul (#13871 )" (#13877 ) ### What problem does this PR solve? This reverts commit `1a608ac411`. ### Type of change - [x] Other (please describe):	2026-04-01 11:05:29 +08:00
Yongteng Lei	1a608ac411	Refa: Chats /chat API to RESTFul (#13871 ) ### What problem does this PR solve? Chats /chat API to RESTFul. ### Type of change - [x] Refactoring	2026-04-01 10:50:22 +08:00
balibabu	00b62dd587	Feat: If a model configured in the agent is deleted from the user center, a notification will be displayed on the canvas with a red border. (#13872 ) ### What problem does this PR solve? Feat: If a model configured in the agent is deleted from the user center, a notification will be displayed on the canvas with a red border. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-31 18:43:24 +08:00
Jin Hai	efd6ecc3e5	New provider and models API and CLI (#13865 ) ### What problem does this PR solve? As title. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-31 18:42:12 +08:00
Sank	68b4287892	Add translate [RU] for MinerU (#13832 ) add translate for MinerU in knowledgeConfiguration ### Type of change - [X] Other (please describe):	2026-03-31 17:03:31 +08:00
balibabu	36513313f8	Fix: The agent form sheet will be obscured by the message log sheet. (#13870 ) ### What problem does this PR solve? Fix: The agent form sheet will be obscured by the message log sheet. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-31 16:18:43 +08:00
Paul Y Hui	3e702c6265	fix: guard against missing/malformed Authorization header in apikey_required (#13860 ) ### What problem does this PR solve? Previously, `apikey_required` called `request.headers.get('Authorization').split()[1]` without checking for None or insufficient parts, causing an unhandled AttributeError or IndexError (500) instead of a proper 403 JSON response. This applies the same guarding pattern already used by `token_required` in the same file. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2026-03-31 15:25:00 +08:00
balibabu	4f27090289	Fix: Unable to reconnect after deleting the connection between begin and parser #13868 (#13869 ) ### What problem does this PR solve? Fix: Unable to reconnect after deleting the connection between begin and parser #13868 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-31 14:44:47 +08:00
writinwaters	db5ab7bbe8	Docs: Image2text is supported by GPUStack. (#13856 ) ### What problem does this PR solve? Image2text is supported by GPUStack. #9515 ### Type of change - [x] Documentation Update	2026-03-30 20:39:02 +08:00
balibabu	3a4f0d1a83	Fix: The chat settings are not displayed correctly on the first page load. (#13855 ) ### What problem does this PR solve? Fix: The chat settings are not displayed correctly on the first page load. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-30 20:16:52 +08:00
qinling0210	620fe215a4	Fix python metadata search (#13727 ) ### What problem does this PR solve? Fix python metadata search ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-30 19:37:19 +08:00
qinling0210	0462c20113	Fix special characters in matching text of search() (#13852 ) ### What problem does this PR solve? Fix special characters in matching text of search(). We should escape some special characters(such as ?, *,:) before passing to matching_text of search() Fix https://github.com/infiniflow/ragflow/issues/13729 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-30 18:47:10 +08:00
Zhichang Yu	0d85a8e7aa	feat: add dynamic log level adjustment APIs (#13850 ) Add REST APIs to dynamically query and modify log levels at runtime for both Python (Flask) and Go servers. Changes: - common/log_utils.py: add set_log_level() and get_log_levels() functions - admin/server/routes.py: add GET/PUT /api/v1/admin/log_levels endpoints - api/apps/system_app.py: add GET/PUT /api/{version}/system/log_levels endpoints - internal/logger/logger.go: add GetLevel() and SetLevel() with atomic level support - internal/handler/system.go: add GetLogLevel, SetLogLevel, Health handlers - internal/router/router.go: route /health to systemHandler - internal/admin/handler.go: add GetLogLevel, SetLogLevel handlers - internal/admin/router.go: add /api/v1/admin/log_level routes ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-30 18:40:58 +08:00
黄圣祺	534729546e	fix(html-parser): correct h4 heading mapping from ##### to #### (#13833 ) ## Summary - Fix incorrect Markdown heading mapping for `h4` in `TITLE_TAGS` dictionary - `h4` was mapped to `"#####"` (h5 level) instead of `"####"` (correct h4 level) Closes #13819 ## Details In `deepdoc/parser/html_parser.py`, the `TITLE_TAGS` dictionary had a typo where `h4` was assigned 5 `#` characters instead of 4, causing h4 headings to be converted to h5-level Markdown headings during HTML parsing. ## Test plan - [ ] Parse an HTML document containing `<h4>` tags and verify the output uses `####` (4 hashes) - [ ] Verify other heading levels remain correct 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Asksksn <Asksksn@noreply.gitcode.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-30 13:17:32 +08:00
Jin Hai	2faaa9f9ce	Update docker container start printout (#13847 ) ### What problem does this PR solve? Printout RAGFlow version ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-30 12:59:42 +08:00
Jin Hai	e20cf39735	Refactor Go server model provider reading and access (#13831 ) ### What problem does this PR solve? 1. Refactor model provider json file format 2. Use memory data structure to replace database 3. Add CLI command to access ``` RAGFlow(user)> list pool models from 'xai'; +-------------------------------------------------------------------------------------+------------+-------------+-----------------------+ \| features \| max_tokens \| model_types \| name \| +-------------------------------------------------------------------------------------+------------+-------------+-----------------------+ \| map[] \| 256000 \| [llm] \| grok-4 \| \| map[] \| 131072 \| [llm] \| grok-3 \| \| map[] \| 131072 \| [llm] \| grok-3-fast \| \| map[] \| 131072 \| [llm] \| grok-3-mini \| \| map[] \| 131072 \| [llm] \| grok-3-mini-mini-fast \| \| map[multimodal:map[enabled:true input_modalities:[image] output_modalities:[text]]] \| 32768 \| [vlm] \| grok-2-vision \| +-------------------------------------------------------------------------------------+------------+-------------+-----------------------+ RAGFlow(user)> show pool model 'grok-2-vision' from 'xai'; +-------------------------------------------------------------------------------------+------------+-------------+---------------+ \| features \| max_tokens \| model_types \| name \| +-------------------------------------------------------------------------------------+------------+-------------+---------------+ \| map[multimodal:map[enabled:true input_modalities:[image] output_modalities:[text]]] \| 32768 \| [vlm] \| grok-2-vision \| +-------------------------------------------------------------------------------------+------------+-------------+---------------+ RAGFlow(user)> list pool providers; +--------+------------------------------------------------------------+---------------------------+ \| name \| tags \| url \| +--------+------------------------------------------------------------+---------------------------+ \| OpenAI \| LLM,TEXT EMBEDDING,TTS,TEXT RE-RANK,SPEECH2TEXT,MODERATION \| https://api.openai.com/v1 \| \| xAI \| LLM \| https://api.x.ai/v1 \| +--------+------------------------------------------------------------+---------------------------+ RAGFlow(user)> show pool provider 'openai'; +---------------------------+--------+------------------------------------------------------------+--------------+ \| base_url \| name \| tags \| total_models \| +---------------------------+--------+------------------------------------------------------------+--------------+ \| https://api.openai.com/v1 \| OpenAI \| LLM,TEXT EMBEDDING,TTS,TEXT RE-RANK,SPEECH2TEXT,MODERATION \| 27 \| +---------------------------+--------+------------------------------------------------------------+--------------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-30 12:00:49 +08:00
qinling0210	a8bbe167a9	Bump to infinity v0.7.0-dev5 (#13846 ) ### What problem does this PR solve? Bump to infinity v0.7.0-dev5 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-30 10:19:06 +08:00
Heyang Wang	641b319647	feat: support reading tags via API (#12891 ) (#13732 ) ### What problem does this PR solve? Enable reading Tag Set tags via API (expose tag_kwd field). The result of the queried list chunks is as shown below: <img width="1422" height="818" alt="image" src="https://github.com/user-attachments/assets/abd1960a-fe34-489e-9d72-525f8e574938" /> ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: heyang.why <heyang.why@alibaba-inc.com>	2026-03-29 20:17:01 +08:00
KeJun	cb78ce0a7b	feat: support rss datasource (#13721 ) ### What problem does this PR solve? Supporting public RSS/Atom feed URLs as data sources for RagFlow. link https://github.com/infiniflow/ragflow/issues/12313 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-27 22:58:44 +08:00
Jin Hai	f32a832f92	Add rename model directory to entity to avoid name misunderstanding (#13829 ) ### What problem does this PR solve? Model-> entity ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-27 19:25:18 +08:00
balibabu	308b3a1299	Feat: Remove antd-related code and upgrade lucide-react to the latest version. (#13830 ) ### What problem does this PR solve? Feat: Remove antd-related code and upgrade lucide-react to the latest version. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-27 19:24:52 +08:00
Jin Hai	1fff48b656	Add minio go test (#13800 ) ### What problem does this PR solve? 1. Add go test 2. Update CI process ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-27 18:12:56 +08:00
Liu An	2240fc778c	Fix: add missing "mom" field to infinity_mapping.json for parent-child chunker (#13821 ) ### What problem does this PR solve? When using Infinity as DOC_ENGINE with parent-child chunker enabled, vector insertion fails because the "mom" field is missing from the index mapping. This fix adds the required field to resolve the issue. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-27 13:06:18 +08:00
Krishna Chaitanya	cdbbd2620c	Fix: upgrade pyasn1 from 0.6.2 to 0.6.3 to address CVE-2026-30922 (#13773 ) ## Summary - Adds `pyasn1>=0.6.3` as a `[tool.uv.constraint-dependencies]` entry to mitigate CVE-2026-30922 (CVSS 7.5 HIGH) - Regenerates `uv.lock` so the resolved pyasn1 version moves from 0.6.2 to 0.6.3 ## Details CVE-2026-30922 is a Denial of Service vulnerability in pyasn1 caused by unbounded recursion when decoding ASN.1 data with deeply nested structures. An attacker can send crafted payloads with thousands of nested SEQUENCE or SET tags to trigger a `RecursionError` crash or memory exhaustion. - Severity: HIGH (CVSS 7.5) - Affected versions: pyasn1 < 0.6.3 - Fixed in: pyasn1 >= 0.6.3 - NVD: https://nvd.nist.gov/vuln/detail/CVE-2026-25769 `pyasn1` is not a direct dependency of RAGFlow but is pulled in transitively via `google-auth` -> `rsa` -> `pyasn1-modules` -> `pyasn1`. The `constraint-dependencies` mechanism in uv is the correct way to enforce a minimum version for transitive dependencies without polluting the direct dependency list. ## Test plan - [x] `pyproject.toml` passes TOML validation - [x] `uv lock` resolves successfully with the new constraint - [x] pyasn1 version in `uv.lock` is now 0.6.3 - [ ] Existing CI/CD tests continue to pass Closes #13686	2026-03-27 10:37:34 +08:00
chanx	8a9bbf3d6d	Feat: add memory function by go (#13754 ) ### What problem does this PR solve? Feat： Add Memory function by go ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-03-27 09:49:50 +08:00
黄圣祺	406339af1f	Fix(paddleocr): load all PDF pages for image cropping instead of first 100 (#13811 ) ## Summary Closes #13803 The `__images__` method in `paddleocr_parser.py` defaulted to `page_to=100`, only loading the first 100 pages for image cropping. However, the PaddleOCR API processes all pages of the PDF. For PDFs with more than 100 pages, page indices beyond 99 were rejected as out of range during crop validation, causing content loss. ## Root Cause ``` __images__(page_to=100) → loads pages 0-99 → page_images has 100 entries PaddleOCR API → processes all 226 pages → tags reference pages 1-226 extract_positions() → converts tag "101" to index 100 crop() validation → 0 <= 100 < 100 → False → "All page indices [100] out of range" ``` ## Fix Changed `page_to` default from `100` to `10**9`, so all PDF pages are loaded for cropping. Python's list slicing safely handles oversized indices. ## Test plan - [ ] Parse a PDF with >100 pages using PaddleOCR — no more "out of range" warnings - [ ] Parse a PDF with <100 pages — behavior unchanged - [ ] Verify cropped images are generated correctly for all pages 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Asksksn <Asksksn@noreply.gitcode.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-27 09:33:11 +08:00
Sank	992a15146d	Add translate autoMetadata (#13807 ) ### What problem does this PR solve? add translate autoMetadata in Russia ### Type of change - [x] Other	2026-03-27 09:31:20 +08:00
Renzo	f3aa3381a2	Fix username line break in SharedBadge component (#13794 ) ## Summary - Added Tailwind truncation classes (`inline-block max-w-[120px] truncate align-middle`) to the username `<span>` in `SharedBadge` to prevent long usernames from wrapping onto multiple lines - Added `title` attribute to show the full username on hover when truncated ![ragflow](https://github.com/user-attachments/assets/8b3d8c03-d605-4957-bcf0-8b4d81fc4e70) ## Test plan - [x] Verify long usernames display truncated with ellipsis (`...`) - [x] Verify hovering over a truncated username shows the full name as a tooltip - [x] Verify short usernames display normally without truncation Closes #13748	2026-03-27 09:31:08 +08:00
Yingfeng	6e309f9d0a	Feat: Initialize context engine CLI (#13776 ) ### What problem does this PR solve? - Add multiple output format to ragflow_cli - Initialize contextengine to Go module - ls datasets/ls files - cat file - search -d dir -q query issue: #13714 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-26 21:07:06 +08:00

1 2 3 4 5 ...

5660 Commits