ragflow/api/db/services at fd196f694ecceef8d527d2fa25b1797e42896f52 - ragflow - GetSkill.work

zlei6/ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-29 23:41:12 +08:00

Files

History

euvre fe46244d30 fix: paginate non-DeepDOC PDF parsing tasks to prevent OOM (#16106 )

The parser pods suffer from OOM kills when processing large PDF
documents. The root cause is in api/db/services/task_service.py: when
layout_recognize is not DeepDOC (e.g. Plain Text), page_size was set to
MAXIMUM_TASK_PAGE_NUMBER (100 million), causing the entire PDF to be
processed as a single task with all pages loaded into memory
simultaneously.

This PR fixes the issue by paginating non-DeepDOC PDF parsing tasks the
same way DeepDOC already does.

2026-06-17 09:33:53 +08:00

..

__init__.py

Feat: tenant llm provider (#14595 )

2026-05-29 17:39:41 +08:00

api_service.py

Feat: Agent api (#14157 )

2026-04-24 10:02:22 +08:00

canvas_service.py

feat: pass chat_template_kwargs through agent chat completion (#14542 )

2026-05-22 15:15:49 +08:00

chat_channel_service.py

fix: rename dialog_id to chat_id in chat_channel (backend + frontend) (#16096 )

2026-06-16 19:02:20 +08:00

chunk_feedback_service.py

feat: Auto-adjust chunk recall weights based on user feedback (#12689 )

2026-04-08 09:52:18 +08:00

common_service.py

Revert "fix: duplicate document ingest guard" (#15707 )

2026-06-05 17:45:29 +08:00

connector_service.py

Fix one data source can be synced to multiple dataset (#16023 )

2026-06-15 16:54:25 +08:00

conversation_service.py

Feat: chat channels — connect assistants to external messaging bots (#15850 )

2026-06-12 18:21:30 +08:00

dialog_service.py

Fix: v0.26.1 model provider (#16073 )

2026-06-16 16:21:43 +08:00

doc_metadata_service.py

Refine handling of POST /api/v1/datasets/search in GO (#15583 )

2026-06-08 11:49:37 +08:00

document_service.py

Refactor: Task Executor (#15154 )

2026-05-27 21:54:17 +08:00

evaluation_service.py

feat(evaluation): track token usage in evaluation results (#13487 )

2026-05-22 15:19:53 +08:00

file2document_service.py

…

file_commit_service.py

Add git-like file commit API (#15978 )

2026-06-15 11:19:56 +08:00

file_service.py

Fix: table parser metadata (#15127 )

2026-05-25 16:05:38 +08:00

knowledgebase_service.py

Refact: Added a private helper _visibility_and_status_filter (#13627 )

2026-05-11 15:21:41 +08:00

langfuse_service.py

…

llm_service.py

Fix: v0.26.1 model provider (#16073 )

2026-06-16 16:21:43 +08:00

mcp_server_service.py

…

memory_service.py

Feat: tenant llm provider (#14595 )

2026-05-29 17:39:41 +08:00

pipeline_operation_log_service.py

fix: propagate memory tenant id in task collect (#15837 )

2026-06-09 17:47:48 +08:00

search_service.py

fix(api): guard missing row in SearchService.get_detail (#15622 )

2026-06-08 23:01:28 +08:00

system_settings_service.py

Fix admin CLI system variable commands (#14956 )

2026-05-18 19:08:45 +08:00

task_service.py

fix: paginate non-DeepDOC PDF parsing tasks to prevent OOM (#16106 )

2026-06-17 09:33:53 +08:00

tenant_llm_service.py

feat: Langfuse session grouping for multi-turn chat traces (#15679 )

2026-06-12 10:18:06 +08:00

tenant_model_group_mapping_service.py

Feat: tenant llm provider (#14595 )

2026-05-29 17:39:41 +08:00

tenant_model_group_service.py

Feat: tenant llm provider (#14595 )

2026-05-29 17:39:41 +08:00

tenant_model_instance_service.py

Feat: tenant llm provider (#14595 )

2026-05-29 17:39:41 +08:00

tenant_model_provider_service.py

Feat: tenant llm provider (#14595 )

2026-05-29 17:39:41 +08:00

tenant_model_service.py

Fix: v0.26.1 model provider (#16073 )

2026-06-16 16:21:43 +08:00

user_canvas_version.py

Fix tiny issues (#14006 )

2026-04-09 19:01:36 +08:00

user_service.py

Feat: tenant llm provider (#14595 )

2026-05-29 17:39:41 +08:00