ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-07-05 19:08:38 +08:00

Author	SHA1	Message	Date
Lynn	794c1f4b25	Fix: volc engine and other json key factories (#15653 ) ### What problem does this PR solve? Fix: - VolcEngine adapt to new api_key format - Save dict api_key as json ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-06-05 09:45:44 +08:00
Aeovy	600590cd18	Fix: disable thinking to avoid potential infinite loops in Qwen3.5/Qwen3.6 models (#15101 ) ### What problem does this PR solve? This PR fixes the issue where Qwen3.5/Qwen3.6 series models may spend excessive time on simple document-parsing tasks, such as Auto Metadata extraction, keyword extraction, question generation, and image description when using the MinerU parser. For these tasks, Qwen3.5/Qwen3.6 models may perform unnecessary reasoning by default, which can lead to very long response times, high token consumption, and, in some cases, potential infinite output loops. Since Qwen3.5/Qwen3.6 multimodal models are instantiated as `CvModel` when configured as `image2text`, the existing `enable_thinking=False` logic in `chat_model.py` does not apply to them. This PR adds the corresponding handling for the CV/image-to-text model path as well. This helps reduce unnecessary thinking time, avoid potential infinite loops, and improve parsing efficiency without noticeably affecting output quality for these simple extraction and image-description tasks. Fixes #15083.	2026-06-02 13:21:35 +08:00
Jonathan Hill	111cdc77b5	fix: guard LLM response against empty choices (fixes #14711 ) (#14988 ) ## Summary Fixes 10 unguarded `response.choices[0]` accesses that cause `IndexError` or `AttributeError` when the LLM returns an empty `choices` list — the scenario described in #14711. - `rag/llm/cv_model.py` - `rag/llm/chat_model.py` Each access site is now guarded with: ```python if not response.choices: raise ValueError("LLM returned empty response") ``` ## Verification Detected and verified by [pact](https://github.com/qizwiz/pact) — a sheaf-cohomological LLM contract checker using Z3 as a local theory solver. pact sheaf-cohomological proof status after fix: \| File \| Ȟ¹ (after) \| Z3 \| \|------\|-----------\|-----\| \| `rag/llm/cv_model.py` \| 0 \| UNSAT ✓ \| \| `rag/llm/chat_model.py` \| 0 \| UNSAT ✓ \| All access sites proven safe (Z3 UNSAT certificate). The checker was also used to verify the autogen streaming-None fix in [microsoft/autogen#7711](https://github.com/microsoft/autogen/pull/7711). ## Test plan - [ ] Existing test suite passes - [ ] Manually test with a provider that returns empty `choices` under load (e.g. Vertex AI) 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Signed-off-by: Jonathan Hill <jonathan.f.hill@gmail.com>	2026-05-21 15:37:19 +08:00
Ricardo-M-L	13922209e6	fix(llm): add timeout to HTTP requests in LLM integration layer (#14313 ) ### What problem does this PR solve? Multiple `requests.post()` calls across the LLM integration layer lack a `timeout` parameter. Without a timeout, a single unresponsive upstream service can block the calling thread indefinitely, eventually exhausting the thread pool and degrading the entire system. This is a well-known issue — Python's `requests` library defaults to `timeout=None` (infinite wait), and [the library docs explicitly recommend](https://requests.readthedocs.io/en/latest/user/advanced/#timeouts) always setting a timeout. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Change Added `timeout` to all `requests.post()` calls missing it: \| File \| Calls fixed \| Timeout \| \|------\|-------------\|---------\| \| `rag/llm/rerank_model.py` \| 9 \| 30s \| \| `rag/llm/embedding_model.py` \| 8 \| 30s \| \| `rag/llm/cv_model.py` \| 3 \| 60s \| \| `rag/llm/tts_model.py` \| 2 \| 60s \| \| `rag/llm/sequence2txt_model.py` \| 2 \| 60s \| Embedding/rerank calls use 30s (lightweight API calls). Vision, TTS, and audio transcription use 60s (heavier workloads with file uploads). Note: other files in the codebase (e.g. `check_minio_alive`, `check_ragflow_server_alive`) already use `timeout=10`, so this PR brings the LLM layer in line with existing practice. Signed-off-by: Ricardo-M-L <Sibyl_Hartmanbnb@webname.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2026-05-11 11:19:07 +08:00
VincentLambert	08bb53bbb1	Feat: add BedrockCV for vision/image2text inference via LiteLLM (#14705 ) ## Summary - `CvModel["Bedrock"]` was absent from `rag/llm/cv_model.py`, causing `model_instance()` to return `None` when a Bedrock model was used as a PDF parser — even after correct model resolution. - This PR adds `BedrockCV`, enabling Bedrock vision models (e.g. `amazon.nova-pro-v1:0`, `anthropic.claude-3-5-sonnet`) to be used as PDF parsers. ## What problem does this PR solve? When a Bedrock model is selected as the PDF parser in a knowledge base, ingestion failed with: ``` 'LiteLLMBase' object has no attribute 'describe_with_prompt' ``` The root cause: `LiteLLMBase` (the Bedrock chat implementation) was the only registered handler for the Bedrock factory. It does not implement `describe_with_prompt`. `CvModel` had no Bedrock entry, so `model_instance()` returned `None` for `image2text` requests. ## Type of change - [x] New Feature (non-breaking change which adds functionality) ## Changes `rag/llm/cv_model.py` Adds `BedrockCV(Base)` with `_FACTORY_NAME = "Bedrock"`: - Uses `litellm.completion` with the `bedrock/` prefix (consistent with `LiteLLMBase`) - Parses AWS credentials from the JSON key assembled by `add_llm` (`auth_mode`, `bedrock_ak`, `bedrock_sk`, `bedrock_region`, `aws_role_arn`) - Supports three auth modes: `access_key_secret`, `iam_role` (via STS `assume_role`), and default credential chain (IRSA, instance profile) - Implements `describe_with_prompt` and `describe` ## Test plan - [ ] Configure a Bedrock vision model (e.g. `amazon.nova-pro-v1:0`) with valid AWS credentials - [ ] Select it as PDF parser in a knowledge base - [ ] Verify ingestion of a PDF document completes without errors - [ ] Verify `CvModel["Bedrock"]` resolves to `BedrockCV` 🤖 Generated with [Claude Code](https://claude.ai/claude-code) --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-11 10:29:58 +08:00
Ricardo-M-L	1046042e01	fix(llm): replace mutable default `gen_conf={}` with None + defensive copy (#14566 ) ### What 19 methods across `rag/llm/chat_model.py` and `rag/llm/cv_model.py` declare `gen_conf={}` (or `gen_conf: dict = {}`) as a parameter default and then mutate `gen_conf` in place — typically `del gen_conf["max_tokens"]`, `gen_conf["penalty_score"] = ...`, or `gen_conf.pop(...)` as part of provider-specific normalization. ### The two bugs in this pattern 1. Mutable default argument (Python footgun). Python evaluates default values once at function-definition time, so the single `{}` dict is shared across every caller that doesn't pass `gen_conf`. The first such call's mutations leak into the default seen by every subsequent call. ```python # Before def chat_streamly(self, system, history, gen_conf={}, kwargs): if "max_tokens" in gen_conf: del gen_conf["max_tokens"] # mutates the SHARED default dict ... ``` After call N with `max_tokens` set, call N+1 that omits `gen_conf` no longer sees `max_tokens` — even though the caller never touched it. 2. Caller-dict pollution.** When the caller does pass a `gen_conf` dict, the same in-place mutations modify the caller's dict. A reused `gen_conf` (very common for chat-loop callers that build the config once and pass it on every turn) silently loses `max_tokens`, `presence_penalty`, etc. after the first round. ### The fix In every affected method: - Change `gen_conf={}` (or `gen_conf: dict = {}`) → `gen_conf=None`. - Add `gen_conf = dict(gen_conf or {})` as the first statement of the body so all subsequent mutations operate on a fresh local copy. ```python # After def chat_streamly(self, system, history, gen_conf=None, kwargs): gen_conf = dict(gen_conf or {}) if "max_tokens" in gen_conf: del gen_conf["max_tokens"] # local copy — safe ... ``` This is byte-for-byte identical provider-side behavior for callers that already pass a fresh `gen_conf` per call. The new `dict(...)` copy is O(small constant) per call. ### Files changed - `rag/llm/chat_model.py` — 17 methods - `rag/llm/cv_model.py` — 2 methods ### Tests Adds `test/unit_test/rag/llm/test_gen_conf_no_mutable_default.py` — an `ast`-based regression guard that walks both modules and asserts no parameter named `gen_conf` ever has a mutable literal (`{}` or `[]`) as its default. The test caught five additional `gen_conf: dict = {}` sites that an initial `gen_conf={}` text grep had missed (annotated parameters with whitespace), and would fail again if the pattern is ever reintroduced. ``` $ pytest test/unit_test/rag/llm/test_gen_conf_no_mutable_default.py -v ============================== 3 passed in 0.04s =============================== ``` `ruff check` passes on all touched files. ### Notes - This PR is intentionally focused on just** the `gen_conf` default + copy fix. There's a related (but separate) `history.insert(0, ...)` pattern in the same files that mutates the caller's history list in 12 places — left for a follow-up so this PR stays mechanical and easy to review. ### Latest revision (`700bb54a7`) — addresses CodeRabbit review - Type annotation: `gen_conf: dict = None` → `gen_conf: dict \| None = None` (5 occurrences in `chat_model.py`). The old annotation was a static-checker mismatch since `None` isn't a `dict`. - Regression test: the AST check accessed `default.keys` directly. `ast.List` has no `.keys` attribute — a future `gen_conf=[]` would crash with `AttributeError` instead of being caught. Use `getattr` for both `.keys` (Dict) and `.elts` (List). Manually verified the updated check correctly catches both `gen_conf={}` and `gen_conf=[]` while ignoring `gen_conf=None` and non-empty literals. --------- Co-authored-by: Ricardo <ricardo@example.com>	2026-05-09 13:11:44 +08:00
FuturMix	2548c28d65	feat: add FuturMix as model provider (#14419 ) ## Summary Add [FuturMix](https://futurmix.ai) as a new model provider. FuturMix is an OpenAI-compatible unified AI gateway that provides access to 22+ models (GPT, Claude, Gemini, DeepSeek, and more) through a single API endpoint and key. - API Base: `https://futurmix.ai/v1` (OpenAI-compatible) - Supported capabilities: Chat, Embedding, Image2Text, TTS, Speech2Text, Rerank ### Changes \| File \| Change \| \|------\|--------\| \| `rag/llm/__init__.py` \| Add `FuturMix` to `SupportedLiteLLMProvider` enum, `FACTORY_DEFAULT_BASE_URL`, and `LITELLM_PROVIDER_PREFIX` \| \| `rag/llm/chat_model.py` \| Add `FuturMixChat(Base)` — follows Astraflow/Avian pattern \| \| `rag/llm/embedding_model.py` \| Add `FuturMixEmbed(OpenAIEmbed)` — follows Astraflow pattern \| \| `rag/llm/cv_model.py` \| Add `FuturMixCV(GptV4)` — follows SILICONFLOW/OpenRouter pattern \| \| `rag/llm/tts_model.py` \| Add `FuturMixTTS(OpenAITTS)` — follows CometAPI/DeerAPI pattern \| \| `rag/llm/sequence2txt_model.py` \| Add `FuturMixSeq2txt(GPTSeq2txt)` — follows StepFun pattern \| \| `rag/llm/rerank_model.py` \| Add `FuturMixRerank(OpenAI_APIRerank)` \| \| `conf/llm_factories.json` \| Add factory config with 8 chat, 2 embedding, 1 image2text, 2 TTS, 1 speech2text models \| \| `docs/guides/models/supported_models.mdx` \| Add FuturMix to supported models table \| ### Models included - Chat: claude-sonnet-4-20250514, claude-3.5-haiku, gpt-4o, gpt-4o-mini, gemini-2.5-flash, gemini-2.0-flash, deepseek-chat, deepseek-reasoner - Embedding: text-embedding-3-small, text-embedding-3-large - Image2Text: gpt-4o - TTS: tts-1, tts-1-hd - Speech2Text: whisper-1 ## Test plan - [ ] Verify FuturMix appears in the model provider list in RAGFlow UI - [ ] Configure FuturMix with API key and test chat completion - [ ] Test embedding model with document indexing - [ ] Test image2text with a sample image 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-30 10:59:37 +08:00
Jonah Hartmann	6023eb27ac	feat: add Ragcon provider (#13425 ) ### What problem does this PR solve? This PR aims to extend the list of possible providers. Adds new Provider "RAGcon" within the Ollama Modal. It provides all model types except OCR via Openai-compatible endpoints. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Jakob <16180662+hauberj@users.noreply.github.com>	2026-03-06 09:37:27 +08:00
Magicbook1108	98e1d5aa5c	Refact: switch from google-generativeai to google-genai (#13140 ) ### What problem does this PR solve? Refact: switch from oogle-generativeai to google-genai #13132 Refact: commnet out unused pywencai. ### Type of change - [x] Refactoring	2026-02-24 10:28:33 +08:00
Magicbook1108	109441628b	Fix: upload image files (#13071 ) ### What problem does this PR solve? Fix: upload image files ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-11 09:47:33 +08:00
Kevin Hu	927db0b373	Refa: asyncio.to_thread to ThreadPoolExecutor to break thread limitat… (#12716 ) ### Type of change - [x] Refactoring	2026-01-20 13:29:37 +08:00
Yongteng Lei	c51e6b2a58	Refa: migrate CV model chat to Async (#11828 ) ### What problem does this PR solve? Migrate CV model chat to Async. #11750 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2025-12-09 13:08:37 +08:00
Kevin Hu	915e385244	Fix: uv lock updates (#11511 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-25 16:01:12 +08:00
Kevin Hu	bcd70affb5	Fix: unexpected parameter. (#11497 ) ### What problem does this PR solve? #11489 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-25 11:17:27 +08:00
Yongteng Lei	e8fe580d7a	Feat: add Gemini 3 Pro preview (#11361 ) ### What problem does this PR solve? Add Gemini 3 Pro preview. Change `GenerativeModel` to `genai`. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-19 13:17:22 +08:00
Billy Bao	0db00f70b2	Fix: add describe_image_with_prompt for ZHIPU AI (#11317 ) ### What problem does this PR solve? Fix: add describe_image_with_prompt for ZHIPU AI #11289 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-18 13:09:39 +08:00
Kevin Hu	dd5b8e2e1a	Fix: add auto_parse to kb detail. (#11153 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-11 12:22:43 +08:00
Stephen Hu	82ca2e0378	Refactor: QWenCV release temp path (#11122 ) ### What problem does this PR solve? QWenCV release temp path ### Type of change - [x] Refactoring	2025-11-10 10:15:37 +08:00
Stephen Hu	660386d3b5	Fix: cannot parse images (#11044 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/11043 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-10 09:31:19 +08:00
Yongteng Lei	9fcc4946e2	Feat: add kimi-k2-thinking and moonshot-v1-vision-preview (#11110 ) ### What problem does this PR solve? Add kimi-k2-thinking and moonshot-v1-vision-preview. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-07 19:52:57 +08:00
buua436	2d83c64eed	Fix:wrong describe_with_prompt() in ollama (#10963 ) ### What problem does this PR solve? change: wrong describe_with_prompt() in ollama ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-03 19:16:41 +08:00
Jin Hai	360f5c1179	Move token related functions to common (#10942 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-03 08:50:05 +08:00
Yongteng Lei	c0c2a10680	Feat: allow initialize Redis without password (#10856 ) ### What problem does this PR solve? Allow initialize Redis without password. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-10-29 09:45:28 +08:00
Stephen Hu	d86d7061ea	Refactor: Improve how to get total token count for AnthropicCV (#10658 ) ### What problem does this PR solve? Improve how to get total token count for AnthropicCV ### Type of change - [x] Refactoring	2025-10-29 09:41:15 +08:00
Kevin Hu	3bd0b99495	Fix: gemini cv model chat issue. (#10799 ) ### What problem does this PR solve? #10787 #10781 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-10-27 11:43:56 +08:00
Billy Bao	a82e9b3d91	Fix: can't upload image in ollama model #10447 (#10717 ) ### What problem does this PR solve? Fix: can't upload image in ollama model #10447 ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue) ### Change all `image=[]` to `image = None` Changing `image=[]` to `images=None` avoids Python’s mutable default parameter issue. If you keep `images=[]`, all calls share the same list, so modifying it (e.g., images.append()) will affect later calls. Using images=None and creating a new list inside the function ensures each call is independent. This change does not affect current behavior — it simply makes the code safer and more predictable. 把 `images=[]` 改成 `images=None` 是为了避免 Python 默认参数的可变对象问题。如果保留 `images=[]`，所有调用都会共用同一个列表，一旦修改就会影响后续调用。改成 None 并在函数内部重新创建列表，可以确保每次调用都是独立的。这个修改不会影响现有运行结果，只是让代码更安全、更可控。	2025-10-22 12:24:12 +08:00
Yongteng Lei	aaa4776657	Feat: Qwen-VL series supports video parsing (#10676 ) ### What problem does this PR solve? Qwen-VL series supports video parsing. #10617. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-10-21 09:36:13 +08:00
Yongteng Lei	5b2e5dd334	Feat: Gemini supports video parsing (#10671 ) ### What problem does this PR solve? Gemini supports video parsing. ![img_v3_02r8_adbd5adc-d665-4756-9a00-3ae0f12224fg](https://github.com/user-attachments/assets/30d8d296-c336-4b55-9823-803979e705ca) ![img_v3_02r8_ab60c046-1727-4029-ad2e-66097fd3ccbg](https://github.com/user-attachments/assets/441b1487-a970-427e-98b6-6e1e002f2bad) Close: #10617 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-10-20 16:49:47 +08:00
buua436	b15643bd80	Feat:VolcEngine Model type add IMAGE2TEXT (#10629 ) ### What problem does this PR solve? issue: [#9004](https://github.com/infiniflow/ragflow/issues/9004) change: VolcEngine Model type add IMAGE2TEXT ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-10-17 11:43:22 +08:00
buua436	4e86ee4ff9	Feat: Support Specifying OpenRouter Model Provider (#10550 ) ### What problem does this PR solve? issue: [#5787](https://github.com/infiniflow/ragflow/issues/5787) change: Support Specifying OpenRouter Model Provider ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-10-16 09:39:59 +08:00
Stephen Hu	6ab4c1a6e9	Refactor: improve how NvidiaCV calculate res total token counts (#10455 ) ### What problem does this PR solve? improve how NvidiaCV calculate res total token counts ### Type of change - [x] Refactoring	2025-10-10 11:03:40 +08:00
Stephen Hu	4585edc20e	Refactor: improve cv model logics (#10414 ) 1. improve how to get total token count Improve how to get total token count ### Type of change - [x] Refactoring	2025-10-09 09:47:36 +08:00
Billy Bao	10cbbb76f8	revert gpt5 integration (#10228 ) ### What problem does this PR solve? Revert back to chat.completions. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [x] Other (please describe): Revert back to chat.completions.	2025-09-23 16:06:12 +08:00
Jin Hai	4eb7659499	Fix bug: broken import from rag.prompts.prompts (#10217 ) ### What problem does this PR solve? Fix broken imports ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: jinhai <haijin.chn@gmail.com>	2025-09-23 10:19:25 +08:00
Billy Bao	da82566304	Fix: resolve hash collisions by switching to UUID &correct logic for always-true statements & Update GPT api integration & Support qianwen-deepresearch (#10208 ) ### What problem does this PR solve? Fix: resolve hash collisions by switching to UUID &correct logic for always-true statements, solved: #10165 Feat: Update GPT api integration, solved: #10204 Feat: Support qianwen-deepresearch, solved: #10163 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-09-23 09:34:30 +08:00
Stephen Hu	1936ad82d2	Refactor:Improve BytesIO usage for GeminiCV (#10042 ) ### What problem does this PR solve? Improve BytesIO usage for GeminiCV ### Type of change - [x] Refactoring	2025-09-11 11:07:15 +08:00
Stephen Hu	127af4e45c	Refactor:Improve BytesIO usage for image2base64 (#9997 ) ### What problem does this PR solve? Improve BytesIO usage for image2base64 ### Type of change - [x] Refactoring	2025-09-10 15:55:33 +08:00
Yongteng Lei	fe32952825	Fix: Gemini parameters error (#9520 ) ### What problem does this PR solve? Fix Gemini parameters error. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-08-18 14:51:10 +08:00
RuyXu	762aa4b8c4	fix: preserve correct MIME & unify data URL handling for vision inputs (relates #9248 ) (#9474 ) fix: preserve correct MIME & unify data URL handling for vision inputs (relates #9248) - Updated image2base64() to return a full data URL (data:image/<fmt>;base64,...) with accurate MIME - Removed hardcoded image/jpeg in Base._image_prompt(); pass through data URLs and default raw base64 to image/png - Set AnthropicCV._image_prompt() raw base64 media_type default to image/png - Ensures MIME type matches actual image content, fixing “cannot process base64 image” errors on vLLM/OpenAI-compatible backends ### What problem does this PR solve? This PR fixes a compatibility issue where base64-encoded images sent to vision models (e.g., vLLM/OpenAI-compatible backends) were rejected due to mismatched MIME type or incorrect decoding. Previously, the backend: - Always converted raw base64 into data:image/jpeg;base64,... even if the actual content was PNG. - In some cases, base64 decoding was attempted on the full data URL string instead of the pure base64 part. This caused errors like: ``` cannot process base64 image failed to decode base64 string: illegal base64 data at input byte 0 ``` by strict validators such as vLLM. With this fix, the MIME type in the request now matches the actual image content, and data URLs are correctly handled or passed through, ensuring vision models can decode and process images reliably. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-14 17:00:56 +08:00
Stephen Hu	f2806a8332	Update cv_model.py (#9472 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/9452 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-14 13:45:38 +08:00
Kevin Hu	a2e1f5618d	Fix: bytes style image issue. (#9304 ) ### What problem does this PR solve? #9302 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-07 15:20:01 +08:00
Kevin Hu	2124329e95	Fix: local variable issue. (#9255 ) ### What problem does this PR solve? #9227 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-05 19:24:34 +08:00
Stephen Hu	0a303d9ae1	Refactor:Improve the chat stream logic for NvidiaCV (#9242 ) ### What problem does this PR solve? Improve the chat stream logic for NvidiaCV ### Type of change - [x] Refactoring	2025-08-05 17:47:00 +08:00
Stephen Hu	1deb0a2d42	Fix:local variable 'response' referenced before assignment (#9230 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/9227 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-08-05 11:00:06 +08:00
Yongteng Lei	30ccc4a66c	Fix: correct single base64 image handling in image prompt (#9220 ) ### What problem does this PR solve? Correct single base64 image handling in image prompt. ![img_v3_02or_ec4757c2-a9d4-4774-9a76-f7c6be633ebg](https://github.com/user-attachments/assets/872a86bf-e2a8-48d1-9b71-2a0c7a35ba9e) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-05 09:26:42 +08:00
Stephen Hu	5ccdb95008	Refactor:Introduce Image Close For GeminiCV (#9147 ) ### What problem does this PR solve? Introduce Image Close For GeminiCV ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-08-01 12:38:13 +08:00
Kevin Hu	d9fe279dde	Feat: Redesign and refactor agent module (#9113 ) ### What problem does this PR solve? #9082 #6365 <u> WARNING: it's not compatible with the older version of `Agent` module, which means that `Agent` from older versions can not work anymore.</u> ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-30 19:41:09 +08:00
Stephen Hu	53b0b0e583	get keep alive from env (#9039 ) ### What problem does this PR solve? get keepalive from env ### Type of change - [x] Refactoring	2025-07-25 12:16:33 +08:00
Stephen Hu	07208e519b	Fix: Wrong_Input_type_for_Gemin (#8783 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8763#issuecomment-3055317110 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-11 11:34:04 +08:00
Yongteng Lei	1895667573	Feat: add xAI provider (#8781 ) ### What problem does this PR solve? Add xAI provider (experimental feature, requires user feedback). ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-11 10:35:23 +08:00

1 2

100 Commits