docs: add FAQ entry for using Ollama with RAGFlow (#14557)

### What problem does this PR solve? Users frequently ask how to use Ollama for local LLM inference with RAGFlow. This FAQ entry provides step-by-step instructions for setting up Ollama as a local model provider. ### Type of change - [x] Documentation update ### Description Adds a new FAQ entry: "How do I use Ollama with RAGFlow for local LLM inference?" Covers: 1. Starting Ollama and pulling a model 2. Configuring Ollama as a model provider in RAGFlow Settings 3. Using the Ollama model in an assistant
2026-06-29 15:31:05 +08:00 · 2026-05-15 11:24:09 +05:30
parent 86bcf9767d
commit 1a25191b13
1 changed files with 23 additions and 0 deletions
--- a/docs/faq.mdx
+++ b/docs/faq.mdx
@@ -692,3 +692,26 @@ http://localhost:8080/layout-parsing
 | `PADDLEOCR_ACCESS_TOKEN` | Access token for official API | `None` | Only when using official API |

 Environment variables can be used for auto-provisioning, but are not required if configuring via UI. When environment variables are set, these values are used to auto-provision a PaddleOCR model for the tenant on first use.
+
+
+### How do I use Ollama with RAGFlow for local LLM inference?
+
+RAGFlow supports Ollama as a local model provider for private, offline inference.
+
+**Step 1: Start Ollama and pull a model**
+
+```bash
+export OLLAMA_HOST=0.0.0.0
+ollama serve
+ollama pull llama3
+```
+
+**Step 2: Add Ollama in RAGFlow**
+
+1. Go to **Settings** > **Model providers** > **Ollama**.
+2. Set the Base URL to `http://host.docker.internal:11434` (Docker) or `http://localhost:11434` (bare-metal).
+3. Enter the model name (e.g., `llama3`) and click **Save**.
+
+**Step 3: Use Ollama in your assistant**
+
+- Open an assistant's **Configuration** page and select the Ollama model under **Chat model**.