docs: add FAQ entry for using Ollama with RAGFlow (#14557)

### What problem does this PR solve?

Users frequently ask how to use Ollama for local LLM inference with
RAGFlow. This FAQ entry provides step-by-step instructions for setting
up Ollama as a local model provider.

### Type of change

- [x] Documentation update

### Description

Adds a new FAQ entry: "How do I use Ollama with RAGFlow for local LLM
inference?"

Covers:
1. Starting Ollama and pulling a model
2. Configuring Ollama as a model provider in RAGFlow Settings
3. Using the Ollama model in an assistant
This commit is contained in:
SnakeEye-sudo (Er. Sangam Krishna)
2026-05-15 11:24:09 +05:30
committed by GitHub
parent 86bcf9767d
commit 1a25191b13

View File

@@ -692,3 +692,26 @@ http://localhost:8080/layout-parsing
| `PADDLEOCR_ACCESS_TOKEN` | Access token for official API | `None` | Only when using official API |
Environment variables can be used for auto-provisioning, but are not required if configuring via UI. When environment variables are set, these values are used to auto-provision a PaddleOCR model for the tenant on first use.
### How do I use Ollama with RAGFlow for local LLM inference?
RAGFlow supports Ollama as a local model provider for private, offline inference.
**Step 1: Start Ollama and pull a model**
```bash
export OLLAMA_HOST=0.0.0.0
ollama serve
ollama pull llama3
```
**Step 2: Add Ollama in RAGFlow**
1. Go to **Settings** > **Model providers** > **Ollama**.
2. Set the Base URL to `http://host.docker.internal:11434` (Docker) or `http://localhost:11434` (bare-metal).
3. Enter the model name (e.g., `llama3`) and click **Save**.
**Step 3: Use Ollama in your assistant**
- Open an assistant's **Configuration** page and select the Ollama model under **Chat model**.