Jin Hai
74b44e1aa3
Go: add balance command ( #14262 )
...
### What problem does this PR solve?
```
RAGFlow(user)> list supported models from 'moonshot' 'test';
+---------------------------------+
| model_name |
+---------------------------------+
| moonshot-v1-32k-vision-preview |
| kimi-k2.6 |
| moonshot-v1-8k |
| moonshot-v1-auto |
| moonshot-v1-128k |
| moonshot-v1-32k |
| kimi-k2.5 |
| moonshot-v1-8k-vision-preview |
| moonshot-v1-128k-vision-preview |
+---------------------------------+
RAGFlow(user)> show balance from 'moonshot' 'test';
+---------+----------+
| balance | currency |
+---------+----------+
| 0 | CNY |
+---------+----------+
```
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-04-21 21:31:50 +08:00
Jin Hai
e48d75987c
Go: add stream / think chat ( #14242 )
...
### What problem does this PR solve?
1. Supports stream and non-stream chat
2. Supports think and non-think chat
3. List supported models from DeepSeek service. (This command can be
used to verify the API validity)
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-04-21 16:52:32 +08:00
Jin Hai
f269ee9739
Go: add thinking features to zhipu-ai ( #14234 )
...
### What problem does this PR solve?
```
RAGFlow(user)> list models from 'zhipu-ai';
+------------+------------+---------------+----------------+
| features | max_tokens | model_types | name |
+------------+------------+---------------+----------------+
| [thinking] | 128000 | [chat] | glm-4.7 |
| [thinking] | 128000 | [chat] | glm-4.5 |
| [thinking] | 128000 | [chat vision] | glm-4.6v-Flash |
| [thinking] | 128000 | [chat] | glm-4.5-x |
| [thinking] | 128000 | [chat] | glm-4.5-air |
| [thinking] | 128000 | [chat] | glm-4.5-airx |
| [thinking] | 128000 | [chat] | glm-4.5-flash |
| [thinking] | 64000 | [vision] | glm-4.5v |
| | 128000 | [chat] | glm-4-plus |
| | 128000 | [chat] | glm-4-0520 |
| | 128000 | [chat] | glm-4 |
| | 8000 | [chat] | glm-4-airx |
| | 128000 | [chat] | glm-4-air |
| | 128000 | [chat] | glm-4-flash |
| | 128000 | [chat] | glm-4-flashx |
| | 1000000 | [chat] | glm-4-long |
| | 128000 | [chat] | glm-3-turbo |
| | 2000 | [vision] | glm-4v |
| | 8192 | [chat] | glm-4-9b |
| | 512 | [embedding] | embedding-2 |
| | 512 | [embedding] | embedding-3 |
| | 4096 | [asr] | glm-asr |
| | 0 | [tts] | glm-tts |
| | 0 | [ocr] | glm-ocr |
| | 0 | [rerank] | glm-rerank |
+------------+------------+---------------+----------------+
```
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-04-20 21:53:27 +08:00
Jin Hai
af2ed416a7
Add extra field to model instance ( #14203 )
...
### What problem does this PR solve?
Now each model support region with different URL
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-04-20 15:31:12 +08:00
Jin Hai
94106646e7
Go: set and list default models ( #14191 )
...
### What problem does this PR solve?
```
RAGFlow(user)> set default vlm "zhipu-ai" "ccc" "glm-4.6v-flash";
SUCCESS
RAGFlow(user)> list default models;
+--------+----------------+----------------+----------------+------------+
| enable | model_instance | model_name | model_provider | model_type |
+--------+----------------+----------------+----------------+------------+
| true | ccc | glm-4.6v-flash | zhipu-ai | llm |
| true | ccc | glm-4.6v-flash | zhipu-ai | image2text |
+--------+----------------+----------------+----------------+------------+
```
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-04-17 18:05:33 +08:00
Jin Hai
6d9430a125
Add think chat to CLI ( #13922 )
...
### What problem does this PR solve?
Now user can use 'think mode' to chat with LLM
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-04-03 18:11:23 +08:00
Jin Hai
6c29128de1
Refactor model provider and command ( #13887 )
...
### What problem does this PR solve?
Introduce 5 new tables, including model groups and provider instance.
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
- [x] Refactoring
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-04-02 20:20:35 +08:00
Jin Hai
e20cf39735
Refactor Go server model provider reading and access ( #13831 )
...
### What problem does this PR solve?
1. Refactor model provider json file format
2. Use memory data structure to replace database
3. Add CLI command to access
```
RAGFlow(user)> list pool models from 'xai';
+-------------------------------------------------------------------------------------+------------+-------------+-----------------------+
| features | max_tokens | model_types | name |
+-------------------------------------------------------------------------------------+------------+-------------+-----------------------+
| map[] | 256000 | [llm] | grok-4 |
| map[] | 131072 | [llm] | grok-3 |
| map[] | 131072 | [llm] | grok-3-fast |
| map[] | 131072 | [llm] | grok-3-mini |
| map[] | 131072 | [llm] | grok-3-mini-mini-fast |
| map[multimodal:map[enabled:true input_modalities:[image] output_modalities:[text]]] | 32768 | [vlm] | grok-2-vision |
+-------------------------------------------------------------------------------------+------------+-------------+-----------------------+
RAGFlow(user)> show pool model 'grok-2-vision' from 'xai';
+-------------------------------------------------------------------------------------+------------+-------------+---------------+
| features | max_tokens | model_types | name |
+-------------------------------------------------------------------------------------+------------+-------------+---------------+
| map[multimodal:map[enabled:true input_modalities:[image] output_modalities:[text]]] | 32768 | [vlm] | grok-2-vision |
+-------------------------------------------------------------------------------------+------------+-------------+---------------+
RAGFlow(user)> list pool providers;
+--------+------------------------------------------------------------+---------------------------+
| name | tags | url |
+--------+------------------------------------------------------------+---------------------------+
| OpenAI | LLM,TEXT EMBEDDING,TTS,TEXT RE-RANK,SPEECH2TEXT,MODERATION | https://api.openai.com/v1 |
| xAI | LLM | https://api.x.ai/v1 |
+--------+------------------------------------------------------------+---------------------------+
RAGFlow(user)> show pool provider 'openai';
+---------------------------+--------+------------------------------------------------------------+--------------+
| base_url | name | tags | total_models |
+---------------------------+--------+------------------------------------------------------------+--------------+
| https://api.openai.com/v1 | OpenAI | LLM,TEXT EMBEDDING,TTS,TEXT RE-RANK,SPEECH2TEXT,MODERATION | 27 |
+---------------------------+--------+------------------------------------------------------------+--------------+
```
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
- [x] Refactoring
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-03-30 12:00:49 +08:00
Jin Hai
f32a832f92
Add rename model directory to entity to avoid name misunderstanding ( #13829 )
...
### What problem does this PR solve?
Model-> entity
### Type of change
- [x] Refactoring
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2026-03-27 19:25:18 +08:00