Files
ragflow/conf/models/stepfun.json
Haruko386 c2863173b0 Go: implement TTS, ASR for Siliconflow and TTs for StepFun (#14944)
### What problem does this PR solve?

This PRimplement TTS, ASR for Siliconflow and TTs for StepFun

**The following functionalities are now supported:**

**SiliConFlow:**
- [x] Text To Speech
- [x] Audio To Text
- [x] Stream Audio To Text

**StrepFun:**

- [x] Audio To Text
- [x] Stream Audio To Text

**Verified examples from the CLI:**
```plaintext
# SiliconFlow

RAGFlow(user)> tts with 'FunAudioLLM/CosyVoice2-0.5B@test@Siliconflow' text 'hello? show yourself' play format 'wav' param '{"voice": "fnlp/MOSS-TTSD-v0.5:alex"}'
SUCCESS

RAGFlow(user)> asr with 'FunAudioLLM/SenseVoiceSmall@test@siliconflow' audio './internal/test.wav' param ''
+----------------------------------------------------------------------------------------------------------------------+
| text                                                                                                                 |
+----------------------------------------------------------------------------------------------------------------------+
| The examination and testimony of the experts enabled the commission to conclude that five shots may have been fired. |
+----------------------------------------------------------------------------------------------------------------------+

RAGFlow(user)> stream asr with 'FunAudioLLM/SenseVoiceSmall@test@siliconflow' audio './internal/test.wav' param ''
+----------------------------------------------------------------------------------------------------------------------+
| text                                                                                                                 |
+----------------------------------------------------------------------------------------------------------------------+
| The examination and testimony of the experts enabled the commission to conclude that five shots may have been fired. |
+----------------------------------------------------------------------------------------------------------------------+
```

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
2026-05-15 14:03:33 +08:00

116 lines
1.9 KiB
JSON

{
"name": "StepFun",
"url": {
"default": "https://api.stepfun.ai/v1"
},
"url_suffix": {
"chat": "chat/completions",
"models": "models",
"tts": "audio/speech"
},
"class": "step",
"models": [
{
"name": "step-3.5-flash",
"max_tokens": 32768,
"model_types": [
"chat"
]
},
{
"name": "step-3.5-flash-paid",
"max_tokens": 32768,
"model_types": [
"chat"
]
},
{
"name": "step-2-16k",
"max_tokens": 16384,
"model_types": [
"chat"
]
},
{
"name": "step-1-256k",
"max_tokens": 262144,
"model_types": [
"chat"
]
},
{
"name": "step-1-128k",
"max_tokens": 131072,
"model_types": [
"chat"
]
},
{
"name": "step-1-32k",
"max_tokens": 32768,
"model_types": [
"chat"
]
},
{
"name": "step-1-8k",
"max_tokens": 8192,
"model_types": [
"chat"
]
},
{
"name": "step-1-flash",
"max_tokens": 8192,
"model_types": [
"chat"
]
},
{
"name": "step-1v-32k",
"max_tokens": 32768,
"model_types": [
"chat",
"vision"
]
},
{
"name": "step-1v-8k",
"max_tokens": 8192,
"model_types": [
"chat",
"vision"
]
},
{
"name": "step-1o-vision-32k",
"max_tokens": 32768,
"model_types": [
"chat",
"vision"
]
},
{
"name": "step-tts-2 ",
"max_tokens": 8192,
"model_types": [
"tts"
]
},
{
"name": "stepaudio-2.5-tts",
"max_tokens": 8192,
"model_types": [
"tts"
]
},
{
"name": "step-tts-mini",
"max_tokens": 8192,
"model_types": [
"tts"
]
}
]
}