Go and Rust SDK for Volcengine Doubao Speech API (豆包语音).
Official API Documentation: api/README.md
- Dual API Version Support: V1 (Classic) and V2/V3 (BigModel) APIs
- Multiple Auth Methods: Bearer Token, API Key, V2 API Key
- Comprehensive Coverage: TTS, ASR, Voice Clone, Realtime, Meeting, Podcast, etc.
- Streaming-first: WebSocket-based streaming for real-time scenarios
Doubao Speech has two API generations:
| Version | Name | Features | Recommended |
| V1 | Classic | Basic TTS/ASR | Legacy use |
| V2/V3 | BigModel | Advanced TTS/ASR, Realtime | ✅ New projects |
| Feature | V1 (Classic) | V2 (BigModel) | Go | Rust |
| TTS Sync | ✅ | ✅ | ✅ | ✅ |
| TTS Stream | ✅ | ✅ | ✅ | ✅ |
| TTS Async (Long Text) | ✅ | ✅ | ✅ | ⚠️ |
| ASR One-sentence | ✅ | ✅ | ✅ | ✅ |
| ASR Stream | ✅ | ✅ | ✅ | ✅ |
| ASR File | ✅ | ✅ | ✅ | ⚠️ |
| Voice Clone | N/A | ✅ | ✅ | ✅ |
| Realtime Dialogue | N/A | ✅ | ✅ | ✅ |
| Meeting Transcription | N/A | ✅ | ✅ | ✅ |
| Podcast Synthesis | N/A | ✅ | ✅ | ✅ |
| Translation (SIMT) | N/A | ✅ | ✅ | ✅ |
| Media Subtitle | N/A | ✅ | ✅ | ✅ |
| Console API | N/A | ✅ | ✅ | ✅ |
graph TB
subgraph client["Client"]
subgraph v1["V1 Services (Classic)"]
tts1[TTS]
asr1[ASR]
end
subgraph v2["V2 Services (BigModel)"]
tts2[TTSV2]
asr2[ASRV2]
advanced["VoiceClone<br/>Realtime<br/>Meeting<br/>Podcast<br/>Translation<br/>Media"]
end
end
subgraph console["Console Client"]
aksig["AK/SK Signature<br/>Authentication"]
end
client --> api["Volcengine API"]
console --> api
| Method | Header | Use Case |
| API Key | x-api-key: {key} | Simplest, recommended |
| Bearer Token | Authorization: Bearer;{token} | V1 APIs |
| V2 API Key | X-Api-Access-Key, X-Api-App-Key | V2/V3 APIs |
Uses Volcengine OpenAPI AK/SK signature (HMAC-SHA256).
| Service | Resource ID |
| TTS 2.0 | seed-tts-2.0 |
| TTS 2.0 Concurrent | seed-tts-2.0-concurr |
| ASR Stream | volc.bigasr.sauc.duration |
| ASR File | volc.bigasr.auc.duration |
| Realtime | volc.speech.dialog |
| Podcast | volc.service_type.10050 |
| Translation | volc.megatts.simt |
| Voice Clone | seed-icl-2.0 |
| Cluster | Service |
volcano_tts | TTS Standard |
volcano_mega | TTS BigModel |
volcano_icl | Voice Clone |
volcengine_streaming_common | ASR Streaming |
examples/go/doubaospeech/ - Go SDK examples
examples/cmd/doubaospeech/ - CLI test scripts
- CLI tool:
go/cmd/doubaospeech/
- CLI tests:
examples/cmd/doubaospeech/