Doubao Speech SDK

Go and Rust SDK for Volcengine Doubao Speech API (豆包语音).

Official API Documentation: api/README.md

Design Goals

  1. Dual API Version Support: V1 (Classic) and V2/V3 (BigModel) APIs
  2. Multiple Auth Methods: Bearer Token, API Key, V2 API Key
  3. Comprehensive Coverage: TTS, ASR, Voice Clone, Realtime, Meeting, Podcast, etc.
  4. Streaming-first: WebSocket-based streaming for real-time scenarios

API Versions

Doubao Speech has two API generations:

VersionNameFeaturesRecommended
V1ClassicBasic TTS/ASRLegacy use
V2/V3BigModelAdvanced TTS/ASR, Realtime✅ New projects

API Coverage

FeatureV1 (Classic)V2 (BigModel)GoRust
TTS Sync
TTS Stream
TTS Async (Long Text)⚠️
ASR One-sentence
ASR Stream
ASR File⚠️
Voice CloneN/A
Realtime DialogueN/A
Meeting TranscriptionN/A
Podcast SynthesisN/A
Translation (SIMT)N/A
Media SubtitleN/A
Console APIN/A

Architecture

graph TB
    subgraph client["Client"]
        subgraph v1["V1 Services (Classic)"]
            tts1[TTS]
            asr1[ASR]
        end
        subgraph v2["V2 Services (BigModel)"]
            tts2[TTSV2]
            asr2[ASRV2]
            advanced["VoiceClone<br/>Realtime<br/>Meeting<br/>Podcast<br/>Translation<br/>Media"]
        end
    end
    
    subgraph console["Console Client"]
        aksig["AK/SK Signature<br/>Authentication"]
    end
    
    client --> api["Volcengine API"]
    console --> api

Authentication Methods

Speech API Client

MethodHeaderUse Case
API Keyx-api-key: {key}Simplest, recommended
Bearer TokenAuthorization: Bearer;{token}V1 APIs
V2 API KeyX-Api-Access-Key, X-Api-App-KeyV2/V3 APIs

Console Client

Uses Volcengine OpenAPI AK/SK signature (HMAC-SHA256).

Resource IDs (V2/V3)

ServiceResource ID
TTS 2.0seed-tts-2.0
TTS 2.0 Concurrentseed-tts-2.0-concurr
ASR Streamvolc.bigasr.sauc.duration
ASR Filevolc.bigasr.auc.duration
Realtimevolc.speech.dialog
Podcastvolc.service_type.10050
Translationvolc.megatts.simt
Voice Cloneseed-icl-2.0

Clusters (V1)

ClusterService
volcano_ttsTTS Standard
volcano_megaTTS BigModel
volcano_iclVoice Clone
volcengine_streaming_commonASR Streaming

Examples Directory

  • examples/go/doubaospeech/ - Go SDK examples
  • examples/cmd/doubaospeech/ - CLI test scripts
  • CLI tool: go/cmd/doubaospeech/
  • CLI tests: examples/cmd/doubaospeech/