Skip to main content
LINEAR16 (16KHz sampling rate) over the mono channel. Audio should be chunked into 100ms packets for optimal performance. We only support 100ms audio chunks to achieve the right balance between quality, latency, and efficiency.
Network connection speed and consistency are important for Suki to perform well.Suki requires:
  • Upload speed: 1Mbps
  • Bitrate: 768kbps
  • Ping time: 150ms
  • Unloaded latency: <50ms
  • Loaded latency: <150ms
Client should set up a WebSocket Secure (wss://) request with the Suki endpoint. Refer to the Audio Stream API for implementation details.
Send LINEAR16 (16KHz sampling rate) audio over the mono channel. Audio should be chunked into 100ms packets for optimal performance.

Message Types

The streaming protocol supports two types of messages:1. AUDIO Message
  • Carries raw audio data in the supported format
2. EVENT Message
  • Signifies an event on the stream

Supported Events

  • PAUSE: Pause the stream
  • RESUME: Resume the stream
  • CANCEL: User cancels the stream
  • ABORT: Stream is aborted (interruption)
  • KEEP_ALIVE: Pings to sustain the stream during inactivity (such as during paused state). Should be sent every 15 seconds
  • EOF: Indicates closure of stream

Message Examples

{
  "type": "AUDIO",
  "data": "Base64EncodedAudioBytes"
}
The EOF data in the AUDIO message is sent as bytes, not as a string.