Views
4

Your rating
Rate update installation process

Log in to rate this update.
Login

Risk factor
No ratings yet. Be the first to rate this update.

Smooth installs 0%
Minor issues 0%
Major issues 0%

Update Summary

Grok Speech to Text and Text to Speech APIs add standalone audio endpoints for transcription and voice generation. The release highlights low-latency REST and WebSocket support, multilingual transcription, diarization, and simple usage-based pricing.

Update Details

New Features

  • Standalone Speech to Text (STT) API for batch and real-time transcription.
  • Standalone Text to Speech (TTS) API for REST and WebSocket speech generation.
  • Word-level timestamps, speaker diarization, and multichannel transcription.
  • Inverse Text Normalization for numbers, dates, currencies, and similar structured text.
  • Speech tags for fine-grained voice control, including emotion and pacing.

Hints

  • STT pricing is $0.10 per hour for batch and $0.20 per hour for streaming.
  • TTS pricing is $15.00 per 1 million characters.
  • The STT API supports 25+ languages.
  • Rate limits and full pricing details are available in the xAI API console.
Product Information

Vendor: x.ai

Product: grok

Release date: Apr 17, 2026